Publication Date
| In 2026 | 0 |
| Since 2025 | 38 |
| Since 2022 (last 5 years) | 225 |
| Since 2017 (last 10 years) | 570 |
| Since 2007 (last 20 years) | 1377 |
Descriptor
Source
Author
Publication Type
Education Level
Audience
| Researchers | 110 |
| Practitioners | 107 |
| Teachers | 46 |
| Administrators | 25 |
| Policymakers | 24 |
| Counselors | 12 |
| Parents | 7 |
| Students | 7 |
| Support Staff | 4 |
| Community | 2 |
Location
| California | 61 |
| Canada | 60 |
| United States | 57 |
| Turkey | 47 |
| Australia | 43 |
| Florida | 34 |
| Germany | 26 |
| Texas | 26 |
| China | 25 |
| Netherlands | 25 |
| Iran | 22 |
| More ▼ | |
Laws, Policies, & Programs
Assessments and Surveys
What Works Clearinghouse Rating
| Meets WWC Standards without Reservations | 1 |
| Meets WWC Standards with or without Reservations | 1 |
| Does not meet standards | 1 |
Deighton, Jessica; Tymms, Peter; Vostanis, Panos; Belsky, Jay; Fonagy, Peter; Brown, Anna; Martin, Amelia; Patalay, Praveetha; Wolpert, Miranda – Journal of Psychoeducational Assessment, 2013
Early detection of child mental health problems in schools is critical for implementing strategies for prevention and intervention. The development of an effective measure of mental health and well-being for this context must be both empirically sound and practically feasible. This study reports the initial validation of a brief self-report…
Descriptors: Mental Health, Child Health, Well Being, School Health Services
Moses, Tim; Miao, Jing; Dorans, Neil – Educational Testing Service, 2010
This study compared the accuracies of four differential item functioning (DIF) estimation methods, where each method makes use of only one of the following: raw data, logistic regression, loglinear models, or kernel smoothing. The major focus was on the estimation strategies' potential for estimating score-level, conditional DIF. A secondary focus…
Descriptors: Test Bias, Statistical Analysis, Computation, Scores
Kim, Jihye – ProQuest LLC, 2010
In DIF studies, a Type I error refers to the mistake of identifying non-DIF items as DIF items, and a Type I error rate refers to the proportion of Type I errors in a simulation study. The possibility of making a Type I error in DIF studies is always present and high possibility of making such an error can weaken the validity of the assessment.…
Descriptors: Test Bias, Test Length, Simulation, Testing
Baker, Eva L. – National Center for Research on Evaluation, Standards, and Student Testing (CRESST), 2010
This report provides an overview of what was known about alternative assessment at the time that the article was written in 1991. Topics include beliefs about assessment reform, overview of alternative assessment including research knowledge, evidence of assessment impact, and critical features of alternative assessment. The author notes that in…
Descriptors: Alternative Assessment, Evaluation Methods, Evaluation Research, Performance Based Assessment
Santelices, Maria Veronica; Wilson, Mark – Harvard Educational Review, 2010
In 2003, the "Harvard Educational Review" published a controversial article by Roy Freedle that claimed bias against African American students in the SAT college admissions test. Freedle's work stimulated national media attention and faced an onslaught of criticism from experts at the Educational Testing Service (ETS), the agency…
Descriptors: College Entrance Examinations, Test Bias, Test Items, Difficulty Level
Barua, Rashmi; Lang, Kevin – National Bureau of Economic Research, 2009
Partly in response to increased testing and accountability, states and districts have been raising the minimum school entry age, but existing studies show mixed results regarding the effects of entry age. These studies may be severely biased because they violate the monotonicity assumption needed for LATE. We propose an instrument not subject to…
Descriptors: Educational Attainment, Age Differences, School Entrance Age, Testing
Johnson, Emily C.; Meade, Adam W.; DuVernet, Amy M. – Structural Equation Modeling: A Multidisciplinary Journal, 2009
Confirmatory factor analytic tests of measurement invariance (MI) require a referent indicator (RI) for model identification. Although the assumption that the RI is perfectly invariant across groups is acknowledged as problematic, the literature provides relatively little guidance for researchers to identify the conditions under which the practice…
Descriptors: Measurement, Validity, Factor Analysis, Models
Wells, Craig S.; Cohen, Allan S.; Patton, Jeffrey – International Journal of Testing, 2009
A primary concern with testing differential item functioning (DIF) using a traditional point-null hypothesis is that a statistically significant result does not imply that the magnitude of DIF is of practical interest. Similarly, for a given sample size, a non-significant result does not allow the researcher to conclude the item is free of DIF. To…
Descriptors: Test Bias, Test Items, Statistical Analysis, Hypothesis Testing
Unlu, Ali; Sargin, Anatol – Applied Psychological Measurement, 2009
Mondrian is state-of-the-art statistical data visualization software featuring modern interactive visualization techniques for a wide range of data types. This article reviews the capabilities, functionality, and interactive properties of this software package. Key features of Mondrian are illustrated with data from the Programme for International…
Descriptors: Statistical Data, Computer Graphics, Computer Software, Item Analysis
Peoples, Shelagh – ProQuest LLC, 2012
The purpose of this study was to determine which of three competing models will provide, reliable, interpretable, and responsive measures of elementary students' understanding of the nature of science (NOS). The Nature of Science Instrument-Elementary (NOSI-E), a 28-item Rasch-based instrument, was used to assess students' NOS…
Descriptors: Scientific Principles, Science Tests, Elementary School Students, Item Response Theory
Fan, Jinyan; Meng, Hui; Zhao, Bihua; Patel, Trishna – Journal of Career Assessment, 2012
The authors report further validity evidence for the Chinese version of a U.S. adult social self-efficacy inventory, the "Perceived Social Self-Efficacy" (PSSE) scale in Chinese populations. Study 1 participants were 323 new graduate students enrolled at a large university in an east coast city of the People's Republic of China. Differential item…
Descriptors: Evidence, Self Efficacy, Measures (Individuals), Foreign Countries
Stone, Elizabeth; Davey, Tim – Educational Testing Service, 2011
There has been an increased interest in developing computer-adaptive testing (CAT) and multistage assessments for K-12 accountability assessments. The move to adaptive testing has been met with some resistance by those in the field of special education who express concern about routing of students with divergent profiles (e.g., some students with…
Descriptors: Disabilities, Adaptive Testing, Accountability, Computer Assisted Testing
Wright, Keith D. – ProQuest LLC, 2011
Standardized testing has been part of the American educational system for decades. Controversy from the beginning has plagued standardized testing, is plaguing testing today, and will continue to be controversial. Given the current federal educational policies supporting increased standardized testing, psychometricians, educators and policy makers…
Descriptors: Test Bias, Test Items, Simulation, Testing
Anthis, Kristine – Teaching of Psychology, 2011
Previous research on the effectiveness of clickers has found their use to be positively associated with exam scores but not without methodological issues that hinder the conclusions that can be drawn. To address these limitations, the current studies isolated the effects of clickers from the effects of questions presented with clickers. Study 1…
Descriptors: Student Reaction, Program Effectiveness, Foreign Countries, Handheld Devices
Oshima, T. C.; Morris, S. B. – Educational Measurement: Issues and Practice, 2008
Nambury S. Raju (1937-2005) developed two model-based indices for differential item functioning (DIF) during his prolific career in psychometrics. Both methods, Raju's area measures (Raju, 1988) and Raju's DFIT (Raju, van der Linden, & Fleer, 1995), are based on quantifying the gap between item characteristic functions (ICFs). This approach…
Descriptors: Test Bias, Psychometrics, Methods, Test Items

Peer reviewed
Direct link
