Publication Date
In 2025 | 1 |
Since 2024 | 1 |
Since 2021 (last 5 years) | 1 |
Since 2016 (last 10 years) | 1 |
Since 2006 (last 20 years) | 4 |
Descriptor
Source
Journal of Educational and… | 4 |
Author
Berger, Martijn P. F. | 1 |
Kim, Seonghoon | 1 |
Kolen, Michael J. | 1 |
Longford, Nicholas T. | 1 |
Passos, Valeria Lima | 1 |
Susu Zhang | 1 |
Tan, Frans E. S. | 1 |
Yang Du | 1 |
Publication Type
Journal Articles | 4 |
Reports - Research | 3 |
Reports - Evaluative | 1 |
Education Level
Elementary Secondary Education | 1 |
Audience
Location
Laws, Policies, & Programs
Assessments and Surveys
What Works Clearinghouse Rating
Yang Du; Susu Zhang – Journal of Educational and Behavioral Statistics, 2025
Item compromise has long posed challenges in educational measurement, jeopardizing both test validity and test security of continuous tests. Detecting compromised items is therefore crucial to address this concern. The present literature on compromised item detection reveals two notable gaps: First, the majority of existing methods are based upon…
Descriptors: Item Response Theory, Item Analysis, Bayesian Statistics, Educational Assessment
Longford, Nicholas T. – Journal of Educational and Behavioral Statistics, 2012
Statistical modeling of school effectiveness data was originally motivated by the dissatisfaction with the analysis of (school-leaving) examination results that took no account of the background of the students or regarded each school as an isolated unit of analysis. The application of multilevel analysis was generally regarded as a breakthrough,…
Descriptors: School Effectiveness, Data Analysis, Statistical Analysis, Statistical Studies
Passos, Valeria Lima; Berger, Martijn P. F.; Tan, Frans E. S. – Journal of Educational and Behavioral Statistics, 2008
During the early stage of computerized adaptive testing (CAT), item selection criteria based on Fisher"s information often produce less stable latent trait estimates than the Kullback-Leibler global information criterion. Robustness against early stage instability has been reported for the D-optimality criterion in a polytomous CAT with the…
Descriptors: Computer Assisted Testing, Adaptive Testing, Evaluation Criteria, Item Analysis
Kim, Seonghoon; Kolen, Michael J. – Journal of Educational and Behavioral Statistics, 2007
Under item response theory, the characteristic curve methods (Haebara and Stocking-Lord methods) are used to link two ability scales from separate calibrations. The linking methods use their respective criterion functions that can be defined differently according to the symmetry- and distribution-related schemes. The symmetry-related scheme…
Descriptors: Measures (Individuals), Item Response Theory, Simulation, Comparative Analysis