Publication Date
In 2025 | 0 |
Since 2024 | 0 |
Since 2021 (last 5 years) | 0 |
Since 2016 (last 10 years) | 1 |
Since 2006 (last 20 years) | 4 |
Descriptor
Educational Testing | 6 |
Error of Measurement | 6 |
Simulation | 6 |
Test Items | 4 |
Item Analysis | 2 |
Item Response Theory | 2 |
Statistical Analysis | 2 |
Statistical Bias | 2 |
Test Reliability | 2 |
Ability | 1 |
Academic Aptitude | 1 |
More ▼ |
Source
American Institutes for… | 1 |
Applied Psychological… | 1 |
ETS Research Report Series | 1 |
Journal of Educational… | 1 |
ProQuest LLC | 1 |
Author
Falk, Carl F. | 1 |
Gallagher, Larry | 1 |
Hong, Seong Eun | 1 |
Jiang, Tao | 1 |
Linn, Bob | 1 |
McLaughlin, Don | 1 |
Meijer, Rob R. | 1 |
Monroe, Scott | 1 |
Patience, Wayne M. | 1 |
Reckase, Mark D. | 1 |
Sotaridona, Leonardo S. | 1 |
More ▼ |
Publication Type
Reports - Research | 4 |
Journal Articles | 3 |
Dissertations/Theses -… | 1 |
Numerical/Quantitative Data | 1 |
Reports - Evaluative | 1 |
Speeches/Meeting Papers | 1 |
Education Level
Elementary Secondary Education | 1 |
Audience
Location
Laws, Policies, & Programs
Assessments and Surveys
What Works Clearinghouse Rating
Hong, Seong Eun; Monroe, Scott; Falk, Carl F. – Journal of Educational Measurement, 2020
In educational and psychological measurement, a person-fit statistic (PFS) is designed to identify aberrant response patterns. For parametric PFSs, valid inference depends on several assumptions, one of which is that the item response theory (IRT) model is correctly specified. Previous studies have used empirical data sets to explore the effects…
Descriptors: Educational Testing, Psychological Testing, Goodness of Fit, Error of Measurement
Topczewski, Anna Marie – ProQuest LLC, 2013
Developmental score scales represent the performance of students along a continuum, where as students learn more they move higher along that continuum. Unidimensional item response theory (UIRT) vertical scaling has become a commonly used method to create developmental score scales. Research has shown that UIRT vertical scaling methods can be…
Descriptors: Item Response Theory, Scaling, Scores, Student Development
Zwick, Rebecca – ETS Research Report Series, 2012
Differential item functioning (DIF) analysis is a key component in the evaluation of the fairness and validity of educational tests. The goal of this project was to review the status of ETS DIF analysis procedures, focusing on three aspects: (a) the nature and stringency of the statistical rules used to flag items, (b) the minimum sample size…
Descriptors: Test Bias, Sample Size, Bayesian Statistics, Evaluation Methods
Sotaridona, Leonardo S.; van der Linden, Wim J.; Meijer, Rob R. – Applied Psychological Measurement, 2006
A statistical test for detecting answer copying on multiple-choice tests based on Cohen's kappa is proposed. The test is free of any assumptions on the response processes of the examinees suspected of copying and having served as the source, except for the usual assumption that these processes are probabilistic. Because the asymptotic null and…
Descriptors: Cheating, Test Items, Simulation, Statistical Analysis
Linn, Bob; McLaughlin, Don; Jiang, Tao; Gallagher, Larry – American Institutes for Research, 2004
The purpose of this simulation was to assess the improvements in estimates of standard errors that could be expected if students participating in NAEP were pre-assigned to test booklets that were adapted to their level of performance based on their state assessment scores. Students in extreme quartiles would receive one regular NAEP block and…
Descriptors: Educational Improvement, Educational Assessment, Error of Measurement, Educational Testing
Patience, Wayne M.; Reckase, Mark D. – 1979
Simulated tailored tests were used to investigate the relationships between characteristics of the item pool and the computer program, and the reliability and bias of the resulting ability estimates. The computer program was varied to provide for various step sizes (differences in difficulty between successive steps) and different acceptance…
Descriptors: Adaptive Testing, Computer Assisted Testing, Computer Programs, Educational Testing