Publication Date
In 2025 | 0 |
Since 2024 | 0 |
Since 2021 (last 5 years) | 0 |
Since 2016 (last 10 years) | 0 |
Since 2006 (last 20 years) | 5 |
Descriptor
Educational Testing | 7 |
Error of Measurement | 7 |
Test Items | 7 |
Item Response Theory | 4 |
Simulation | 4 |
Statistical Analysis | 3 |
Accuracy | 2 |
Adaptive Testing | 2 |
Computation | 2 |
Computer Assisted Testing | 2 |
Multiple Choice Tests | 2 |
More ▼ |
Source
American Institutes for… | 1 |
Applied Psychological… | 1 |
International Journal of… | 1 |
Practical Assessment,… | 1 |
ProQuest LLC | 1 |
Psychometrika | 1 |
Author
Chang, Yuan-chin Ivan | 1 |
DeMars, Christine E. | 1 |
Gallagher, Larry | 1 |
Han, Kyung T. | 1 |
Jiang, Tao | 1 |
Linn, Bob | 1 |
Lu, Hung-Yi | 1 |
McLaughlin, Don | 1 |
Meijer, Rob R. | 1 |
Patience, Wayne M. | 1 |
Phan, Ha | 1 |
More ▼ |
Publication Type
Journal Articles | 4 |
Reports - Research | 4 |
Reports - Evaluative | 2 |
Dissertations/Theses -… | 1 |
Numerical/Quantitative Data | 1 |
Speeches/Meeting Papers | 1 |
Education Level
Elementary Secondary Education | 1 |
Audience
Location
Laws, Policies, & Programs
Assessments and Surveys
What Works Clearinghouse Rating
Socha, Alan; DeMars, Christine E.; Zilberberg, Anna; Phan, Ha – International Journal of Testing, 2015
The Mantel-Haenszel (MH) procedure is commonly used to detect items that function differentially for groups of examinees from various demographic and linguistic backgrounds--for example, in international assessments. As in some other DIF methods, the total score is used to match examinees on ability. In thin matching, each of the total score…
Descriptors: Test Items, Educational Testing, Evaluation Methods, Ability Grouping
Topczewski, Anna Marie – ProQuest LLC, 2013
Developmental score scales represent the performance of students along a continuum, where as students learn more they move higher along that continuum. Unidimensional item response theory (UIRT) vertical scaling has become a commonly used method to create developmental score scales. Research has shown that UIRT vertical scaling methods can be…
Descriptors: Item Response Theory, Scaling, Scores, Student Development
Han, Kyung T. – Practical Assessment, Research & Evaluation, 2012
For several decades, the "three-parameter logistic model" (3PLM) has been the dominant choice for practitioners in the field of educational measurement for modeling examinees' response data from multiple-choice (MC) items. Past studies, however, have pointed out that the c-parameter of 3PLM should not be interpreted as a guessing…
Descriptors: Statistical Analysis, Models, Multiple Choice Tests, Guessing (Tests)
Chang, Yuan-chin Ivan; Lu, Hung-Yi – Psychometrika, 2010
Item calibration is an essential issue in modern item response theory based psychological or educational testing. Due to the popularity of computerized adaptive testing, methods to efficiently calibrate new items have become more important than that in the time when paper and pencil test administration is the norm. There are many calibration…
Descriptors: Test Items, Educational Testing, Adaptive Testing, Measurement
Sotaridona, Leonardo S.; van der Linden, Wim J.; Meijer, Rob R. – Applied Psychological Measurement, 2006
A statistical test for detecting answer copying on multiple-choice tests based on Cohen's kappa is proposed. The test is free of any assumptions on the response processes of the examinees suspected of copying and having served as the source, except for the usual assumption that these processes are probabilistic. Because the asymptotic null and…
Descriptors: Cheating, Test Items, Simulation, Statistical Analysis
Linn, Bob; McLaughlin, Don; Jiang, Tao; Gallagher, Larry – American Institutes for Research, 2004
The purpose of this simulation was to assess the improvements in estimates of standard errors that could be expected if students participating in NAEP were pre-assigned to test booklets that were adapted to their level of performance based on their state assessment scores. Students in extreme quartiles would receive one regular NAEP block and…
Descriptors: Educational Improvement, Educational Assessment, Error of Measurement, Educational Testing
Patience, Wayne M.; Reckase, Mark D. – 1979
Simulated tailored tests were used to investigate the relationships between characteristics of the item pool and the computer program, and the reliability and bias of the resulting ability estimates. The computer program was varied to provide for various step sizes (differences in difficulty between successive steps) and different acceptance…
Descriptors: Adaptive Testing, Computer Assisted Testing, Computer Programs, Educational Testing