Publication Date
In 2025 | 0 |
Since 2024 | 0 |
Since 2021 (last 5 years) | 0 |
Since 2016 (last 10 years) | 1 |
Since 2006 (last 20 years) | 1 |
Descriptor
Raw Scores | 3 |
Equated Scores | 2 |
Item Response Theory | 2 |
Comparative Analysis | 1 |
Error of Measurement | 1 |
Estimation (Mathematics) | 1 |
Multiple Choice Tests | 1 |
Psychometrics | 1 |
Reliability | 1 |
Sample Size | 1 |
Scaling | 1 |
More ▼ |
Source
Applied Measurement in… | 3 |
Author
Feldt, Leonard S. | 1 |
Gregg, Justin L. | 1 |
Han, Tianqi | 1 |
O'Neill, Thomas R. | 1 |
Peabody, Michael R. | 1 |
Qualls, Audrey L. | 1 |
Publication Type
Journal Articles | 3 |
Reports - Evaluative | 2 |
Reports - Research | 1 |
Education Level
Audience
Location
Laws, Policies, & Programs
Assessments and Surveys
What Works Clearinghouse Rating
O'Neill, Thomas R.; Gregg, Justin L.; Peabody, Michael R. – Applied Measurement in Education, 2020
This study addresses equating issues with varying sample sizes using the Rasch model by examining how sample size affects the stability of item calibrations and person ability estimates. A resampling design was used to create 9 sample size conditions (200, 100, 50, 45, 40, 35, 30, 25, and 20), each replicated 10 times. Items were recalibrated…
Descriptors: Sample Size, Equated Scores, Item Response Theory, Raw Scores

Feldt, Leonard S.; Qualls, Audrey L. – Applied Measurement in Education, 1998
Two relatively simple methods for estimating the condition standard error of measurement (SEM) for nonlinearly derived score scales are proposed. Applications indicate that these two procedures produce fairly consistent estimates that tend to peak near the high end of the scale and reach a minimum in the middle of the raw score scale. (SLD)
Descriptors: Error of Measurement, Estimation (Mathematics), Raw Scores, Reliability

Han, Tianqi; And Others – Applied Measurement in Education, 1997
Stability among equating procedures was studied by comparing item response theory (IRT) true-score equating with IRT observed-score equating, IRT true-score equating with equipercentile equating, and IRT observed-score equating with equipercentile equating. On average, IRT true-score equating more frequently produced more stable conversions. (SLD)
Descriptors: Comparative Analysis, Equated Scores, Item Response Theory, Raw Scores