Publication Date
In 2025 | 0 |
Since 2024 | 1 |
Since 2021 (last 5 years) | 2 |
Since 2016 (last 10 years) | 3 |
Descriptor
Educational Testing | 3 |
Error of Measurement | 3 |
Goodness of Fit | 2 |
Item Response Theory | 2 |
Achievement Tests | 1 |
Conflict Resolution | 1 |
Correlation | 1 |
Equated Scores | 1 |
Evaluation Methods | 1 |
Examiners | 1 |
Grade 4 | 1 |
More ▼ |
Author
Falk, Carl F. | 1 |
Ho, Andrew D. | 1 |
Hong, Seong Eun | 1 |
Kalogrides, Demetra | 1 |
Monroe, Scott | 1 |
Reardon, Sean F. | 1 |
Stefanie A. Wind | 1 |
Yangmeng Xu | 1 |
Publication Type
Journal Articles | 3 |
Reports - Research | 3 |
Education Level
Elementary Education | 1 |
Grade 4 | 1 |
Grade 8 | 1 |
Intermediate Grades | 1 |
Junior High Schools | 1 |
Middle Schools | 1 |
Secondary Education | 1 |
Audience
Location
Laws, Policies, & Programs
Assessments and Surveys
Measures of Academic Progress | 1 |
National Assessment of… | 1 |
What Works Clearinghouse Rating
Stefanie A. Wind; Yangmeng Xu – Educational Assessment, 2024
We explored three approaches to resolving or re-scoring constructed-response items in mixed-format assessments: rater agreement, person fit, and targeted double scoring (TDS). We used a simulation study to consider how the three approaches impact the psychometric properties of student achievement estimates, with an emphasis on person fit. We found…
Descriptors: Interrater Reliability, Error of Measurement, Evaluation Methods, Examiners
Hong, Seong Eun; Monroe, Scott; Falk, Carl F. – Journal of Educational Measurement, 2020
In educational and psychological measurement, a person-fit statistic (PFS) is designed to identify aberrant response patterns. For parametric PFSs, valid inference depends on several assumptions, one of which is that the item response theory (IRT) model is correctly specified. Previous studies have used empirical data sets to explore the effects…
Descriptors: Educational Testing, Psychological Testing, Goodness of Fit, Error of Measurement
Reardon, Sean F.; Kalogrides, Demetra; Ho, Andrew D. – Journal of Educational and Behavioral Statistics, 2021
Linking score scales across different tests is considered speculative and fraught, even at the aggregate level. We introduce and illustrate validation methods for aggregate linkages, using the challenge of linking U.S. school district average test scores across states as a motivating example. We show that aggregate linkages can be validated both…
Descriptors: Equated Scores, Validity, Methods, School Districts