ERIC - Search Results

Publication Date

In 2025	0
Since 2024	1
Since 2021 (last 5 years)	2
Since 2016 (last 10 years)	3

Descriptor

Educational Testing	3
Error of Measurement	3
Goodness of Fit	2
Item Response Theory	2
Achievement Tests	1
Conflict Resolution	1
Correlation	1
Equated Scores	1
Evaluation Methods	1
Examiners	1
Grade 4	1
Grade 8	1
Grade Prediction	1
Interrater Reliability	1
Mathematics Tests	1
Methods	1
Models	1
National Competency Tests	1
Psychological Testing	1
Psychometrics	1
Reading Tests	1
Sample Size	1
School Districts	1
Scoring	1
Simulation	1
More ▼

Source

Educational Assessment	1
Journal of Educational…	1
Journal of Educational and…	1

Author

Falk, Carl F.	1
Ho, Andrew D.	1
Hong, Seong Eun	1
Kalogrides, Demetra	1
Monroe, Scott	1
Reardon, Sean F.	1
Stefanie A. Wind	1
Yangmeng Xu	1

Publication Type

Journal Articles	3
Reports - Research	3

Education Level

Elementary Education	1
Grade 4	1
Grade 8	1
Intermediate Grades	1
Junior High Schools	1
Middle Schools	1
Secondary Education	1

Audience

Location

Laws, Policies, & Programs

Assessments and Surveys

Measures of Academic Progress	1
National Assessment of…	1

What Works Clearinghouse Rating

Showing all 3 results Save | Export

Resolving and Re-Scoring Constructed Response Items in Mixed-Format Assessments: An Exploration of Three Approaches

Peer reviewed

Direct link

Stefanie A. Wind; Yangmeng Xu – Educational Assessment, 2024

We explored three approaches to resolving or re-scoring constructed-response items in mixed-format assessments: rater agreement, person fit, and targeted double scoring (TDS). We used a simulation study to consider how the three approaches impact the psychometric properties of student achievement estimates, with an emphasis on person fit. We found…

Descriptors: Interrater Reliability, Error of Measurement, Evaluation Methods, Examiners

Performance of Person-Fit Statistics under Model Misspecification

Peer reviewed

Direct link

Hong, Seong Eun; Monroe, Scott; Falk, Carl F. – Journal of Educational Measurement, 2020

In educational and psychological measurement, a person-fit statistic (PFS) is designed to identify aberrant response patterns. For parametric PFSs, valid inference depends on several assumptions, one of which is that the item response theory (IRT) model is correctly specified. Previous studies have used empirical data sets to explore the effects…

Descriptors: Educational Testing, Psychological Testing, Goodness of Fit, Error of Measurement

Validation Methods for Aggregate-Level Test Scale Linking: A Case Study Mapping School District Test Score Distributions to a Common Scale

Peer reviewed
PDF on ERIC

Download full text

Direct link

Reardon, Sean F.; Kalogrides, Demetra; Ho, Andrew D. – Journal of Educational and Behavioral Statistics, 2021

Linking score scales across different tests is considered speculative and fraught, even at the aggregate level. We introduce and illustrate validation methods for aggregate linkages, using the challenge of linking U.S. school district average test scores across states as a motivating example. We show that aggregate linkages can be validated both…

Descriptors: Equated Scores, Validity, Methods, School Districts