ERIC - Search Results

Publication Date

In 2025	1
Since 2024	1
Since 2021 (last 5 years)	2
Since 2016 (last 10 years)	2
Since 2006 (last 20 years)	2

Descriptor

Evaluation Methods	3
Interrater Reliability	3
Evaluators	2
Goodness of Fit	2
Scoring	2
Academic Achievement	1
Classification	1
Comparative Analysis	1
Cutting Scores	1
Decision Making	1
Examiners	1
Identification	1
Individual Characteristics	1
Knowledge Level	1
Mathematical Models	1
Multiple Choice Tests	1
Performance Based Assessment	1
Psychometrics	1
Sampling	1
Scores	1
Selection	1
Simulation	1
Standard Setting (Scoring)	1
Students	1
Test Interpretation	1
More ▼

Source

Educational Measurement:…

Author

Jaeger, Richard M.	1
Stefanie A. Wind	1
Walker, A. Adrienne	1
Wind, Stefanie A.	1
Yangmeng Xu	1

Publication Type

Journal Articles	3
Reports - Research	2
Opinion Papers	1
Reports - Evaluative	1

Education Level

Audience

Location

Laws, Policies, & Programs

Assessments and Surveys

What Works Clearinghouse Rating

Showing all 3 results Save | Export

Examining the Psychometric Impact of Targeted and Random Double-Scoring in Mixed-Format Assessments

Peer reviewed

Direct link

Yangmeng Xu; Stefanie A. Wind – Educational Measurement: Issues and Practice, 2025

Double-scoring constructed-response items is a common but costly practice in mixed-format assessments. This study explored the impacts of Targeted Double-Scoring (TDS) and random double-scoring procedures on the quality of psychometric outcomes, including student achievement estimates, person fit, and student classifications under various…

Descriptors: Academic Achievement, Psychometrics, Scoring, Evaluation Methods

A Model-Data-Fit-Informed Approach to Score Resolution in Performance Assessments

Peer reviewed

Direct link

Wind, Stefanie A.; Walker, A. Adrienne – Educational Measurement: Issues and Practice, 2021

Many large-scale performance assessments include score resolution procedures for resolving discrepancies in rater judgments. The goal of score resolution is conceptually similar to person fit analyses: To identify students for whom observed scores may not accurately reflect their achievement. Previously, researchers have observed that…

Descriptors: Goodness of Fit, Performance Based Assessment, Evaluators, Decision Making

Selection of Judges for Standard-Setting.

Peer reviewed

Jaeger, Richard M. – Educational Measurement: Issues and Practice, 1991

Issues concerning the selection of judges for standard setting are discussed. Determining the consistency of judges' recommendations, or their congruity with other expert recommendations, would help in selection. Enough judges must be chosen to allow estimation of recommendations by an entire population of judges. (SLD)

Descriptors: Cutting Scores, Evaluation Methods, Evaluators, Examiners