ERIC - Search Results

Publication Date

In 2025	0
Since 2024	0
Since 2021 (last 5 years)	0
Since 2016 (last 10 years)	1
Since 2006 (last 20 years)	1

Descriptor

Raw Scores	3
Equated Scores	2
Item Response Theory	2
Comparative Analysis	1
Error of Measurement	1
Estimation (Mathematics)	1
Multiple Choice Tests	1
Psychometrics	1
Reliability	1
Sample Size	1
Scaling	1
Test Items	1
True Scores	1
More ▼

Source

Applied Measurement in…

Author

Feldt, Leonard S.	1
Gregg, Justin L.	1
Han, Tianqi	1
O'Neill, Thomas R.	1
Peabody, Michael R.	1
Qualls, Audrey L.	1

Publication Type

Journal Articles	3
Reports - Evaluative	2
Reports - Research	1

Education Level

Audience

Location

Laws, Policies, & Programs

Assessments and Surveys

What Works Clearinghouse Rating

Showing all 3 results Save | Export

Effect of Sample Size on Common Item Equating Using the Dichotomous Rasch Model

Peer reviewed

Direct link

O'Neill, Thomas R.; Gregg, Justin L.; Peabody, Michael R. – Applied Measurement in Education, 2020

This study addresses equating issues with varying sample sizes using the Rasch model by examining how sample size affects the stability of item calibrations and person ability estimates. A resampling design was used to create 9 sample size conditions (200, 100, 50, 45, 40, 35, 30, 25, and 20), each replicated 10 times. Items were recalibrated…

Descriptors: Sample Size, Equated Scores, Item Response Theory, Raw Scores

Approximating Scale Score Standard Error of Measurement from the Raw Score Standard Error.

Peer reviewed

Feldt, Leonard S.; Qualls, Audrey L. – Applied Measurement in Education, 1998

Two relatively simple methods for estimating the condition standard error of measurement (SEM) for nonlinearly derived score scales are proposed. Applications indicate that these two procedures produce fairly consistent estimates that tend to peak near the high end of the scale and reach a minimum in the middle of the raw score scale. (SLD)

Descriptors: Error of Measurement, Estimation (Mathematics), Raw Scores, Reliability

A Comparison among IRT True- and Observed-Score Equatings and Traditional Equipercentile Equating.

Peer reviewed

Han, Tianqi; And Others – Applied Measurement in Education, 1997

Stability among equating procedures was studied by comparing item response theory (IRT) true-score equating with IRT observed-score equating, IRT true-score equating with equipercentile equating, and IRT observed-score equating with equipercentile equating. On average, IRT true-score equating more frequently produced more stable conversions. (SLD)

Descriptors: Comparative Analysis, Equated Scores, Item Response Theory, Raw Scores