ERIC - Search Results

Publication Date

In 2025	0
Since 2024	0
Since 2021 (last 5 years)	0
Since 2016 (last 10 years)	3
Since 2006 (last 20 years)	4

Descriptor

Error of Measurement	5
Probability	5
Item Analysis	3
Item Response Theory	3
Test Items	3
Equated Scores	2
Guidelines	2
Sample Size	2
Simulation	2
Accuracy	1
Achievement Tests	1
Cutting Scores	1
Data	1
Decision Making	1
Difficulty Level	1
Evaluators	1
Foreign Countries	1
Generalization	1
Goodness of Fit	1
International Assessment	1
Matrices	1
Measurement Techniques	1
Models	1
Secondary School Students	1
Standard Setting	1
More ▼

Source

Journal of Educational…

Author

Andersson, Björn	1
Bolsinova, Maria	1
Clauser, Brian E.	1
Clauser, Jerome C.	1
Dawis, Rene V.	1
Kane, Michael	1
Liaw, Yuan-Ling	1
Rutkowski, David	1
Rutkowski, Leslie	1
Tijmstra, Jesper	1
Whitely, Susan E.	1
Yuan, Ke-Hai	1
Zu, Jiyun	1
More ▼

Publication Type

Journal Articles	4
Reports - Evaluative	2
Reports - Descriptive	1
Reports - Research	1

Education Level

Secondary Education

Audience

Location

Laws, Policies, & Programs

Assessments and Surveys

Program for International…

What Works Clearinghouse Rating

Showing all 5 results Save | Export

Examining the Precision of Cut Scores within a Generalizability Theory Framework: A Closer Look at the Item Effect

Peer reviewed

Direct link

Clauser, Brian E.; Kane, Michael; Clauser, Jerome C. – Journal of Educational Measurement, 2020

An Angoff standard setting study generally yields judgments on a number of items by a number of judges (who may or may not be nested in panels). Variability associated with judges (and possibly panels) contributes error to the resulting cut score. The variability associated with items plays a more complicated role. To the extent that the mean item…

Descriptors: Cutting Scores, Generalization, Decision Making, Standard Setting

Sensitivity of the RMSD for Detecting Item-Level Misfit in Low-Performing Countries

Peer reviewed

Direct link

Tijmstra, Jesper; Bolsinova, Maria; Liaw, Yuan-Ling; Rutkowski, Leslie; Rutkowski, David – Journal of Educational Measurement, 2020

Although the root-mean squared deviation (RMSD) is a popular statistical measure for evaluating country-specific item-level misfit (i.e., differential item functioning [DIF]) in international large-scale assessment, this paper shows that its sensitivity to detect misfit may depend strongly on the proficiency distribution of the considered…

Descriptors: Test Items, Goodness of Fit, Probability, Accuracy

Asymptotic Standard Errors of Observed-Score Equating with Polytomous IRT Models

Peer reviewed

Direct link

Andersson, Björn – Journal of Educational Measurement, 2016

In observed-score equipercentile equating, the goal is to make scores on two scales or tests measuring the same construct comparable by matching the percentiles of the respective score distributions. If the tests consist of different items with multiple categories for each item, a suitable model for the responses is a polytomous item response…

Descriptors: Equated Scores, Item Response Theory, Error of Measurement, Tests

Standard Error of Linear Observed-Score Equating for the NEAT Design with Nonnormally Distributed Data

Peer reviewed

Direct link

Zu, Jiyun; Yuan, Ke-Hai – Journal of Educational Measurement, 2012

In the nonequivalent groups with anchor test (NEAT) design, the standard error of linear observed-score equating is commonly estimated by an estimator derived assuming multivariate normality. However, real data are seldom normally distributed, causing this normal estimator to be inconsistent. A general estimator, which does not rely on the…

Descriptors: Sample Size, Equated Scores, Test Items, Error of Measurement

The Nature of Objectivity with the Rasch Model

Peer reviewed

Whitely, Susan E.; Dawis, Rene V. – Journal of Educational Measurement, 1974

Descriptors: Error of Measurement, Item Analysis, Matrices, Measurement Techniques