ERIC - Search Results

Publication Date

In 2025	0
Since 2024	0
Since 2021 (last 5 years)	0
Since 2016 (last 10 years)	3
Since 2006 (last 20 years)	3

Descriptor

Goodness of Fit	3
Statistical Analysis	3
Sample Size	2
Administrator Surveys	1
Comparative Analysis	1
Correlation	1
Cues	1
Educational Research	1
Error Correction	1
Essays	1
Evaluators	1
International Assessment	1
Item Response Theory	1
Measurement	1
Multiple Choice Tests	1
Reliability	1
Scoring	1
Simulation	1
Teacher Surveys	1
Test Items	1
Test Length	1
Validity	1
Writing Tests	1
More ▼

Source

Applied Measurement in…

Author

DeMars, Christine	1
Ferrara, Steve	1
Rutkowski, Leslie	1
Sauder, Derek	1
Steedle, Jeffrey T.	1
Svetina, Dubravka	1

Publication Type

Journal Articles	3
Reports - Research	3

Education Level

Audience

Location

Laws, Policies, & Programs

Assessments and Surveys

What Works Clearinghouse Rating

Showing all 3 results Save | Export

Applying a Multiple Comparison Control to IRT Item-Fit Testing

Peer reviewed

Direct link

Sauder, Derek; DeMars, Christine – Applied Measurement in Education, 2020

We used simulation techniques to assess the item-level and familywise Type I error control and power of an IRT item-fit statistic, the "S-X"[superscript 2]. Previous research indicated that the "S-X"[superscript 2] has good Type I error control and decent power, but no previous research examined familywise Type I error control.…

Descriptors: Item Response Theory, Test Items, Sample Size, Test Length

Measurement Invariance in International Surveys: Categorical Indicators and Fit Measure Performance

Peer reviewed

Direct link

Rutkowski, Leslie; Svetina, Dubravka – Applied Measurement in Education, 2017

In spite of the challenges inherent in making dozens of comparisons across heterogeneous populations, a relatively recent interest in scale-score equivalence for non-achievement measures in an international context has emerged. Until recently, operational procedures for establishing measurement invariance using multiple-groups analyses were…

Descriptors: International Assessment, Goodness of Fit, Statistical Analysis, Teacher Surveys

Evaluating Comparative Judgment as an Approach to Essay Scoring

Peer reviewed

Direct link

Steedle, Jeffrey T.; Ferrara, Steve – Applied Measurement in Education, 2016

As an alternative to rubric scoring, comparative judgment generates essay scores by aggregating decisions about the relative quality of the essays. Comparative judgment eliminates certain scorer biases and potentially reduces training requirements, thereby allowing a large number of judges, including teachers, to participate in essay evaluation.…

Descriptors: Essays, Scoring, Comparative Analysis, Evaluators