ERIC - Search Results

Publication Date

In 2026	0
Since 2025	0
Since 2022 (last 5 years)	0
Since 2017 (last 10 years)	5
Since 2007 (last 20 years)	7

Descriptor

Comparative Analysis	8
Evaluation Methods	5
Teaching Methods	3
Correlation	2
Cutting Scores	2
Scores	2
Simulation	2
Test Bias	2
Academic Accommodations…	1
Academic Achievement	1
Accuracy	1
Achievement Gap	1
Achievement Tests	1
Barriers	1
Bayesian Statistics	1
College Freshmen	1
Computation	1
Computer Assisted Testing	1
Content Area Reading	1
Decision Making	1
Disabilities	1
Educational Innovation	1
Educational Testing	1
Effect Size	1
Elementary Education	1
More ▼

Source

Educational Measurement:…

Author

Wyse, Adam E.	2
Albers, Casper J.	1
Babcock, Ben	1
Beldhuis, Hans J. A.	1
Boevé, Anja J.	1
Bosker, Roel J.	1
Cho, Sun-Joo	1
Lee, Woo-yeol	1
Li, Hongli	1
Linn, Robert L.	1
Meijer, Rob R.	1
Suh, Youngsuk	1
Walker, A. Adrienne	1
Wind, Stefanie A.	1
Wright, Daniel B.	1
More ▼

Publication Type

Journal Articles	8
Reports - Research	6
Reports - Evaluative	2

Education Level

Higher Education	1
Postsecondary Education	1

Audience

Location

Netherlands

Laws, Policies, & Programs

Assessments and Surveys

What Works Clearinghouse Rating

Showing all 8 results Save | Export

A Model-Data-Fit-Informed Approach to Score Resolution in Performance Assessments

Peer reviewed

Direct link

Wind, Stefanie A.; Walker, A. Adrienne – Educational Measurement: Issues and Practice, 2021

Many large-scale performance assessments include score resolution procedures for resolving discrepancies in rater judgments. The goal of score resolution is conceptually similar to person fit analyses: To identify students for whom observed scores may not accurately reflect their achievement. Previously, researchers have observed that…

Descriptors: Goodness of Fit, Performance Based Assessment, Evaluators, Decision Making

On Natural Variation in Grades in Higher Education, and Its Implications for Assessing Effectiveness of Educational Innovations

Peer reviewed

Direct link

Boevé, Anja J.; Meijer, Rob R.; Beldhuis, Hans J. A.; Bosker, Roel J.; Albers, Casper J. – Educational Measurement: Issues and Practice, 2019

To investigate the effect of innovations in the teaching-learning environment, researchers often compare study results from different cohorts across years. However, variance in scores can be attributed to both random fluctuation and systematic changes due to the innovation, complicating cohort comparisons. In the present study, we illustrate how…

Descriptors: Grades (Scholastic), Foreign Countries, Teaching Methods, Educational Innovation

An Investigation of Undefined Cut Scores with the Hofstee Standard-Setting Method

Peer reviewed

Direct link

Wyse, Adam E.; Babcock, Ben – Educational Measurement: Issues and Practice, 2017

This article provides an overview of the Hofstee standard-setting method and illustrates several situations where the Hofstee method will produce undefined cut scores. The situations where the cut scores will be undefined involve cases where the line segment derived from the Hofstee ratings does not intersect the score distribution curve based on…

Descriptors: Cutting Scores, Evaluation Methods, Standard Setting (Scoring), Comparative Analysis

An NCME Instructional Module on Latent DIF Analysis Using Mixture Item Response Models

Peer reviewed

Direct link

Cho, Sun-Joo; Suh, Youngsuk; Lee, Woo-yeol – Educational Measurement: Issues and Practice, 2016

The purpose of this ITEMS module is to provide an introduction to differential item functioning (DIF) analysis using mixture item response models. The mixture item response models for DIF analysis involve comparing item profiles across latent groups, instead of manifest groups. First, an overview of DIF analysis based on latent groups, called…

Descriptors: Test Bias, Research Methodology, Evaluation Methods, Models

Speed Gaps: Exploring Differences in Response Latencies among Groups

Peer reviewed

Direct link

Wright, Daniel B. – Educational Measurement: Issues and Practice, 2019

There is much discussion about and many policies to address achievement gaps in education among groups of students. The focus here is on a different gap and it is argued that it also should be of concern. Speed gaps are differences in how quickly different groups of students answer the questions on academic assessments. To investigate some speed…

Descriptors: Academic Achievement, Achievement Gap, Reaction Time, Educational Testing

Five Methods for Estimating Angoff Cut Scores with IRT

Peer reviewed

Direct link

Wyse, Adam E. – Educational Measurement: Issues and Practice, 2017

This article illustrates five different methods for estimating Angoff cut scores using item response theory (IRT) models. These include maximum likelihood (ML), expected a priori (EAP), modal a priori (MAP), and weighted maximum likelihood (WML) estimators, as well as the most commonly used approach based on translating ratings through the test…

Descriptors: Cutting Scores, Item Response Theory, Bayesian Statistics, Maximum Likelihood Statistics

The Effects of Read-Aloud Accommodations for Students with and without Disabilities: A Meta-Analysis

Peer reviewed

Direct link

Li, Hongli – Educational Measurement: Issues and Practice, 2014

Read-aloud accommodations have been proposed as a way to help remove barriers faced by students with disabilities in reading comprehension. Many empirical studies have examined the effects of read-aloud accommodations; however, the results are mixed. With a variance-known hierarchical linear modeling approach, based on 114 effect sizes from 23…

Descriptors: Reading Instruction, Reading Strategies, Reading Comprehension, Barriers

Comparing State and District Results to National Norms: The Validity of the Claims that "Everyone Is above Average".

Peer reviewed

Linn, Robert L.; And Others – Educational Measurement: Issues and Practice, 1990

Results of a 1987 report--indicating that elementary students of all 50 states were above the national average--were assessed via 2 national mail and telephone surveys. Although results of data for 35 states support the general findings of the 1987 report, it appears that more specific results are less sensational. (TJH)

Descriptors: Achievement Tests, Comparative Analysis, Elementary Education, Evaluation Methods