ERIC - Search Results

Publication Date

In 2025	0
Since 2024	0
Since 2021 (last 5 years)	1
Since 2016 (last 10 years)	2
Since 2006 (last 20 years)	4

Descriptor

Comparative Analysis	4
Probability	4
Item Response Theory	3
Test Items	3
Test Bias	2
Algebra	1
Bayesian Statistics	1
Benchmarking	1
Case Studies	1
Citizenship	1
College Admission	1
College Entrance Examinations	1
Computation	1
Difficulty Level	1
Effect Size	1
Error of Measurement	1
Evaluation Criteria	1
Foreign Countries	1
Geometry	1
Grade 7	1
Graduate Study	1
Higher Education	1
Item Banks	1
Judges	1
Literacy	1
More ▼

Source

Applied Measurement in…

Author

Andrich, David	1
Bridgeman, Brent	1
Demars, Christine E.	1
Heldsinger, Sandra	1
Humphry, Stephen	1
Kim, Stella Yun	1
Lawless, Rene	1
Lee, Won-Chan	1
Oliveri, Maria Elena	1
Robin, Frederic	1

Publication Type

Journal Articles	4
Reports - Research	3
Reports - Evaluative	1

Education Level

Elementary Education	1
Grade 7	1
Higher Education	1
Junior High Schools	1
Middle Schools	1
Secondary Education	1

Audience

Location

Australia

Laws, Policies, & Programs

Assessments and Surveys

Graduate Record Examinations

What Works Clearinghouse Rating

Showing all 4 results Save | Export

Maintaining Score Scales over Time: A Comparison of Five Scoring Methods

Peer reviewed

Direct link

Kim, Stella Yun; Lee, Won-Chan – Applied Measurement in Education, 2023

This study evaluates various scoring methods including number-correct scoring, IRT theta scoring, and hybrid scoring in terms of scale-score stability over time. A simulation study was conducted to examine the relative performance of five scoring methods in terms of preserving the first two moments of scale scores for a population in a chain of…

Descriptors: Scoring, Comparative Analysis, Item Response Theory, Simulation

An Exploratory Analysis of Differential Item Functioning and Its Possible Sources in a Higher Education Admissions Context

Peer reviewed

Direct link

Oliveri, Maria Elena; Lawless, Rene; Robin, Frederic; Bridgeman, Brent – Applied Measurement in Education, 2018

We analyzed a pool of items from an admissions test for differential item functioning (DIF) for groups based on age, socioeconomic status, citizenship, or English language status using Mantel-Haenszel and item response theory. DIF items were systematically examined to identify its possible sources by item type, content, and wording. DIF was…

Descriptors: Test Bias, Comparative Analysis, Item Banks, Item Response Theory

Requiring a Consistent Unit of Scale between the Responses of Students and Judges in Standard Setting

Peer reviewed

Direct link

Humphry, Stephen; Heldsinger, Sandra; Andrich, David – Applied Measurement in Education, 2014

One of the best-known methods for setting a benchmark standard on a test is that of Angoff and its modifications. When scored dichotomously, judges estimate the probability that a benchmark student has of answering each item correctly. As in most methods of standard setting, it is assumed implicitly that the unit of the latent scale of the…

Descriptors: Foreign Countries, Standard Setting (Scoring), Judges, Item Response Theory

An Analytic Comparison of Effect Sizes for Differential Item Functioning

Peer reviewed

Direct link

Demars, Christine E. – Applied Measurement in Education, 2011

Three types of effects sizes for DIF are described in this exposition: log of the odds-ratio (differences in log-odds), differences in probability-correct, and proportion of variance accounted for. Using these indices involves conceptualizing the degree of DIF in different ways. This integrative review discusses how these measures are impacted in…

Descriptors: Effect Size, Test Bias, Probability, Difficulty Level