ERIC - Search Results

Publication Date

In 2025	0
Since 2024	0
Since 2021 (last 5 years)	0
Since 2016 (last 10 years)	2
Since 2006 (last 20 years)	4

Source

Educational and Psychological…

Author

Batchelder, William H.	1
Chalmers, R. Philip	1
DeMars, Christine E.	1
France, Stephen L.	1
Kingma, Johannes	1
Kroc, Edward	1
Masters, Geofferey N.	1
Oshima, T.C.	1
Raju, Nambury S.	1
Van Den Bos, Kees P.	1
Zumbo, Bruno D.	1
More ▼

Publication Type

Journal Articles	7
Reports - Descriptive	7
Opinion Papers	1
Reports - Research	1

Education Level

Audience

Practitioners

Location

Australia

Laws, Policies, & Programs

Assessments and Surveys

What Works Clearinghouse Rating

Showing all 7 results Save | Export

On Misconceptions and the Limited Usefulness of Ordinal Alpha

Peer reviewed

Direct link

Chalmers, R. Philip – Educational and Psychological Measurement, 2018

This article discusses the theoretical and practical contributions of Zumbo, Gadermann, and Zeisser's family of ordinal reliability statistics. Implications, interpretation, recommendations, and practical applications regarding their ordinal measures, particularly ordinal alpha, are discussed. General misconceptions relating to this family of…

Descriptors: Misconceptions, Test Theory, Test Reliability, Statistics

A Measurement Is a Choice and Stevens' Scales of Measurement Do Not Help Make It: A Response to Chalmers

Peer reviewed

Direct link

Zumbo, Bruno D.; Kroc, Edward – Educational and Psychological Measurement, 2019

Chalmers recently published a critique of the use of ordinal a[alpha] proposed in Zumbo et al. as a measure of test reliability in certain research settings. In this response, we take up the task of refuting Chalmers' critique. We identify three broad misconceptions that characterize Chalmers' criticisms: (1) confusing assumptions with…

Descriptors: Test Reliability, Statistical Analysis, Misconceptions, Mathematical Models

Maximum Likelihood Item Easiness Models for Test Theory without an Answer Key

Peer reviewed

Direct link

France, Stephen L.; Batchelder, William H. – Educational and Psychological Measurement, 2015

Cultural consensus theory (CCT) is a data aggregation technique with many applications in the social and behavioral sciences. We describe the intuition and theory behind a set of CCT models for continuous type data using maximum likelihood inference methodology. We describe how bias parameters can be incorporated into these models. We introduce…

Descriptors: Maximum Likelihood Statistics, Test Items, Difficulty Level, Test Theory

Polytomous Differential Item Functioning and Violations of Ordering of the Expected Latent Trait by the Raw Score

Peer reviewed

Direct link

DeMars, Christine E. – Educational and Psychological Measurement, 2008

The graded response (GR) and generalized partial credit (GPC) models do not imply that examinees ordered by raw observed score will necessarily be ordered on the expected value of the latent trait (OEL). Factors were manipulated to assess whether increased violations of OEL also produced increased Type I error rates in differential item…

Descriptors: Test Items, Raw Scores, Test Theory, Error of Measurement

Two Prophecy Formulas for Assessing the Reliability of Item Response Theory-Based Ability Estimates

Peer reviewed

Direct link

Raju, Nambury S.; Oshima, T.C. – Educational and Psychological Measurement, 2005

Two new prophecy formulas for estimating item response theory (IRT)-based reliability of a shortened or lengthened test are proposed. Some of the relationships between the two formulas, one of which is identical to the well-known Spearman-Brown prophecy formula, are examined and illustrated. The major assumptions underlying these formulas are…

Descriptors: Item Response Theory, Test Reliability, Evaluation Methods, Computation

MARKOV-FORGET: A Package for Parameter Estimation and Hypothesis Testing of 5, 7, 8, 9, and 10-Parameter Two-Stage Forgetting Models.

Peer reviewed

Kingma, Johannes; Van Den Bos, Kees P. – Educational and Psychological Measurement, 1987

Fifteen FORTRAN 77 programs are contained in the described package. Three programs are available for each of five forgetting models involving 5, 7, 8, 9, and 10-parameter models. The programs compute parameter estimates and test parameter estimates both between and within experimental conditions. (Author/GDC)

Descriptors: Computer Software, Educational Experiments, Hypothesis Testing, Learning

DICOT: Analyzing Classroom Tests with the Rasch Model.

Peer reviewed

Masters, Geofferey N. – Educational and Psychological Measurement, 1984

DICOT, a computer program for the Rasch analysis of classroom tests, is described. Results are presented in a self-explanatory form. Person ability and item difficulty estimates are expressed in a familiar metric. Person and item fit statistics provide a diagnosis of individual children and identification of problematic items. (Author/DWH)

Descriptors: Classroom Techniques, Foreign Countries, Item Analysis, Latent Trait Theory

Test Theory	7
Test Reliability	3
Item Response Theory	2
Mathematical Models	2
Misconceptions	2
Test Items	2
Answer Keys	1
Classroom Techniques	1
Computation	1
Computer Software	1
Definitions	1
Difficulty Level	1
Educational Experiments	1
Error of Measurement	1
Essay Tests	1
Evaluation Methods	1
Foreign Countries	1
Goodness of Fit	1
Hypothesis Testing	1
Item Analysis	1
Latent Trait Theory	1
Learning	1
Mathematical Formulas	1
Maximum Likelihood Statistics	1
Measurement	1
More ▼