ERIC - Search Results

Publication Date

In 2025	0
Since 2024	0
Since 2021 (last 5 years)	2
Since 2016 (last 10 years)	4
Since 2006 (last 20 years)	13

Descriptor

Computation	14
Item Response Theory	14
Test Theory	14
Test Items	6
Comparative Analysis	5
Scores	5
Error of Measurement	4
Statistical Analysis	4
College Entrance Examinations	2
Correlation	2
Difficulty Level	2
Elementary School Students	2
Evaluation Methods	2
Foreign Countries	2
Goodness of Fit	2
Grade 4	2
Grade 5	2
Grade 6	2
Grade 7	2
Grade 8	2
Likert Scales	2
Models	2
Monte Carlo Methods	2
Reading Comprehension	2
Reading Tests	2
More ▼

Source

Applied Psychological…	3
Educational and Psychological…	3
ACT, Inc.	1
Annenberg Institute for…	1
Applied Measurement in…	1
Behavioral Research and…	1
International Journal of…	1
Practical Assessment,…	1
ProQuest LLC	1
Research Papers in Education	1

Publication Type

Journal Articles	10
Reports - Research	7
Reports - Evaluative	4
Reports - Descriptive	2
Dissertations/Theses -…	1
Numerical/Quantitative Data	1

Education Level

Elementary Education	4
Secondary Education	3
Grade 4	2
Grade 5	2
Grade 6	2
Grade 7	2
Grade 8	2
Higher Education	2
Junior High Schools	2
Middle Schools	2
Postsecondary Education	2
Early Childhood Education	1
Grade 2	1
Grade 3	1
Intermediate Grades	1
Primary Education	1
More ▼

Audience

Location

Australia	1
Colorado	1
Florida	1
New York	1
North Carolina	1
Tennessee	1
Texas	1
United Kingdom (England)	1

Laws, Policies, & Programs

Assessments and Surveys

ACT Assessment	1
Law School Admission Test	1

What Works Clearinghouse Rating

Showing all 14 results Save | Export

Rasch Measurement v. Item Response Theory: Knowing When to Cross the Line

Peer reviewed
PDF on ERIC

Download full text

Stemler, Steven E.; Naples, Adam – Practical Assessment, Research & Evaluation, 2021

When students receive the same score on a test, does that mean they know the same amount about the topic? The answer to this question is more complex than it may first appear. This paper compares classical and modern test theories in terms of how they estimate student ability. Crucial distinctions between the aims of Rasch Measurement and IRT are…

Descriptors: Item Response Theory, Test Theory, Ability, Computation

Estimating Treatment Effects with the Explanatory Item Response Model. EdWorkingPaper No. 22-677

Download full text

Joshua B. Gilbert – Annenberg Institute for School Reform at Brown University, 2022

This simulation study examines the characteristics of the Explanatory Item Response Model (EIRM) when estimating treatment effects when compared to classical test theory (CTT) sum and mean scores and item response theory (IRT)-based theta scores. Results show that the EIRM and IRT theta scores provide generally equivalent bias and false positive…

Descriptors: Item Response Theory, Models, Test Theory, Computation

On True Score Evaluation Using Item Response Theory Modeling

Peer reviewed

Direct link

Raykov, Tenko; Dimitrov, Dimiter M.; Marcoulides, George A.; Harrison, Michael – Educational and Psychological Measurement, 2019

Building on prior research on the relationships between key concepts in item response theory and classical test theory, this note contributes to highlighting their important and useful links. A readily and widely applicable latent variable modeling procedure is discussed that can be used for point and interval estimation of the individual person…

Descriptors: True Scores, Item Response Theory, Test Items, Test Theory

Effects of Various Simulation Conditions on Latent-Trait Estimates: A Simulation Study

Peer reviewed
PDF on ERIC

Download full text

Kogar, Hakan – International Journal of Assessment Tools in Education, 2018

The aim of this simulation study, determine the relationship between true latent scores and estimated latent scores by including various control variables and different statistical models. The study also aimed to compare the statistical models and determine the effects of different distribution types, response formats and sample sizes on latent…

Descriptors: Simulation, Context Effect, Computation, Statistical Analysis

The Reliability and Precision of Total Scores and IRT Estimates as a Function of Polytomous IRT Parameters and Latent Trait Distribution

Peer reviewed

Direct link

Culpepper, Steven Andrew – Applied Psychological Measurement, 2013

A classic topic in the fields of psychometrics and measurement has been the impact of the number of scale categories on test score reliability. This study builds on previous research by further articulating the relationship between item response theory (IRT) and classical test theory (CTT). Equations are presented for comparing the reliability and…

Descriptors: Item Response Theory, Reliability, Scores, Error of Measurement

A Comparison of Three Methods for Computing Scale Score Conditional Standard Errors of Measurement. ACT Research Report Series, 2013 (7)

Download full text

Woodruff, David; Traynor, Anne; Cui, Zhongmin; Fang, Yu – ACT, Inc., 2013

Professional standards for educational testing recommend that both the overall standard error of measurement and the conditional standard error of measurement (CSEM) be computed on the score scale used to report scores to examinees. Several methods have been developed to compute scale score CSEMs. This paper compares three methods, based on…

Descriptors: Comparative Analysis, Error of Measurement, Scores, Scaling

Quantifying Local, Response Dependence between Two Polytomous Items Using the Rasch Model

Peer reviewed

Direct link

Andrich, David; Humphry, Stephen M.; Marais, Ida – Applied Psychological Measurement, 2012

Models of modern test theory imply statistical independence among responses, generally referred to as "local independence." One violation of local independence occurs when the response to one item governs the response to a subsequent item. Expanding on a formulation of this kind of violation as a process in the dichotomous Rasch model,…

Descriptors: Test Theory, Models, Item Response Theory, Evidence

Problems in Estimating Composite Reliability of "Unitised" Assessments

Peer reviewed

Direct link

Bramley, Tom; Dhawan, Vikas – Research Papers in Education, 2013

This paper discusses the issues involved in calculating indices of composite reliability for "modular" or "unitised" assessments of the kind used in GCSEs, AS and A level examinations in England. The increasingly widespread use of on-screen marking has meant that the item-level data required for calculating indices of…

Descriptors: Foreign Countries, Exit Examinations, Secondary Education, Test Reliability

A Comparison of Teacher Effectiveness Measures Calculated Using Three Multilevel Models for Raters Effects

Peer reviewed

Direct link

Murphy, Daniel L.; Beretvas, S. Natasha – Applied Measurement in Education, 2015

This study examines the use of cross-classified random effects models (CCrem) and cross-classified multiple membership random effects models (CCMMrem) to model rater bias and estimate teacher effectiveness. Effect estimates are compared using CTT versus item response theory (IRT) scaling methods and three models (i.e., conventional multilevel…

Descriptors: Teacher Effectiveness, Comparative Analysis, Hierarchical Linear Modeling, Test Theory

Using IRT Trait Estimates versus Summated Scores in Predicting Outcomes

Peer reviewed

Direct link

Xu, Ting; Stone, Clement A. – Educational and Psychological Measurement, 2012

It has been argued that item response theory trait estimates should be used in analyses rather than number right (NR) or summated scale (SS) scores. Thissen and Orlando postulated that IRT scaling tends to produce trait estimates that are linearly related to the underlying trait being measured. Therefore, IRT trait estimates can be more useful…

Descriptors: Educational Research, Monte Carlo Methods, Measures (Individuals), Item Response Theory

Evaluating IRT- and CTT-Based Methods of Estimating Classification Consistency and Accuracy Indices from Single Administrations

Direct link

Deng, Nina – ProQuest LLC, 2011

Three decision consistency and accuracy (DC/DA) methods, the Livingston and Lewis (LL) method, LEE method, and the Hambleton and Han (HH) method, were evaluated. The purposes of the study were: (1) to evaluate the accuracy and robustness of these methods, especially when their assumptions were not well satisfied, (2) to investigate the "true"…

Descriptors: Item Response Theory, Test Theory, Computation, Classification

Quantifying Response Dependence between Two Dichotomous Items Using the Rasch Model

Peer reviewed

Direct link

Andrich, David; Kreiner, Svend – Applied Psychological Measurement, 2010

Descriptors: Test Theory, Item Response Theory, Test Items, Correlation

Two Prophecy Formulas for Assessing the Reliability of Item Response Theory-Based Ability Estimates

Peer reviewed

Direct link

Raju, Nambury S.; Oshima, T.C. – Educational and Psychological Measurement, 2005

Two new prophecy formulas for estimating item response theory (IRT)-based reliability of a shortened or lengthened test are proposed. Some of the relationships between the two formulas, one of which is identical to the well-known Spearman-Brown prophecy formula, are examined and illustrated. The major assumptions underlying these formulas are…

Descriptors: Item Response Theory, Test Reliability, Evaluation Methods, Computation

Instrument Development Procedures for Maze Measures. Technical Report # 08-06

Download full text

Liu, Kimy; Sundstrom-Hebert, Krystal; Ketterlin-Geller, Leanne R.; Tindal, Gerald – Behavioral Research and Teaching, 2008

The purpose of this study was to document the instrument development of maze measures for grades 3-8. Each maze passage contained twelve omitted words that students filled in by choosing the best-fit word from among the provided options. In this technical report, we describe the process of creating, reviewing, and pilot testing the maze measures.…

Descriptors: Test Construction, Cloze Procedure, Multiple Choice Tests, Reading Tests

Andrich, David	2
Beretvas, S. Natasha	1
Bramley, Tom	1
Cui, Zhongmin	1
Culpepper, Steven Andrew	1
Deng, Nina	1
Dhawan, Vikas	1
Dimitrov, Dimiter M.	1
Fang, Yu	1
Harrison, Michael	1
Humphry, Stephen M.	1
Joshua B. Gilbert	1
Ketterlin-Geller, Leanne R.	1
Kogar, Hakan	1
Kreiner, Svend	1
Liu, Kimy	1
Marais, Ida	1
Marcoulides, George A.	1
Murphy, Daniel L.	1
Naples, Adam	1
Oshima, T.C.	1
Raju, Nambury S.	1
Raykov, Tenko	1
Stemler, Steven E.	1
Stone, Clement A.	1
More ▼