ERIC - Search Results

Publication Date

In 2025	2
Since 2024	5
Since 2021 (last 5 years)	18
Since 2016 (last 10 years)	48
Since 2006 (last 20 years)	79

Descriptor

Item Response Theory	58
Models	24
Test Items	24
Psychometrics	16
Scores	16
Measurement	15
Correlation	11
Testing	11
Evaluation Methods	10
Simulation	10
Test Construction	10
Test Reliability	10
Educational Assessment	9
Computer Assisted Testing	8
Error of Measurement	8
Test Theory	8
Academic Achievement	7
Comparative Analysis	7
Computation	7
Item Analysis	7
Test Bias	7
Achievement Tests	6
Computer Software	6
Diagnostic Tests	6
Foreign Countries	6
More ▼

Source

Educational Measurement:…

Publication Type

Journal Articles	79
Reports - Research	36
Reports - Descriptive	25
Reports - Evaluative	15
Opinion Papers	4
Information Analyses	1

Education Level

Elementary Secondary Education	7
Secondary Education	5
Grade 4	3
Grade 5	3
Early Childhood Education	2
Elementary Education	1
Grade 3	1
Grade 6	1
Grade 7	1
High Schools	1
Higher Education	1
Intermediate Grades	1
Postsecondary Education	1
Preschool Education	1
More ▼

Audience

Researchers	1
Students	1
Teachers	1

Location

Colorado	1
Greece	1
Haiti	1
Indiana	1
Israel	1
United States	1

Laws, Policies, & Programs

Every Student Succeeds Act…	1
No Child Left Behind Act 2001	1

Assessments and Surveys

Program for International…	3
ACT Assessment	2
National Assessment of…	2
Graduate Record Examinations	1
Iowa Tests of Basic Skills	1
Iowa Tests of Educational…	1
Preliminary Scholastic…	1
Program for the International…	1
SAT (College Admission Test)	1

What Works Clearinghouse Rating

Showing 1 to 15 of 79 results Save | Export

Generalizability Theory Approach to Analyzing Automated-Item Generated Test Forms

Peer reviewed

Direct link

Stella Y. Kim; Sungyeun Kim – Educational Measurement: Issues and Practice, 2025

This study presents several multivariate Generalizability theory designs for analyzing automatic item-generated (AIG) based test forms. The study used real data to illustrate the analysis procedure and discuss practical considerations. We collected the data from two groups of students, each group receiving a different form generated by AIG. A…

Descriptors: Generalizability Theory, Automation, Test Items, Students

Investigating Approaches to Controlling Item Position Effects in Computerized Adaptive Tests

Peer reviewed

Direct link

Ye Ma; Deborah J. Harris – Educational Measurement: Issues and Practice, 2025

Item position effect (IPE) refers to situations where an item performs differently when it is administered in different positions on a test. The majority of previous research studies have focused on investigating IPE under linear testing. There is a lack of IPE research under adaptive testing. In addition, the existence of IPE might violate Item…

Descriptors: Computer Assisted Testing, Adaptive Testing, Item Response Theory, Test Items

Guesses and Slips as Proficiency-Related Phenomena and Impacts on Parameter Invariance

Peer reviewed

Direct link

Xiangyi Liao; Daniel M Bolt – Educational Measurement: Issues and Practice, 2024

Traditional approaches to the modeling of multiple-choice item response data (e.g., 3PL, 4PL models) emphasize slips and guesses as random events. In this paper, an item response model is presented that characterizes both disjunctively interacting guessing and conjunctively interacting slipping processes as proficiency-related phenomena. We show…

Descriptors: Item Response Theory, Test Items, Error Correction, Guessing (Tests)

Digital Module 29: Multidimensional Item Response Theory Equating

Peer reviewed

Direct link

Kim, Stella Y. – Educational Measurement: Issues and Practice, 2022

In this digital ITEMS module, Dr. Stella Kim provides an overview of multidimensional item response theory (MIRT) equating. Traditional unidimensional item response theory (IRT) equating methods impose the sometimes untenable restriction on data that only a single ability is assessed. This module discusses potential sources of multidimensionality…

Descriptors: Item Response Theory, Models, Equated Scores, Evaluation Methods

An Investigation of the Nature and Consequence of the Relationship between IRT Difficulty and Discrimination

Peer reviewed

Direct link

Sweeney, Sandra M.; Sinharay, Sandip; Johnson, Matthew S.; Steinhauer, Eric W. – Educational Measurement: Issues and Practice, 2022

The focus of this paper is on the empirical relationship between item difficulty and item discrimination. Two studies--an empirical investigation and a simulation study--were conducted to examine the association between item difficulty and item discrimination under classical test theory and item response theory (IRT), and the effects of the…

Descriptors: Correlation, Item Response Theory, Item Analysis, Difficulty Level

Expected Classification Accuracy for Categorical Growth Models

Peer reviewed

Direct link

Daniel Murphy; Sarah Quesen; Matthew Brunetti; Quintin Love – Educational Measurement: Issues and Practice, 2024

Categorical growth models describe examinee growth in terms of performance-level category transitions, which implies that some percentage of examinees will be misclassified. This paper introduces a new procedure for estimating the classification accuracy of categorical growth models, based on Rudner's classification accuracy index for item…

Descriptors: Classification, Growth Models, Accuracy, Performance Based Assessment

Applying a Mixture Rasch Model-Based Approach to Standard Setting

Peer reviewed

Direct link

Peabody, Michael R.; Muckle, Timothy J.; Meng, Yu – Educational Measurement: Issues and Practice, 2023

The subjective aspect of standard-setting is often criticized, yet data-driven standard-setting methods are rarely applied. Therefore, we applied a mixture Rasch model approach to setting performance standards across several testing programs of various sizes and compared the results to existing passing standards derived from traditional…

Descriptors: Item Response Theory, Standard Setting, Testing, Sampling

Psychometric Evaluation of the Preschool Early Numeracy Skills Test--Brief Version within the Item Response Theory Framework

Peer reviewed

Direct link

Tsigilis, Nikolaos; Krousorati, Katerina; Gregoriadis, Athanasios; Grammatikopoulos, Vasilis – Educational Measurement: Issues and Practice, 2023

The Preschool Early Numeracy Skills Test--Brief Version (PENS-B) is a measure of early numeracy skills, developed and mainly used in the United States. The purpose of this study was to examine the factorial validity and measurement invariance across gender of PENS-B in the Greek educational context. PENS-B was administered to 906 preschool…

Descriptors: Psychometrics, Preschool Education, Numeracy, Item Response Theory

Combining Process Information and Item Response Modeling to Estimate Problem-Solving Ability

Peer reviewed

Direct link

Xiao, Yue; Veldkamp, Bernard; Liu, Hongyun – Educational Measurement: Issues and Practice, 2022

The action sequences of respondents in problem-solving tasks reflect rich and detailed information about their performance, including differences in problem-solving ability, even if item scores are equal. It is therefore not sufficient to infer individual problem-solving skills based solely on item scores. This study is a preliminary attempt to…

Descriptors: Problem Solving, Item Response Theory, Scores, Item Analysis

Modeling Slipping Effects in a Large-Scale Assessment with Innovative Item Formats

Peer reviewed

Direct link

Cuhadar, Ismail; Binici, Salih – Educational Measurement: Issues and Practice, 2022

This study employs the 4-parameter logistic item response theory model to account for the unexpected incorrect responses or slipping effects observed in a large-scale Algebra 1 End-of-Course assessment, including several innovative item formats. It investigates whether modeling the misfit at the upper asymptote has any practical impact on the…

Descriptors: Item Response Theory, Measurement, Student Evaluation, Algebra

Digital Module 27: Hierarchical Rater Models

Peer reviewed

Direct link

Casabianca, Jodi M. – Educational Measurement: Issues and Practice, 2021

Module Overview: In this digital ITEMS module, Dr. Jodi M. Casabianca provides a primer on the "hierarchical rater model" (HRM) framework and the recent expansions to the model for analyzing raters and ratings of constructed responses. In the first part of the module, she establishes an understanding of the nature of constructed…

Descriptors: Hierarchical Linear Modeling, Rating Scales, Error of Measurement, Item Response Theory

Learning Theory, Classroom Assessment, and Equity

Peer reviewed

Direct link

Kang, Hosun; Furtak, Erin M. – Educational Measurement: Issues and Practice, 2021

Despite increasing awareness about the role of classroom assessments in perpetuating educational inequities, the research community continues to struggle with how to support teachers to design and use classroom assessments for achieving equity. In response to recent calls to better connect learning theory to the design of classroom assessments, we…

Descriptors: Learning Theories, Student Evaluation, Equal Education, Educational Research

Item Response Theory Models for Polytomous Multidimensional Forced-Choice Items to Measure Construct Differentiation

Peer reviewed

Direct link

Xuelan Qiu; Jimmy de la Torre; You-Gan Wang; Jinran Wu – Educational Measurement: Issues and Practice, 2024

Multidimensional forced-choice (MFC) items have been found to be useful to reduce response biases in personality assessments. However, conventional scoring methods for the MFC items result in ipsative data, hindering the wider applications of the MFC format. In the last decade, a number of item response theory (IRT) models have been developed,…

Descriptors: Item Response Theory, Personality Traits, Personality Measures, Personality Assessment

A Longitudinal Diagnostic Model with Hierarchical Learning Trajectories

Peer reviewed

Direct link

Zhan, Peida; He, Keren – Educational Measurement: Issues and Practice, 2021

In learning diagnostic assessments, the attribute hierarchy specifies a sequential network of interrelated attribute mastery processes, which makes a test blueprint consistent with the cognitive theory. One of the most important functions of attribute hierarchy is to guide or limit the developmental direction of students and then form a…

Descriptors: Longitudinal Studies, Models, Comparative Analysis, Diagnostic Tests

Digital Module 13: Monte Carlo Simulation Studies in Item Response Theory

Peer reviewed

Direct link

Leventhal, Brian; Ames, Allison – Educational Measurement: Issues and Practice, 2020

In this digital ITEMS module, Dr. Brian Leventhal and Dr. Allison Ames provide an overview of "Monte Carlo simulation studies" (MCSS) in "item response theory" (IRT). MCSS are utilized for a variety of reasons, one of the most compelling being that they can be used when analytic solutions are impractical or nonexistent because…

Descriptors: Item Response Theory, Monte Carlo Methods, Simulation, Test Items

Previous Page | Next Page »

Pages: 1 | 2 | 3 | 4 | 5 | 6

Sinharay, Sandip	5
Wind, Stefanie A.	3
Ames, Allison	2
Briggs, Derek C.	2
Frey, Andreas	2
Khorramdel, Lale	2
Kolen, Michael J.	2
Kopp, Jason P.	2
Li, Min	2
Rubright, Jonathan D.	2
Solano-Flores, Guillermo	2
Tong, Ye	2
Wyse, Adam E.	2
Yamamoto, Kentaro	2
Ackerman, Terry A.	1
Ahmadi, Alireza	1
Alderoqui-Pinus, Diana	1
Allalouf, Avi	1
Allen, Jeff	1
Ames, Allison J.	1
An, Ji	1
Anderson, Dan	1
Andrich, David	1
Arikan, Serkan	1
Aybek, Eren Can	1
More ▼