ERIC - Search Results

Publication Date

In 2025	4
Since 2024	21
Since 2021 (last 5 years)	36
Since 2016 (last 10 years)	75
Since 2006 (last 20 years)	142

Descriptor

Models	194
Item Response Theory	89
Test Items	53
Simulation	48
Scores	27
Comparative Analysis	24
Computation	24
Test Bias	24
Psychometrics	22
Evaluation Methods	21
Goodness of Fit	21
Statistical Analysis	21
Accuracy	20
Bayesian Statistics	18
Measurement	18
Measurement Techniques	18
Reaction Time	18
Test Construction	18
Achievement Tests	17
Error of Measurement	17
Bias	16
Difficulty Level	13
Foreign Countries	13
Item Analysis	13
Cognitive Measurement	12
More ▼

Source

Journal of Educational…

194

Publication Type

Journal Articles	174
Reports - Research	109
Reports - Evaluative	40
Reports - Descriptive	22
Book/Product Reviews	2
Speeches/Meeting Papers	2
Information Analyses	1

Education Level

Secondary Education	13
Higher Education	6
Postsecondary Education	6
Middle Schools	3
Elementary Secondary Education	2
High Schools	2
Junior High Schools	2
Elementary Education	1
Grade 10	1
Grade 8	1
Grade 9	1
More ▼

Audience

Researchers

Location

Hong Kong	2
Belgium	1
China	1
Turkey	1
United Kingdom (England)	1

Laws, Policies, & Programs

Defunis v Odegaard	1
Race to the Top	1

Assessments and Surveys

Program for International…	9
Graduate Record Examinations	1
Iowa Tests of Educational…	1
Metropolitan Achievement Tests	1
National Assessment of…	1
National Longitudinal Study…	1
SAT (College Admission Test)	1

What Works Clearinghouse Rating

Showing 1 to 15 of 194 results Save | Export

Theory-Driven IRT Modeling of Vocabulary Development: Matthew Effects and the Case for Unipolar IRT

Peer reviewed

Direct link

Qi Huang; Daniel M. Bolt; Xiangyi Liao – Journal of Educational Measurement, 2025

Item response theory (IRT) encompasses a broader class of measurement models than is commonly appreciated by practitioners in educational measurement. For measures of vocabulary and its development, we show how psychological theory might in certain instances support unipolar IRT modeling as a superior alternative to the more traditional bipolar…

Descriptors: Educational Theories, Item Response Theory, Vocabulary Development, Models

An Item Response Tree Model for Items with Multiple-Choice and Constructed-Response Parts

Peer reviewed

Direct link

Junhuan Wei; Qin Wang; Buyun Dai; Yan Cai; Dongbo Tu – Journal of Educational Measurement, 2024

Traditional IRT and IRTree models are not appropriate for analyzing the item that simultaneously consists of multiple-choice (MC) task and constructed-response (CR) task in one item. To address this issue, this study proposed an item response tree model (called as IRTree-MR) to accommodate items that contain different response types at different…

Descriptors: Item Response Theory, Models, Multiple Choice Tests, Cognitive Processes

Controlling the Speededness of Assembled Test Forms: A Generalization to the Three-Parameter Lognormal Response Time Model

Peer reviewed

Direct link

Becker, Benjamin; Weirich, Sebastian; Goldhammer, Frank; Debeer, Dries – Journal of Educational Measurement, 2023

When designing or modifying a test, an important challenge is controlling its speededness. To achieve this, van der Linden (2011a, 2011b) proposed using a lognormal response time model, more specifically the two-parameter lognormal model, and automated test assembly (ATA) via mixed integer linear programming. However, this approach has a severe…

Descriptors: Test Construction, Automation, Models, Test Items

Modeling Hierarchical Attribute Structures in Diagnostic Classification Models with Multiple Attempts

Peer reviewed

Direct link

Tae Yeon Kwon; A. Corinne Huggins-Manley; Jonathan Templin; Mingying Zheng – Journal of Educational Measurement, 2024

In classroom assessments, examinees can often answer test items multiple times, resulting in sequential multiple-attempt data. Sequential diagnostic classification models (DCMs) have been developed for such data. As student learning processes may be aligned with a hierarchy of measured traits, this study aimed to develop a sequential hierarchical…

Descriptors: Classification, Accuracy, Student Evaluation, Sequential Approach

Model Selection Posterior Predictive Model Checking via Limited-Information Indices for Bayesian Diagnostic Classification Modeling

Peer reviewed

Direct link

Jihong Zhang; Jonathan Templin; Xinya Liang – Journal of Educational Measurement, 2024

Recently, Bayesian diagnostic classification modeling has been becoming popular in health psychology, education, and sociology. Typically information criteria are used for model selection when researchers want to choose the best model among alternative models. In Bayesian estimation, posterior predictive checking is a flexible Bayesian model…

Descriptors: Bayesian Statistics, Cognitive Measurement, Models, Classification

Cognitive Diagnostic Multistage Testing by Partitioning Hierarchically Structured Attributes

Peer reviewed

Direct link

Kim, Rae Yeong; Yoo, Yun Joo – Journal of Educational Measurement, 2023

In cognitive diagnostic models (CDMs), a set of fine-grained attributes is required to characterize complex problem solving and provide detailed diagnostic information about an examinee. However, it is challenging to ensure reliable estimation and control computational complexity when The test aims to identify the examinee's attribute profile in a…

Descriptors: Models, Diagnostic Tests, Adaptive Testing, Accuracy

A Deterministic Gated Lognormal Response Time Model to Identify Examinees with Item Preknowledge

Peer reviewed

Direct link

Kasli, Murat; Zopluoglu, Cengiz; Toton, Sarah L. – Journal of Educational Measurement, 2023

Response times (RTs) have recently attracted a significant amount of attention in the literature as they may provide meaningful information about item preknowledge. In this study, a new model, the Deterministic Gated Lognormal Response Time (DG-LNRT) model, is proposed to identify examinees with item preknowledge using RTs. The proposed model was…

Descriptors: Reaction Time, Test Items, Models, Familiarity

Information Functions of Rank-2PL Models for Forced-Choice Questionnaires

Peer reviewed

Direct link

Jianbin Fu; Xuan Tan; Patrick C. Kyllonen – Journal of Educational Measurement, 2024

This paper presents the item and test information functions of the Rank two-parameter logistic models (Rank-2PLM) for items with two (pair) and three (triplet) statements in forced-choice questionnaires. The Rank-2PLM model for pairs is the MUPP-2PLM (Multi-Unidimensional Pairwise Preference) and, for triplets, is the Triplet-2PLM. Fisher's…

Descriptors: Questionnaires, Test Items, Item Response Theory, Models

A Bayesian Moderated Nonlinear Factor Analysis Approach for DIF Detection under Violation of the Equal Variance Assumption

Peer reviewed

Direct link

Sooyong Lee; Suhwa Han; Seung W. Choi – Journal of Educational Measurement, 2024

Research has shown that multiple-indicator multiple-cause (MIMIC) models can result in inflated Type I error rates in detecting differential item functioning (DIF) when the assumption of equal latent variance is violated. This study explains how the violation of the equal variance assumption adversely impacts the detection of nonuniform DIF and…

Descriptors: Factor Analysis, Bayesian Statistics, Test Bias, Item Response Theory

A Dual-Purpose Model for Binary Data: Estimating Ability and Misconceptions

Peer reviewed

Direct link

Wenchao Ma; Miguel A. Sorrel; Xiaoming Zhai; Yuan Ge – Journal of Educational Measurement, 2024

Most existing diagnostic models are developed to detect whether students have mastered a set of skills of interest, but few have focused on identifying what scientific misconceptions students possess. This article developed a general dual-purpose model for simultaneously estimating students' overall ability and the presence and absence of…

Descriptors: Models, Misconceptions, Diagnostic Tests, Ability

Generating Models for Item Preknowledge

Peer reviewed

Direct link

Gorney, Kylie; Wollack, James A. – Journal of Educational Measurement, 2022

Detection methods for item preknowledge are often evaluated in simulation studies where models are used to generate the data. To ensure the reliability of such methods, it is crucial that these models are able to accurately represent situations that are encountered in practice. The purpose of this article is to provide a critical analysis of…

Descriptors: Prior Learning, Simulation, Models, Reaction Time

Using Item Scores and Distractors in Person-Fit Assessment

Peer reviewed

Direct link

Gorney, Kylie; Wollack, James A. – Journal of Educational Measurement, 2023

In order to detect a wide range of aberrant behaviors, it can be useful to incorporate information beyond the dichotomous item scores. In this paper, we extend the l[subscript z] and l*[subscript z] person-fit statistics so that unusual behavior in item scores and unusual behavior in item distractors can be used as indicators of aberrance. Through…

Descriptors: Test Items, Scores, Goodness of Fit, Statistics

Modeling Directional Testlet Effects on Multiple Open-Ended Questions

Peer reviewed

Direct link

Kuan-Yu Jin; Wai-Lok Siu – Journal of Educational Measurement, 2025

Educational tests often have a cluster of items linked by a common stimulus ("testlet"). In such a design, the dependencies caused between items are called "testlet effects." In particular, the directional testlet effect (DTE) refers to a recursive influence whereby responses to earlier items can positively or negatively affect…

Descriptors: Models, Test Items, Educational Assessment, Scores

Comparing and Combining IRTree Models and Anchoring Vignettes in Addressing Response Styles

Peer reviewed

Direct link

Mingfeng Xue; Ping Chen – Journal of Educational Measurement, 2025

Response styles pose great threats to psychological measurements. This research compares IRTree models and anchoring vignettes in addressing response styles and estimating the target traits. It also explores the potential of combining them at the item level and total-score level (ratios of extreme and middle responses to vignettes). Four models…

Descriptors: Item Response Theory, Models, Comparative Analysis, Vignettes

Expanding the Lognormal Response Time Model Using Profile Similarity Metrics to Improve the Detection of Anomalous Testing Behavior

Peer reviewed

Direct link

Gregory M. Hurtz; Regi Mucino – Journal of Educational Measurement, 2024

The Lognormal Response Time (LNRT) model measures the speed of test-takers relative to the normative time demands of items on a test. The resulting speed parameters and model residuals are often analyzed for evidence of anomalous test-taking behavior associated with fast and poorly fitting response time patterns. Extending this model, we…

Descriptors: Student Reaction, Reaction Time, Response Style (Tests), Test Items

Previous Page | Next Page »

Pages: 1 | 2 | 3 | 4 | 5 | 6 | 7 | 8 | 9 | 10 | 11 | 12 | 13

Wang, Wen-Chung	9
de la Torre, Jimmy	6
Jin, Kuan-Yu	4
Bolt, Daniel M.	3
DeCarlo, Lawrence T.	3
Debeer, Dries	3
Hambleton, Ronald K.	3
Janssen, Rianne	3
Jiao, Hong	3
Lee, Won-Chan	3
Mislevy, Robert J.	3
Nandakumar, Ratna	3
Novick, Melvin R.	3
Roussos, Louis A.	3
Suh, Youngsuk	3
Wilson, Mark	3
Wind, Stefanie A.	3
Wollack, James A.	3
Amanda Goodwin	2
Clauser, Brian E.	2
De Boeck, Paul	2
DeMars, Christine E.	2
Forsyth, Robert A.	2
Gierl, Mark J.	2
More ▼