ERIC - Search Results

Publication Date

In 2025	0
Since 2024	2
Since 2021 (last 5 years)	2
Since 2016 (last 10 years)	9
Since 2006 (last 20 years)	13

Descriptor

Data Analysis	14
Simulation	13
Item Response Theory	8
Models	8
Test Items	6
Evaluation Methods	4
Comparative Analysis	3
Maximum Likelihood Statistics	3
Algorithms	2
Computation	2
Equated Scores	2
Error of Measurement	2
Hypothesis Testing	2
Learning Processes	2
Measurement	2
Middle School Students	2
Multidimensional Scaling	2
Regression (Statistics)	2
Test Bias	2
Test Results	2
Tests	2
Ability	1
Academic Aspiration	1
Accuracy	1
Achievement Tests	1
More ▼

Source

Journal of Educational…

Publication Type

Journal Articles	14
Reports - Research	10
Reports - Evaluative	4

Education Level

Middle Schools	2
Junior High Schools	1
Secondary Education	1

Audience

Location

Laws, Policies, & Programs

Assessments and Surveys

What Works Clearinghouse Rating

Showing all 14 results Save | Export

A Dual-Purpose Model for Binary Data: Estimating Ability and Misconceptions

Peer reviewed

Direct link

Wenchao Ma; Miguel A. Sorrel; Xiaoming Zhai; Yuan Ge – Journal of Educational Measurement, 2024

Most existing diagnostic models are developed to detect whether students have mastered a set of skills of interest, but few have focused on identifying what scientific misconceptions students possess. This article developed a general dual-purpose model for simultaneously estimating students' overall ability and the presence and absence of…

Descriptors: Models, Misconceptions, Diagnostic Tests, Ability

Modeling Nonlinear Effects of Person-by-Item Covariates in Explanatory Item Response Models: Exploratory Plots and Modeling Using Smooth Functions

Peer reviewed

Direct link

Sun-Joo Cho; Amanda Goodwin; Matthew Naveiras; Paul De Boeck – Journal of Educational Measurement, 2024

Explanatory item response models (EIRMs) have been applied to investigate the effects of person covariates, item covariates, and their interactions in the fields of reading education and psycholinguistics. In practice, it is often assumed that the relationships between the covariates and the logit transformation of item response probability are…

Descriptors: Item Response Theory, Test Items, Models, Maximum Likelihood Statistics

Examining Differential Rater Functioning Using a Between-Subgroup Outfit Approach

Peer reviewed

Direct link

Wind, Stefanie A.; Sebok-Syer, Stefanie S. – Journal of Educational Measurement, 2019

When practitioners use modern measurement models to evaluate rating quality, they commonly examine rater fit statistics that summarize how well each rater's ratings fit the expectations of the measurement model. Essentially, this approach involves examining the unexpected ratings that each misfitting rater assigned (i.e., carrying out analyses of…

Descriptors: Measurement, Models, Evaluators, Simulation

Scale Alignment in Between-Item Multidimensional Rasch Models

Peer reviewed

Direct link

Feuerstahler, Leah; Wilson, Mark – Journal of Educational Measurement, 2019

Scores estimated from multidimensional item response theory (IRT) models are not necessarily comparable across dimensions. In this article, the concept of aligned dimensions is formalized in the context of Rasch models, and two methods are described--delta dimensional alignment (DDA) and logistic regression alignment (LRA)--to transform estimated…

Descriptors: Item Response Theory, Models, Scores, Comparative Analysis

Detection of Differential Item Functioning with Nonlinear Regression: A Non-IRT Approach Accounting for Guessing

Peer reviewed

Direct link

Drabinová, Adéla; Martinková, Patrícia – Journal of Educational Measurement, 2017

In this article we present a general approach not relying on item response theory models (non-IRT) to detect differential item functioning (DIF) in dichotomous items with presence of guessing. The proposed nonlinear regression (NLR) procedure for DIF detection is an extension of method based on logistic regression. As a non-IRT approach, NLR can…

Descriptors: Test Items, Regression (Statistics), Guessing (Tests), Identification

Parameter Estimation in Rasch Models for Examinee-Selected Items

Peer reviewed

Direct link

Liu, Chen-Wei; Wang, Wen-Chung – Journal of Educational Measurement, 2017

The examinee-selected-item (ESI) design, in which examinees are required to respond to a fixed number of items in a given set of items (e.g., choose one item to respond from a pair of items), always yields incomplete data (i.e., only the selected items are answered and the others have missing data) that are likely nonignorable. Therefore, using…

Descriptors: Item Response Theory, Models, Maximum Likelihood Statistics, Data Analysis

A Stepwise Test Characteristic Curve Method to Detect Item Parameter Drift

Peer reviewed

Direct link

Guo, Rui; Zheng, Yi; Chang, Hua-Hua – Journal of Educational Measurement, 2015

An important assumption of item response theory is item parameter invariance. Sometimes, however, item parameters are not invariant across different test administrations due to factors other than sampling error; this phenomenon is termed item parameter drift. Several methods have been developed to detect drifted items. However, most of the…

Descriptors: Item Response Theory, Test Items, Evaluation Methods, Equated Scores

Lord's Wald Test for Detecting Dif in Multidimensional Irt Models: A Comparison of Two Estimation Approaches

Peer reviewed

Direct link

Lee, Soo; Suh, Youngsuk – Journal of Educational Measurement, 2018

Lord's Wald test for differential item functioning (DIF) has not been studied extensively in the context of the multidimensional item response theory (MIRT) framework. In this article, Lord's Wald test was implemented using two estimation approaches, marginal maximum likelihood estimation and Bayesian Markov chain Monte Carlo estimation, to detect…

Descriptors: Item Response Theory, Sample Size, Models, Error of Measurement

Optimal Bandwidth Selection in Observed-Score Kernel Equating

Peer reviewed

Direct link

Häggström, Jenny; Wiberg, Marie – Journal of Educational Measurement, 2014

The selection of bandwidth in kernel equating is important because it has a direct impact on the equated test scores. The aim of this article is to examine the use of double smoothing when selecting bandwidths in kernel equating and to compare double smoothing with the commonly used penalty method. This comparison was made using both an equivalent…

Descriptors: Equated Scores, Data Analysis, Comparative Analysis, Simulation

Structured Constructs Models Based on Change-Point Analysis

Peer reviewed

Direct link

Shin, Hyo Jeong; Wilson, Mark; Choi, In-Hee – Journal of Educational Measurement, 2017

This study proposes a structured constructs model (SCM) to examine measurement in the context of a multidimensional learning progression (LP). The LP is assumed to have features that go beyond a typical multidimentional IRT model, in that there are hypothesized to be certain cross-dimensional linkages that correspond to requirements between the…

Descriptors: Middle School Students, Student Evaluation, Measurement Techniques, Learning Processes

Modeling Data from Collaborative Assessments: Learning in Digital Interactive Social Networks

Peer reviewed

Direct link

Wilson, Mark; Gochyyev, Perman; Scalise, Kathleen – Journal of Educational Measurement, 2017

This article summarizes assessment of cognitive skills through collaborative tasks, using field test results from the Assessment and Teaching of 21st Century Skills (ATC21S) project. This project, sponsored by Cisco, Intel, and Microsoft, aims to help educators around the world enable students with the skills to succeed in future career and…

Descriptors: Cognitive Ability, Thinking Skills, Evaluation Methods, Educational Assessment

A Nested Logit Approach for Investigating Distractors as Causes of Differential Item Functioning

Peer reviewed

Direct link

Suh, Youngsuk; Bolt, Daniel M. – Journal of Educational Measurement, 2011

In multiple-choice items, differential item functioning (DIF) in the correct response may or may not be caused by differentially functioning distractors. Identifying distractors as causes of DIF can provide valuable information for potential item revision or the design of new test items. In this paper, we examine a two-step approach based on…

Descriptors: Test Items, Test Bias, Multiple Choice Tests, Simulation

Testing Features of Graphical DIF: Application of a Regression Correction to Three Nonparametric Statistical Tests

Peer reviewed

Direct link

Bolt, Daniel M.; Gierl, Mark J. – Journal of Educational Measurement, 2006

Inspection of differential item functioning (DIF) in translated test items can be informed by graphical comparisons of item response functions (IRFs) across translated forms. Due to the many forms of DIF that can emerge in such analyses, it is important to develop statistical tests that can confirm various characteristics of DIF when present.…

Descriptors: Regression (Statistics), Tests, Test Bias, Test Items

On the Dimensionality of Achievement Test Data.

Peer reviewed

Birenbaum, Menucha; Tatsuoka, Kikumi – Journal of Educational Measurement, 1982

Empirical results from two studies--a simulation study and an experimental one--indicated that, in achievement data of the problem-solving type where a specific subject matter area is being tested, the greater the variety of the algorithms used, the higher the dimensionality of the test data. (Author/PN)

Descriptors: Achievement Tests, Algorithms, Data Analysis, Factor Structure

Wilson, Mark	3
Bolt, Daniel M.	2
Suh, Youngsuk	2
Amanda Goodwin	1
Birenbaum, Menucha	1
Chang, Hua-Hua	1
Choi, In-Hee	1
Drabinová, Adéla	1
Feuerstahler, Leah	1
Gierl, Mark J.	1
Gochyyev, Perman	1
Guo, Rui	1
Häggström, Jenny	1
Lee, Soo	1
Liu, Chen-Wei	1
Martinková, Patrícia	1
Matthew Naveiras	1
Miguel A. Sorrel	1
Paul De Boeck	1
Scalise, Kathleen	1
Sebok-Syer, Stefanie S.	1
Shin, Hyo Jeong	1
Sun-Joo Cho	1
Tatsuoka, Kikumi	1
Wang, Wen-Chung	1
More ▼