ERIC - Search Results

Publication Date

In 2026	0
Since 2025	0
Since 2022 (last 5 years)	4
Since 2017 (last 10 years)	8
Since 2007 (last 20 years)	12

Descriptor

Simulation	12
Test Items	12
Achievement Tests	10
Foreign Countries	10
International Assessment	9
Secondary School Students	9
Item Response Theory	6
Item Analysis	5
Accuracy	4
Evaluation Methods	4
Models	4
Test Bias	4
Comparative Analysis	3
Error of Measurement	3
Responses	3
Academic Achievement	2
Classification	2
Computation	2
Computer Software	2
Difficulty Level	2
Educational Assessment	2
Error Patterns	2
Goodness of Fit	2
Guessing (Tests)	2
Mathematics Achievement	2
More ▼

Source

Journal of Educational…	3
International Journal of…	2
Applied Measurement in…	1
Educational Measurement:…	1
Grantee Submission	1
Harvard Education Press	1
Journal of Educational Data…	1
Journal of Educational and…	1
Large-scale Assessments in…	1

Publication Type

Journal Articles	10
Reports - Research	9
Reports - Descriptive	2
Collected Works - Serial	1
Reports - Evaluative	1

Education Level

Secondary Education	9
Elementary Secondary Education	1

Audience

Location

Massachusetts	1
Oregon	1

Laws, Policies, & Programs

Assessments and Surveys

Program for International…	12
National Assessment of…	2
Trends in International…	1

What Works Clearinghouse Rating

Showing all 12 results Save | Export

Latent Program Modeling: Inferring Latent Problem-Solving Strategies from a PISA Problem-Solving Task

Peer reviewed
PDF on ERIC

Download full text

Lundgren, Erik – Journal of Educational Data Mining, 2022

Response process data have the potential to provide a rich description of test-takers' thinking processes. However, retrieving insights from these data presents a challenge for educational assessments and educational data mining as they are complex and not well annotated. The present study addresses this challenge by developing a computational…

Descriptors: Problem Solving, Classification, Accuracy, Foreign Countries

Comparing Different Trend Estimation Approaches in Country Means and Standard Deviations in International Large-Scale Assessment Studies

Peer reviewed

Direct link

Robitzsch, Alexander; Lüdtke, Oliver – Large-scale Assessments in Education, 2023

One major aim of international large-scale assessments (ILSA) like PISA is to monitor changes in student performance over time. To accomplish this task, a set of common items (i.e., link items) is repeatedly administered in each assessment. Linking methods based on item response theory (IRT) models are used to align the results from the different…

Descriptors: Educational Trends, Trend Analysis, International Assessment, Achievement Tests

A Sequential Bayesian Changepoint Detection Procedure for Aberrant Behaviors in Computerized Testing

Peer reviewed
PDF on ERIC

Download full text

Direct link

Jing Lu; Chun Wang; Jiwei Zhang; Xue Wang – Grantee Submission, 2023

Changepoints are abrupt variations in a sequence of data in statistical inference. In educational and psychological assessments, it is pivotal to properly differentiate examinees' aberrant behaviors from solution behavior to ensure test reliability and validity. In this paper, we propose a sequential Bayesian changepoint detection algorithm to…

Descriptors: Bayesian Statistics, Behavior Patterns, Computer Assisted Testing, Accuracy

Comparing the Robustness of Three Nonparametric DIF Procedures to Differential Rapid Guessing

Peer reviewed

Direct link

Abulela, Mohammed A. A.; Rios, Joseph A. – Applied Measurement in Education, 2022

When there are no personal consequences associated with test performance for examinees, rapid guessing (RG) is a concern and can differ between subgroups. To date, the impact of differential RG on item-level measurement invariance has received minimal attention. To that end, a simulation study was conducted to examine the robustness of the…

Descriptors: Comparative Analysis, Robustness (Statistics), Nonparametric Statistics, Item Analysis

Sensitivity of the RMSD for Detecting Item-Level Misfit in Low-Performing Countries

Peer reviewed

Direct link

Tijmstra, Jesper; Bolsinova, Maria; Liaw, Yuan-Ling; Rutkowski, Leslie; Rutkowski, David – Journal of Educational Measurement, 2020

Although the root-mean squared deviation (RMSD) is a popular statistical measure for evaluating country-specific item-level misfit (i.e., differential item functioning [DIF]) in international large-scale assessment, this paper shows that its sensitivity to detect misfit may depend strongly on the proficiency distribution of the considered…

Descriptors: Test Items, Goodness of Fit, Probability, Accuracy

Measuring Widening Proficiency Differences in International Assessments: Are Current Approaches Enough?

Peer reviewed

Direct link

Rutkowski, David; Rutkowski, Leslie; Liaw, Yuan-Ling – Educational Measurement: Issues and Practice, 2018

Participation in international large-scale assessments has grown over time with the largest, the Programme for International Student Assessment (PISA), including more than 70 education systems that are economically and educationally diverse. To help accommodate for large achievement differences among participants, in 2009 PISA offered…

Descriptors: Educational Assessment, Foreign Countries, Achievement Tests, Secondary School Students

Item Response Data Analysis Using Stata Item Response Theory Package

Peer reviewed

Direct link

Yang, Ji Seung; Zheng, Xiaying – Journal of Educational and Behavioral Statistics, 2018

The purpose of this article is to introduce and review the capability and performance of the Stata item response theory (IRT) package that is available from Stata v.14, 2015. Using a simulated data set and a publicly available item response data set extracted from Programme of International Student Assessment, we review the IRT package from…

Descriptors: Item Response Theory, Item Analysis, Computer Software, Statistical Analysis

Item Calibration Samples and the Stability of Achievement Estimates and System Rankings: Another Look at the PISA Model

Peer reviewed

Direct link

Rutkowski, Leslie; Rutkowski, David; Zhou, Yan – International Journal of Testing, 2016

Using an empirically-based simulation study, we show that typically used methods of choosing an item calibration sample have significant impacts on achievement bias and system rankings. We examine whether recent PISA accommodations, especially for lower performing participants, can mitigate some of this bias. Our findings indicate that standard…

Descriptors: Simulation, International Programs, Adolescents, Student Evaluation

Modeling Skipped and Not-Reached Items Using IRTrees

Peer reviewed

Direct link

Debeer, Dries; Janssen, Rianne; De Boeck, Paul – Journal of Educational Measurement, 2017

When dealing with missing responses, two types of omissions can be discerned: items can be skipped or not reached by the test taker. When the occurrence of these omissions is related to the proficiency process the missingness is nonignorable. The purpose of this article is to present a tree-based IRT framework for modeling responses and omissions…

Descriptors: Item Response Theory, Test Items, Responses, Testing Problems

A Comparison of Linking Methods for Estimating National Trends in International Comparative Large-Scale Assessments in the Presence of Cross-national DIF

Peer reviewed

Direct link

Sachse, Karoline A.; Roppelt, Alexander; Haag, Nicole – Journal of Educational Measurement, 2016

Trend estimation in international comparative large-scale assessments relies on measurement invariance between countries. However, cross-national differential item functioning (DIF) has been repeatedly documented. We ran a simulation study using national item parameters, which required trends to be computed separately for each country, to compare…

Descriptors: Comparative Analysis, Measurement, Test Bias, Simulation

Differential Item Functioning Analysis Using Rasch Item Information Functions

Peer reviewed

Direct link

Wyse, Adam E.; Mapuranga, Raymond – International Journal of Testing, 2009

Differential item functioning (DIF) analysis is a statistical technique used for ensuring the equity and fairness of educational assessments. This study formulates a new DIF analysis method using the information similarity index (ISI). ISI compares item information functions when data fits the Rasch model. Through simulations and an international…

Descriptors: Test Bias, Evaluation Methods, Test Items, Educational Assessment

Harvard Education Letter. Volume 26, Number 2, March-April 2010

Direct link

Chauncey, Caroline T., Ed. – Harvard Education Press, 2010

"Harvard Education Letter" is published bimonthly at the Harvard Graduate School of Education. This issue of "Harvard Education Letter" contains the following articles: (1) Online Testing, Version 1.0: Oregon's Adaptive Computer-Based Accountability Test Offers a Peek at a Brave New Future (Robert Rothman); (2) Beyond…

Descriptors: Family Programs, Homosexuality, Educational Policy, Sexual Identity

Rutkowski, David	3
Rutkowski, Leslie	3
Liaw, Yuan-Ling	2
Abulela, Mohammed A. A.	1
Bolsinova, Maria	1
Chauncey, Caroline T., Ed.	1
Chun Wang	1
De Boeck, Paul	1
Debeer, Dries	1
Haag, Nicole	1
Janssen, Rianne	1
Jing Lu	1
Jiwei Zhang	1
Lundgren, Erik	1
Lüdtke, Oliver	1
Mapuranga, Raymond	1
Rios, Joseph A.	1
Robitzsch, Alexander	1
Roppelt, Alexander	1
Sachse, Karoline A.	1
Tijmstra, Jesper	1
Wyse, Adam E.	1
Xue Wang	1
Yang, Ji Seung	1
Zheng, Xiaying	1
More ▼