ERIC - Search Results

Publication Date

In 2025	0
Since 2024	2
Since 2021 (last 5 years)	4
Since 2016 (last 10 years)	8
Since 2006 (last 20 years)	11

Source

Journal of Educational and…	3
Large-scale Assessments in…	2
Education Sciences	1
Educational Sciences: Theory…	1
Grantee Submission	1
International Journal of…	1
Journal of Educational…	1
Sociological Methods &…	1

Publication Type

Journal Articles	10
Reports - Research	10
Reports - Descriptive	1

Education Level

Elementary Secondary Education	10
Secondary Education	5
Junior High Schools	2
Middle Schools	2
Elementary Education	1
Grade 12	1
Grade 4	1
Grade 8	1
High Schools	1

Audience

Location

Armenia	1
Austria	1
Iran	1
Norway	1
Tunisia	1
United States	1

Laws, Policies, & Programs

Assessments and Surveys

Trends in International…	11
Program for International…	2
Big Five Inventory	1
National Assessment of…	1

What Works Clearinghouse Rating

Showing all 11 results Save | Export

Chance-Constrained Automated Test Assembly

Peer reviewed

Direct link

Giada Spaccapanico Proietti; Mariagiulia Matteucci; Stefania Mignani; Bernard P. Veldkamp – Journal of Educational and Behavioral Statistics, 2024

Classical automated test assembly (ATA) methods assume fixed and known coefficients for the constraints and the objective function. This hypothesis is not true for the estimates of item response theory parameters, which are crucial elements in test assembly classical models. To account for uncertainty in ATA, we propose a chance-constrained…

Descriptors: Automation, Computer Assisted Testing, Ambiguity (Context), Item Response Theory

Variational Estimation for Multidimensional Generalized Partial Credit Model

Peer reviewed

Direct link

Chengyu Cui; Chun Wang; Gongjun Xu – Grantee Submission, 2024

Multidimensional item response theory (MIRT) models have generated increasing interest in the psychometrics literature. Efficient approaches for estimating MIRT models with dichotomous responses have been developed, but constructing an equally efficient and robust algorithm for polytomous models has received limited attention. To address this gap,…

Descriptors: Item Response Theory, Accuracy, Simulation, Psychometrics

Testing Latent Variable Distribution Fit in IRT Using Posterior Residuals

Peer reviewed

Direct link

Monroe, Scott – Journal of Educational and Behavioral Statistics, 2021

This research proposes a new statistic for testing latent variable distribution fit for unidimensional item response theory (IRT) models. If the typical assumption of normality is violated, then item parameter estimates will be biased, and dependent quantities such as IRT score estimates will be adversely affected. The proposed statistic compares…

Descriptors: Item Response Theory, Simulation, Scores, Comparative Analysis

On the Treatment of Missing Data in Background Questionnaires in Educational Large-Scale Assessments: An Evaluation of Different Procedures

Peer reviewed

Direct link

Grund, Simon; Lüdtke, Oliver; Robitzsch, Alexander – Journal of Educational and Behavioral Statistics, 2021

Large-scale assessments (LSAs) use Mislevy's "plausible value" (PV) approach to relate student proficiency to noncognitive variables administered in a background questionnaire. This method requires background variables to be completely observed, a requirement that is seldom fulfilled. In this article, we evaluate and compare the…

Descriptors: Data Analysis, Error of Measurement, Research Problems, Statistical Inference

Spurious Latent Class Problem in the Mixed Rasch Model: A Comparison of Three Maximum Likelihood Estimation Methods under Different Ability Distributions

Peer reviewed

Direct link

Sen, Sedat – International Journal of Testing, 2018

Recent research has shown that over-extraction of latent classes can be observed in the Bayesian estimation of the mixed Rasch model when the distribution of ability is non-normal. This study examined the effect of non-normal ability distributions on the number of latent classes in the mixed Rasch model when estimated with maximum likelihood…

Descriptors: Item Response Theory, Comparative Analysis, Computation, Maximum Likelihood Statistics

Comparing DIF Methods for Data with Dual Dependency

Peer reviewed

Direct link

Jin, Ying; Kang, Minsoo – Large-scale Assessments in Education, 2016

Background: The current study compared four differential item functioning (DIF) methods to examine their performances in terms of accounting for dual dependency (i.e., person and item clustering effects) simultaneously by a simulation study, which is not sufficiently studied under the current DIF literature. The four methods compared are logistic…

Descriptors: Comparative Analysis, Test Bias, Simulation, Regression (Statistics)

Phantom Effects in Multilevel Compositional Analysis: Problems and Solutions

Peer reviewed

Direct link

Pokropek, Artur – Sociological Methods & Research, 2015

This article combines statistical and applied research perspective showing problems that might arise when measurement error in multilevel compositional effects analysis is ignored. This article focuses on data where independent variables are constructed measures. Simulation studies are conducted evaluating methods that could overcome the…

Descriptors: Error of Measurement, Hierarchical Linear Modeling, Simulation, Evaluation Methods

A Comparison of Linking Methods for Estimating National Trends in International Comparative Large-Scale Assessments in the Presence of Cross-national DIF

Peer reviewed

Direct link

Sachse, Karoline A.; Roppelt, Alexander; Haag, Nicole – Journal of Educational Measurement, 2016

Trend estimation in international comparative large-scale assessments relies on measurement invariance between countries. However, cross-national differential item functioning (DIF) has been repeatedly documented. We ran a simulation study using national item parameters, which required trends to be computed separately for each country, to compare…

Descriptors: Comparative Analysis, Measurement, Test Bias, Simulation

The Impact of Test Dimensionality, Common-Item Set Format, and Scale Linking Methods on Mixed-Format Test Equating

Peer reviewed
PDF on ERIC

Download full text

Öztürk-Gübes, Nese; Kelecioglu, Hülya – Educational Sciences: Theory and Practice, 2016

The purpose of this study was to examine the impact of dimensionality, common-item set format, and different scale linking methods on preserving equity property with mixed-format test equating. Item response theory (IRT) true-score equating (TSE) and IRT observed-score equating (OSE) methods were used under common-item nonequivalent groups design.…

Descriptors: Test Format, Item Response Theory, True Scores, Equated Scores

Detecting Differential Item Functioning Using Generalized Logistic Regression in the Context of Large-Scale Assessments

Peer reviewed

Direct link

Svetina, Dubravka; Rutkowski, Leslie – Large-scale Assessments in Education, 2014

Background: When studying student performance across different countries or cultures, an important aspect for comparisons is that of score comparability. In other words, it is imperative that the latent variable (i.e., construct of interest) is understood and measured equivalently across all participating groups or countries, if our inferences…

Descriptors: Test Items, Item Response Theory, Item Analysis, Regression (Statistics)

Improving Science Assessments by Situating Them in a Virtual Environment

Peer reviewed
PDF on ERIC

Download full text

Ketelhut, Diane Jass; Nelson, Brian; Schifter, Catherine; Kim, Younsu – Education Sciences, 2013

Current science assessments typically present a series of isolated fact-based questions, poorly representing the complexity of how real-world science is constructed. The National Research Council asserts that this needs to change to reflect a more authentic model of science practice. We strongly concur and suggest that good science assessments…

Descriptors: Virtual Classrooms, Science Tests, Academic Standards, Middle School Students

Achievement Tests	11
International Assessment	11
Mathematics Achievement	11
Mathematics Tests	11
Science Achievement	11
Science Tests	11
Elementary Secondary Education	10
Foreign Countries	10
Simulation	10
Item Response Theory	7
Error of Measurement	5
Test Items	5
Comparative Analysis	4
Evaluation Methods	4
Item Analysis	4
Data Analysis	2
Maximum Likelihood Statistics	2
Models	2
Regression (Statistics)	2
Secondary School Students	2
Statistical Analysis	2
Test Bias	2
Test Construction	2
Academic Standards	1
Accuracy	1
More ▼

Bernard P. Veldkamp	1
Chengyu Cui	1
Chun Wang	1
Giada Spaccapanico Proietti	1
Gongjun Xu	1
Grund, Simon	1
Haag, Nicole	1
Jin, Ying	1
Kang, Minsoo	1
Kelecioglu, Hülya	1
Ketelhut, Diane Jass	1
Kim, Younsu	1
Lüdtke, Oliver	1
Mariagiulia Matteucci	1
Monroe, Scott	1
Nelson, Brian	1
Pokropek, Artur	1
Robitzsch, Alexander	1
Roppelt, Alexander	1
Rutkowski, Leslie	1
Sachse, Karoline A.	1
Schifter, Catherine	1
Sen, Sedat	1
Stefania Mignani	1
Svetina, Dubravka	1
More ▼