ERIC - Search Results

Publication Date

In 2025	0
Since 2024	2
Since 2021 (last 5 years)	5
Since 2016 (last 10 years)	10
Since 2006 (last 20 years)	19

Descriptor

Simulation	19
Foreign Countries	12
International Assessment	11
Item Response Theory	11
Science Achievement	11
Achievement Tests	10
Mathematics Achievement	10
Mathematics Tests	10
Science Tests	10
Elementary Secondary Education	9
Test Items	8
Comparative Analysis	6
Evaluation Methods	6
Models	6
Error of Measurement	5
Hierarchical Linear Modeling	4
Item Analysis	4
Test Bias	4
Data Analysis	3
Educational Assessment	3
Grade 4	3
Measurement	3
Sample Size	3
Statistical Analysis	3
Accuracy	2
More ▼

Source

Journal of Educational and…	5
Large-scale Assessments in…	2
ProQuest LLC	2
Applied Measurement in…	1
Canadian Mathematics…	1
Educational Sciences: Theory…	1
Educational and Psychological…	1
Grantee Submission	1
International Journal of…	1
Journal of Educational…	1
Journal of Experimental…	1
Practical Assessment,…	1
Sociological Methods &…	1
More ▼

Publication Type

Journal Articles	15
Reports - Research	15
Dissertations/Theses -…	2
Collected Works - Proceedings	1
Reports - Descriptive	1

Education Level

Elementary Secondary Education	9
Secondary Education	5
Elementary Education	4
Grade 4	3
Grade 8	2
Intermediate Grades	2
Junior High Schools	2
Middle Schools	2
Grade 12	1
High Schools	1
Higher Education	1
More ▼

Audience

Location

Tunisia	2
Armenia	1
Austria	1
Botswana	1
Canada	1
France	1
Honduras	1
Iran	1
Norway	1

Laws, Policies, & Programs

Assessments and Surveys

Trends in International…	19
Program for International…	3
Big Five Inventory	1
National Assessment of…	1

What Works Clearinghouse Rating

Showing 1 to 15 of 19 results Save | Export

Reconsidering Multilevel Latent Class Models: Can Level-2 Latent Classes Affect Item Response Probabilities?

Peer reviewed

Direct link

Wang, Yan; Kim, Eunsook; Joo, Seang-Hwane; Chun, Seokjoon; Alamri, Abeer; Lee, Philseok; Stark, Stephen – Journal of Experimental Education, 2022

Multilevel latent class analysis (MLCA) has been increasingly used to investigate unobserved population heterogeneity while taking into account data dependency. Nonparametric MLCA has gained much popularity due to the advantage of classifying both individuals and clusters into latent classes. This study demonstrated the need to relax the…

Descriptors: Nonparametric Statistics, Hierarchical Linear Modeling, Monte Carlo Methods, Simulation

Chance-Constrained Automated Test Assembly

Peer reviewed

Direct link

Giada Spaccapanico Proietti; Mariagiulia Matteucci; Stefania Mignani; Bernard P. Veldkamp – Journal of Educational and Behavioral Statistics, 2024

Classical automated test assembly (ATA) methods assume fixed and known coefficients for the constraints and the objective function. This hypothesis is not true for the estimates of item response theory parameters, which are crucial elements in test assembly classical models. To account for uncertainty in ATA, we propose a chance-constrained…

Descriptors: Automation, Computer Assisted Testing, Ambiguity (Context), Item Response Theory

Variational Estimation for Multidimensional Generalized Partial Credit Model

Peer reviewed

Direct link

Chengyu Cui; Chun Wang; Gongjun Xu – Grantee Submission, 2024

Multidimensional item response theory (MIRT) models have generated increasing interest in the psychometrics literature. Efficient approaches for estimating MIRT models with dichotomous responses have been developed, but constructing an equally efficient and robust algorithm for polytomous models has received limited attention. To address this gap,…

Descriptors: Item Response Theory, Accuracy, Simulation, Psychometrics

Testing Latent Variable Distribution Fit in IRT Using Posterior Residuals

Peer reviewed

Direct link

Monroe, Scott – Journal of Educational and Behavioral Statistics, 2021

This research proposes a new statistic for testing latent variable distribution fit for unidimensional item response theory (IRT) models. If the typical assumption of normality is violated, then item parameter estimates will be biased, and dependent quantities such as IRT score estimates will be adversely affected. The proposed statistic compares…

Descriptors: Item Response Theory, Simulation, Scores, Comparative Analysis

On the Treatment of Missing Data in Background Questionnaires in Educational Large-Scale Assessments: An Evaluation of Different Procedures

Peer reviewed

Direct link

Grund, Simon; Lüdtke, Oliver; Robitzsch, Alexander – Journal of Educational and Behavioral Statistics, 2021

Large-scale assessments (LSAs) use Mislevy's "plausible value" (PV) approach to relate student proficiency to noncognitive variables administered in a background questionnaire. This method requires background variables to be completely observed, a requirement that is seldom fulfilled. In this article, we evaluate and compare the…

Descriptors: Data Analysis, Error of Measurement, Research Problems, Statistical Inference

The Mediated MIMIC Model for Understanding the Underlying Mechanism of DIF

Peer reviewed

Direct link

Cheng, Ying; Shao, Can; Lathrop, Quinn N. – Educational and Psychological Measurement, 2016

Due to its flexibility, the multiple-indicator, multiple-causes (MIMIC) model has become an increasingly popular method for the detection of differential item functioning (DIF). In this article, we propose the mediated MIMIC model method to uncover the underlying mechanism of DIF. This method extends the usual MIMIC model by including one variable…

Descriptors: Test Bias, Models, Simulation, Sample Size

An Aggregate IRT Procedure for Exploratory Factor Analysis

Peer reviewed

Direct link

Camilli, Gregory; Fox, Jean-Paul – Journal of Educational and Behavioral Statistics, 2015

An aggregation strategy is proposed to potentially address practical limitation related to computing resources for two-level multidimensional item response theory (MIRT) models with large data sets. The aggregate model is derived by integration of the normal ogive model, and an adaptation of the stochastic approximation expectation maximization…

Descriptors: Factor Analysis, Item Response Theory, Grade 4, Simulation

Spurious Latent Class Problem in the Mixed Rasch Model: A Comparison of Three Maximum Likelihood Estimation Methods under Different Ability Distributions

Peer reviewed

Direct link

Sen, Sedat – International Journal of Testing, 2018

Recent research has shown that over-extraction of latent classes can be observed in the Bayesian estimation of the mixed Rasch model when the distribution of ability is non-normal. This study examined the effect of non-normal ability distributions on the number of latent classes in the mixed Rasch model when estimated with maximum likelihood…

Descriptors: Item Response Theory, Comparative Analysis, Computation, Maximum Likelihood Statistics

Linking Errors between Two Populations and Tests: A Case Study in International Surveys in Education

Peer reviewed
PDF on ERIC

Download full text

Hastedt, Dirk; Desa, Deana – Practical Assessment, Research & Evaluation, 2015

This simulation study was prompted by the current increased interest in linking national studies to international large-scale assessments (ILSAs) such as IEA's TIMSS, IEA's PIRLS, and OECD's PISA. Linkage in this scenario is achieved by including items from the international assessments in the national assessments on the premise that the average…

Descriptors: Case Studies, Simulation, International Programs, Testing Programs

Centering, Scale Indeterminacy, and Differential Item Functioning Detection in Hierarchical Generalized Linear and Generalized Linear Mixed Models

Peer reviewed

Direct link

Cheong, Yuk Fai; Kamata, Akihito – Applied Measurement in Education, 2013

In this article, we discuss and illustrate two centering and anchoring options available in differential item functioning (DIF) detection studies based on the hierarchical generalized linear and generalized linear mixed modeling frameworks. We compared and contrasted the assumptions of the two options, and examined the properties of their DIF…

Descriptors: Test Bias, Hierarchical Linear Modeling, Comparative Analysis, Test Items

Comparing DIF Methods for Data with Dual Dependency

Peer reviewed

Direct link

Jin, Ying; Kang, Minsoo – Large-scale Assessments in Education, 2016

Background: The current study compared four differential item functioning (DIF) methods to examine their performances in terms of accounting for dual dependency (i.e., person and item clustering effects) simultaneously by a simulation study, which is not sufficiently studied under the current DIF literature. The four methods compared are logistic…

Descriptors: Comparative Analysis, Test Bias, Simulation, Regression (Statistics)

Nonparametric Bayesian Multiple Imputation for Incomplete Categorical Variables in Large-Scale Assessment Surveys

Peer reviewed

Direct link

Si, Yajuan; Reiter, Jerome P. – Journal of Educational and Behavioral Statistics, 2013

In many surveys, the data comprise a large number of categorical variables that suffer from item nonresponse. Standard methods for multiple imputation, like log-linear models or sequential regression imputation, can fail to capture complex dependencies and can be difficult to implement effectively in high dimensions. We present a fully Bayesian,…

Descriptors: Nonparametric Statistics, Bayesian Statistics, Measurement, Evaluation Methods

Phantom Effects in Multilevel Compositional Analysis: Problems and Solutions

Peer reviewed

Direct link

Pokropek, Artur – Sociological Methods & Research, 2015

This article combines statistical and applied research perspective showing problems that might arise when measurement error in multilevel compositional effects analysis is ignored. This article focuses on data where independent variables are constructed measures. Simulation studies are conducted evaluating methods that could overcome the…

Descriptors: Error of Measurement, Hierarchical Linear Modeling, Simulation, Evaluation Methods

A Comparison of Linking Methods for Estimating National Trends in International Comparative Large-Scale Assessments in the Presence of Cross-national DIF

Peer reviewed

Direct link

Sachse, Karoline A.; Roppelt, Alexander; Haag, Nicole – Journal of Educational Measurement, 2016

Trend estimation in international comparative large-scale assessments relies on measurement invariance between countries. However, cross-national differential item functioning (DIF) has been repeatedly documented. We ran a simulation study using national item parameters, which required trends to be computed separately for each country, to compare…

Descriptors: Comparative Analysis, Measurement, Test Bias, Simulation

The Impact of Test Dimensionality, Common-Item Set Format, and Scale Linking Methods on Mixed-Format Test Equating

Peer reviewed
PDF on ERIC

Download full text

Öztürk-Gübes, Nese; Kelecioglu, Hülya – Educational Sciences: Theory and Practice, 2016

The purpose of this study was to examine the impact of dimensionality, common-item set format, and different scale linking methods on preserving equity property with mixed-format test equating. Item response theory (IRT) true-score equating (TSE) and IRT observed-score equating (OSE) methods were used under common-item nonequivalent groups design.…

Descriptors: Test Format, Item Response Theory, True Scores, Equated Scores

Previous Page | Next Page »

Pages: 1 | 2

Alamri, Abeer	1
Allan, Darien, Ed.	1
Bernard P. Veldkamp	1
Camilli, Gregory	1
Cheng, Ying	1
Chengyu Cui	1
Cheong, Yuk Fai	1
Chun Wang	1
Chun, Seokjoon	1
Desa, Deana	1
Fox, Jean-Paul	1
Giada Spaccapanico Proietti	1
Gongjun Xu	1
Grund, Simon	1
Haag, Nicole	1
Hastedt, Dirk	1
Jin, Ying	1
Joo, Seang-Hwane	1
Kamata, Akihito	1
Kang, Minsoo	1
Kelecioglu, Hülya	1
Kim, Eunsook	1
Lathrop, Quinn N.	1
Lee, Philseok	1
Lu, Yi	1
More ▼