ERIC - Search Results

Publication Date

In 2025	0
Since 2024	6
Since 2021 (last 5 years)	16
Since 2016 (last 10 years)	55
Since 2006 (last 20 years)	121

Descriptor

Simulation	157
Item Response Theory	76
Test Items	67
Models	48
Comparative Analysis	30
Computation	27
Scores	27
Computer Assisted Testing	24
Evaluation Methods	24
Item Analysis	22
Sample Size	22
Error of Measurement	18
Test Bias	18
Accuracy	16
Goodness of Fit	15
Measurement	15
Correlation	14
Difficulty Level	14
Equated Scores	14
Statistical Analysis	14
Test Construction	14
Data Analysis	13
Maximum Likelihood Statistics	13
Regression (Statistics)	13
Adaptive Testing	12
More ▼

Source

Journal of Educational…

157

Publication Type

Journal Articles	156
Reports - Research	99
Reports - Evaluative	49
Reports - Descriptive	7
Speeches/Meeting Papers	6
Information Analyses	1

Education Level

Secondary Education	6
Elementary Education	3
Higher Education	3
Elementary Secondary Education	2
Middle Schools	2
Postsecondary Education	2
Grade 10	1
Grade 4	1
Grade 9	1
High Schools	1
Intermediate Grades	1
Junior High Schools	1
More ▼

Audience

Researchers

Location

China

Laws, Policies, & Programs

No Child Left Behind Act 2001

Assessments and Surveys

National Assessment of…	5
Program for International…	4
Indiana Statewide Testing for…	2
Early Childhood Longitudinal…	1
Iowa Tests of Basic Skills	1
Law School Admission Test	1
Progress in International…	1
Teaching and Learning…	1
Trends in International…	1

What Works Clearinghouse Rating

Showing 1 to 15 of 157 results Save | Export

Curvilinearity in the Reference Composite and Practical Implications for Measurement

Peer reviewed

Direct link

Xiangyi Liao; Daniel M. Bolt; Jee-Seon Kim – Journal of Educational Measurement, 2024

Item difficulty and dimensionality often correlate, implying that unidimensional IRT approximations to multidimensional data (i.e., reference composites) can take a curvilinear form in the multidimensional space. Although this issue has been previously discussed in the context of vertical scaling applications, we illustrate how such a phenomenon…

Descriptors: Difficulty Level, Simulation, Multidimensional Scaling, Graphs

Model Selection Posterior Predictive Model Checking via Limited-Information Indices for Bayesian Diagnostic Classification Modeling

Peer reviewed

Direct link

Jihong Zhang; Jonathan Templin; Xinya Liang – Journal of Educational Measurement, 2024

Recently, Bayesian diagnostic classification modeling has been becoming popular in health psychology, education, and sociology. Typically information criteria are used for model selection when researchers want to choose the best model among alternative models. In Bayesian estimation, posterior predictive checking is a flexible Bayesian model…

Descriptors: Bayesian Statistics, Cognitive Measurement, Models, Classification

Measuring the Uncertainty of Imputed Scores

Peer reviewed

Direct link

Sinharay, Sandip – Journal of Educational Measurement, 2023

Technical difficulties and other unforeseen events occasionally lead to incomplete data on educational tests, which necessitates the reporting of imputed scores to some examinees. While there exist several approaches for reporting imputed scores, there is a lack of any guidance on the reporting of the uncertainty of imputed scores. In this paper,…

Descriptors: Evaluation Methods, Scores, Standardized Tests, Simulation

A Dual-Purpose Model for Binary Data: Estimating Ability and Misconceptions

Peer reviewed

Direct link

Wenchao Ma; Miguel A. Sorrel; Xiaoming Zhai; Yuan Ge – Journal of Educational Measurement, 2024

Most existing diagnostic models are developed to detect whether students have mastered a set of skills of interest, but few have focused on identifying what scientific misconceptions students possess. This article developed a general dual-purpose model for simultaneously estimating students' overall ability and the presence and absence of…

Descriptors: Models, Misconceptions, Diagnostic Tests, Ability

Generating Models for Item Preknowledge

Peer reviewed

Direct link

Gorney, Kylie; Wollack, James A. – Journal of Educational Measurement, 2022

Detection methods for item preknowledge are often evaluated in simulation studies where models are used to generate the data. To ensure the reliability of such methods, it is crucial that these models are able to accurately represent situations that are encountered in practice. The purpose of this article is to provide a critical analysis of…

Descriptors: Prior Learning, Simulation, Models, Reaction Time

Using Simulated Retests to Estimate the Reliability of Diagnostic Assessment Systems

Peer reviewed

Direct link

Thompson, W. Jake; Nash, Brooke; Clark, Amy K.; Hoover, Jeffrey C. – Journal of Educational Measurement, 2023

As diagnostic classification models become more widely used in large-scale operational assessments, we must give consideration to the methods for estimating and reporting reliability. Researchers must explore alternatives to traditional reliability methods that are consistent with the design, scoring, and reporting levels of diagnostic assessment…

Descriptors: Diagnostic Tests, Simulation, Test Reliability, Accuracy

Fully Gibbs Sampling Algorithms for Bayesian Variable Selection in Latent Regression Models

Peer reviewed

Direct link

Yamaguchi, Kazuhiro; Zhang, Jihong – Journal of Educational Measurement, 2023

This study proposed Gibbs sampling algorithms for variable selection in a latent regression model under a unidimensional two-parameter logistic item response theory model. Three types of shrinkage priors were employed to obtain shrinkage estimates: double-exponential (i.e., Laplace), horseshoe, and horseshoe+ priors. These shrinkage priors were…

Descriptors: Algorithms, Simulation, Mathematics Achievement, Bayesian Statistics

Modeling Response Styles in Cross-Classified Data Using a Cross-Classified Multidimensional Nominal Response Model

Peer reviewed

Direct link

Sijia Huang; Seungwon Chung; Carl F. Falk – Journal of Educational Measurement, 2024

In this study, we introduced a cross-classified multidimensional nominal response model (CC-MNRM) to account for various response styles (RS) in the presence of cross-classified data. The proposed model allows slopes to vary across items and can explore impacts of observed covariates on latent constructs. We applied a recently developed variant of…

Descriptors: Response Style (Tests), Classification, Data, Models

Online Calibration in Multidimensional Computerized Adaptive Testing with Polytomously Scored Items

Peer reviewed

Direct link

Yuan, Lu; Huang, Yingshi; Li, Shuhang; Chen, Ping – Journal of Educational Measurement, 2023

Online calibration is a key technology for item calibration in computerized adaptive testing (CAT) and has been widely used in various forms of CAT, including unidimensional CAT, multidimensional CAT (MCAT), CAT with polytomously scored items, and cognitive diagnostic CAT. However, as multidimensional and polytomous assessment data become more…

Descriptors: Computer Assisted Testing, Adaptive Testing, Computation, Test Items

A Residual-Based Differential Item Functioning Detection Framework in Item Response Theory

Peer reviewed

Direct link

Lim, Hwanggyu; Choe, Edison M.; Han, Kyung T. – Journal of Educational Measurement, 2022

Differential item functioning (DIF) of test items should be evaluated using practical methods that can produce accurate and useful results. Among a plethora of DIF detection techniques, we introduce the new "Residual DIF" (RDIF) framework, which stands out for its accessibility without sacrificing efficacy. This framework consists of…

Descriptors: Test Items, Item Response Theory, Identification, Robustness (Statistics)

Modeling Nonlinear Effects of Person-by-Item Covariates in Explanatory Item Response Models: Exploratory Plots and Modeling Using Smooth Functions

Peer reviewed

Direct link

Sun-Joo Cho; Amanda Goodwin; Matthew Naveiras; Paul De Boeck – Journal of Educational Measurement, 2024

Explanatory item response models (EIRMs) have been applied to investigate the effects of person covariates, item covariates, and their interactions in the fields of reading education and psycholinguistics. In practice, it is often assumed that the relationships between the covariates and the logit transformation of item response probability are…

Descriptors: Item Response Theory, Test Items, Models, Maximum Likelihood Statistics

Simultaneous Constrained Adaptive Item Selection for Group-Based Testing

Peer reviewed

Direct link

Bengs, Daniel; Kroehne, Ulf; Brefeld, Ulf – Journal of Educational Measurement, 2021

By tailoring test forms to the test-taker's proficiency, Computerized Adaptive Testing (CAT) enables substantial increases in testing efficiency over fixed forms testing. When used for formative assessment, the alignment of task difficulty with proficiency increases the chance that teachers can derive useful feedback from assessment data. The…

Descriptors: Computer Assisted Testing, Formative Evaluation, Group Testing, Program Effectiveness

Logistic Regression Procedure Using Penalized Maximum Likelihood Estimation for Differential Item Functioning

Peer reviewed

Direct link

Lee, Sunbok – Journal of Educational Measurement, 2020

In the logistic regression (LR) procedure for differential item functioning (DIF), the parameters of LR have often been estimated using maximum likelihood (ML) estimation. However, ML estimation suffers from the finite-sample bias. Furthermore, ML estimation for LR can be substantially biased in the presence of rare event data. The bias of ML…

Descriptors: Regression (Statistics), Test Bias, Maximum Likelihood Statistics, Simulation

Two IRT Fixed Parameter Calibration Methods for the Bifactor Model

Peer reviewed

Direct link

Kim, Kyung Yong – Journal of Educational Measurement, 2020

New items are often evaluated prior to their operational use to obtain item response theory (IRT) item parameter estimates for quality control purposes. Fixed parameter calibration is one linking method that is widely used to estimate parameters for new items and place them on the desired scale. This article provides detailed descriptions of two…

Descriptors: Item Response Theory, Evaluation Methods, Test Items, Simulation

Differential and Functional Response Time Item Analysis: An Application to Understanding Paper versus Digital Reading Processes

Peer reviewed

Direct link

Sun-Joo Cho; Amanda Goodwin; Matthew Naveiras; Jorge Salas – Journal of Educational Measurement, 2024

Despite the growing interest in incorporating response time data into item response models, there has been a lack of research investigating how the effect of speed on the probability of a correct response varies across different groups (e.g., experimental conditions) for various items (i.e., differential response time item analysis). Furthermore,…

Descriptors: Item Response Theory, Reaction Time, Models, Accuracy

Previous Page | Next Page »

Pages: 1 | 2 | 3 | 4 | 5 | 6 | 7 | 8 | 9 | 10 | 11

Sinharay, Sandip	6
Wang, Wen-Chung	6
Meijer, Rob R.	4
Penfield, Randall D.	4
Roussos, Louis A.	4
Rutkowski, Leslie	4
Suh, Youngsuk	4
Bolt, Daniel M.	3
Chang, Hua-Hua	3
Falk, Carl F.	3
Gierl, Mark J.	3
Lee, Won-Chan	3
Moses, Tim	3
Pommerich, Mary	3
Sijtsma, Klaas	3
Wilson, Mark	3
Wyse, Adam E.	3
de la Torre, Jimmy	3
Amanda Goodwin	2
Babcock, Ben	2
Chen, Ping	2
Cheng, Ying	2
Cho, Sun-Joo	2
Choi, Seung W.	2
More ▼