ERIC - Search Results

Publication Date

In 2025	0
Since 2024	2
Since 2021 (last 5 years)	2
Since 2016 (last 10 years)	4
Since 2006 (last 20 years)	10

Descriptor

Classification	12
Simulation	12
Accuracy	5
Computation	5
Models	5
Item Response Theory	4
Bayesian Statistics	3
Cognitive Tests	2
Cutting Scores	2
Diagnostic Tests	2
Evaluation Methods	2
Goodness of Fit	2
Probability	2
Reliability	2
Sample Size	2
Statistical Analysis	2
Test Items	2
Ability	1
Achievement	1
Achievement Tests	1
Algorithms	1
Cognitive Measurement	1
College Students	1
Comparative Analysis	1
Computer Assisted Testing	1
More ▼

Source

Journal of Educational…

Publication Type

Journal Articles	12
Reports - Research	10
Reports - Evaluative	2

Education Level

Elementary Education	1
Grade 4	1
Higher Education	1
Intermediate Grades	1
Postsecondary Education	1

Audience

Location

Laws, Policies, & Programs

Assessments and Surveys

Progress in International…

What Works Clearinghouse Rating

Showing all 12 results Save | Export

Model Selection Posterior Predictive Model Checking via Limited-Information Indices for Bayesian Diagnostic Classification Modeling

Peer reviewed

Direct link

Jihong Zhang; Jonathan Templin; Xinya Liang – Journal of Educational Measurement, 2024

Recently, Bayesian diagnostic classification modeling has been becoming popular in health psychology, education, and sociology. Typically information criteria are used for model selection when researchers want to choose the best model among alternative models. In Bayesian estimation, posterior predictive checking is a flexible Bayesian model…

Descriptors: Bayesian Statistics, Cognitive Measurement, Models, Classification

Modeling Response Styles in Cross-Classified Data Using a Cross-Classified Multidimensional Nominal Response Model

Peer reviewed

Direct link

Sijia Huang; Seungwon Chung; Carl F. Falk – Journal of Educational Measurement, 2024

In this study, we introduced a cross-classified multidimensional nominal response model (CC-MNRM) to account for various response styles (RS) in the presence of cross-classified data. The proposed model allows slopes to vary across items and can explore impacts of observed covariates on latent constructs. We applied a recently developed variant of…

Descriptors: Response Style (Tests), Classification, Data, Models

Does Maximizing Information at the Cut Score Always Maximize Classification Accuracy and Consistency?

Peer reviewed

Direct link

Wyse, Adam E.; Babcock, Ben – Journal of Educational Measurement, 2016

A common suggestion made in the psychometric literature for fixed-length classification tests is that one should design tests so that they have maximum information at the cut score. Designing tests in this way is believed to maximize the classification accuracy and consistency of the assessment. This article uses simulated examples to illustrate…

Descriptors: Cutting Scores, Psychometrics, Test Construction, Classification

Attribute-Level and Pattern-Level Classification Consistency and Accuracy Indices for Cognitive Diagnostic Assessment

Peer reviewed

Direct link

Wang, Wenyi; Song, Lihong; Chen, Ping; Meng, Yaru; Ding, Shuliang – Journal of Educational Measurement, 2015

Classification consistency and accuracy are viewed as important indicators for evaluating the reliability and validity of classification results in cognitive diagnostic assessment (CDA). Pattern-level classification consistency and accuracy indices were introduced by Cui, Gierl, and Chang. However, the indices at the attribute level have not yet…

Descriptors: Classification, Reliability, Accuracy, Cognitive Tests

Effect Size Measures for Differential Item Functioning in a Multidimensional IRT Model

Peer reviewed

Direct link

Suh, Youngsuk – Journal of Educational Measurement, 2016

This study adapted an effect size measure used for studying differential item functioning (DIF) in unidimensional tests and extended the measure to multidimensional tests. Two effect size measures were considered in a multidimensional item response theory model: signed weighted P-difference and unsigned weighted P-difference. The performance of…

Descriptors: Effect Size, Goodness of Fit, Statistical Analysis, Statistical Significance

A Nonparametric Approach to Estimate Classification Accuracy and Consistency

Peer reviewed

Direct link

Lathrop, Quinn N.; Cheng, Ying – Journal of Educational Measurement, 2014

When cut scores for classifications occur on the total score scale, popular methods for estimating classification accuracy (CA) and classification consistency (CC) require assumptions about a parametric form of the test scores or about a parametric response model, such as item response theory (IRT). This article develops an approach to estimate CA…

Descriptors: Cutting Scores, Classification, Computation, Nonparametric Statistics

Correcting Measurement Error in Latent Regression Covariates via the MC-SIMEX Method

Peer reviewed

Direct link

Rutkowski, Leslie; Zhou, Yan – Journal of Educational Measurement, 2015

Given the importance of large-scale assessments to educational policy conversations, it is critical that subpopulation achievement is estimated reliably and with sufficient precision. Despite this importance, biased subpopulation estimates have been found to occur when variables in the conditioning model side of a latent regression model contain…

Descriptors: Error of Measurement, Error Correction, Regression (Statistics), Computation

The Impact of Model Misspecification on Parameter Estimation and Item-Fit Assessment in Log-Linear Diagnostic Classification Models

Peer reviewed

Direct link

Kunina-Habenicht, Olga; Rupp, Andre A.; Wilhelm, Oliver – Journal of Educational Measurement, 2012

Using a complex simulation study we investigated parameter recovery, classification accuracy, and performance of two item-fit statistics for correct and misspecified diagnostic classification models within a log-linear modeling framework. The basic manipulated test design factors included the number of respondents (1,000 vs. 10,000), attributes (3…

Descriptors: Classification, Accuracy, Goodness of Fit, Models

Estimating Classification Consistency and Accuracy for Cognitive Diagnostic Assessment

Peer reviewed

Direct link

Cui, Ying; Gierl, Mark J.; Chang, Hua-Hua – Journal of Educational Measurement, 2012

This article introduces procedures for the computation and asymptotic statistical inference for classification consistency and accuracy indices specifically designed for cognitive diagnostic assessments. The new classification indices can be used as important indicators of the reliability and validity of classification results produced by…

Descriptors: Classification, Accuracy, Cognitive Tests, Diagnostic Tests

Factors Affecting the Item Parameter Estimation and Classification Accuracy of the DINA Model

Peer reviewed

Direct link

de la Torre, Jimmy; Hong, Yuan; Deng, Weiling – Journal of Educational Measurement, 2010

To better understand the statistical properties of the deterministic inputs, noisy "and" gate cognitive diagnosis (DINA) model, the impact of several factors on the quality of the item parameter estimates and classification accuracy was investigated. Results of the simulation study indicate that the fully Bayes approach is most accurate when the…

Descriptors: Classification, Computation, Models, Simulation

The Effect of Model Misspecification on Classification Decisions Made Using a Computerized Test.

Peer reviewed

Kalohn, John C.; Spray, Judith A. – Journal of Educational Measurement, 1999

Examined the effects of model misspecification on the precision of decisions made using the sequential probability ratio test (SPRT) in computer testing. Simulation results show that the one-parameter logistic model produced more errors than the true model. (SLD)

Descriptors: Classification, Computer Assisted Testing, Decision Making, Models

Estimation of Classification Consistency When the Probability of a Correct Response Varies.

Peer reviewed

Spray, Judith A.; Welch, Catherine J. – Journal of Educational Measurement, 1990

The effect of large, within-examinee item difficulty variability on estimates of the proportion of consistent classification of examinees into mastery categories was studied over 2 test administrations for 100 simulated examinees. The proportion of consistent classifications was adequately estimated using the technique proposed by M. Subkoviak…

Descriptors: Classification, Difficulty Level, Estimation (Mathematics), Item Response Theory

Spray, Judith A.	2
Babcock, Ben	1
Carl F. Falk	1
Chang, Hua-Hua	1
Chen, Ping	1
Cheng, Ying	1
Cui, Ying	1
Deng, Weiling	1
Ding, Shuliang	1
Gierl, Mark J.	1
Hong, Yuan	1
Jihong Zhang	1
Jonathan Templin	1
Kalohn, John C.	1
Kunina-Habenicht, Olga	1
Lathrop, Quinn N.	1
Meng, Yaru	1
Rupp, Andre A.	1
Rutkowski, Leslie	1
Seungwon Chung	1
Sijia Huang	1
Song, Lihong	1
Suh, Youngsuk	1
Wang, Wenyi	1
Welch, Catherine J.	1
More ▼