ERIC - Search Results

Publication Date

In 2025	1
Since 2024	5
Since 2021 (last 5 years)	10
Since 2016 (last 10 years)	21
Since 2006 (last 20 years)	31

Descriptor

Classification	31
Accuracy	16
Item Response Theory	10
Simulation	10
Models	9
Computation	8
Diagnostic Tests	6
Evaluation Methods	6
Psychometrics	6
Reliability	6
Scores	6
Error of Measurement	5
Test Items	5
Cognitive Tests	4
Comparative Analysis	4
Cutting Scores	4
Testing	4
Bayesian Statistics	3
Decision Making	3
Goodness of Fit	3
Monte Carlo Methods	3
Prediction	3
Sample Size	3
Student Evaluation	3
Test Format	3
More ▼

Source

Journal of Educational…

Publication Type

Journal Articles	31
Reports - Research	25
Reports - Evaluative	4
Reports - Descriptive	2

Education Level

Elementary Education	1
Elementary Secondary Education	1
Grade 4	1
High Schools	1
Higher Education	1
Intermediate Grades	1
Postsecondary Education	1
Secondary Education	1

Audience

Location

Laws, Policies, & Programs

No Child Left Behind Act 2001

Assessments and Surveys

Progress in International…

What Works Clearinghouse Rating

Showing 1 to 15 of 31 results Save | Export

Modeling Hierarchical Attribute Structures in Diagnostic Classification Models with Multiple Attempts

Peer reviewed

Direct link

Tae Yeon Kwon; A. Corinne Huggins-Manley; Jonathan Templin; Mingying Zheng – Journal of Educational Measurement, 2024

In classroom assessments, examinees can often answer test items multiple times, resulting in sequential multiple-attempt data. Sequential diagnostic classification models (DCMs) have been developed for such data. As student learning processes may be aligned with a hierarchy of measured traits, this study aimed to develop a sequential hierarchical…

Descriptors: Classification, Accuracy, Student Evaluation, Sequential Approach

Model Selection Posterior Predictive Model Checking via Limited-Information Indices for Bayesian Diagnostic Classification Modeling

Peer reviewed

Direct link

Jihong Zhang; Jonathan Templin; Xinya Liang – Journal of Educational Measurement, 2024

Recently, Bayesian diagnostic classification modeling has been becoming popular in health psychology, education, and sociology. Typically information criteria are used for model selection when researchers want to choose the best model among alternative models. In Bayesian estimation, posterior predictive checking is a flexible Bayesian model…

Descriptors: Bayesian Statistics, Cognitive Measurement, Models, Classification

The Vulnerability of AI-Based Scoring Systems to Gaming Strategies: A Case Study

Peer reviewed

Direct link

Peter Baldwin; Victoria Yaneva; Kai North; Le An Ha; Yiyun Zhou; Alex J. Mechaber; Brian E. Clauser – Journal of Educational Measurement, 2025

Recent developments in the use of large-language models have led to substantial improvements in the accuracy of content-based automated scoring of free-text responses. The reported accuracy levels suggest that automated systems could have widespread applicability in assessment. However, before they are used in operational testing, other aspects of…

Descriptors: Artificial Intelligence, Scoring, Computational Linguistics, Accuracy

Modeling Response Styles in Cross-Classified Data Using a Cross-Classified Multidimensional Nominal Response Model

Peer reviewed

Direct link

Sijia Huang; Seungwon Chung; Carl F. Falk – Journal of Educational Measurement, 2024

In this study, we introduced a cross-classified multidimensional nominal response model (CC-MNRM) to account for various response styles (RS) in the presence of cross-classified data. The proposed model allows slopes to vary across items and can explore impacts of observed covariates on latent constructs. We applied a recently developed variant of…

Descriptors: Response Style (Tests), Classification, Data, Models

Estimating Classification Accuracy and Consistency Indices for Multiple Measures with the Simple Structure MIRT Model

Peer reviewed

Direct link

Park, Seohee; Kim, Kyung Yong; Lee, Won-Chan – Journal of Educational Measurement, 2023

Multiple measures, such as multiple content domains or multiple types of performance, are used in various testing programs to classify examinees for screening or selection. Despite the popular usages of multiple measures, there is little research on classification consistency and accuracy of multiple measures. Accordingly, this study introduces an…

Descriptors: Testing, Computation, Classification, Accuracy

Validating Performance Standards via Latent Class Analysis

Peer reviewed

Direct link

Binici, Salih; Cuhadar, Ismail – Journal of Educational Measurement, 2022

Validity of performance standards is a key element for the defensibility of standard setting results, and validating performance standards requires collecting multiple pieces of evidence at every step during the standard setting process. This study employs a statistical procedure, latent class analysis, to set performance standards and compares…

Descriptors: Validity, Performance, Standards, Multivariate Analysis

Classification Accuracy and Consistency of Compensatory Composite Test Scores

Peer reviewed

Direct link

Setzer, J. Carl; Cheng, Ying; Liu, Cheng – Journal of Educational Measurement, 2023

Test scores are often used to make decisions about examinees, such as in licensure and certification testing, as well as in many educational contexts. In some cases, these decisions are based upon compensatory scores, such as those from multiple sections or components of an exam. Classification accuracy and classification consistency are two…

Descriptors: Classification, Accuracy, Psychometrics, Scores

A Recursion-Based Analytical Approach to Evaluate the Performance of MST

Peer reviewed

Direct link

Lim, Hwanggyu; Davey, Tim; Wells, Craig S. – Journal of Educational Measurement, 2021

This study proposed a recursion-based analytical approach to assess measurement precision of ability estimation and classification accuracy in multistage adaptive tests (MSTs). A simulation study was conducted to compare the proposed recursion-based analytical method with an analytical method proposed by Park, Kim, Chung, and Dodd and with the…

Descriptors: Adaptive Testing, Measurement, Accuracy, Classification

A One-Parameter Diagnostic Classification Model with Familiar Measurement Properties

Peer reviewed

Direct link

Matthew J. Madison; Stefanie A. Wind; Lientje Maas; Kazuhiro Yamaguchi; Sergio Haab – Journal of Educational Measurement, 2024

Diagnostic classification models (DCMs) are psychometric models designed to classify examinees according to their proficiency or nonproficiency of specified latent characteristics. These models are well suited for providing diagnostic and actionable feedback to support intermediate and formative assessment efforts. Several DCMs have been developed…

Descriptors: Diagnostic Tests, Classification, Models, Psychometrics

A Computationally Simple Method for Estimating Decision Consistency

Peer reviewed

Direct link

Wolkowitz, Amanda A. – Journal of Educational Measurement, 2021

Decision consistency (DC) is the reliability of a classification decision based on a test score. In professional credentialing, the decision is often a high-stakes pass/fail decision. The current methods for estimating DC are computationally complex. The purpose of this research is to provide a computationally and conceptually simple method for…

Descriptors: Decision Making, Reliability, Classification, Scores

Classification Consistency and Accuracy with Atypical Score Distributions

Peer reviewed

Direct link

Kim, Stella Y.; Lee, Won-Chan – Journal of Educational Measurement, 2020

The current study aims to evaluate the performance of three non-IRT procedures (i.e., normal approximation, Livingston-Lewis, and compound multinomial) for estimating classification indices when the observed score distribution shows atypical patterns: (a) bimodality, (b) structural (i.e., systematic) bumpiness, or (c) structural zeros (i.e., no…

Descriptors: Classification, Accuracy, Scores, Cutting Scores

Measures of Agreement to Assess Attribute-Level Classification Accuracy and Consistency for Cognitive Diagnostic Assessments

Peer reviewed

Direct link

Johnson, Matthew S.; Sinharay, Sandip – Journal of Educational Measurement, 2018

One of the proposed uses of cognitive diagnostic assessments is to classify the examinees as either masters or nonmasters on each of a number of skills being assessed. As with any test, it is important to report the quality of these binary classifications with measures of their reliability. Cui et al. and Wang et al. have suggested reliability…

Descriptors: Classification, Accuracy, Test Reliability, Diagnostic Tests

Examining Psychometric Properties and Level Classification of the van Hiele Geometry Test Using CTT and CDM Frameworks

Peer reviewed

Direct link

Chen, Yi-Hsin; Senk, Sharon L.; Thompson, Denisse R.; Voogt, Kevin – Journal of Educational Measurement, 2019

The van Hiele theory and van Hiele Geometry Test have been extensively used in mathematics assessments across countries. The purpose of this study is to use classical test theory (CTT) and cognitive diagnostic modeling (CDM) frameworks to examine psychometric properties of the van Hiele Geometry Test and to compare how various classification…

Descriptors: Geometry, Mathematics Tests, Test Theory, Psychometrics

IRT Approaches to Modeling Scores on Mixed-Format Tests

Peer reviewed

Direct link

Lee, Won-Chan; Kim, Stella Y.; Choi, Jiwon; Kang, Yujin – Journal of Educational Measurement, 2020

This article considers psychometric properties of composite raw scores and transformed scale scores on mixed-format tests that consist of a mixture of multiple-choice and free-response items. Test scores on several mixed-format tests are evaluated with respect to conditional and overall standard errors of measurement, score reliability, and…

Descriptors: Raw Scores, Item Response Theory, Test Format, Multiple Choice Tests

An Item-Level Expected Classification Accuracy and Its Applications in Cognitive Diagnostic Assessment

Peer reviewed

Direct link

Wang, Wenyi; Song, Lihong; Chen, Ping; Ding, Shuliang – Journal of Educational Measurement, 2019

Most of the existing classification accuracy indices of attribute patterns lose effectiveness when the response data is absent in diagnostic testing. To handle this issue, this article proposes new indices to predict the correct classification rate of a diagnostic test before administering the test under the deterministic noise input…

Descriptors: Cognitive Tests, Classification, Accuracy, Diagnostic Tests

Previous Page | Next Page »

Pages: 1 | 2 | 3

Lee, Won-Chan	4
Chen, Ping	2
Cheng, Ying	2
Ding, Shuliang	2
Jonathan Templin	2
Kim, Stella Y.	2
Sinharay, Sandip	2
Song, Lihong	2
Wang, Wenyi	2
A. Corinne Huggins-Manley	1
Alex J. Mechaber	1
Babcock, Ben	1
Betebenner, Damian W.	1
Binici, Salih	1
Bradshaw, Laine	1
Brian E. Clauser	1
Cai, Yan	1
Carl F. Falk	1
Chang, Hua-Hua	1
Chen, Yi-Hsin	1
Choi, Jiwon	1
Cuhadar, Ismail	1
Cui, Ying	1
Davey, Tim	1
Deng, Weiling	1
More ▼