ERIC - Search Results

Publication Date

In 2025	0
Since 2024	0
Since 2021 (last 5 years)	5
Since 2016 (last 10 years)	9
Since 2006 (last 20 years)	16

Descriptor

Classification	19
Probability	19
Models	9
Item Response Theory	7
Statistical Analysis	7
Accuracy	5
Computer Assisted Testing	5
Scores	5
Comparative Analysis	4
Item Analysis	4
Simulation	4
Test Items	4
Computation	3
Guessing (Tests)	3
Bayesian Statistics	2
Equations (Mathematics)	2
Error Patterns	2
Factor Analysis	2
Measurement	2
Multiple Choice Tests	2
Psychometrics	2
Reading Achievement	2
Test Length	2
Ability	1
Advanced Placement	1
More ▼

Source

Educational and Psychological…

Publication Type

Journal Articles	16
Reports - Research	11
Reports - Descriptive	3
Reports - Evaluative	2

Education Level

Elementary Secondary Education	1
Higher Education	1
Postsecondary Education	1

Audience

Location

Laws, Policies, & Programs

Assessments and Surveys

Early Childhood Longitudinal…	1
Test of English as a Foreign…	1

What Works Clearinghouse Rating

Showing 1 to 15 of 19 results Save | Export

Summary Intervals for Model-Based Classification Accuracy and Consistency Indices

Peer reviewed

Direct link

Gonzalez, Oscar – Educational and Psychological Measurement, 2023

When scores are used to make decisions about respondents, it is of interest to estimate classification accuracy (CA), the probability of making a correct decision, and classification consistency (CC), the probability of making the same decision across two parallel administrations of the measure. Model-based estimates of CA and CC computed from the…

Descriptors: Classification, Accuracy, Intervals, Probability

A Comparison of Label Switching Algorithms in the Context of Growth Mixture Models

Peer reviewed

Direct link

Cassiday, Kristina R.; Cho, Youngmi; Harring, Jeffrey R. – Educational and Psychological Measurement, 2021

Simulation studies involving mixture models inevitably aggregate parameter estimates and other output across numerous replications. A primary issue that arises in these methodological investigations is label switching. The current study compares several label switching corrections that are commonly used when dealing with mixture models. A growth…

Descriptors: Probability, Models, Simulation, Mathematics

Combined Approach to Multi-Informant Data Using Latent Factors and Latent Classes: Trifactor Mixture Model

Peer reviewed

Direct link

Kim, Eunsook; von der Embse, Nathaniel – Educational and Psychological Measurement, 2021

Although collecting data from multiple informants is highly recommended, methods to model the congruence and incongruence between informants are limited. Bauer and colleagues suggested the trifactor model that decomposes the variances into common factor, informant perspective factors, and item-specific factors. This study extends their work to the…

Descriptors: Probability, Models, Statistical Analysis, Congruence (Psychology)

Estimating Probabilities of Passing for Examinees with Incomplete Data in Mastery Tests

Peer reviewed

Direct link

Sinharay, Sandip – Educational and Psychological Measurement, 2022

Administrative problems such as computer malfunction and power outage occasionally lead to missing item scores and hence to incomplete data on mastery tests such as the AP and U.S. Medical Licensing examinations. Investigators are often interested in estimating the probabilities of passing of the examinees with incomplete data on mastery tests.…

Descriptors: Mastery Tests, Computer Assisted Testing, Probability, Test Wiseness

Growth Mixture Modeling with Nonnormal Distributions: Implications for Data Transformation

Peer reviewed

Direct link

Nam, Yeji; Hong, Sehee – Educational and Psychological Measurement, 2021

This study investigated the extent to which class-specific parameter estimates are biased by the within-class normality assumption in nonnormal growth mixture modeling (GMM). Monte Carlo simulations for nonnormal GMM were conducted to analyze and compare two strategies for obtaining unbiased parameter estimates: relaxing the within-class normality…

Descriptors: Probability, Models, Statistical Analysis, Statistical Distributions

Relative Diagnostic Profile: A Subscore Reporting Framework

Peer reviewed

Direct link

Liu, Ren; Qian, Hong; Luo, Xiao; Woo, Ada – Educational and Psychological Measurement, 2018

Subscore reporting under item response theory models has always been a challenge partly because the test length of each subdomain is limited for precisely locating individuals on multiple continua. Diagnostic classification models (DCMs), providing a pass/fail decision and associated probability of pass on each subdomain, are promising…

Descriptors: Classification, Probability, Pass Fail Grading, Scores

Kappa and Rater Accuracy: Paradigms and Parameters

Peer reviewed

Direct link

Conger, Anthony J. – Educational and Psychological Measurement, 2017

Drawing parallels to classical test theory, this article clarifies the difference between rater accuracy and reliability and demonstrates how category marginal frequencies affect rater agreement and Cohen's kappa. Category assignment paradigms are developed: comparing raters to a standard (index) versus comparing two raters to one another…

Descriptors: Interrater Reliability, Evaluators, Accuracy, Statistical Analysis

Adjacent-Categories Mokken Models for Rater-Mediated Assessments

Peer reviewed

Direct link

Wind, Stefanie A. – Educational and Psychological Measurement, 2017

Molenaar extended Mokken's original probabilistic-nonparametric scaling models for use with polytomous data. These polytomous extensions of Mokken's original scaling procedure have facilitated the use of Mokken scale analysis as an approach to exploring fundamental measurement properties across a variety of domains in which polytomous ratings are…

Descriptors: Nonparametric Statistics, Scaling, Models, Item Response Theory

Does Matching Quality Matter in Mode Comparison Studies?

Peer reviewed

Direct link

Zeng, Ji; Yin, Ping; Shedden, Kerby A. – Educational and Psychological Measurement, 2015

This article provides a brief overview and comparison of three matching approaches in forming comparable groups for a study comparing test administration modes (i.e., computer-based tests [CBT] and paper-and-pencil tests [PPT]): (a) a propensity score matching approach proposed in this article, (b) the propensity score matching approach used by…

Descriptors: Comparative Analysis, Computer Assisted Testing, Probability, Classification

Retrofitting Diagnostic Classification Models to Responses from IRT-Based Assessment Forms

Peer reviewed

Direct link

Liu, Ren; Huggins-Manley, Anne Corinne; Bulut, Okan – Educational and Psychological Measurement, 2018

Developing a diagnostic tool within the diagnostic measurement framework is the optimal approach to obtain multidimensional and classification-based feedback on examinees. However, end users may seek to obtain diagnostic feedback from existing item responses to assessments that have been designed under either the classical test theory or item…

Descriptors: Models, Item Response Theory, Psychometrics, Test Construction

Enhancing a Short Measure of Big Five Personality Traits with Bayesian Scaling

Peer reviewed

Direct link

Jones, W. Paul – Educational and Psychological Measurement, 2014

A study in a university clinic/laboratory investigated adaptive Bayesian scaling as a supplement to interpretation of scores on the Mini-IPIP. A "probability of belonging" in categories of low, medium, or high on each of the Big Five traits was calculated after each item response and continued until all items had been used or until a…

Descriptors: Personality Traits, Personality Measures, Bayesian Statistics, Clinics

Using a Model of Analysts' Judgments to Augment an Item Calibration Process

Peer reviewed

Direct link

Hauser, Carl; Thum, Yeow Meng; He, Wei; Ma, Lingling – Educational and Psychological Measurement, 2015

When conducting item reviews, analysts evaluate an array of statistical and graphical information to assess the fit of a field test (FT) item to an item response theory model. The process can be tedious, particularly when the number of human reviews (HR) to be completed is large. Furthermore, such a process leads to decisions that are susceptible…

Descriptors: Test Items, Item Response Theory, Research Methodology, Decision Making

Formulating the Rasch Differential Item Functioning Model under the Marginal Maximum Likelihood Estimation Context and Its Comparison with Mantel-Haenszel Procedure in Short Test and Small Sample Conditions

Peer reviewed

Direct link

Paek, Insu; Wilson, Mark – Educational and Psychological Measurement, 2011

This study elaborates the Rasch differential item functioning (DIF) model formulation under the marginal maximum likelihood estimation context. Also, the Rasch DIF model performance was examined and compared with the Mantel-Haenszel (MH) procedure in small sample and short test length conditions through simulations. The theoretically known…

Descriptors: Test Bias, Test Length, Statistical Inference, Geometric Concepts

Computerized Classification Testing under the One-Parameter Logistic Response Model with Ability-Based Guessing

Peer reviewed

Direct link

Wang, Wen-Chung; Huang, Sheng-Yun – Educational and Psychological Measurement, 2011

The one-parameter logistic model with ability-based guessing (1PL-AG) has been recently developed to account for effect of ability on guessing behavior in multiple-choice items. In this study, the authors developed algorithms for computerized classification testing under the 1PL-AG and conducted a series of simulations to evaluate their…

Descriptors: Computer Assisted Testing, Classification, Item Analysis, Probability

Mutual Information Item Selection in Adaptive Classification Testing

Peer reviewed

Direct link

Weissman, Alexander – Educational and Psychological Measurement, 2007

A general approach for item selection in adaptive multiple-category classification tests is provided. The approach uses mutual information (MI), a special case of the Kullback-Leibler distance, or relative entropy. MI works efficiently with the sequential probability ratio test and alleviates the difficulties encountered with using other local-…

Descriptors: Scientific Concepts, Probability, Test Length, Item Analysis

Previous Page | Next Page »

Pages: 1 | 2

Liu, Ren	2
Bart, William M.	1
Bulut, Okan	1
Carter, Walter H., Jr.	1
Cassiday, Kristina R.	1
Cho, Youngmi	1
Conger, Anthony J.	1
Gonzalez, Oscar	1
Harring, Jeffrey R.	1
Hauser, Carl	1
He, Wei	1
Hong, Sehee	1
Huang, Sheng-Yun	1
Huggins-Manley, Anne Corinne	1
Jones, W. Paul	1
Kim, Eunsook	1
Krauth, J.	1
Lienert, G. A.	1
Luo, Xiao	1
Ma, Lingling	1
Nam, Yeji	1
Paek, Insu	1
Qian, Hong	1
Shedden, Kerby A.	1
Sinharay, Sandip	1
More ▼