ERIC - Search Results

Publication Date

In 2025	0
Since 2024	0
Since 2021 (last 5 years)	3
Since 2016 (last 10 years)	9
Since 2006 (last 20 years)	23

Descriptor

Classification	26
Simulation	26
Statistical Analysis	26
Models	13
Comparative Analysis	10
Item Response Theory	10
Computation	7
Sample Size	7
Data Analysis	6
Educational Research	5
Evaluation Methods	5
Mathematics	5
Bayesian Statistics	4
Computer Software	4
Goodness of Fit	4
Information Retrieval	4
Intelligent Tutoring Systems	4
Multivariate Analysis	4
Prediction	4
Student Behavior	4
Test Length	4
Accuracy	3
Artificial Intelligence	3
Automation	3
Computer Assisted Testing	3
More ▼

Source

Educational and Psychological…	7
International Educational…	3
Journal of Educational…	2
ProQuest LLC	2
Psychological Assessment	2
Applied Psychological…	1
Educational Psychology	1
Educational Research and…	1
International Working Group…	1
Journal of Educational and…	1
Measurement:…	1
Multivariate Behavioral…	1
Review of Educational Research	1
More ▼

Publication Type

Journal Articles	18
Reports - Research	14
Reports - Evaluative	5
Collected Works - Proceedings	3
Dissertations/Theses -…	2
Speeches/Meeting Papers	1

Education Level

Junior High Schools	4
Middle Schools	4
Secondary Education	4
Higher Education	3
Postsecondary Education	3
Elementary Education	2
High Schools	2
Adult Education	1
Elementary Secondary Education	1
Grade 10	1
Grade 12	1
Grade 4	1
Grade 6	1
Grade 7	1
Grade 8	1
Grade 9	1
Intermediate Grades	1
More ▼

Audience

Location

Afghanistan	1
Australia	1
Czech Republic	1
Finland	1
France	1
Illinois (Chicago)	1
Israel	1
Massachusetts	1
Netherlands	1
North Carolina	1
Pennsylvania	1
Slovakia	1
Spain	1
Utah	1
Washington	1
More ▼

Laws, Policies, & Programs

Assessments and Surveys

Massachusetts Comprehensive…	1
Minnesota Multiphasic…	1
Program for International…	1

What Works Clearinghouse Rating

Showing 1 to 15 of 26 results Save | Export

Implementing a Standardized Effect Size in the POLYSIBTEST Procedure

Peer reviewed

Direct link

Weese, James D.; Turner, Ronna C.; Liang, Xinya; Ames, Allison; Crawford, Brandon – Educational and Psychological Measurement, 2023

A study was conducted to implement the use of a standardized effect size and corresponding classification guidelines for polytomous data with the POLYSIBTEST procedure and compare those guidelines with prior recommendations. Two simulation studies were included. The first identifies new unstandardized test heuristics for classifying moderate and…

Descriptors: Effect Size, Classification, Guidelines, Statistical Analysis

A Comparison of Label Switching Algorithms in the Context of Growth Mixture Models

Peer reviewed

Direct link

Cassiday, Kristina R.; Cho, Youngmi; Harring, Jeffrey R. – Educational and Psychological Measurement, 2021

Simulation studies involving mixture models inevitably aggregate parameter estimates and other output across numerous replications. A primary issue that arises in these methodological investigations is label switching. The current study compares several label switching corrections that are commonly used when dealing with mixture models. A growth…

Descriptors: Probability, Models, Simulation, Mathematics

Handling Missing Data in Cross-Classified Multilevel Analyses: An Evaluation of Different Multiple Imputation Approaches

Peer reviewed

Direct link

Grund, Simon; Lüdtke, Oliver; Robitzsch, Alexander – Journal of Educational and Behavioral Statistics, 2023

Multiple imputation (MI) is a popular method for handling missing data. In education research, it can be challenging to use MI because the data often have a clustered structure that need to be accommodated during MI. Although much research has considered applications of MI in hierarchical data, little is known about its use in cross-classified…

Descriptors: Educational Research, Data Analysis, Error of Measurement, Computation

GDINA and CDM Packages in R

Peer reviewed

Direct link

Rupp, André A.; van Rijn, Peter W. – Measurement: Interdisciplinary Research and Perspectives, 2018

We review the GIDNA and CDM packages in R for fitting cognitive diagnosis/diagnostic classification models. We first provide a summary of their core capabilities and then use both simulated and real data to compare their functionalities in practice. We found that the most relevant routines in the two packages appear to be more similar than…

Descriptors: Educational Assessment, Cognitive Measurement, Measurement, Computer Software

Investigation of Rater Effects Using Social Network Analysis and Exponential Random Graph Models

Peer reviewed

Direct link

Lamprianou, Iasonas – Educational and Psychological Measurement, 2018

It is common practice for assessment programs to organize qualifying sessions during which the raters (often known as "markers" or "judges") demonstrate their consistency before operational rating commences. Because of the high-stakes nature of many rating activities, the research community tends to continuously explore new…

Descriptors: Social Networks, Network Analysis, Comparative Analysis, Innovation

Statistical Classification for Cognitive Diagnostic Assessment: An Artificial Neural Network Approach

Peer reviewed

Direct link

Cui, Ying; Gierl, Mark; Guo, Qi – Educational Psychology, 2016

The purpose of the current investigation was to describe how the artificial neural networks (ANNs) can be used to interpret student performance on cognitive diagnostic assessments (CDAs) and evaluate the performances of ANNs using simulation results. CDAs are designed to measure student performance on problem-solving tasks and provide useful…

Descriptors: Cognitive Tests, Diagnostic Tests, Classification, Artificial Intelligence

Effect Size Measures for Differential Item Functioning in a Multidimensional IRT Model

Peer reviewed

Direct link

Suh, Youngsuk – Journal of Educational Measurement, 2016

This study adapted an effect size measure used for studying differential item functioning (DIF) in unidimensional tests and extended the measure to multidimensional tests. Two effect size measures were considered in a multidimensional item response theory model: signed weighted P-difference and unsigned weighted P-difference. The performance of…

Descriptors: Effect Size, Goodness of Fit, Statistical Analysis, Statistical Significance

Unrestricted Mixture Models for Class Identification in Growth Mixture Modeling

Peer reviewed

Direct link

Liu, Min; Hancock, Gregory R. – Educational and Psychological Measurement, 2014

Growth mixture modeling has gained much attention in applied and methodological social science research recently, but the selection of the number of latent classes for such models remains a challenging issue, especially when the assumption of proper model specification is violated. The current simulation study compared the performance of a linear…

Descriptors: Models, Classification, Simulation, Comparative Analysis

Challenging Conventional Wisdom for Multivariate Statistical Models with Small Samples

Peer reviewed

Direct link

McNeish, Daniel – Review of Educational Research, 2017

In education research, small samples are common because of financial limitations, logistical challenges, or exploratory studies. With small samples, statistical principles on which researchers rely do not hold, leading to trust issues with model estimates and possible replication issues when scaling up. Researchers are generally aware of such…

Descriptors: Models, Statistical Analysis, Sampling, Sample Size

"Your Model Is Predictive-- but Is It Useful?" Theoretical and Empirical Considerations of a New Paradigm for Adaptive Tutoring Evaluation

Download full text

González-Brenes, José P.; Huang, Yun – International Educational Data Mining Society, 2015

Classification evaluation metrics are often used to evaluate adaptive tutoring systems-- programs that teach and adapt to humans. Unfortunately, it is not clear how intuitive these metrics are for practitioners with little machine learning background. Moreover, our experiments suggest that existing convention for evaluating tutoring systems may…

Descriptors: Intelligent Tutoring Systems, Evaluation Methods, Program Evaluation, Student Behavior

Performance of the S - [chi][squared] Statistic for Full-Information Bifactor Models

Peer reviewed

Direct link

Li, Ying; Rupp, Andre A. – Educational and Psychological Measurement, 2011

This study investigated the Type I error rate and power of the multivariate extension of the S - [chi][squared] statistic using unidimensional and multidimensional item response theory (UIRT and MIRT, respectively) models as well as full-information bifactor (FI-bifactor) models through simulation. Manipulated factors included test length, sample…

Descriptors: Test Length, Item Response Theory, Statistical Analysis, Error Patterns

Factors Affecting the Item Parameter Estimation and Classification Accuracy of the DINA Model

Peer reviewed

Direct link

de la Torre, Jimmy; Hong, Yuan; Deng, Weiling – Journal of Educational Measurement, 2010

To better understand the statistical properties of the deterministic inputs, noisy "and" gate cognitive diagnosis (DINA) model, the impact of several factors on the quality of the item parameter estimates and classification accuracy was investigated. Results of the simulation study indicate that the fully Bayes approach is most accurate when the…

Descriptors: Classification, Computation, Models, Simulation

A New Approach for Testing the Rasch Model

Peer reviewed

Direct link

Kubinger, Klaus D.; Rasch, Dieter; Yanagida, Takuya – Educational Research and Evaluation, 2011

Though calibration of an achievement test within psychological and educational context is very often carried out by the Rasch model, data sampling is hardly designed according to statistical foundations. However, Kubinger, Rasch, and Yanagida (2009) recently suggested an approach for the determination of sample size according to a given Type I and…

Descriptors: Sample Size, Simulation, Testing, Achievement Tests

Formulating the Rasch Differential Item Functioning Model under the Marginal Maximum Likelihood Estimation Context and Its Comparison with Mantel-Haenszel Procedure in Short Test and Small Sample Conditions

Peer reviewed

Direct link

Paek, Insu; Wilson, Mark – Educational and Psychological Measurement, 2011

This study elaborates the Rasch differential item functioning (DIF) model formulation under the marginal maximum likelihood estimation context. Also, the Rasch DIF model performance was examined and compared with the Mantel-Haenszel (MH) procedure in small sample and short test length conditions through simulations. The theoretically known…

Descriptors: Test Bias, Test Length, Statistical Inference, Geometric Concepts

Differentiating Categories and Dimensions: Evaluating the Robustness of Taxometric Analyses

Peer reviewed

Direct link

Ruscio, John; Kaczetow, Walter – Multivariate Behavioral Research, 2009

Interest in modeling the structure of latent variables is gaining momentum, and many simulation studies suggest that taxometric analysis can validly assess the relative fit of categorical and dimensional models. The generation and parallel analysis of categorical and dimensional comparison data sets reduces the subjectivity required to interpret…

Descriptors: Classification, Models, Comparative Analysis, Statistical Analysis

Previous Page | Next Page »

Pages: 1 | 2

Barnes, Tiffany, Ed.	2
Rupp, Andre A.	2
Ruscio, John	2
Ames, Allison	1
Amir, Nader	1
Bau, Jinn Jonp	1
Beach, Steven R. H.	1
Cassiday, Kristina R.	1
Chi, Min, Ed.	1
Cho, Youngmi	1
Crawford, Brandon	1
Cui, Ying	1
Deng, Nina	1
Deng, Weiling	1
Desmarais, Michel, Ed.	1
Feng, Mingyu, Ed.	1
Finkelman, Matthew David	1
Fried, J.B.	1
Gierl, Mark	1
González-Brenes, José P.	1
Grund, Simon	1
Guo, Qi	1
Hancock, Gregory R.	1
Harring, Jeffrey R.	1
More ▼