ERIC - Search Results

Publication Date

In 2025	0
Since 2024	0
Since 2021 (last 5 years)	4
Since 2016 (last 10 years)	7
Since 2006 (last 20 years)	18

Descriptor

Classification	20
Simulation	20
Item Response Theory	9
Test Items	8
Comparative Analysis	7
Statistical Analysis	7
Computer Assisted Testing	5
Models	5
Accuracy	4
Computation	4
Probability	4
Bayesian Statistics	3
Cutting Scores	3
Equations (Mathematics)	3
Monte Carlo Methods	3
Sample Size	3
Test Bias	3
Test Length	3
Ability	2
Adaptive Testing	2
Data Analysis	2
Decision Making	2
Diagnostic Tests	2
Effect Size	2
Evaluation Methods	2
More ▼

Source

Educational and Psychological…

Publication Type

Journal Articles	20
Reports - Research	16
Reports - Evaluative	3
Reports - Descriptive	1

Education Level

Early Childhood Education	1
Elementary Education	1
Elementary Secondary Education	1
Grade 3	1
Grade 4	1
Grade 5	1
Grade 6	1
Intermediate Grades	1
Middle Schools	1
Primary Education	1

Audience

Location

Laws, Policies, & Programs

No Child Left Behind Act 2001

Assessments and Surveys

What Works Clearinghouse Rating

Showing 1 to 15 of 20 results Save | Export

Implementing a Standardized Effect Size in the POLYSIBTEST Procedure

Peer reviewed

Direct link

Weese, James D.; Turner, Ronna C.; Liang, Xinya; Ames, Allison; Crawford, Brandon – Educational and Psychological Measurement, 2023

A study was conducted to implement the use of a standardized effect size and corresponding classification guidelines for polytomous data with the POLYSIBTEST procedure and compare those guidelines with prior recommendations. Two simulation studies were included. The first identifies new unstandardized test heuristics for classifying moderate and…

Descriptors: Effect Size, Classification, Guidelines, Statistical Analysis

A Comparison of Label Switching Algorithms in the Context of Growth Mixture Models

Peer reviewed

Direct link

Cassiday, Kristina R.; Cho, Youngmi; Harring, Jeffrey R. – Educational and Psychological Measurement, 2021

Simulation studies involving mixture models inevitably aggregate parameter estimates and other output across numerous replications. A primary issue that arises in these methodological investigations is label switching. The current study compares several label switching corrections that are commonly used when dealing with mixture models. A growth…

Descriptors: Probability, Models, Simulation, Mathematics

A New Stopping Criterion for Rasch Trees Based on the Mantel-Haenszel Effect Size Measure for Differential Item Functioning

Peer reviewed

Direct link

Henninger, Mirka; Debelak, Rudolf; Strobl, Carolin – Educational and Psychological Measurement, 2023

To detect differential item functioning (DIF), Rasch trees search for optimal split-points in covariates and identify subgroups of respondents in a data-driven way. To determine whether and in which covariate a split should be performed, Rasch trees use statistical significance tests. Consequently, Rasch trees are more likely to label small DIF…

Descriptors: Item Response Theory, Test Items, Effect Size, Statistical Significance

Diagnostic Classification Model for Forced-Choice Items and Noncognitive Tests

Peer reviewed

Direct link

Huang, Hung-Yu – Educational and Psychological Measurement, 2023

The forced-choice (FC) item formats used for noncognitive tests typically develop a set of response options that measure different traits and instruct respondents to make judgments among these options in terms of their preference to control the response biases that are commonly observed in normative tests. Diagnostic classification models (DCMs)…

Descriptors: Test Items, Classification, Bayesian Statistics, Decision Making

Investigation of Rater Effects Using Social Network Analysis and Exponential Random Graph Models

Peer reviewed

Direct link

Lamprianou, Iasonas – Educational and Psychological Measurement, 2018

It is common practice for assessment programs to organize qualifying sessions during which the raters (often known as "markers" or "judges") demonstrate their consistency before operational rating commences. Because of the high-stakes nature of many rating activities, the research community tends to continuously explore new…

Descriptors: Social Networks, Network Analysis, Comparative Analysis, Innovation

Student Assessment Opt Out and the Impact on Value-Added Measures of Teacher Quality

Peer reviewed

Direct link

Marland, Joshua; Harrick, Matthew; Sireci, Stephen G. – Educational and Psychological Measurement, 2020

Student assessment nonparticipation (or opt out) has increased substantially in K-12 schools in states across the country. This increase in opt out has the potential to impact achievement and growth (or value-added) measures used for educator and institutional accountability. In this simulation study, we investigated the extent to which…

Descriptors: Value Added Models, Teacher Effectiveness, Teacher Evaluation, Elementary Secondary Education

Unrestricted Mixture Models for Class Identification in Growth Mixture Modeling

Peer reviewed

Direct link

Liu, Min; Hancock, Gregory R. – Educational and Psychological Measurement, 2014

Growth mixture modeling has gained much attention in applied and methodological social science research recently, but the selection of the number of latent classes for such models remains a challenging issue, especially when the assumption of proper model specification is violated. The current simulation study compared the performance of a linear…

Descriptors: Models, Classification, Simulation, Comparative Analysis

The Impact of Ignoring the Level of Nesting Structure in Nonparametric Multilevel Latent Class Models

Peer reviewed

Direct link

Park, Jungkyu; Yu, Hsiu-Ting – Educational and Psychological Measurement, 2016

The multilevel latent class model (MLCM) is a multilevel extension of a latent class model (LCM) that is used to analyze nested structure data structure. The nonparametric version of an MLCM assumes a discrete latent variable at a higher-level nesting structure to account for the dependency among observations nested within a higher-level unit. In…

Descriptors: Hierarchical Linear Modeling, Nonparametric Statistics, Data Analysis, Simulation

Panel Design Variations in the Multistage Test Using the Mixed-Format Tests

Peer reviewed

Direct link

Kim, Jiseon; Chung, Hyewon; Dodd, Barbara G.; Park, Ryoungsun – Educational and Psychological Measurement, 2012

This study compared various panel designs of the multistage test (MST) using mixed-format tests in the context of classification testing. Simulations varied the design of the first-stage module. The first stage was constructed according to three levels of test information functions (TIFs) with three different TIF centers. Additional computerized…

Descriptors: Test Format, Comparative Analysis, Computer Assisted Testing, Adaptive Testing

Item Selection for the Development of Parallel Forms from an IRT-Based Seed Test Using a Sampling and Classification Approach

Peer reviewed

Direct link

Chen, Pei-Hua; Chang, Hua-Hua; Wu, Haiyan – Educational and Psychological Measurement, 2012

Two sampling-and-classification-based procedures were developed for automated test assembly: the Cell Only and the Cell and Cube methods. A simulation study based on a 540-item bank was conducted to compare the performance of the procedures with the performance of a mixed-integer programming (MIP) method for assembling multiple parallel test…

Descriptors: Test Items, Selection, Test Construction, Item Response Theory

Performance of the S - [chi][squared] Statistic for Full-Information Bifactor Models

Peer reviewed

Direct link

Li, Ying; Rupp, Andre A. – Educational and Psychological Measurement, 2011

This study investigated the Type I error rate and power of the multivariate extension of the S - [chi][squared] statistic using unidimensional and multidimensional item response theory (UIRT and MIRT, respectively) models as well as full-information bifactor (FI-bifactor) models through simulation. Manipulated factors included test length, sample…

Descriptors: Test Length, Item Response Theory, Statistical Analysis, Error Patterns

Polytomous Adaptive Classification Testing: Effects of Item Pool Size, Test Termination Criterion, and Number of Cutscores

Peer reviewed

Direct link

Gnambs, Timo; Batinic, Bernad – Educational and Psychological Measurement, 2011

Computer-adaptive classification tests focus on classifying respondents in different proficiency groups (e.g., for pass/fail decisions). To date, adaptive classification testing has been dominated by research on dichotomous response formats and classifications in two groups. This article extends this line of research to polytomous classification…

Descriptors: Test Length, Computer Assisted Testing, Classification, Test Items

Formulating the Rasch Differential Item Functioning Model under the Marginal Maximum Likelihood Estimation Context and Its Comparison with Mantel-Haenszel Procedure in Short Test and Small Sample Conditions

Peer reviewed

Direct link

Paek, Insu; Wilson, Mark – Educational and Psychological Measurement, 2011

This study elaborates the Rasch differential item functioning (DIF) model formulation under the marginal maximum likelihood estimation context. Also, the Rasch DIF model performance was examined and compared with the Mantel-Haenszel (MH) procedure in small sample and short test length conditions through simulations. The theoretically known…

Descriptors: Test Bias, Test Length, Statistical Inference, Geometric Concepts

Computerized Classification Testing under the One-Parameter Logistic Response Model with Ability-Based Guessing

Peer reviewed

Direct link

Wang, Wen-Chung; Huang, Sheng-Yun – Educational and Psychological Measurement, 2011

The one-parameter logistic model with ability-based guessing (1PL-AG) has been recently developed to account for effect of ability on guessing behavior in multiple-choice items. In this study, the authors developed algorithms for computerized classification testing under the 1PL-AG and conducted a series of simulations to evaluate their…

Descriptors: Computer Assisted Testing, Classification, Item Analysis, Probability

Item Selection in Computerized Classification Testing

Peer reviewed

Direct link

Thompson, Nathan A. – Educational and Psychological Measurement, 2009

Several alternatives for item selection algorithms based on item response theory in computerized classification testing (CCT) have been suggested, with no conclusive evidence on the substantial superiority of a single method. It is argued that the lack of sizable effect is because some of the methods actually assess items very similarly through…

Descriptors: Item Response Theory, Psychoeducational Methods, Cutting Scores, Simulation

Previous Page | Next Page »

Pages: 1 | 2

Rupp, Andre A.	2
Yang, Xiangdong	2
Alley, William E.	1
Ames, Allison	1
Batinic, Bernad	1
Cassiday, Kristina R.	1
Chang, Hua-Hua	1
Chen, Pei-Hua	1
Cho, Youngmi	1
Chung, Hyewon	1
Crawford, Brandon	1
Darby, Melody M.	1
Debelak, Rudolf	1
Dodd, Barbara G.	1
Glasnapp, Douglas R.	1
Gnambs, Timo	1
Hancock, Gregory R.	1
Harrick, Matthew	1
Harring, Jeffrey R.	1
Henninger, Mirka	1
Huang, Hung-Yu	1
Huang, Sheng-Yun	1
Kim, Jiseon	1
Lamprianou, Iasonas	1
More ▼