ERIC - Search Results

Publication Date

In 2025	0
Since 2024	1
Since 2021 (last 5 years)	5
Since 2016 (last 10 years)	9
Since 2006 (last 20 years)	15

Descriptor

Classification	15
Error of Measurement	15
Statistical Bias	15
Accuracy	5
Computation	5
Evaluation Methods	5
Regression (Statistics)	5
Sample Size	5
Comparative Analysis	4
Monte Carlo Methods	4
Statistical Analysis	3
Statistical Inference	3
Academic Achievement	2
Achievement Rating	2
Effect Size	2
Factor Analysis	2
Goodness of Fit	2
Item Analysis	2
Item Response Theory	2
Models	2
Scores	2
Simulation	2
Test Items	2
Achievement	1
Achievement Gains	1
More ▼

Source

Educational and Psychological…	2
Journal of Educational…	2
Journal of Experimental…	2
Structural Equation Modeling:…	2
Applied Measurement in…	1
Group of Eight (NJ1)	1
Journal of Educational and…	1
Journal of Research on…	1
Journal of School Choice	1
Journal of Statistics and…	1
Measurement:…	1
More ▼

Publication Type

Journal Articles	14
Reports - Research	11
Reports - Evaluative	3
Reports - Descriptive	1

Education Level

Higher Education	2
Elementary Secondary Education	1
Grade 8	1
Junior High Schools	1
Middle Schools	1
Postsecondary Education	1
Secondary Education	1

Audience

Location

Asia	1
Australia	1

Laws, Policies, & Programs

Assessments and Surveys

National Assessment of…	2
Program for International…	1
Progress in International…	1
Trends in International…	1

What Works Clearinghouse Rating

Showing all 15 results Save | Export

Bias-Adjusted Three-Step Multilevel Latent Class Modeling with Covariates

Peer reviewed

Direct link

Johan Lyrvall; Zsuzsa Bakk; Jennifer Oser; Roberto Di Mari – Structural Equation Modeling: A Multidisciplinary Journal, 2024

We present a bias-adjusted three-step estimation approach for multilevel latent class models (LC) with covariates. The proposed approach involves (1) fitting a single-level measurement model while ignoring the multilevel structure, (2) assigning units to latent classes, and (3) fitting the multilevel model with the covariates while controlling for…

Descriptors: Hierarchical Linear Modeling, Statistical Bias, Error of Measurement, Simulation

Misclassification Error, Binary Regression Bias, and Reliability in Multidimensional Poverty Measurement: An Estimation Approach Based on Bayesian Modelling

Peer reviewed

Direct link

Najera, Hector – Measurement: Interdisciplinary Research and Perspectives, 2023

Measurement error affects the quality of population orderings of an index and, hence, increases the misclassification of the poor and the non-poor groups and affects statistical inferences from binary regression models. Hence, the conclusions about the extent, profile, and distribution of poverty are likely to be misleading. However, the size and…

Descriptors: Poverty, Error of Measurement, Classification, Statistical Inference

Classification Consistency and Accuracy with Atypical Score Distributions

Peer reviewed

Direct link

Kim, Stella Y.; Lee, Won-Chan – Journal of Educational Measurement, 2020

The current study aims to evaluate the performance of three non-IRT procedures (i.e., normal approximation, Livingston-Lewis, and compound multinomial) for estimating classification indices when the observed score distribution shows atypical patterns: (a) bimodality, (b) structural (i.e., systematic) bumpiness, or (c) structural zeros (i.e., no…

Descriptors: Classification, Accuracy, Scores, Cutting Scores

Using Pooled Heteroskedastic Ordered Probit Models to Improve Small-Sample Estimates of Latent Test Score Distributions

Peer reviewed
PDF on ERIC

Download full text

Direct link

Shear, Benjamin R.; Reardon, Sean F. – Journal of Educational and Behavioral Statistics, 2021

This article describes an extension to the use of heteroskedastic ordered probit (HETOP) models to estimate latent distributional parameters from grouped, ordered-categorical data by pooling across multiple waves of data. We illustrate the method with aggregate proficiency data reporting the number of students in schools or districts scoring in…

Descriptors: Statistical Analysis, Computation, Regression (Statistics), Sample Size

Impact of DIF on General Factor Mean Comparisons for Bifactor, Ordinal Data

Peer reviewed

Direct link

Liu, Yixing; Thompson, Marilyn S. – Journal of Experimental Education, 2022

A simulation study was conducted to explore the impact of differential item functioning (DIF) on general factor difference estimation for bifactor, ordinal data. Common analysis misspecifications in which the generated bifactor data with DIF were fitted using models with equality constraints on noninvariant item parameters were compared under data…

Descriptors: Comparative Analysis, Item Analysis, Sample Size, Error of Measurement

Might Temporal Logic Improve the Specification of Directed Acyclic Graphs (DAGs)?

Peer reviewed

Direct link

Ellison, George T. H. – Journal of Statistics and Data Science Education, 2021

Temporality-driven covariate classification had limited impact on: the specification of directed acyclic graphs (DAGs) by 85 novice analysts (medical undergraduates); or the risk of bias in DAG-informed multivariable models designed to generate causal inference from observational data. Only 71 students (83.5%) managed to complete the…

Descriptors: Statistics Education, Medical Education, Undergraduate Students, Graphs

An Unbiased Estimate of Global Interrater Agreement

Peer reviewed

Direct link

Cousineau, Denis; Laurencelle, Louis – Educational and Psychological Measurement, 2017

Assessing global interrater agreement is difficult as most published indices are affected by the presence of mixtures of agreements and disagreements. A previously proposed method was shown to be specifically sensitive to global agreement, excluding mixtures, but also negatively biased. Here, we propose two alternatives in an attempt to find what…

Descriptors: Interrater Reliability, Evaluation Methods, Statistical Bias, Accuracy

Fitting Large Factor Analysis Models with Ordinal Data

Peer reviewed

Direct link

DiStefano, Christine; McDaniel, Heather L.; Zhang, Liyun; Shi, Dexin; Jiang, Zhehan – Educational and Psychological Measurement, 2019

A simulation study was conducted to investigate the model size effect when confirmatory factor analysis (CFA) models include many ordinal items. CFA models including between 15 and 120 ordinal items were analyzed with mean- and variance-adjusted weighted least squares to determine how varying sample size, number of ordered categories, and…

Descriptors: Factor Analysis, Effect Size, Data, Sample Size

Estimating Causal Effects of Education Interventions Using a Two-Rating Regression Discontinuity Design: Lessons from a Simulation Study and an Application

Peer reviewed

Direct link

Porter, Kristin E.; Reardon, Sean F.; Unlu, Fatih; Bloom, Howard S.; Cimpian, Joseph R. – Journal of Research on Educational Effectiveness, 2017

A valuable extension of the single-rating regression discontinuity design (RDD) is a multiple-rating RDD (MRRDD). To date, four main methods have been used to estimate average treatment effects at the multiple treatment frontiers of an MRRDD: the "surface" method, the "frontier" method, the "binding-score" method, and…

Descriptors: Regression (Statistics), Intervention, Quasiexperimental Design, Simulation

Comparisons of Improvement-Over-Chance Effect Sizes for Two Groups under Variance Heterogeneity and Prior Probabilities

Peer reviewed

Direct link

Henson, Robin K.; Natesan, Prathiba; Axelson, Erika D. – Journal of Experimental Education, 2014

The authors examined the distributional properties of 3 improvement-over-chance, I, effect sizes each derived from linear and quadratic predictive discriminant analysis and from logistic regression analysis for the 2-group univariate classification. These 3 classification methods (3 levels) were studied under varying levels of data conditions,…

Descriptors: Effect Size, Probability, Comparative Analysis, Classification

Sensitivity of Achievement Estimation to Conditioning Model Misclassification

Peer reviewed

Direct link

Rutkowski, Leslie – Applied Measurement in Education, 2014

Large-scale assessment programs such as the National Assessment of Educational Progress (NAEP), Trends in International Mathematics and Science Study (TIMSS), and Programme for International Student Assessment (PISA) use a sophisticated assessment administration design called matrix sampling that minimizes the testing burden on individual…

Descriptors: Measurement, Testing, Item Sampling, Computation

A Multilevel Testlet Model for Dual Local Dependence

Peer reviewed

Direct link

Jiao, Hong; Kamata, Akihito; Wang, Shudong; Jin, Ying – Journal of Educational Measurement, 2012

The applications of item response theory (IRT) models assume local item independence and that examinees are independent of each other. When a representative sample for psychometric analysis is selected using a cluster sampling method in a testlet-based assessment, both local item dependence and local person dependence are likely to be induced.…

Descriptors: Item Response Theory, Test Items, Markov Processes, Monte Carlo Methods

Wise and Proper Use of National Assessment of Educational Progress (NAEP) Data

Peer reviewed

Direct link

Innes, Richard G. – Journal of School Choice, 2012

This article provides examples of how serious misconceptions can result when only "all student" scores from the National Assessment of Educational Progress (NAEP) are used for simplistic state-to-state comparisons. Suggestions for better treatment are presented. The article also compares Kentucky's eighth grade EXPLORE testing to NAEP…

Descriptors: National Competency Tests, Scoring, Misconceptions, Academic Achievement

World University Rankings: Ambiguous Signals. Go8 Backgrounder 30

Download full text

Group of Eight (NJ1), 2012

The current main world university rankings broadly group the leading research universities of nations. Australia's Go8 universities are generally within the top 250 ranked universities, with several institutions in the top 50-100 on some measures. This recognition is commendable, however imperfect the individual rankings may be. Use is made of…

Descriptors: Evaluation Methods, Foreign Countries, Public Policy, Research Universities

Is Parceling Really Necessary? A Comparison of Results from Item Parceling and Categorical Variable Methodology

Peer reviewed

Direct link

Bandalos, Deborah L. – Structural Equation Modeling: A Multidisciplinary Journal, 2008

This study examined the efficacy of 4 different parceling methods for modeling categorical data with 2, 3, and 4 categories and with normal, moderately nonnormal, and severely nonnormal distributions. The parceling methods investigated were isolated parceling in which items were parceled with other items sharing the same source of variance, and…

Descriptors: Structural Equation Models, Computation, Goodness of Fit, Classification

Reardon, Sean F.	2
Axelson, Erika D.	1
Bandalos, Deborah L.	1
Bloom, Howard S.	1
Cimpian, Joseph R.	1
Cousineau, Denis	1
DiStefano, Christine	1
Ellison, George T. H.	1
Henson, Robin K.	1
Innes, Richard G.	1
Jennifer Oser	1
Jiang, Zhehan	1
Jiao, Hong	1
Jin, Ying	1
Johan Lyrvall	1
Kamata, Akihito	1
Kim, Stella Y.	1
Laurencelle, Louis	1
Lee, Won-Chan	1
Liu, Yixing	1
McDaniel, Heather L.	1
Najera, Hector	1
Natesan, Prathiba	1
Porter, Kristin E.	1
Roberto Di Mari	1
More ▼