ERIC - Search Results

Publication Date

In 2025	0
Since 2024	1
Since 2021 (last 5 years)	4
Since 2016 (last 10 years)	7
Since 2006 (last 20 years)	15

Descriptor

Classification	17
Comparative Analysis	17
Error of Measurement	17
Sample Size	6
Evaluation Methods	5
Item Analysis	5
Statistical Analysis	5
Accuracy	4
Monte Carlo Methods	4
Regression (Statistics)	4
Statistical Bias	4
Correlation	3
Discriminant Analysis	3
Item Response Theory	3
Psychometrics	3
Test Items	3
Achievement Gains	2
Achievement Tests	2
Computation	2
Control Groups	2
Decision Making	2
Demography	2
Difficulty Level	2
Educational Research	2
Effect Size	2
More ▼

Source

Journal of Experimental…	4
Educational and Psychological…	2
Applied Measurement in…	1
Applied Psychological…	1
Journal of Educational…	1
Journal of School Choice	1
ProQuest LLC	1
Program on Education Policy…	1
Research Papers in Education	1
Research in Developmental…	1
Structural Equation Modeling:…	1
Suicide and Life-Threatening…	1
What Works Clearinghouse	1
More ▼

Publication Type

Journal Articles	14
Reports - Research	11
Reports - Descriptive	2
Reports - Evaluative	2
Dissertations/Theses -…	1
Guides - Non-Classroom	1

Education Level

Elementary Secondary Education	2
Elementary Education	1
Secondary Education	1

Audience

Location

California (Stanford)	1
Florida	1
United Kingdom (England)	1

Laws, Policies, & Programs

Assessments and Surveys

Florida Comprehensive…	1
National Assessment of…	1
Program for International…	1

What Works Clearinghouse Rating

Showing 1 to 15 of 17 results Save | Export

Bias-Adjusted Three-Step Multilevel Latent Class Modeling with Covariates

Peer reviewed

Direct link

Johan Lyrvall; Zsuzsa Bakk; Jennifer Oser; Roberto Di Mari – Structural Equation Modeling: A Multidisciplinary Journal, 2024

We present a bias-adjusted three-step estimation approach for multilevel latent class models (LC) with covariates. The proposed approach involves (1) fitting a single-level measurement model while ignoring the multilevel structure, (2) assigning units to latent classes, and (3) fitting the multilevel model with the covariates while controlling for…

Descriptors: Hierarchical Linear Modeling, Statistical Bias, Error of Measurement, Simulation

Impact of DIF on General Factor Mean Comparisons for Bifactor, Ordinal Data

Peer reviewed

Direct link

Liu, Yixing; Thompson, Marilyn S. – Journal of Experimental Education, 2022

A simulation study was conducted to explore the impact of differential item functioning (DIF) on general factor difference estimation for bifactor, ordinal data. Common analysis misspecifications in which the generated bifactor data with DIF were fitted using models with equality constraints on noninvariant item parameters were compared under data…

Descriptors: Comparative Analysis, Item Analysis, Sample Size, Error of Measurement

IRT Approaches to Modeling Scores on Mixed-Format Tests

Peer reviewed

Direct link

Lee, Won-Chan; Kim, Stella Y.; Choi, Jiwon; Kang, Yujin – Journal of Educational Measurement, 2020

This article considers psychometric properties of composite raw scores and transformed scale scores on mixed-format tests that consist of a mixture of multiple-choice and free-response items. Test scores on several mixed-format tests are evaluated with respect to conditional and overall standard errors of measurement, score reliability, and…

Descriptors: Raw Scores, Item Response Theory, Test Format, Multiple Choice Tests

Examining Cognitive Diagnostic Modeling in Classroom Assessment Conditions

Peer reviewed

Direct link

Paulsen, Justin; Valdivia, Dubravka Svetina – Journal of Experimental Education, 2022

Cognitive diagnostic models (CDMs) are a family of psychometric models designed to provide categorical classifications for multiple latent attributes. CDMs provide more granular evidence than other psychometric models and have potential for guiding teaching and learning decisions in the classroom. However, CDMs have primarily been conducted using…

Descriptors: Psychometrics, Classification, Teaching Methods, Learning Processes

Comparing the Robustness of Three Nonparametric DIF Procedures to Differential Rapid Guessing

Peer reviewed

Direct link

Abulela, Mohammed A. A.; Rios, Joseph A. – Applied Measurement in Education, 2022

When there are no personal consequences associated with test performance for examinees, rapid guessing (RG) is a concern and can differ between subgroups. To date, the impact of differential RG on item-level measurement invariance has received minimal attention. To that end, a simulation study was conducted to examine the robustness of the…

Descriptors: Comparative Analysis, Robustness (Statistics), Nonparametric Statistics, Item Analysis

The Development of MST Test Information for the Prediction of Test Performances

Peer reviewed

Direct link

Park, Ryoungsun; Kim, Jiseon; Chung, Hyewon; Dodd, Barbara G. – Educational and Psychological Measurement, 2017

The current study proposes novel methods to predict multistage testing (MST) performance without conducting simulations. This method, called MST test information, is based on analytic derivation of standard errors of ability estimates across theta levels. We compared standard errors derived analytically to the simulation results to demonstrate the…

Descriptors: Testing, Performance, Prediction, Error of Measurement

A Monte Carlo Simulation Comparing the Statistical Precision of Two High-Stakes Teacher Evaluation Methods: A Value-Added Model and a Composite Measure

Direct link

Spencer, Bryden – ProQuest LLC, 2016

Value-added models are a class of growth models used in education to assign responsibility for student growth to teachers or schools. For value-added models to be used fairly, sufficient statistical precision is necessary for accurate teacher classification. Previous research indicated precision below practical limits. An alternative approach has…

Descriptors: Monte Carlo Methods, Comparative Analysis, Accuracy, High Stakes Tests

Comparisons of Improvement-Over-Chance Effect Sizes for Two Groups under Variance Heterogeneity and Prior Probabilities

Peer reviewed

Direct link

Henson, Robin K.; Natesan, Prathiba; Axelson, Erika D. – Journal of Experimental Education, 2014

The authors examined the distributional properties of 3 improvement-over-chance, I, effect sizes each derived from linear and quadratic predictive discriminant analysis and from logistic regression analysis for the 2-group univariate classification. These 3 classification methods (3 levels) were studied under varying levels of data conditions,…

Descriptors: Effect Size, Probability, Comparative Analysis, Classification

An Investigation of Measurement Invariance of the Key Stage 2 National Curriculum Science Sampling Test in England

Peer reviewed

Direct link

He, Qingping; Anwyll, Steve; Glanville, Matthew; Opposs, Dennis – Research Papers in Education, 2014

Since 2010, the whole national cohort Key Stage 2 (KS2) National Curriculum test in science in England has been replaced with a sampling test taken by pupils at the age of 11 from a nationally representative sample of schools annually. The study reported in this paper compares the performance of different subgroups of the samples (classified by…

Descriptors: National Curriculum, Sampling, Foreign Countries, Factor Analysis

Assessing Tradeoffs between Observational and Experimental Designs for Charter School Research. Program on Education Policy and Governance Working Papers Series. PEPG 15-04

Download full text

Ackerman, Matthew; Egalite, Anna J. – Program on Education Policy and Governance, 2015

When lotteries are infeasible, researchers must rely on observational methods to estimate charter effectiveness at raising student test scores. Considerable attention has been paid to observational studies by the Stanford Center for Research on Education Outcomes (CREDO), which have analyzed charter performance in 27 states. However, the…

Descriptors: Charter Schools, Observation, Special Education, Lunch Programs

A Clinical Tool to Measure Trunk Control in Children with Cerebral Palsy: The Trunk Control Measurement Scale

Peer reviewed

Direct link

Heyrman, Lieve; Molenaers, Guy; Desloovere, Kaat; Verheyden, Geert; De Cat, Jos; Monbaliu, Elegast; Feys, Hilde – Research in Developmental Disabilities: A Multidisciplinary Journal, 2011

In this study the psychometric properties of the Trunk Control Measurement Scale (TCMS) in children with cerebral palsy (CP) were examined. Twenty-six children with spastic CP (mean age 11 years 3 months, range 8-15 years; Gross Motor Function Classification System level I n = 11, level II n = 5, level III n = 10) were included in this study. To…

Descriptors: Construct Validity, Cerebral Palsy, Test Validity, Interrater Reliability

What Works Clearinghouse Procedures and Standards Handbook, Version 3.0

Peer reviewed
PDF on ERIC

Download full text

What Works Clearinghouse, 2014

This "What Works Clearinghouse Procedures and Standards Handbook (Version 3.0)" provides a detailed description of the standards and procedures of the What Works Clearinghouse (WWC). The remaining chapters of this Handbook are organized to take the reader through the basic steps that the WWC uses to develop a review protocol, identify…

Descriptors: Educational Research, Guides, Intervention, Classification

DIF Trees: Using Classification Trees to Detect Differential Item Functioning

Peer reviewed

Direct link

Vaughn, Brandon K.; Wang, Qiu – Educational and Psychological Measurement, 2010

A nonparametric tree classification procedure is used to detect differential item functioning for items that are dichotomously scored. Classification trees are shown to be an alternative procedure to detect differential item functioning other than the use of traditional Mantel-Haenszel and logistic regression analysis. A nonparametric…

Descriptors: Test Bias, Classification, Nonparametric Statistics, Regression (Statistics)

Wise and Proper Use of National Assessment of Educational Progress (NAEP) Data

Peer reviewed

Direct link

Innes, Richard G. – Journal of School Choice, 2012

This article provides examples of how serious misconceptions can result when only "all student" scores from the National Assessment of Educational Progress (NAEP) are used for simplistic state-to-state comparisons. Suggestions for better treatment are presented. The article also compares Kentucky's eighth grade EXPLORE testing to NAEP…

Descriptors: National Competency Tests, Scoring, Misconceptions, Academic Achievement

Strengthening the Validity of Population-Based Suicide Rate Comparisons: An Illustration Using U.S. Military and Civilian Data

Peer reviewed

Direct link

Eaton, Karen M.; Messer, Stephen C.; Garvey Wilson, Abigail L.; Hoge, Charles W. – Suicide and Life-Threatening Behavior, 2006

The objectives of this study were to generate precise estimates of suicide rates in the military while controlling for factors contributing to rate variability such as demographic differences and classification bias, and to develop a simple methodology for the determination of statistically derived thresholds for detecting significant rate…

Descriptors: Suicide, Mortality Rate, Comparative Analysis, Validity

Previous Page | Next Page »

Pages: 1 | 2

Abulela, Mohammed A. A.	1
Ackerman, Matthew	1
Anwyll, Steve	1
Axelson, Erika D.	1
Choi, Jiwon	1
Chung, Hyewon	1
De Cat, Jos	1
Desloovere, Kaat	1
Dodd, Barbara G.	1
Eaton, Karen M.	1
Egalite, Anna J.	1
Feys, Hilde	1
Garvey Wilson, Abigail L.	1
Glanville, Matthew	1
He, Qingping	1
Henson, Robin K.	1
Heyrman, Lieve	1
Hoge, Charles W.	1
Innes, Richard G.	1
Jennifer Oser	1
Johan Lyrvall	1
Kang, Yujin	1
Kim, Jiseon	1
Kim, Stella Y.	1
Koehly, Laura M.	1
More ▼