ERIC - Search Results

Publication Date

In 2025	0
Since 2024	2
Since 2021 (last 5 years)	2
Since 2016 (last 10 years)	8
Since 2006 (last 20 years)	16

Descriptor

Classification	18
Error of Measurement	18
Evaluation Methods	18
Accuracy	6
Comparative Analysis	5
Correlation	5
Statistical Bias	5
Sample Size	4
Scores	4
Simulation	4
Statistical Analysis	4
Computation	3
Identification	3
Item Response Theory	3
Measures (Individuals)	3
Monte Carlo Methods	3
Validity	3
Control Groups	2
Cutting Scores	2
Diagnostic Tests	2
Educational Indicators	2
Educational Policy	2
Educational Research	2
Evaluation Criteria	2
Evidence	2
More ▼

Source

Structural Equation Modeling:…	2
Developmental Medicine &…	1
ETS Research Report Series	1
Educational and Psychological…	1
Group of Eight (NJ1)	1
International Journal of…	1
Journal of Educational…	1
Journal of Research on…	1
Online Submission	1
ProQuest LLC	1
Program on Education Policy…	1
Psychological Methods	1
Sociological Methods &…	1
Suicide and Life-Threatening…	1
Teachers College Record	1
What Works Clearinghouse	1
More ▼

Publication Type

Journal Articles	12
Reports - Evaluative	7
Reports - Research	7
Reports - Descriptive	2
Dissertations/Theses -…	1
Guides - Non-Classroom	1
Opinion Papers	1
Speeches/Meeting Papers	1

Education Level

Elementary Secondary Education	1
High Schools	1
Higher Education	1

Audience

Researchers

Location

Asia	1
Australia	1
California (Stanford)	1
Florida	1
Florida (Miami)	1

Laws, Policies, & Programs

Assessments and Surveys

Florida Comprehensive…

What Works Clearinghouse Rating

Showing 1 to 15 of 18 results Save | Export

Bias-Adjusted Three-Step Multilevel Latent Class Modeling with Covariates

Peer reviewed

Direct link

Johan Lyrvall; Zsuzsa Bakk; Jennifer Oser; Roberto Di Mari – Structural Equation Modeling: A Multidisciplinary Journal, 2024

We present a bias-adjusted three-step estimation approach for multilevel latent class models (LC) with covariates. The proposed approach involves (1) fitting a single-level measurement model while ignoring the multilevel structure, (2) assigning units to latent classes, and (3) fitting the multilevel model with the covariates while controlling for…

Descriptors: Hierarchical Linear Modeling, Statistical Bias, Error of Measurement, Simulation

Comparing Mimic and Mimic-Interaction to Alignment Methods for Investigating Measurement Invariance Concerning a Continuous Violator

Peer reviewed

Direct link

Yuanfang Liu; Mark H. C. Lai; Ben Kelcey – Structural Equation Modeling: A Multidisciplinary Journal, 2024

Measurement invariance holds when a latent construct is measured in the same way across different levels of background variables (continuous or categorical) while controlling for the true value of that construct. Using Monte Carlo simulation, this paper compares the multiple indicators, multiple causes (MIMIC) model and MIMIC-interaction to a…

Descriptors: Classification, Accuracy, Error of Measurement, Correlation

Classification Consistency and Accuracy with Atypical Score Distributions

Peer reviewed

Direct link

Kim, Stella Y.; Lee, Won-Chan – Journal of Educational Measurement, 2020

The current study aims to evaluate the performance of three non-IRT procedures (i.e., normal approximation, Livingston-Lewis, and compound multinomial) for estimating classification indices when the observed score distribution shows atypical patterns: (a) bimodality, (b) structural (i.e., systematic) bumpiness, or (c) structural zeros (i.e., no…

Descriptors: Classification, Accuracy, Scores, Cutting Scores

An Unbiased Estimate of Global Interrater Agreement

Peer reviewed

Direct link

Cousineau, Denis; Laurencelle, Louis – Educational and Psychological Measurement, 2017

Assessing global interrater agreement is difficult as most published indices are affected by the presence of mixtures of agreements and disagreements. A previously proposed method was shown to be specifically sensitive to global agreement, excluding mixtures, but also negatively biased. Here, we propose two alternatives in an attempt to find what…

Descriptors: Interrater Reliability, Evaluation Methods, Statistical Bias, Accuracy

Estimating Causal Effects of Education Interventions Using a Two-Rating Regression Discontinuity Design: Lessons from a Simulation Study and an Application

Peer reviewed

Direct link

Porter, Kristin E.; Reardon, Sean F.; Unlu, Fatih; Bloom, Howard S.; Cimpian, Joseph R. – Journal of Research on Educational Effectiveness, 2017

A valuable extension of the single-rating regression discontinuity design (RDD) is a multiple-rating RDD (MRRDD). To date, four main methods have been used to estimate average treatment effects at the multiple treatment frontiers of an MRRDD: the "surface" method, the "frontier" method, the "binding-score" method, and…

Descriptors: Regression (Statistics), Intervention, Quasiexperimental Design, Simulation

Cross-Battery Assessment? XBA PSW? A Case of Mistaken Identity: A Commentary on Kranzler and Colleagues' "Classification Agreement Analysis of Cross-Battery Assessment in the Identification of Specific Learning Disorders in Children and Youth"

Peer reviewed

Direct link

Flanagan, Dawn P.; Schneider, W. Joel – International Journal of School & Educational Psychology, 2016

When education works, it creates productive, innovative citizens eager to contribute to a well-functioning democracy. In contrast, educational failure has lifelong consequences, with some individuals experiencing decades of preventable hardship. Dawn Flanagan and Joel Schneider write in this response that, like Kranzler, Floyd, Benson, Zabowski,…

Descriptors: Learning Disabilities, Identification, Diagnostic Tests, Criticism

Bayesian Analysis and Design for Joint Modeling of Two Binary Responses with Misclassification

Peer reviewed

Direct link

Stamey, James D.; Beavers, Daniel P.; Sherr, Michael E. – Sociological Methods & Research, 2017

Survey data are often subject to various types of errors such as misclassification. In this article, we consider a model where interest is simultaneously in two correlated response variables and one is potentially subject to misclassification. A motivating example of a recent study of the impact of a sexual education course for adolescents is…

Descriptors: Bayesian Statistics, Classification, Models, Correlation

A Monte Carlo Simulation Comparing the Statistical Precision of Two High-Stakes Teacher Evaluation Methods: A Value-Added Model and a Composite Measure

Direct link

Spencer, Bryden – ProQuest LLC, 2016

Value-added models are a class of growth models used in education to assign responsibility for student growth to teachers or schools. For value-added models to be used fairly, sufficient statistical precision is necessary for accurate teacher classification. Previous research indicated precision below practical limits. An alternative approach has…

Descriptors: Monte Carlo Methods, Comparative Analysis, Accuracy, High Stakes Tests

A 2 x 2 Taxonomy of Multilevel Latent Contextual Models: Accuracy-Bias Trade-Offs in Full and Partial Error Correction Models

Peer reviewed

Direct link

Ludtke, Oliver; Marsh, Herbert W.; Robitzsch, Alexander; Trautwein, Ulrich – Psychological Methods, 2011

In multilevel modeling, group-level variables (L2) for assessing contextual effects are frequently generated by aggregating variables from a lower level (L1). A major problem of contextual analyses in the social sciences is that there is no error-free measurement of constructs. In the present article, 2 types of error occurring in multilevel data…

Descriptors: Simulation, Educational Psychology, Social Sciences, Measurement

Assessing Tradeoffs between Observational and Experimental Designs for Charter School Research. Program on Education Policy and Governance Working Papers Series. PEPG 15-04

Download full text

Ackerman, Matthew; Egalite, Anna J. – Program on Education Policy and Governance, 2015

When lotteries are infeasible, researchers must rely on observational methods to estimate charter effectiveness at raising student test scores. Considerable attention has been paid to observational studies by the Stanford Center for Research on Education Outcomes (CREDO), which have analyzed charter performance in 27 states. However, the…

Descriptors: Charter Schools, Observation, Special Education, Lunch Programs

What Works Clearinghouse Procedures and Standards Handbook, Version 3.0

Peer reviewed
PDF on ERIC

Download full text

What Works Clearinghouse, 2014

This "What Works Clearinghouse Procedures and Standards Handbook (Version 3.0)" provides a detailed description of the standards and procedures of the What Works Clearinghouse (WWC). The remaining chapters of this Handbook are organized to take the reader through the basic steps that the WWC uses to develop a review protocol, identify…

Descriptors: Educational Research, Guides, Intervention, Classification

World University Rankings: Ambiguous Signals. Go8 Backgrounder 30

Download full text

Group of Eight (NJ1), 2012

The current main world university rankings broadly group the leading research universities of nations. Australia's Go8 universities are generally within the top 250 ranked universities, with several institutions in the top 50-100 on some measures. This recognition is commendable, however imperfect the individual rankings may be. Use is made of…

Descriptors: Evaluation Methods, Foreign Countries, Public Policy, Research Universities

Rating Scales for Dystonia in Cerebral Palsy: Reliability and Validity

Peer reviewed

Direct link

Monbaliu, E.; Ortibus, E.; Roelens, F.; Desloovere, K.; Deklerck, J.; Prinzie, P.; De Cock, P.; Feys, H. – Developmental Medicine & Child Neurology, 2010

Aim: This study investigated the reliability and validity of the Barry-Albright Dystonia Scale (BADS), the Burke-Fahn-Marsden Movement Scale (BFMMS), and the Unified Dystonia Rating Scale (UDRS) in patients with bilateral dystonic cerebral palsy (CP). Method: Three raters independently scored videotapes of 10 patients (five males, five females;…

Descriptors: Content Validity, Cerebral Palsy, Validity, Interrater Reliability

A Review of Recent Developments in Differential Item Functioning. Research Report. ETS RR-08-43

Peer reviewed
PDF on ERIC

Download full text

Mapuranga, Raymond; Dorans, Neil J.; Middleton, Kyndra – ETS Research Report Series, 2008

In many practical settings, essentially the same differential item functioning (DIF) procedures have been in use since the late 1980s. Since then, examinee populations have become more heterogeneous, and tests have included more polytomously scored items. This paper summarizes and classifies new DIF methods and procedures that have appeared since…

Descriptors: Test Bias, Educational Development, Evaluation Methods, Statistical Analysis

Strengthening the Validity of Population-Based Suicide Rate Comparisons: An Illustration Using U.S. Military and Civilian Data

Peer reviewed

Direct link

Eaton, Karen M.; Messer, Stephen C.; Garvey Wilson, Abigail L.; Hoge, Charles W. – Suicide and Life-Threatening Behavior, 2006

The objectives of this study were to generate precise estimates of suicide rates in the military while controlling for factors contributing to rate variability such as demographic differences and classification bias, and to develop a simple methodology for the determination of statistically derived thresholds for detecting significant rate…

Descriptors: Suicide, Mortality Rate, Comparative Analysis, Validity

Previous Page | Next Page »

Pages: 1 | 2

Abedi, Jamal	1
Ackerman, Matthew	1
Beavers, Daniel P.	1
Ben Kelcey	1
Bloom, Howard S.	1
Cimpian, Joseph R.	1
Cousineau, Denis	1
De Cock, P.	1
Deklerck, J.	1
Desloovere, K.	1
Dorans, Neil J.	1
Eaton, Karen M.	1
Egalite, Anna J.	1
Feys, H.	1
Flanagan, Dawn P.	1
Garvey Wilson, Abigail L.	1
Hoge, Charles W.	1
Jennifer Oser	1
Johan Lyrvall	1
Karkee, Thakur B.	1
Kim, Stella Y.	1
Laurencelle, Louis	1
Lee, Won-Chan	1
Ludtke, Oliver	1
Mapuranga, Raymond	1
More ▼