ERIC - Search Results

Publication Date

In 2025	1
Since 2024	1
Since 2021 (last 5 years)	2
Since 2016 (last 10 years)	3
Since 2006 (last 20 years)	9

Descriptor

Error of Measurement	10
Models	10
Robustness (Statistics)	10
Measurement Techniques	3
Statistical Distributions	3
Academic Achievement	2
Achievement Gains	2
Computation	2
Educational Research	2
Equated Scores	2
Evaluation Methods	2
Foreign Countries	2
Item Response Theory	2
Longitudinal Studies	2
Predictor Variables	2
Research Methodology	2
School Effectiveness	2
Scores	2
Statistical Analysis	2
Ability Identification	1
Accountability	1
Achievement Rating	1
Achievement Tests	1
Analysis of Variance	1
Barriers	1
More ▼

Source

British Educational Research…	1
Centre for Economic…	1
Developmental Psychology	1
Grantee Submission	1
Journal of Educational…	1
Journal of Educational and…	1
Multivariate Behavioral…	1
Phi Delta Kappan	1
Psychological Methods	1

Author

Tong, Xin	2
Zhang, Zhiyong	2
Beasley, T. Mark	1
Brosseau-Liard, Patricia E.	1
Carl Westine	1
Foster, E. Michael	1
Gorard, Stephen	1
Harris, Douglas N.	1
Michelle Boyer	1
Murphy, Richard	1
Rhemtulla, Mijke	1
Savalei, Victoria	1
Stella Y. Kim	1
Tong Wu	1
Wallin, Gabriel	1
Weinhardt, Felix	1
Wiberg, Marie	1
More ▼

Publication Type

Journal Articles	7
Reports - Research	5
Reports - Evaluative	4
Reports - Descriptive	1
Speeches/Meeting Papers	1

Education Level

Adult Education	2
Elementary Secondary Education	2
Grade 10	1
Grade 7	1
Grade 8	1
Grade 9	1
High Schools	1
Higher Education	1
Junior High Schools	1
Middle Schools	1
Secondary Education	1
More ▼

Audience

Location

United Kingdom (England)

Laws, Policies, & Programs

Assessments and Surveys

National Longitudinal Survey…	2
Peabody Individual…	2
Child Behavior Checklist	1

What Works Clearinghouse Rating

Showing all 10 results Save | Export

IRT Observed-Score Equating for Rater-Mediated Assessments Using a Hierarchical Rater Model

Peer reviewed

Direct link

Tong Wu; Stella Y. Kim; Carl Westine; Michelle Boyer – Journal of Educational Measurement, 2025

While significant attention has been given to test equating to ensure score comparability, limited research has explored equating methods for rater-mediated assessments, where human raters inherently introduce error. If not properly addressed, these errors can undermine score interchangeability and test validity. This study proposes an equating…

Descriptors: Item Response Theory, Evaluators, Error of Measurement, Test Validity

Model Misspecification and Robustness of Observed-Score Test Equating Using Propensity Scores

Peer reviewed

Direct link

Wallin, Gabriel; Wiberg, Marie – Journal of Educational and Behavioral Statistics, 2023

This study explores the usefulness of covariates on equating test scores from nonequivalent test groups. The covariates are captured by an estimated propensity score, which is used as a proxy for latent ability to balance the test groups. The objective is to assess the sensitivity of the equated scores to various misspecifications in the…

Descriptors: Models, Error of Measurement, Robustness (Statistics), Equated Scores

Robust Bayesian Approaches in Growth Curve Modeling: Using Student's "t" Distributions versus a Semiparametric Method

Peer reviewed
PDF on ERIC

Download full text

Direct link

Tong, Xin; Zhang, Zhiyong – Grantee Submission, 2020

Despite broad applications of growth curve models, few studies have dealt with a practical issue -- nonnormality of data. Previous studies have used Student's "t" distributions to remedy the nonnormal problems. In this study, robust distributional growth curve models are proposed from a semiparametric Bayesian perspective, in which…

Descriptors: Robustness (Statistics), Bayesian Statistics, Models, Error of Measurement

Diagnostics of Robust Growth Curve Modeling Using Student's "t" Distribution

Peer reviewed

Direct link

Tong, Xin; Zhang, Zhiyong – Multivariate Behavioral Research, 2012

Growth curve models with different types of distributions of random effects and of intraindividual measurement errors for robust analysis are compared. After demonstrating the influence of distribution specification on parameter estimation, 3 methods for diagnosing the distributions for both random effects and intraindividual measurement errors…

Descriptors: Models, Robustness (Statistics), Statistical Analysis, Error of Measurement

The U-Shaped Relationship between Complexity and Usefulness: A Commentary

Peer reviewed

Direct link

Foster, E. Michael – Developmental Psychology, 2010

The relationship between complexity and usefulness can be captured by a U-shaped curve. This comment explores that relationship. Complexity may be useful for one of the main aims of developmental psychology (causal inference) but not for another (description of developmental phenomena). Currently, developmentalists conduct complex analyses that…

Descriptors: Inferences, Developmental Psychology, Models, Methods

When Can Categorical Variables Be Treated as Continuous? A Comparison of Robust Continuous and Categorical SEM Estimation Methods under Suboptimal Conditions

Peer reviewed

Direct link

Rhemtulla, Mijke; Brosseau-Liard, Patricia E.; Savalei, Victoria – Psychological Methods, 2012

A simulation study compared the performance of robust normal theory maximum likelihood (ML) and robust categorical least squares (cat-LS) methodology for estimating confirmatory factor analysis models with ordinal variables. Data were generated from 2 models with 2-7 categories, 4 sample sizes, 2 latent distributions, and 5 patterns of category…

Descriptors: Factor Analysis, Computation, Simulation, Sample Size

Clear Away the Smoke and Mirrors of Value-Added

Direct link

Harris, Douglas N. – Phi Delta Kappan, 2010

Current value-added models for teacher accountability are better than models based only on student achievement, but they have their weakness. They are subject to systematic and random error, as are all measures, and there are concerns about the tests used for the measurements. However, value-added models are better than the alternatives at the…

Descriptors: School Effectiveness, Error of Measurement, Achievement Gains, Academic Achievement

The Importance of Rank Position. CEP Discussion Paper No. 1241

Download full text

Murphy, Richard; Weinhardt, Felix – Centre for Economic Performance, 2013

We find an individual's rank within their reference group has effects on later objective outcomes. To evaluate the impact of local rank, we use a large administrative dataset tracking over two million students in England from primary through to secondary school. Academic rank within primary school has sizable, robust and significant effects on…

Descriptors: Foreign Countries, Class Rank, Progress Monitoring, Effect Size

Serious Doubts about School Effectiveness

Peer reviewed

Direct link

Gorard, Stephen – British Educational Research Journal, 2010

This paper considers the model of school effectiveness (SE) currently dominant in research, policy and practice in England (although the concerns it raises are international). It shows, principally through consideration of initial and propagated error, that SE results cannot be relied upon. By considering the residual difference between the…

Descriptors: School Effectiveness, Foreign Countries, Scores, Educational Policy

Education as a Fixed Effect Fallacy.

Beasley, T. Mark – 1994

In educational research, nonessential factors are commonly ignored and when accounted for, they are often treated statistically as fixed effects. Yet many researchers in these situations generalize their findings beyond the specific levels selected; however, the analyses may require treating the factor as a random effect. Such inappropriate…

Descriptors: Analysis of Variance, Behavioral Science Research, Educational Research, Equations (Mathematics)