ERIC - Search Results

Publication Date

In 2025	0
Since 2024	0
Since 2021 (last 5 years)	2
Since 2016 (last 10 years)	2
Since 2006 (last 20 years)	4

Descriptor

Error of Measurement	7
Statistical Distributions	7
Computation	4
Sample Size	4
Probability	3
Scores	3
Computer Simulation	2
Equated Scores	2
Equations (Mathematics)	2
Guidelines	2
Item Response Theory	2
Models	2
Statistical Analysis	2
Academic Achievement	1
Achievement Tests	1
Bayesian Statistics	1
Bias	1
Classification	1
Effect Size	1
Goodness of Fit	1
Intervals	1
Mathematical Formulas	1
Methods	1
Monte Carlo Methods	1
Prediction	1
More ▼

Source

Journal of Educational and…

Author

Brennan, Robert L.	1
Cope, Ronald T.	1
Kolen, Michael J.	1
Kong, Nan	1
Lee, Won-Chan	1
Maxwell, Scott	1
Reardon, Sean F.	1
Shear, Benjamin R.	1
Sinharay, Sandip	1
Wallin, Gabriel	1
Wiberg, Marie	1
Yuan, Ke-Hai	1
Zeng, Lingjia	1
von Davier, Alina A.	1
More ▼

Publication Type

Journal Articles	7
Reports - Evaluative	3
Reports - Research	3
Reports - Descriptive	1

Education Level

Audience

Location

Laws, Policies, & Programs

Assessments and Surveys

What Works Clearinghouse Rating

Showing all 7 results Save | Export

Model Misspecification and Robustness of Observed-Score Test Equating Using Propensity Scores

Peer reviewed

Direct link

Wallin, Gabriel; Wiberg, Marie – Journal of Educational and Behavioral Statistics, 2023

This study explores the usefulness of covariates on equating test scores from nonequivalent test groups. The covariates are captured by an estimated propensity score, which is used as a proxy for latent ability to balance the test groups. The objective is to assess the sensitivity of the equated scores to various misspecifications in the…

Descriptors: Models, Error of Measurement, Robustness (Statistics), Equated Scores

Using Pooled Heteroskedastic Ordered Probit Models to Improve Small-Sample Estimates of Latent Test Score Distributions

Peer reviewed
PDF on ERIC

Download full text

Direct link

Shear, Benjamin R.; Reardon, Sean F. – Journal of Educational and Behavioral Statistics, 2021

This article describes an extension to the use of heteroskedastic ordered probit (HETOP) models to estimate latent distributional parameters from grouped, ordered-categorical data by pooling across multiple waves of data. We illustrate the method with aggregate proficiency data reporting the number of students in schools or districts scoring in…

Descriptors: Statistical Analysis, Computation, Regression (Statistics), Sample Size

Assessment of Person Fit for Mixed-Format Tests

Peer reviewed

Direct link

Sinharay, Sandip – Journal of Educational and Behavioral Statistics, 2015

Person-fit assessment may help the researcher to obtain additional information regarding the answering behavior of persons. Although several researchers examined person fit, there is a lack of research on person-fit assessment for mixed-format tests. In this article, the lz statistic and the ?2 statistic, both of which have been used for tests…

Descriptors: Test Format, Goodness of Fit, Item Response Theory, Bayesian Statistics

Standard Error of Linear Equating for the Counterbalanced Design.

Peer reviewed

Zeng, Lingjia; Cope, Ronald T. – Journal of Educational and Behavioral Statistics, 1995

Large-sample standard errors of linear equating for the counterbalanced design are derived using the general delta method. Computer simulations found that standard errors derived without the normality assumption were more accurate than those derived with the normality assumption in a large sample with moderately skewed score distributions. (SLD)

Descriptors: Computer Simulation, Error of Measurement, Research Design, Sample Size

Interval Estimation for True Raw and Scale Scores under the Binomial Error Model

Peer reviewed

Direct link

Lee, Won-Chan; Brennan, Robert L.; Kolen, Michael J. – Journal of Educational and Behavioral Statistics, 2006

Assuming errors of measurement are distributed binomially, this article reviews various procedures for constructing an interval for an individual's true number-correct score; presents two general interval estimation procedures for an individual's true scale score (i.e., normal approximation and endpoints conversion methods); compares various…

Descriptors: Probability, Intervals, Guidelines, Computer Simulation

A Unified Approach to Linear Equating for the Nonequivalent Groups Design

Peer reviewed

Direct link

von Davier, Alina A.; Kong, Nan – Journal of Educational and Behavioral Statistics, 2005

This article describes a new, unified framework for linear equating in a non-equivalent groups anchor test (NEAT) design. The authors focus on three methods for linear equating in the NEAT design--Tucker, Levine observed-score, and chain--and develop a common parameterization that shows that each particular equating method is a special case of the…

Descriptors: Equations (Mathematics), Sample Size, Statistical Distributions, Error of Measurement

On the Post Hoc Power in Testing Mean Differences

Peer reviewed

Direct link

Yuan, Ke-Hai; Maxwell, Scott – Journal of Educational and Behavioral Statistics, 2005

Retrospective or post hoc power analysis is recommended by reviewers and editors of many journals. Little literature has been found that gave a serious study of the post hoc power. When the sample size is large, the observed effect size is a good estimator of the true power. This article studies whether such a power estimator provides valuable…

Descriptors: Effect Size, Computation, Monte Carlo Methods, Bias