ERIC - Search Results

Publication Date

In 2025	0
Since 2024	0
Since 2021 (last 5 years)	1
Since 2016 (last 10 years)	2
Since 2006 (last 20 years)	5

Source

Journal of Educational and…

Author

Zimmerman, Donald W.	2
Chan, Wendy	1
Ho, Andrew D.	1
Reardon, Sean F.	1
Schochet, Peter Z.	1
van der Linden, Wim J.	1

Publication Type

Journal Articles	6
Reports - Evaluative	3
Reports - Research	3

Education Level

Elementary Education	1
Elementary Secondary Education	1

Audience

Location

Indiana

Laws, Policies, & Programs

Assessments and Surveys

What Works Clearinghouse Rating

Showing all 6 results Save | Export

What Is Actually Equated in "Test Equating"? A Didactic Note

Peer reviewed

Direct link

van der Linden, Wim J. – Journal of Educational and Behavioral Statistics, 2022

The current literature on test equating generally defines it as the process necessary to obtain score comparability between different test forms. The definition is in contrast with Lord's foundational paper which viewed equating as the process required to obtain comparability of measurement scale between forms. The distinction between the notions…

Descriptors: Equated Scores, Test Items, Scores, Probability

Applications of Small Area Estimation to Generalization with Subclassification by Propensity Scores

Peer reviewed

Direct link

Chan, Wendy – Journal of Educational and Behavioral Statistics, 2018

Policymakers have grown increasingly interested in how experimental results may generalize to a larger population. However, recently developed propensity score-based methods are limited by small sample sizes, where the experimental study is generalized to a population that is at least 20 times larger. This is particularly problematic for methods…

Descriptors: Computation, Generalization, Probability, Sample Size

Practical Issues in Estimating Achievement Gaps from Coarsened Data

Peer reviewed

Direct link

Reardon, Sean F.; Ho, Andrew D. – Journal of Educational and Behavioral Statistics, 2015

In an earlier paper, we presented methods for estimating achievement gaps when test scores are coarsened into a small number of ordered categories, preventing fine-grained distinctions between individual scores. We demonstrated that gaps can nonetheless be estimated with minimal bias across a broad range of simulated and real coarsened data…

Descriptors: Achievement Gap, Performance Factors, Educational Practices, Scores

Sampling Variability and Axioms of Classical Test Theory

Peer reviewed

Direct link

Zimmerman, Donald W. – Journal of Educational and Behavioral Statistics, 2011

Many well-known equations in classical test theory are mathematical identities in populations of individuals but not in random samples from those populations. First, test scores are subject to the same sampling error that is familiar in statistical estimation and hypothesis testing. Second, the assumptions made in derivation of formulas in test…

Descriptors: Test Theory, Equations (Mathematics), Scores, Sampling

Statistical Power for Random Assignment Evaluations of Education Programs

Peer reviewed

Direct link

Schochet, Peter Z. – Journal of Educational and Behavioral Statistics, 2008

This article examines theoretical and empirical issues related to the statistical power of impact estimates for experimental evaluations of education programs. The author considers designs where random assignment is conducted at the school, classroom, or student level, and employs a unified analytic framework using statistical methods from the…

Descriptors: Elementary School Students, Research Design, Standardized Tests, Program Evaluation

A Note on the Interpretation of the Paired-Samples "t" Test. Teacher's Corner.

Peer reviewed

Zimmerman, Donald W. – Journal of Educational and Behavioral Statistics, 1997

Paired-samples experimental designs are appropriate and widely used when there is a natural correspondence or pairing of scores. However, researchers must not fail to consider the implications of undetected correlation between supposedly independent samples in the absence of explicit pairing. (SLD)

Descriptors: Comparative Analysis, Correlation, Experiments, Research Design

Sampling	6
Scores	6
Experiments	3
Computation	2
Correlation	2
Probability	2
Research Design	2
Research Methodology	2
Sample Size	2
Test Reliability	2
Accuracy	1
Achievement Gap	1
Achievement Rating	1
Classification	1
Comparative Analysis	1
Data Analysis	1
Definitions	1
Educational Assessment	1
Educational Practices	1
Educational Research	1
Elementary School Students	1
Elementary Secondary Education	1
Equated Scores	1
Equations (Mathematics)	1
Error of Measurement	1
More ▼