Publication Date
In 2025 | 0 |
Since 2024 | 0 |
Since 2021 (last 5 years) | 1 |
Since 2016 (last 10 years) | 2 |
Since 2006 (last 20 years) | 5 |
Descriptor
Sampling | 6 |
Scores | 6 |
Experiments | 3 |
Computation | 2 |
Correlation | 2 |
Probability | 2 |
Research Design | 2 |
Research Methodology | 2 |
Sample Size | 2 |
Test Reliability | 2 |
Accuracy | 1 |
More ▼ |
Source
Journal of Educational and… | 6 |
Author
Zimmerman, Donald W. | 2 |
Chan, Wendy | 1 |
Ho, Andrew D. | 1 |
Reardon, Sean F. | 1 |
Schochet, Peter Z. | 1 |
van der Linden, Wim J. | 1 |
Publication Type
Journal Articles | 6 |
Reports - Evaluative | 3 |
Reports - Research | 3 |
Education Level
Elementary Education | 1 |
Elementary Secondary Education | 1 |
Audience
Location
Indiana | 1 |
Laws, Policies, & Programs
Assessments and Surveys
What Works Clearinghouse Rating
van der Linden, Wim J. – Journal of Educational and Behavioral Statistics, 2022
The current literature on test equating generally defines it as the process necessary to obtain score comparability between different test forms. The definition is in contrast with Lord's foundational paper which viewed equating as the process required to obtain comparability of measurement scale between forms. The distinction between the notions…
Descriptors: Equated Scores, Test Items, Scores, Probability
Chan, Wendy – Journal of Educational and Behavioral Statistics, 2018
Policymakers have grown increasingly interested in how experimental results may generalize to a larger population. However, recently developed propensity score-based methods are limited by small sample sizes, where the experimental study is generalized to a population that is at least 20 times larger. This is particularly problematic for methods…
Descriptors: Computation, Generalization, Probability, Sample Size
Reardon, Sean F.; Ho, Andrew D. – Journal of Educational and Behavioral Statistics, 2015
In an earlier paper, we presented methods for estimating achievement gaps when test scores are coarsened into a small number of ordered categories, preventing fine-grained distinctions between individual scores. We demonstrated that gaps can nonetheless be estimated with minimal bias across a broad range of simulated and real coarsened data…
Descriptors: Achievement Gap, Performance Factors, Educational Practices, Scores
Zimmerman, Donald W. – Journal of Educational and Behavioral Statistics, 2011
Many well-known equations in classical test theory are mathematical identities in populations of individuals but not in random samples from those populations. First, test scores are subject to the same sampling error that is familiar in statistical estimation and hypothesis testing. Second, the assumptions made in derivation of formulas in test…
Descriptors: Test Theory, Equations (Mathematics), Scores, Sampling
Schochet, Peter Z. – Journal of Educational and Behavioral Statistics, 2008
This article examines theoretical and empirical issues related to the statistical power of impact estimates for experimental evaluations of education programs. The author considers designs where random assignment is conducted at the school, classroom, or student level, and employs a unified analytic framework using statistical methods from the…
Descriptors: Elementary School Students, Research Design, Standardized Tests, Program Evaluation

Zimmerman, Donald W. – Journal of Educational and Behavioral Statistics, 1997
Paired-samples experimental designs are appropriate and widely used when there is a natural correspondence or pairing of scores. However, researchers must not fail to consider the implications of undetected correlation between supposedly independent samples in the absence of explicit pairing. (SLD)
Descriptors: Comparative Analysis, Correlation, Experiments, Research Design