ERIC - Search Results

Publication Date

In 2025	0
Since 2024	0
Since 2021 (last 5 years)	5
Since 2016 (last 10 years)	7
Since 2006 (last 20 years)	12

Descriptor

Probability	15
Sampling	15
Scores	15
Statistical Analysis	5
Scoring	4
Computation	3
Generalizability Theory	3
Generalization	3
Sample Size	3
Statistical Distributions	3
Statistical Inference	3
Academic Achievement	2
Achievement Tests	2
Causal Models	2
Classification	2
Comparative Analysis	2
Correlation	2
Educational Assessment	2
Effect Size	2
Evaluation Methods	2
Experiments	2
Higher Education	2
Hypothesis Testing	2
Interrater Reliability	2
Pretests Posttests	2
More ▼

Source

Journal of Educational and…	2
ProQuest LLC	2
American Journal of Evaluation	1
Annenberg Institute for…	1
Cambridge University Press	1
ETS Research Report Series	1
Educational and Psychological…	1
Institute for Research on…	1
Journal of Educational…	1
Journal of MultiDisciplinary…	1
Journal of Research on…	1
Society for Research on…	1
More ▼

Publication Type

Reports - Research	8
Journal Articles	7
Dissertations/Theses -…	2
Reports - Evaluative	2
Books	1
Numerical/Quantitative Data	1

Education Level

Higher Education	3
Elementary Education	2
Postsecondary Education	2
Early Childhood Education	1
Elementary Secondary Education	1
Grade 3	1
Junior High Schools	1
Kindergarten	1
Middle Schools	1
Primary Education	1
Secondary Education	1
Two Year Colleges	1
More ▼

Audience

Location

Indiana	2
Florida	1
Tennessee	1
Virginia	1

Laws, Policies, & Programs

Assessments and Surveys

Florida Comprehensive…	1
Stanford Achievement Tests	1

What Works Clearinghouse Rating

Showing all 15 results Save | Export

The Implications of Propensity Score Augmentation for Generalization

Peer reviewed

Direct link

Wendy Chan; Jimin Oh; Chen Li; Jiexuan Huang; Yeran Tong – Society for Research on Educational Effectiveness, 2023

Background: The generalizability of a study's results continues to be at the forefront of concerns in evaluation research in education (Tipton & Olsen, 2018). Over the past decade, statisticians have developed methods, mainly based on propensity scores, to improve generalizations in the absence of random sampling (Stuart et al., 2011; Tipton,…

Descriptors: Generalizability Theory, Probability, Scores, Sampling

What Is Actually Equated in "Test Equating"? A Didactic Note

Peer reviewed

Direct link

van der Linden, Wim J. – Journal of Educational and Behavioral Statistics, 2022

The current literature on test equating generally defines it as the process necessary to obtain score comparability between different test forms. The definition is in contrast with Lord's foundational paper which viewed equating as the process required to obtain comparability of measurement scale between forms. The distinction between the notions…

Descriptors: Equated Scores, Test Items, Scores, Probability

The Role of Distributional Overlap on the Precision Gain of Bounds for Generalization

Peer reviewed

Direct link

Chan, Wendy – American Journal of Evaluation, 2022

Over the past ten years, propensity score methods have made an important contribution to improving generalizations from studies that do not select samples randomly from a population of inference. However, these methods require assumptions and recent work has considered the role of bounding approaches that provide a range of treatment impact…

Descriptors: Probability, Scores, Scoring, Generalization

Investigating Constructed-Response Scoring over Time: The Effects of Study Design on Trend Rescore Statistics. Research Report. ETS RR-22-15

Peer reviewed
PDF on ERIC

Download full text

Donoghue, John R.; McClellan, Catherine A.; Hess, Melinda R. – ETS Research Report Series, 2022

When constructed-response items are administered for a second time, it is necessary to evaluate whether the current Time B administration's raters have drifted from the scoring of the original administration at Time A. To study this, Time A papers are sampled and rescored by Time B scorers. Commonly the scores are compared using the proportion of…

Descriptors: Item Response Theory, Test Construction, Scoring, Testing

Bringing Transparency to Predictive Analytics: A Systematic Comparison of Predictive Modeling Methods in Higher Education. EdWorkingPaper No. 21-438

Download full text

Kelli A. Bird; Benjamin L. Castleman; Zachary Mabel; Yifeng Song – Annenberg Institute for School Reform at Brown University, 2021

Colleges have increasingly turned to predictive analytics to target at-risk students for additional support. Most of the predictive analytic applications in higher education are proprietary, with private companies offering little transparency about their underlying models. We address this lack of transparency by systematically comparing two…

Descriptors: At Risk Students, Higher Education, Predictive Measurement, Models

Applications of Small Area Estimation to Generalization with Subclassification by Propensity Scores

Peer reviewed

Direct link

Chan, Wendy – Journal of Educational and Behavioral Statistics, 2018

Policymakers have grown increasingly interested in how experimental results may generalize to a larger population. However, recently developed propensity score-based methods are limited by small sample sizes, where the experimental study is generalized to a population that is at least 20 times larger. This is particularly problematic for methods…

Descriptors: Computation, Generalization, Probability, Sample Size

Partially Identified Treatment Effects for Generalizability

Peer reviewed

Direct link

Chan, Wendy – Journal of Research on Educational Effectiveness, 2017

Recent methods to improve generalizations from nonrandom samples typically invoke assumptions such as the strong ignorability of sample selection, which is challenging to meet in practice. Although researchers acknowledge the difficulty in meeting this assumption, point estimates are still provided and used without considering alternative…

Descriptors: Generalization, Inferences, Probability, Educational Research

Causal Inference for Statistics, Social, and Biomedical Sciences: An Introduction

Direct link

Imbens, Guido W.; Rubin, Donald B. – Cambridge University Press, 2015

Most questions in social and biomedical sciences are causal in nature: what would happen to individuals, or to groups, if part of their environment were changed? In this groundbreaking text, two world-renowned experts present statistical methods for studying such questions. This book starts with the notion of potential outcomes, each corresponding…

Descriptors: Causal Models, Statistical Inference, Statistics, Social Sciences

A Comparative Study of Exact versus Propensity Matching Techniques Using Monte Carlo Simulation

Direct link

Itang'ata, Mukaria J. J. – ProQuest LLC, 2013

Often researchers face situations where comparative studies between two or more programs are necessary to make causal inferences for informed policy decision-making. Experimental designs employing randomization provide the strongest evidence for causal inferences. However, many pragmatic and ethical challenges may preclude the use of randomized…

Descriptors: Comparative Analysis, Probability, Statistical Bias, Monte Carlo Methods

Utilizing Generalizability Theory to Investigate the Reliability of the Grades Assigned to Undergraduate Research Papers

Peer reviewed

Direct link

Gugiu, Mihaiela R.; Gugiu, Paul C.; Baldus, Robert – Journal of MultiDisciplinary Evaluation, 2012

Background: Educational researchers have long espoused the virtues of writing with regard to student cognitive skills. However, research on the reliability of the grades assigned to written papers reveals a high degree of contradiction, with some researchers concluding that the grades assigned are very reliable whereas others suggesting that they…

Descriptors: Grades (Scholastic), Grading, Scoring Rubrics, Research Design

Exploring the Impact of Varying Levels of Augmented Reality to Teach Probability and Sampling with a Mobile Device

Direct link

Conley, Quincy – ProQuest LLC, 2013

Statistics is taught at every level of education, yet teachers often have to assume their students have no knowledge of statistics and start from scratch each time they set out to teach statistics. The motivation for this experimental study comes from interest in exploring educational applications of augmented reality (AR) delivered via mobile…

Descriptors: Statistics, Mathematics Instruction, Simulated Environment, Computer Simulation

Exploring the Value Added of a Guided, Silent Reading Intervention: Effects on Struggling Third-Grade Readers' Achievement

Peer reviewed

Direct link

Reutzel, D. Ray; Petscher, Yaacov; Spichtig, Alexandra N. – Journal of Educational Research, 2012

The authors' purpose was to explore the effects of a supplementary, guided, silent reading intervention with 80 struggling third-grade readers who were retained at grade level as a result of poor performance on the reading portion of a criterion referenced state assessment. The students were distributed in 11 elementary schools in a large, urban…

Descriptors: Academic Achievement, Achievement Tests, Control Groups, Reading Fluency

The Robustness of Tilton's Measure of Overlap

Peer reviewed

Elster, Richard S.; Dunnette, Marvin D. – Educational and Psychological Measurement, 1971

Descriptors: Hypothesis Testing, Measurement Techniques, Probability, Sampling

How Close Is Close Enough? Testing Nonexperimental Estimates of Impact against Experimental Estimates of Impact with Education Test Scores as Outcomes. Discussion Paper No. 1242-02

Direct link

Wilde, Elizabeth Ty; Hollister, Robinson – Institute for Research on Poverty, 2002

In this study we test the performance of some nonexperimental estimators of impacts applied to an educational intervention--reduction in class size--where achievement test scores were the outcome. We compare the nonexperimental estimates of the impacts to "true impact" estimates provided by a random-assignment design used to assess the…

Descriptors: Computation, Outcome Measures, Achievement Tests, Scores

Tailored Testing, An Application of Stochastic Approximation.

Download full text

Lord, Frederic M. – 1971

Some stochastic approximation procedures are considered in relation to the problem of choosing a sequence of test questions to accurately estimate a given examinee's standing on a psychological dimension. Illustrations are given evaluating certain procedures in a specific context. (Author/CK)

Descriptors: Academic Ability, Adaptive Testing, Computer Programs, Difficulty Level

Chan, Wendy	3
Baldus, Robert	1
Benjamin L. Castleman	1
Chen Li	1
Conley, Quincy	1
Donoghue, John R.	1
Dunnette, Marvin D.	1
Elster, Richard S.	1
Gugiu, Mihaiela R.	1
Gugiu, Paul C.	1
Hess, Melinda R.	1
Hollister, Robinson	1
Imbens, Guido W.	1
Itang'ata, Mukaria J. J.	1
Jiexuan Huang	1
Jimin Oh	1
Kelli A. Bird	1
Lord, Frederic M.	1
McClellan, Catherine A.	1
Petscher, Yaacov	1
Reutzel, D. Ray	1
Rubin, Donald B.	1
Spichtig, Alexandra N.	1
Wendy Chan	1
Wilde, Elizabeth Ty	1
More ▼