ERIC - Search Results

Publication Date

In 2025	0
Since 2024	0
Since 2021 (last 5 years)	1
Since 2016 (last 10 years)	2
Since 2006 (last 20 years)	8

Descriptor

Evaluation Methods	10
Evaluation Research	10
Sample Size	10
Computation	4
Item Response Theory	4
Research Methodology	4
Simulation	4
Error of Measurement	3
Statistical Bias	3
Statistical Inference	3
Correlation	2
Educational Research	2
Error Patterns	2
Factor Analysis	2
Intervals	2
Least Squares Statistics	2
Models	2
Predictor Variables	2
Program Effectiveness	2
Statistical Analysis	2
Test Bias	2
Adaptive Testing	1
Administrative Change	1
Administrative Organization	1
Bias	1
More ▼

Source

Psychological Methods	3
Journal of Educational…	2
Journal of Research on…	1
Multivariate Behavioral…	1
ProQuest LLC	1
Society for Research on…	1
US Government Accountability…	1

Publication Type

Journal Articles	7
Reports - Research	5
Reports - Evaluative	4
Dissertations/Theses -…	1
Opinion Papers	1
Tests/Questionnaires	1

Education Level

Adult Education	2
Elementary Secondary Education	1

Audience

Location

Laws, Policies, & Programs

Assessments and Surveys

What Works Clearinghouse Rating

Showing all 10 results Save | Export

Redefining Populations of Inference for Generalizations from Small Studies

Peer reviewed

Direct link

Wendy Chan; Jimin Oh; Katherine Wilson – Society for Research on Educational Effectiveness, 2022

Background: Over the past decade, research on the development and assessment of tools to improve the generalizability of experimental findings has grown extensively (Tipton & Olsen, 2018). However, many experimental studies in education are based on small samples, which may include 30-70 schools while inference populations to which…

Descriptors: Educational Research, Research Problems, Sample Size, Research Methodology

Assessing the Precision of Multisite Trials for Estimating the Parameters of a Cross-Site Population Distribution of Program Effects

Peer reviewed

Direct link

Bloom, Howard S.; Spybrook, Jessaca – Journal of Research on Educational Effectiveness, 2017

Multisite trials, which are being used with increasing frequency in education and evaluation research, provide an exciting opportunity for learning about how the effects of interventions or programs are distributed across sites. In particular, these studies can produce rigorous estimates of a cross-site mean effect of program assignment…

Descriptors: Program Effectiveness, Program Evaluation, Sample Size, Evaluation Research

Commentary: Are Three Waves of Data Sufficient for Assessing Mediation?

Peer reviewed

Direct link

Reichardt, Charles S. – Multivariate Behavioral Research, 2011

Maxwell, Cole, and Mitchell (2011) demonstrated that simple structural equation models, when used with cross-sectional data, generally produce biased estimates of meditated effects. I extend those results by showing how simple structural equation models can produce biased estimates of meditated effects when used even with longitudinal data. Even…

Descriptors: Structural Equation Models, Statistical Data, Longitudinal Studies, Error of Measurement

Improving IRT Parameter Estimates with Small Sample Sizes: Evaluating the Efficacy of a New Data Augmentation Technique

Direct link

Foley, Brett Patrick – ProQuest LLC, 2010

The 3PL model is a flexible and widely used tool in assessment. However, it suffers from limitations due to its need for large sample sizes. This study introduces and evaluates the efficacy of a new sample size augmentation technique called Duplicate, Erase, and Replace (DupER) Augmentation through a simulation study. Data are augmented using…

Descriptors: Test Length, Sample Size, Simulation, Item Response Theory

When Can Categorical Variables Be Treated as Continuous? A Comparison of Robust Continuous and Categorical SEM Estimation Methods under Suboptimal Conditions

Peer reviewed

Direct link

Rhemtulla, Mijke; Brosseau-Liard, Patricia E.; Savalei, Victoria – Psychological Methods, 2012

A simulation study compared the performance of robust normal theory maximum likelihood (ML) and robust categorical least squares (cat-LS) methodology for estimating confirmatory factor analysis models with ordinal variables. Data were generated from 2 models with 2-7 categories, 4 sample sizes, 2 latent distributions, and 5 patterns of category…

Descriptors: Factor Analysis, Computation, Simulation, Sample Size

Estimation of IRT Graded Response Models: Limited versus Full Information Methods

Peer reviewed

Direct link

Forero, Carlos G.; Maydeu-Olivares, Alberto – Psychological Methods, 2009

The performance of parameter estimates and standard errors in estimating F. Samejima's graded response model was examined across 324 conditions. Full information maximum likelihood (FIML) was compared with a 3-stage estimator for categorical item factor analysis (CIFA) when the unweighted least squares method was used in CIFA's third stage. CIFA…

Descriptors: Factor Analysis, Least Squares Statistics, Computation, Item Response Theory

Department of Education: Improved Dissemination and Timely Product Release Would Enhance the Usefulness of the What Works Clearinghouse. Report to Congressional Committees. GAO-10-644

Download full text

Ashby, Cornelia M. – US Government Accountability Office, 2010

In connection with the Omnibus Appropriations Act, 2009, GAO (Government Accountability Office) was required to study the What Works Clearinghouse (WWC), a federal source of evidence about effective education practices. Operating through a 5-year contract awarded by the U.S. Department of Education's Institute of Education Sciences (IES), the WWC…

Descriptors: Clearinghouses, Instructional Effectiveness, Federal Legislation, Evaluation Methods

Effect of Unequal Variances in Proficiency Distributions on Type-I Error of the Mantel-Haenszel Chi-Square Test for Differential Item Functioning

Peer reviewed

Direct link

Monahan, Patrick O.; Ankenmann, Robert D. – Journal of Educational Measurement, 2005

Empirical studies demonstrated Type-I error (TIE) inflation (especially for highly discriminating easy items) of the Mantel-Haenszel chi-square test for differential item functioning (DIF), when data conformed to item response theory (IRT) models more complex than Rasch, and when IRT proficiency distributions differed only in means. However, no…

Descriptors: Sample Size, Item Response Theory, Test Items, Test Bias

Bootstrap Standard Error and Confidence Intervals for the Correlation Corrected for Range Restriction: A Simulation Study

Peer reviewed

Direct link

Chan, Wai; Chan, Daniel W.-L. – Psychological Methods, 2004

The standard Pearson correlation coefficient is a biased estimator of the true population correlation, ?, when the predictor and the criterion are range restricted. To correct the bias, the correlation corrected for range restriction, r-sub(c), has been recommended, and a standard formula based on asymptotic results for estimating its standard…

Descriptors: Computation, Intervals, Sample Size, Monte Carlo Methods

Comparing Methods of Assessing Differential Item Functioning in a Computerized Adaptive Testing Environment

Peer reviewed

Direct link

Lei, Pui-Wa; Chen, Shu-Ying; Yu, Lan – Journal of Educational Measurement, 2006

Mantel-Haenszel and SIBTEST, which have known difficulty in detecting non-unidirectional differential item functioning (DIF), have been adapted with some success for computerized adaptive testing (CAT). This study adapts logistic regression (LR) and the item-response-theory-likelihood-ratio test (IRT-LRT), capable of detecting both unidirectional…

Descriptors: Evaluation Methods, Test Bias, Computer Assisted Testing, Multiple Regression Analysis

Ankenmann, Robert D.	1
Ashby, Cornelia M.	1
Bloom, Howard S.	1
Brosseau-Liard, Patricia E.	1
Chan, Daniel W.-L.	1
Chan, Wai	1
Chen, Shu-Ying	1
Foley, Brett Patrick	1
Forero, Carlos G.	1
Jimin Oh	1
Katherine Wilson	1
Lei, Pui-Wa	1
Maydeu-Olivares, Alberto	1
Monahan, Patrick O.	1
Reichardt, Charles S.	1
Rhemtulla, Mijke	1
Savalei, Victoria	1
Spybrook, Jessaca	1
Wendy Chan	1
Yu, Lan	1
More ▼