ERIC - Search Results

Publication Date

In 2025	1
Since 2024	1
Since 2021 (last 5 years)	5
Since 2016 (last 10 years)	16
Since 2006 (last 20 years)	40

Descriptor

Sample Size	56
Simulation	56
Sampling	55
Error of Measurement	16
Statistical Analysis	15
Evaluation Methods	12
Item Response Theory	10
Statistical Distributions	10
Effect Size	9
Research Methodology	9
Test Items	9
Computation	8
Monte Carlo Methods	8
Research Design	8
Statistical Bias	8
Correlation	7
Nonparametric Statistics	7
Probability	7
Comparative Analysis	6
Regression (Statistics)	6
Statistical Studies	6
Accuracy	5
Educational Research	5
Equated Scores	5
Goodness of Fit	5
More ▼

Publication Type

Journal Articles	38
Reports - Research	37
Reports - Evaluative	10
Speeches/Meeting Papers	9
Reports - Descriptive	6
Dissertations/Theses -…	3
Information Analyses	1
Numerical/Quantitative Data	1

Education Level

Secondary Education	3
High Schools	2
Elementary Secondary Education	1
Higher Education	1
Middle Schools	1
Postsecondary Education	1

Audience

Teachers	3
Researchers	2

Location

Indiana	1
North Carolina	1

Laws, Policies, & Programs

Assessments and Surveys

What Works Clearinghouse Rating

Showing 1 to 15 of 56 results Save | Export

Improving the Use of Parallel Analysis by Accounting for Sampling Variability of the Observed Correlation Matrix

Peer reviewed

Direct link

Yan Xia; Xinchang Zhou – Educational and Psychological Measurement, 2025

Parallel analysis has been considered one of the most accurate methods for determining the number of factors in factor analysis. One major advantage of parallel analysis over traditional factor retention methods (e.g., Kaiser's rule) is that it addresses the sampling variability of eigenvalues obtained from the identity matrix, representing the…

Descriptors: Factor Analysis, Statistical Analysis, Evaluation Methods, Sampling

Power Properties of Ordinal Regression Models for Likert Type Data

Peer reviewed
PDF on ERIC

Download full text

Olsson, Ulf – Practical Assessment, Research & Evaluation, 2022

We discuss analysis of 5-grade Likert type data in the two-sample case. Analysis using two-sample "t" tests, nonparametric Wilcoxon tests, and ordinal regression methods, are compared using simulated data based on an ordinal regression paradigm. One thousand pairs of samples of size "n"=10 and "n"=30 were generated,…

Descriptors: Regression (Statistics), Likert Scales, Sampling, Nonparametric Statistics

The Role of Distributional Overlap on the Precision Gain of Bounds for Generalization

Peer reviewed

Direct link

Chan, Wendy – American Journal of Evaluation, 2022

Over the past ten years, propensity score methods have made an important contribution to improving generalizations from studies that do not select samples randomly from a population of inference. However, these methods require assumptions and recent work has considered the role of bounding approaches that provide a range of treatment impact…

Descriptors: Probability, Scores, Scoring, Generalization

Using the Standardized Root Mean Squared Residual (SRMR) to Assess Exact Fit in Structural Equation Models

Peer reviewed

Direct link

Pavlov, Goran; Maydeu-Olivares, Alberto; Shi, Dexin – Educational and Psychological Measurement, 2021

We examine the accuracy of p values obtained using the asymptotic mean and variance (MV) correction to the distribution of the sample standardized root mean squared residual (SRMR) proposed by Maydeu-Olivares to assess the exact fit of SEM models. In a simulation study, we found that under normality, the MV-corrected SRMR statistic provides…

Descriptors: Structural Equation Models, Goodness of Fit, Simulation, Error of Measurement

Equating with Small and Unbalanced Samples

Peer reviewed

Direct link

Goodman, Joshua T.; Dallas, Andrew D.; Fan, Fen – Applied Measurement in Education, 2020

Recent research has suggested that re-setting the standard for each administration of a small sample examination, in addition to the high cost, does not adequately maintain similar performance expectations year after year. Small-sample equating methods have shown promise with samples between 20 and 30. For groups that have fewer than 20 students,…

Descriptors: Equated Scores, Sample Size, Sampling, Weighted Scores

Hierarchical Bayes Approach to Estimate the Treatment Effect for Randomized Controlled Trials

Peer reviewed

Direct link

Liang, Xinya; Kamata, Akihito; Li, Ji – Educational and Psychological Measurement, 2020

One important issue in Bayesian estimation is the determination of an effective informative prior. In hierarchical Bayes models, the uncertainty of hyperparameters in a prior can be further modeled via their own priors, namely, hyper priors. This study introduces a framework to construct hyper priors for both the mean and the variance…

Descriptors: Bayesian Statistics, Randomized Controlled Trials, Effect Size, Sampling

Robustness of Weighted Differential Item Functioning (DIF) Analysis: The Case of Mantel-Haenszel DIF Statistics. Research Report. ETS RR-21-12

Peer reviewed
PDF on ERIC

Download full text

Lu, Ru; Guo, Hongwen; Dorans, Neil J. – ETS Research Report Series, 2021

Two families of analysis methods can be used for differential item functioning (DIF) analysis. One family is DIF analysis based on observed scores, such as the Mantel-Haenszel (MH) and the standardized proportion-correct metric for DIF procedures; the other is analysis based on latent ability, in which the statistic is a measure of departure from…

Descriptors: Robustness (Statistics), Weighted Scores, Test Items, Item Analysis

Impact of Item Parameter Drift on Rasch Scale Stability in Small Samples over Multiple Administrations

Peer reviewed

Direct link

Kopp, Jason P.; Jones, Andrew T. – Applied Measurement in Education, 2020

Traditional psychometric guidelines suggest that at least several hundred respondents are needed to obtain accurate parameter estimates under the Rasch model. However, recent research indicates that Rasch equating results in accurate parameter estimates with sample sizes as small as 25. Item parameter drift under the Rasch model has been…

Descriptors: Item Response Theory, Psychometrics, Sample Size, Sampling

Comparing Small-Sample Equating with Angoff Judgement for Linking Cut-Scores on Two Tests

Download full text

Bramley, Tom – Research Matters, 2020

The aim of this study was to compare, by simulation, the accuracy of mapping a cut-score from one test to another by expert judgement (using the Angoff method) versus the accuracy with a small-sample equating method (chained linear equating). As expected, the standard-setting method resulted in more accurate equating when we assumed a higher level…

Descriptors: Cutting Scores, Standard Setting (Scoring), Equated Scores, Accuracy

A Simulation Study Evaluating the Generalized Additive Model for Assessing Intervention Effects with Small Samples

Peer reviewed

Direct link

Finch, W. Holmes; Finch, Maria Hernández – Journal of Experimental Education, 2018

Single subject (SS) designs are popular in educational and psychological research. There exist several statistical techniques designed to analyze such data and to address the question of whether an intervention has the desired impact. Recently, researchers have suggested that generalized additive models (GAMs) might be useful for modeling…

Descriptors: Educational Research, Longitudinal Studies, Simulation, Models

Bayesian Inference under Cluster Sampling with Probability Proportional to Size

Peer reviewed
PDF on ERIC

Download full text

Direct link

Makela, Susanna; Si, Yajuan; Gelman, Andrew – Grantee Submission, 2018

Cluster sampling is common in survey practice, and the corresponding inference has been predominantly design-based. We develop a Bayesian framework for cluster sampling and account for the design effect in the outcome modeling. We consider a two-stage cluster sampling design where the clusters are first selected with probability proportional to…

Descriptors: Bayesian Statistics, Statistical Inference, Sampling, Probability

Applications of Small Area Estimation to Generalization with Subclassification by Propensity Scores

Peer reviewed

Direct link

Chan, Wendy – Journal of Educational and Behavioral Statistics, 2018

Policymakers have grown increasingly interested in how experimental results may generalize to a larger population. However, recently developed propensity score-based methods are limited by small sample sizes, where the experimental study is generalized to a population that is at least 20 times larger. This is particularly problematic for methods…

Descriptors: Computation, Generalization, Probability, Sample Size

Brief Research Report: Growth Models with Small Samples and Missing Data

Peer reviewed

Direct link

McNeish, Daniel – Journal of Experimental Education, 2018

Small samples are common in growth models due to financial and logistical difficulties of following people longitudinally. For similar reasons, longitudinal studies often contain missing data. Though full information maximum likelihood (FIML) is popular to accommodate missing data, the limited number of studies in this area have found that FIML…

Descriptors: Growth Models, Sampling, Sample Size, Hierarchical Linear Modeling

Simulations of the Sampling Distribution of the Mean Do Not Necessarily Mislead and Can Facilitate Learning

Peer reviewed

Direct link

Lane, David M. – Journal of Statistics Education, 2015

Recently Watkins, Bargagliotti, and Franklin (2014) discovered that simulations of the sampling distribution of the mean can mislead students into concluding that the mean of the sampling distribution of the mean depends on sample size. This potential error arises from the fact that the mean of a simulated sampling distribution will tend to be…

Descriptors: Statistical Distributions, Sampling, Sample Size, Misconceptions

Correcting Model Fit Criteria for Small Sample Latent Growth Models with Incomplete Data

Peer reviewed

Direct link

McNeish, Daniel; Harring, Jeffrey R. – Educational and Psychological Measurement, 2017

To date, small sample problems with latent growth models (LGMs) have not received the amount of attention in the literature as related mixed-effect models (MEMs). Although many models can be interchangeably framed as a LGM or a MEM, LGMs uniquely provide criteria to assess global data-model fit. However, previous studies have demonstrated poor…

Descriptors: Growth Models, Goodness of Fit, Error Correction, Sampling

Previous Page | Next Page »

Pages: 1 | 2 | 3 | 4

Educational and Psychological…	7
Journal of Experimental…	6
Journal of Statistics…	3
ProQuest LLC	3
Psychological Methods	3
American Journal of Evaluation	2
Applied Measurement in…	2
International Journal of…	2
Journal of Educational and…	2
Mathematics Teacher	2
ACT, Inc.	1
ETS Research Report Series	1
Grantee Submission	1
Journal of Experimental…	1
Journal of Research on…	1
Practical Assessment,…	1
Psicologica: International…	1
Psychometrika	1
Research Matters	1
Review of Educational Research	1
Society for Research on…	1
Structural Equation Modeling:…	1
More ▼

McNeish, Daniel	3
Chan, Wendy	2
Elmore, Patricia B.	2
Algina, James	1
Anderson, Richard B.	1
Bargagliotti, Anna	1
Barr, James	1
Beasley, T. Mark	1
Bell, Stephen H.	1
Beretvas, S. Natasha	1
Bramley, Tom	1
Broadbooks, Wendy J.	1
Broodbooks, Wendy J.	1
Carifio, James	1
Chen, Hanwei	1
Cizek, Gregory J.	1
Cui, Zhongmin	1
Cumsille, Patricio E.	1
Dallas, Andrew D.	1
Doherty, Michael E.	1
Dorans, Neil J.	1
Fan, Fen	1
Fan, Xitao	1
Fang, Yu	1
More ▼