ERIC - Search Results

Publication Date

In 2025	0
Since 2024	1
Since 2021 (last 5 years)	3
Since 2016 (last 10 years)	11
Since 2006 (last 20 years)	29

Descriptor

Error of Measurement	49
Evaluation Methods	49
Statistical Analysis	49
Simulation	12
Computation	11
Sample Size	11
Item Response Theory	10
Comparative Analysis	8
Models	8
Statistical Bias	8
Educational Research	7
Goodness of Fit	6
Measurement Techniques	6
Monte Carlo Methods	6
Probability	6
Research Methodology	6
Sampling	6
Accuracy	5
Data Analysis	5
Effect Size	5
Research Problems	5
Scores	5
Structural Equation Models	5
Test Bias	5
Test Items	5
More ▼

Publication Type

Journal Articles	32
Reports - Research	21
Reports - Evaluative	10
Reports - Descriptive	8
Speeches/Meeting Papers	4
Dissertations/Theses -…	3
Guides - Non-Classroom	3
Books	1
Information Analyses	1
Numerical/Quantitative Data	1

Education Level

Higher Education	3
Elementary Secondary Education	2
Grade 10	1
Junior High Schools	1
Middle Schools	1
Postsecondary Education	1
Secondary Education	1

Audience

Researchers	5
Students	1

Location

California (Stanford)	1
Florida	1
Turkey	1
United States	1

Laws, Policies, & Programs

Elementary and Secondary…

Assessments and Surveys

Florida Comprehensive…

What Works Clearinghouse Rating

Showing 1 to 15 of 49 results Save | Export

Comparing Factor Score Approaches to SEM in Multigroup Models with Small Samples

Peer reviewed

Direct link

Emma Somer; Carl Falk; Milica Miocevic – Structural Equation Modeling: A Multidisciplinary Journal, 2024

Factor Score Regression (FSR) is increasingly employed as an alternative to structural equation modeling (SEM) in small samples. Despite its popularity in psychology, the performance of FSR in multigroup models with small samples remains relatively unknown. The goal of this study was to examine the performance of FSR, namely Croon's correction and…

Descriptors: Scores, Structural Equation Models, Comparative Analysis, Sample Size

Comparison of Kernel Equating Methods under NEAT and NEC Designs

Peer reviewed
PDF on ERIC

Download full text

Ozsoy, Seyma Nur; Kilmen, Sevilay – International Journal of Assessment Tools in Education, 2023

In this study, Kernel test equating methods were compared under NEAT and NEC designs. In NEAT design, Kernel post-stratification and chain equating methods taking into account optimal and large bandwidths were compared. In the NEC design, gender and/or computer/tablet use was considered as a covariate, and Kernel test equating methods were…

Descriptors: Equated Scores, Testing, Test Items, Statistical Analysis

The BASIE (BAyeSian Interpretation of Estimates) Framework for Interpreting Findings from Impact Evaluations: A Practical Guide for Education Researchers. Toolkit. NCEE 2022-005

Peer reviewed
PDF on ERIC

Download full text

Deke, John; Finucane, Mariel; Thal, Daniel – National Center for Education Evaluation and Regional Assistance, 2022

BASIE is a framework for interpreting impact estimates from evaluations. It is an alternative to null hypothesis significance testing. This guide walks researchers through the key steps of applying BASIE, including selecting prior evidence, reporting impact estimates, interpreting impact estimates, and conducting sensitivity analyses. The guide…

Descriptors: Bayesian Statistics, Educational Research, Data Interpretation, Hypothesis Testing

Using the Standard Wald Confidence Interval for a Population Proportion Hypothesis Test Is a Common Mistake

Peer reviewed

Direct link

Yang, Shitao; Black, Ken – Teaching Statistics: An International Journal for Teachers, 2019

Summary Employing a Wald confidence interval to test hypotheses about population proportions could lead to an increase in Type I or Type II errors unless the hypothesized value, p0, is used in computing its standard error rather than the sample proportion. Whereas the Wald confidence interval to estimate a population proportion uses the sample…

Descriptors: Error Patterns, Evaluation Methods, Error of Measurement, Measurement Techniques

Improving Methods for Propensity Score Analysis with Mismeasured Variables by Incorporating Background Variables with Moderated Nonlinear Factor Analysis

Direct link

Greifer, Noah – ProQuest LLC, 2018

There has been some research in the use of propensity scores in the context of measurement error in the confounding variables; one recommended method is to generate estimates of the mis-measured covariate using a latent variable model, and to use those estimates (i.e., factor scores) in place of the covariate. I describe a simulation study…

Descriptors: Evaluation Methods, Probability, Scores, Statistical Analysis

Evaluating Local Independence in Rasch Models with WLSMV Global Fit Indices

Direct link

Hyunsuk Han – ProQuest LLC, 2018

In Huggins-Manley & Han (2017), it was shown that WLSMV global model fit indices used in structural equating modeling practice are sensitive to person parameter estimate RMSE and item difficulty parameter estimate RMSE that results from local dependence in 2-PL IRT models, particularly when conditioning on number of test items and sample size.…

Descriptors: Models, Statistical Analysis, Item Response Theory, Evaluation Methods

Testing Autocorrelation and Partial Autocorrelation: Asymptotic Methods versus Resampling Techniques

Peer reviewed
PDF on ERIC

Download full text

Direct link

Ke, Zijun; Zhang, Zhiyong – Grantee Submission, 2018

Autocorrelation and partial autocorrelation, which provide a mathematical tool to understand repeating patterns in time series data, are often used to facilitate the identification of model orders of time series models (e.g., moving average and autoregressive models). Asymptotic methods for testing autocorrelation and partial autocorrelation such…

Descriptors: Correlation, Mathematical Formulas, Sampling, Monte Carlo Methods

An Unbiased Estimate of Global Interrater Agreement

Peer reviewed

Direct link

Cousineau, Denis; Laurencelle, Louis – Educational and Psychological Measurement, 2017

Assessing global interrater agreement is difficult as most published indices are affected by the presence of mixtures of agreements and disagreements. A previously proposed method was shown to be specifically sensitive to global agreement, excluding mixtures, but also negatively biased. Here, we propose two alternatives in an attempt to find what…

Descriptors: Interrater Reliability, Evaluation Methods, Statistical Bias, Accuracy

Methods to Estimate the Variance of Some Indices of the Signal Detection Theory: A Simulation Study

Peer reviewed
PDF on ERIC

Download full text

Suero, Manuel; Privado, Jesús; Botella, Juan – Psicologica: International Journal of Methodology and Experimental Psychology, 2017

A simulation study is presented to evaluate and compare three methods to estimate the variance of the estimates of the parameters d and "C" of the signal detection theory (SDT). Several methods have been proposed to calculate the variance of their estimators, "d'" and "c." Those methods have been mostly assessed by…

Descriptors: Evaluation Methods, Theories, Simulation, Statistical Analysis

The Nonuse, Misuse, and Proper Use of Pilot Studies in Experimental Evaluation Research

Peer reviewed
PDF on ERIC

Download full text

Direct link

Westlund, Erik; Stuart, Elizabeth A. – American Journal of Evaluation, 2017

This article discusses the nonuse, misuse, and proper use of pilot studies in experimental evaluation research. The authors first show that there is little theoretical, practical, or empirical guidance available to researchers who seek to incorporate pilot studies into experimental evaluation research designs. The authors then discuss how pilot…

Descriptors: Use Studies, Pilot Projects, Evaluation Research, Experiments

Reliable and More Powerful Methods for Power Analysis in Structural Equation Modeling

Peer reviewed
PDF on ERIC

Download full text

Direct link

Yuan, Ke-Hai; Zhang, Zhiyong; Zhao, Yanyun – Grantee Submission, 2017

The normal-distribution-based likelihood ratio statistic T[subscript ml] = nF[subscript ml] is widely used for power analysis in structural Equation modeling (SEM). In such an analysis, power and sample size are computed by assuming that T[subscript ml] follows a central chi-square distribution under H[subscript 0] and a noncentral chi-square…

Descriptors: Statistical Analysis, Evaluation Methods, Structural Equation Models, Reliability

Differential Item Functioning Detection with the Mantel-Haenszel Procedure: The Effects of Matching Types and Other Factors

Peer reviewed

Direct link

Socha, Alan; DeMars, Christine E.; Zilberberg, Anna; Phan, Ha – International Journal of Testing, 2015

The Mantel-Haenszel (MH) procedure is commonly used to detect items that function differentially for groups of examinees from various demographic and linguistic backgrounds--for example, in international assessments. As in some other DIF methods, the total score is used to match examinees on ability. In thin matching, each of the total score…

Descriptors: Test Items, Educational Testing, Evaluation Methods, Ability Grouping

Propensity Score Estimation with Data Mining Techniques: Alternatives to Logistic Regression

Peer reviewed
PDF on ERIC

Download full text

Keller, Bryan S. B.; Kim, Jee-Seon; Steiner, Peter M. – Society for Research on Educational Effectiveness, 2013

Propensity score analysis (PSA) is a methodological technique which may correct for selection bias in a quasi-experiment by modeling the selection process using observed covariates. Because logistic regression is well understood by researchers in a variety of fields and easy to implement in a number of popular software packages, it has…

Descriptors: Probability, Scores, Statistical Analysis, Statistical Bias

Handling Missing Data in Educational Research Using SPSS

Direct link

Cheema, Jehanzeb – ProQuest LLC, 2012

This study looked at the effect of a number of factors such as the choice of analytical method, the handling method for missing data, sample size, and proportion of missing data, in order to evaluate the effect of missing data treatment on accuracy of estimation. In order to accomplish this a methodological approach involving simulated data was…

Descriptors: Educational Research, Educational Researchers, Statistical Analysis, Sample Size

A Review of ETS Differential Item Functioning Assessment Procedures: Flagging Rules, Minimum Sample Size Requirements, and Criterion Refinement. Research Report. ETS RR-12-08

Peer reviewed
PDF on ERIC

Download full text

Zwick, Rebecca – ETS Research Report Series, 2012

Differential item functioning (DIF) analysis is a key component in the evaluation of the fairness and validity of educational tests. The goal of this project was to review the status of ETS DIF analysis procedures, focusing on three aspects: (a) the nature and stringency of the statistical rules used to flag items, (b) the minimum sample size…

Descriptors: Test Bias, Sample Size, Bayesian Statistics, Evaluation Methods

Previous Page | Next Page »

Pages: 1 | 2 | 3 | 4

Educational and Psychological…	5
ETS Research Report Series	3
ProQuest LLC	3
Psychological Methods	3
Grantee Submission	2
Journal of Educational and…	2
Journal of Experimental…	2
Psychometrika	2
American Journal of Evaluation	1
Applied Measurement in…	1
Applied Psychological…	1
Audio-Visual Language Journal	1
European Journal of Higher…	1
International Journal of…	1
International Journal of…	1
Journal of Consulting and…	1
Journal of Educational…	1
Journal of Speech, Language,…	1
Multivariate Behavioral…	1
National Center for Education…	1
National Center for Research…	1
New Directions for…	1
Program on Education Policy…	1
Psicologica: International…	1
Society for Research on…	1
More ▼

Algina, James	2
DeMars, Christine E.	2
Zhang, Zhiyong	2
Ackerman, Matthew	1
Alcala-Quintana, Rocio	1
Ankenmann, Robert D.	1
Baldwin, Scott A.	1
Bernstein, Lawrence	1
Black, Ken	1
Botella, Juan	1
Briggs, Derek C.	1
Burstein, Nancy	1
Cai, Li	1
Carl Falk	1
Cheema, Jehanzeb	1
Cheng, Soh Kay	1
Cousineau, Denis	1
Deke, John	1
Dekle, Dawn J.	1
Doran, Harold C.	1
Dorans, Neil J.	1
Dudgeon, Paul	1
Dunivant, Noel	1
Echternacht, Gary	1
Egalite, Anna J.	1
More ▼