ERIC - Search Results

Publication Date

In 2025	0
Since 2024	0
Since 2021 (last 5 years)	0
Since 2016 (last 10 years)	2
Since 2006 (last 20 years)	12

Descriptor

Evaluation Research	12
Simulation	12
Statistical Analysis	12
Evaluation Methods	6
Psychometrics	4
Computation	3
Equated Scores	3
Error Patterns	3
Factor Analysis	3
Item Response Theory	3
Measurement Techniques	3
Models	3
Research Methodology	3
Test Items	3
Correlation	2
Educational Testing	2
Probability	2
Randomized Controlled Trials	2
Research Design	2
Test Bias	2
Test Format	2
Achievement Tests	1
Admission (School)	1
Bias	1
Case Studies	1
More ▼

Source

Educational and Psychological…	4
ProQuest LLC	2
American Journal of Evaluation	1
Educational Sciences: Theory…	1
Journal of Educational…	1
Journal of Educational and…	1
Society for Research on…	1
Structural Equation Modeling:…	1

Publication Type

Journal Articles	9
Reports - Research	8
Dissertations/Theses -…	2
Information Analyses	1
Reports - Descriptive	1
Reports - Evaluative	1

Education Level

Elementary Education	1
Elementary Secondary Education	1
Grade 8	1
Junior High Schools	1
Middle Schools	1
Secondary Education	1

Audience

Location

Laws, Policies, & Programs

Assessments and Surveys

Trends in International…

What Works Clearinghouse Rating

Showing all 12 results Save | Export

The Nonuse, Misuse, and Proper Use of Pilot Studies in Experimental Evaluation Research

Peer reviewed
PDF on ERIC

Download full text

Direct link

Westlund, Erik; Stuart, Elizabeth A. – American Journal of Evaluation, 2017

This article discusses the nonuse, misuse, and proper use of pilot studies in experimental evaluation research. The authors first show that there is little theoretical, practical, or empirical guidance available to researchers who seek to incorporate pilot studies into experimental evaluation research designs. The authors then discuss how pilot…

Descriptors: Use Studies, Pilot Projects, Evaluation Research, Experiments

Propensity Score Matching Techniques: Simulation and Application in an Educational Research Context

Direct link

Phillips, Shane Michael – ProQuest LLC, 2012

Propensity score matching is a relatively new technique used in observational studies to approximate data that have been randomly assigned to treatment. This technique assimilates the values of several covariates into a single propensity score that is used as a matching variable to create similar groups. This dissertation comprises two separate…

Descriptors: Statistical Analysis, Educational Research, Simulation, Observation

An Algorithm for Testing Unidimensionality and Clustering Items in Rasch Measurement

Peer reviewed

Direct link

Debelak, Rudolf; Arendasy, Martin – Educational and Psychological Measurement, 2012

A new approach to identify item clusters fitting the Rasch model is described and evaluated using simulated and real data. The proposed method is based on hierarchical cluster analysis and constructs clusters of items that show a good fit to the Rasch model. It thus gives an estimate of the number of independent scales satisfying the postulates of…

Descriptors: Test Items, Factor Analysis, Evaluation Methods, Simulation

The Impact of Test Dimensionality, Common-Item Set Format, and Scale Linking Methods on Mixed-Format Test Equating

Peer reviewed
PDF on ERIC

Download full text

Öztürk-Gübes, Nese; Kelecioglu, Hülya – Educational Sciences: Theory and Practice, 2016

The purpose of this study was to examine the impact of dimensionality, common-item set format, and different scale linking methods on preserving equity property with mixed-format test equating. Item response theory (IRT) true-score equating (TSE) and IRT observed-score equating (OSE) methods were used under common-item nonequivalent groups design.…

Descriptors: Test Format, Item Response Theory, True Scores, Equated Scores

Testing Measurement Invariance Using MIMIC: Likelihood Ratio Test with a Critical Value Adjustment

Peer reviewed

Direct link

Kim, Eun Sook; Yoon, Myeongsun; Lee, Taehun – Educational and Psychological Measurement, 2012

Multiple-indicators multiple-causes (MIMIC) modeling is often used to test a latent group mean difference while assuming the equivalence of factor loadings and intercepts over groups. However, this study demonstrated that MIMIC was insensitive to the presence of factor loading noninvariance, which implies that factor loading invariance should be…

Descriptors: Test Items, Simulation, Testing, Statistical Analysis

Estimating Cross-Site Impact Variation in the Presence of Heteroscedasticity

Peer reviewed
PDF on ERIC

Download full text

Bloom, Howard S.; Porter, Kristin E.; Weiss, Michael J.; Raudenbush, Stephen – Society for Research on Educational Effectiveness, 2013

To date, evaluation research and policy analysis have focused mainly on average program impacts and paid little systematic attention to their variation. Recently, the growing number of multi-site randomized trials that are being planned and conducted make it increasingly feasible to study "cross-site" variation in impacts. Important…

Descriptors: Research Methodology, Policy, Evaluation Research, Randomized Controlled Trials

Standard Errors of Equating Differences: Prior Developments, Extensions, and Simulations

Peer reviewed

Direct link

Moses, Tim; Zhang, Wenmin – Journal of Educational and Behavioral Statistics, 2011

The purpose of this article was to extend the use of standard errors for equated score differences (SEEDs) to traditional equating functions. The SEEDs are described in terms of their original proposal for kernel equating functions and extended so that SEEDs for traditional linear and traditional equipercentile equating functions can be computed.…

Descriptors: Equated Scores, Error Patterns, Evaluation Research, Statistical Analysis

Impact of Outliers Arising from Unintended and Unknowingly Included Subpopulations on the Decisions about the Number of Factors in Exploratory Factor Analysis

Peer reviewed

Direct link

Liu, Yan; Zumbo, Bruno D. – Educational and Psychological Measurement, 2012

There is a lack of research on the effects of outliers on the decisions about the number of factors to retain in an exploratory factor analysis, especially for outliers arising from unintended and unknowingly included subpopulations. The purpose of the present research was to investigate how outliers from an unintended and unknowingly included…

Descriptors: Factor Analysis, Factor Structure, Evaluation Research, Evaluation Methods

Performance of the S - [chi][squared] Statistic for Full-Information Bifactor Models

Peer reviewed

Direct link

Li, Ying; Rupp, Andre A. – Educational and Psychological Measurement, 2011

This study investigated the Type I error rate and power of the multivariate extension of the S - [chi][squared] statistic using unidimensional and multidimensional item response theory (UIRT and MIRT, respectively) models as well as full-information bifactor (FI-bifactor) models through simulation. Manipulated factors included test length, sample…

Descriptors: Test Length, Item Response Theory, Statistical Analysis, Error Patterns

Diagnostic Procedures for Detecting Nonlinear Relationships between Latent Variables

Peer reviewed

Direct link

Bauer, Daniel J.; Baldasaro, Ruth E.; Gottfredson, Nisha C. – Structural Equation Modeling: A Multidisciplinary Journal, 2012

Structural equation models are commonly used to estimate relationships between latent variables. Almost universally, the fitted models specify that these relationships are linear in form. This assumption is rarely checked empirically, largely for lack of appropriate diagnostic techniques. This article presents and evaluates two procedures that can…

Descriptors: Structural Equation Models, Mixed Methods Research, Statistical Analysis, Sampling

A Comparison of Equating/Linking Using the Stocking-Lord Method and Concurrent Calibration with Mixed-Format Tests in the Non-Equivalent Groups Common-Item Design under IRT

Direct link

Tian, Feng – ProQuest LLC, 2011

There has been a steady increase in the use of mixed-format tests, that is, tests consisting of both multiple-choice items and constructed-response items in both classroom and large-scale assessments. This calls for appropriate equating methods for such tests. As Item Response Theory (IRT) has rapidly become mainstream as the theoretical basis for…

Descriptors: Item Response Theory, Comparative Analysis, Equated Scores, Statistical Analysis

Model-Free CUSUM Methods for Person Fit

Peer reviewed

Direct link

Armstrong, Ronald D.; Shi, Min – Journal of Educational Measurement, 2009

This article demonstrates the use of a new class of model-free cumulative sum (CUSUM) statistics to detect person fit given the responses to a linear test. The fundamental statistic being accumulated is the likelihood ratio of two probabilities. The detection performance of this CUSUM scheme is compared to other model-free person-fit statistics…

Descriptors: Probability, Simulation, Models, Psychometrics

Arendasy, Martin	1
Armstrong, Ronald D.	1
Baldasaro, Ruth E.	1
Bauer, Daniel J.	1
Bloom, Howard S.	1
Debelak, Rudolf	1
Gottfredson, Nisha C.	1
Kelecioglu, Hülya	1
Kim, Eun Sook	1
Lee, Taehun	1
Li, Ying	1
Liu, Yan	1
Moses, Tim	1
Phillips, Shane Michael	1
Porter, Kristin E.	1
Raudenbush, Stephen	1
Rupp, Andre A.	1
Shi, Min	1
Stuart, Elizabeth A.	1
Tian, Feng	1
Weiss, Michael J.	1
Westlund, Erik	1
Yoon, Myeongsun	1
Zhang, Wenmin	1
Zumbo, Bruno D.	1
More ▼