ERIC - Search Results

Publication Date

In 2025	0
Since 2024	2
Since 2021 (last 5 years)	3
Since 2016 (last 10 years)	7
Since 2006 (last 20 years)	56

Descriptor

Evaluation Research	61
Simulation	61
Evaluation Methods	35
Item Response Theory	26
Test Items	20
Computation	16
Psychometrics	13
Models	12
Statistical Analysis	12
Factor Analysis	10
Research Methodology	10
Test Bias	10
Comparative Analysis	9
Measurement Techniques	9
Correlation	7
Item Analysis	7
Sample Size	7
Computer Assisted Testing	6
Data Analysis	6
Error Patterns	6
Error of Measurement	6
Evaluation Problems	6
Goodness of Fit	6
Measurement	6
Predictor Variables	6
More ▼

Publication Type

Journal Articles	51
Reports - Research	34
Reports - Evaluative	17
Dissertations/Theses -…	7
Reports - Descriptive	2
Information Analyses	1
Reports - General	1
Speeches/Meeting Papers	1
Tests/Questionnaires	1

Education Level

Adult Education	3
Elementary Secondary Education	3
Postsecondary Education	2
Elementary Education	1
Grade 8	1
Higher Education	1
Junior High Schools	1
Middle Schools	1
Secondary Education	1

Audience

Location

Denmark	1
Japan	1
North Carolina (Durham)	1
United States	1

Laws, Policies, & Programs

Assessments and Surveys

ACT Assessment	1
Trends in International…	1

What Works Clearinghouse Rating

Showing 1 to 15 of 61 results Save | Export

Impact of Violating Unidimensionality on Rasch Calibration for Mixed-Format Tests

Peer reviewed

Direct link

Chunyan Liu; Raja Subhiyah; Richard A. Feinberg – Applied Measurement in Education, 2024

Mixed-format tests that include both multiple-choice (MC) and constructed-response (CR) items have become widely used in many large-scale assessments. When an item response theory (IRT) model is used to score a mixed-format test, the unidimensionality assumption may be violated if the CR items measure a different construct from that measured by MC…

Descriptors: Test Format, Response Style (Tests), Multiple Choice Tests, Item Response Theory

Analyzing Transitions in Sequential Data with Marginal Models

Peer reviewed
PDF on ERIC

Download full text

Jeffrey Matayoshi; Shamya Karumbaiah – Journal of Educational Data Mining, 2024

Various areas of educational research are interested in the transitions between different states--or events--in sequential data, with the goal of understanding the significance of these transitions; one notable example is affect dynamics, which aims to identify important transitions between affective states. Unfortunately, several works have…

Descriptors: Models, Statistical Bias, Data Analysis, Simulation

Mis-Specification of Functional Forms in Growth Mixture Modeling: A Monte Carlo Simulation

Direct link

Richa Ghevarghese – ProQuest LLC, 2022

Growth mixture modeling (GMM) is a methodological tool used to represent heterogeneity in longitudinal datasets through the identification of unobserved subgroups following qualitatively and quantitatively distinct trajectories in a population. These growth trajectories or functional forms are informed by the underlying developmental theory, are…

Descriptors: Monte Carlo Methods, Longitudinal Studies, Simulation, Growth Models

Parameter Estimation in Rasch Models for Examinee-Selected Items

Peer reviewed

Direct link

Liu, Chen-Wei; Wang, Wen-Chung – Journal of Educational Measurement, 2017

The examinee-selected-item (ESI) design, in which examinees are required to respond to a fixed number of items in a given set of items (e.g., choose one item to respond from a pair of items), always yields incomplete data (i.e., only the selected items are answered and the others have missing data) that are likely nonignorable. Therefore, using…

Descriptors: Item Response Theory, Models, Maximum Likelihood Statistics, Data Analysis

The Nonuse, Misuse, and Proper Use of Pilot Studies in Experimental Evaluation Research

Peer reviewed
PDF on ERIC

Download full text

Direct link

Westlund, Erik; Stuart, Elizabeth A. – American Journal of Evaluation, 2017

This article discusses the nonuse, misuse, and proper use of pilot studies in experimental evaluation research. The authors first show that there is little theoretical, practical, or empirical guidance available to researchers who seek to incorporate pilot studies into experimental evaluation research designs. The authors then discuss how pilot…

Descriptors: Use Studies, Pilot Projects, Evaluation Research, Experiments

Evaluating Bang for the Buck: A Cost-Effectiveness Comparison Between Individual Interviews and Focus Groups Based on Thematic Saturation Levels

Peer reviewed

Direct link

Namey, Emily; Guest, Greg; McKenna, Kevin; Chen, Mario – American Journal of Evaluation, 2016

Evaluators often use qualitative research methods, yet there is little evidence on the comparative cost-effectiveness of the two most commonly employed qualitative methods--in-depth interviews (IDIs) and focus groups (FGs). We performed an inductive thematic analysis of data from 40 IDIs and 40 FGs on the health-seeking behaviors of African…

Descriptors: Cost Effectiveness, Comparative Analysis, Interviews, Focus Groups

Propensity Score Matching Techniques: Simulation and Application in an Educational Research Context

Direct link

Phillips, Shane Michael – ProQuest LLC, 2012

Propensity score matching is a relatively new technique used in observational studies to approximate data that have been randomly assigned to treatment. This technique assimilates the values of several covariates into a single propensity score that is used as a matching variable to create similar groups. This dissertation comprises two separate…

Descriptors: Statistical Analysis, Educational Research, Simulation, Observation

Can Value-Added Measures of Teacher Performance Be Trusted?

Direct link

Guarino, Cassandra M.; Reckase, Mark D.; Wooldridge, Jeffrey M. – Education Finance and Policy, 2015

We investigate whether commonly used value-added estimation strategies produce accurate estimates of teacher effects under a variety of scenarios. We estimate teacher effects in simulated student achievement data sets that mimic plausible types of student grouping and teacher assignment scenarios. We find that no one method accurately captures…

Descriptors: Teacher Evaluation, Teacher Effectiveness, Achievement Gains, Merit Rating

An Algorithm for Testing Unidimensionality and Clustering Items in Rasch Measurement

Peer reviewed

Direct link

Debelak, Rudolf; Arendasy, Martin – Educational and Psychological Measurement, 2012

A new approach to identify item clusters fitting the Rasch model is described and evaluated using simulated and real data. The proposed method is based on hierarchical cluster analysis and constructs clusters of items that show a good fit to the Rasch model. It thus gives an estimate of the number of independent scales satisfying the postulates of…

Descriptors: Test Items, Factor Analysis, Evaluation Methods, Simulation

The Impact of Test Dimensionality, Common-Item Set Format, and Scale Linking Methods on Mixed-Format Test Equating

Peer reviewed
PDF on ERIC

Download full text

Öztürk-Gübes, Nese; Kelecioglu, Hülya – Educational Sciences: Theory and Practice, 2016

The purpose of this study was to examine the impact of dimensionality, common-item set format, and different scale linking methods on preserving equity property with mixed-format test equating. Item response theory (IRT) true-score equating (TSE) and IRT observed-score equating (OSE) methods were used under common-item nonequivalent groups design.…

Descriptors: Test Format, Item Response Theory, True Scores, Equated Scores

Testing Factorial Invariance in Multilevel Data: A Monte Carlo Study

Peer reviewed

Direct link

Kim, Eun Sook; Kwok, Oi-man; Yoon, Myeongsun – Structural Equation Modeling: A Multidisciplinary Journal, 2012

Testing factorial invariance has recently gained more attention in different social science disciplines. Nevertheless, when examining factorial invariance, it is generally assumed that the observations are independent of each other, which might not be always true. In this study, we examined the impact of testing factorial invariance in multilevel…

Descriptors: Monte Carlo Methods, Testing, Social Science Research, Factor Structure

Testing Measurement Invariance Using MIMIC: Likelihood Ratio Test with a Critical Value Adjustment

Peer reviewed

Direct link

Kim, Eun Sook; Yoon, Myeongsun; Lee, Taehun – Educational and Psychological Measurement, 2012

Multiple-indicators multiple-causes (MIMIC) modeling is often used to test a latent group mean difference while assuming the equivalence of factor loadings and intercepts over groups. However, this study demonstrated that MIMIC was insensitive to the presence of factor loading noninvariance, which implies that factor loading invariance should be…

Descriptors: Test Items, Simulation, Testing, Statistical Analysis

Estimating Cross-Site Impact Variation in the Presence of Heteroscedasticity

Peer reviewed
PDF on ERIC

Download full text

Bloom, Howard S.; Porter, Kristin E.; Weiss, Michael J.; Raudenbush, Stephen – Society for Research on Educational Effectiveness, 2013

To date, evaluation research and policy analysis have focused mainly on average program impacts and paid little systematic attention to their variation. Recently, the growing number of multi-site randomized trials that are being planned and conducted make it increasingly feasible to study "cross-site" variation in impacts. Important…

Descriptors: Research Methodology, Policy, Evaluation Research, Randomized Controlled Trials

Establishing Effect Size Guidelines for Interpreting the Results of Differential Bundle Functioning Analyses Using SIBTEST

Peer reviewed

Direct link

Walker, Cindy M.; Zhang, Bo; Banks, Kathleen; Cappaert, Kevin – Educational and Psychological Measurement, 2012

The purpose of this simulation study was to establish general effect size guidelines for interpreting the results of differential bundle functioning (DBF) analyses using simultaneous item bias test (SIBTEST). Three factors were manipulated: number of items in a bundle, test length, and magnitude of uniform differential item functioning (DIF)…

Descriptors: Test Bias, Test Length, Simulation, Guidelines

Standard Errors of Equating Differences: Prior Developments, Extensions, and Simulations

Peer reviewed

Direct link

Moses, Tim; Zhang, Wenmin – Journal of Educational and Behavioral Statistics, 2011

The purpose of this article was to extend the use of standard errors for equated score differences (SEEDs) to traditional equating functions. The SEEDs are described in terms of their original proposal for kernel equating functions and extended so that SEEDs for traditional linear and traditional equipercentile equating functions can be computed.…

Descriptors: Equated Scores, Error Patterns, Evaluation Research, Statistical Analysis

Previous Page | Next Page »

Pages: 1 | 2 | 3 | 4 | 5

Educational and Psychological…	11
ProQuest LLC	7
Journal of Educational…	6
Structural Equation Modeling:…	5
Psychological Methods	3
Advances in Health Sciences…	2
American Journal of Evaluation	2
Applied Measurement in…	2
Journal of Educational and…	2
Alberta Journal of…	1
Appalachia Educational…	1
Applied Psychological…	1
Assessment	1
ETS Research Report Series	1
Education Finance and Policy	1
Educational Sciences: Theory…	1
International Journal of…	1
International Journal of…	1
Journal of Applied Testing…	1
Journal of Educational Data…	1
Journal of Personnel…	1
Journal of Political Science…	1
Journal of Technology,…	1
Measurement:…	1
National Center for Research…	1
More ▼

Wang, Wen-Chung	3
Bauer, Daniel J.	2
Kim, Eun Sook	2
Rupp, Andre A.	2
Shih, Ching-Lin	2
Yoon, Myeongsun	2
Arendasy, Martin	1
Armstrong, Ronald D.	1
Baker, Eva L.	1
Baldasaro, Ruth E.	1
Ban, Jae-Chun	1
Banks, Kathleen	1
Barakat, Bilal Fouad	1
Bergeron, Jennifer M.	1
Bianchini, Kevin J.	1
Bloom, Howard S.	1
Boulet, John R.	1
Brauer, J.	1
Brosseau-Liard, Patricia E.	1
Cappaert, Kevin	1
Carbonaro, Michael	1
Chan, Darius K.-S.	1
Chen, Mario	1
Chen, Shu-Ying	1
Chen, Tzu-An	1
More ▼