Publication Date
In 2025 | 15 |
Since 2024 | 60 |
Since 2021 (last 5 years) | 148 |
Since 2016 (last 10 years) | 252 |
Since 2006 (last 20 years) | 589 |
Descriptor
Evaluation Methods | 589 |
Simulation | 426 |
Computer Simulation | 169 |
Models | 153 |
Item Response Theory | 104 |
Comparative Analysis | 103 |
Teaching Methods | 89 |
Computation | 87 |
Foreign Countries | 87 |
Educational Technology | 72 |
Student Evaluation | 71 |
More ▼ |
Source
Author
Woods, Carol M. | 6 |
Cai, Li | 5 |
Sinharay, Sandip | 4 |
Zumbo, Bruno D. | 4 |
Beretvas, S. Natasha | 3 |
Chun Wang | 3 |
Cohen, Allan S. | 3 |
Falk, Carl F. | 3 |
Guarino, Cassandra M. | 3 |
Harring, Jeffrey R. | 3 |
Lee, Sik-Yum | 3 |
More ▼ |
Publication Type
Education Level
Audience
Researchers | 14 |
Teachers | 12 |
Practitioners | 6 |
Policymakers | 3 |
Students | 3 |
Administrators | 2 |
Media Staff | 1 |
Parents | 1 |
Location
Australia | 9 |
China | 6 |
Japan | 6 |
Turkey | 6 |
United States | 6 |
Ohio | 5 |
United Kingdom | 5 |
United Kingdom (England) | 5 |
European Union | 4 |
Florida | 4 |
Greece | 4 |
More ▼ |
Laws, Policies, & Programs
No Child Left Behind Act 2001 | 3 |
Americans with Disabilities… | 1 |
Elementary and Secondary… | 1 |
Race to the Top | 1 |
Assessments and Surveys
What Works Clearinghouse Rating
Jason A. Schoeneberger; Christopher Rhoads – American Journal of Evaluation, 2025
Regression discontinuity (RD) designs are increasingly used for causal evaluations. However, the literature contains little guidance for conducting a moderation analysis within an RDD context. The current article focuses on moderation with a single binary variable. A simulation study compares: (1) different bandwidth selectors and (2) local…
Descriptors: Regression (Statistics), Causal Models, Evaluation Methods, Multivariate Analysis
Simen Hjellvik; Steven Mallam; Marte Fannelø Giskeødegård; Salman Nazir – Technology, Knowledge and Learning, 2024
Computer-based simulation is utilised across various educational fields, employing diverse technologies to facilitate practical understanding of content and the acquisition of skills that can help close the gap between theory and practice. The possibility of providing scenarios that resemble on-the-job tasks, enables instructors to both train and…
Descriptors: Computer Simulation, Competence, Evaluation Methods, Test Construction
Fisk, Charles L.; Harring, Jeffrey R.; Shen, Zuchao; Leite, Walter; Suen, King Yiu; Marcoulides, Katerina M. – Educational and Psychological Measurement, 2023
Sensitivity analyses encompass a broad set of post-analytic techniques that are characterized as measuring the potential impact of any factor that has an effect on some output variables of a model. This research focuses on the utility of the simulated annealing algorithm to automatically identify path configurations and parameter values of omitted…
Descriptors: Structural Equation Models, Algorithms, Simulation, Evaluation Methods
Yan Xia; Xinchang Zhou – Educational and Psychological Measurement, 2025
Parallel analysis has been considered one of the most accurate methods for determining the number of factors in factor analysis. One major advantage of parallel analysis over traditional factor retention methods (e.g., Kaiser's rule) is that it addresses the sampling variability of eigenvalues obtained from the identity matrix, representing the…
Descriptors: Factor Analysis, Statistical Analysis, Evaluation Methods, Sampling
Edmonds, Bruce – International Journal of Social Research Methodology, 2023
This paper looks at the tension between the desire to claim predictive ability for Agent-Based Models (ABMs) and its extreme difficulty for social and ecological systems, suggesting that this is the main cause for the continuance of a rhetoric of prediction that is at odds with what is achievable. Following others, it recommends that it is better…
Descriptors: Models, Prediction, Evaluation Methods, Standards
Guido Schwarzer; Gerta Rücker; Cristina Semaca – Research Synthesis Methods, 2024
The "LFK" index has been promoted as an improved method to detect bias in meta-analysis. Putatively, its performance does not depend on the number of studies in the meta-analysis. We conducted a simulation study, comparing the "LFK" index test to three standard tests for funnel plot asymmetry in settings with smaller or larger…
Descriptors: Bias, Meta Analysis, Simulation, Evaluation Methods
Hasan Mahbub Tusher; Steven Mallam; Salman Nazir – Technology, Knowledge and Learning, 2024
The evolving complexity of Virtual Reality (VR) technologies necessitates an in-depth investigation of the VR features and their specific utility. Although VR is utilized across various skill-training applications, its successful deployment depends on both technical maturity and context-specific suitability. A comprehensive understanding of…
Descriptors: Computer Simulation, Skill Development, Professional Training, Outcomes of Education
Jihong Zhang; Jonathan Templin; Xinya Liang – Journal of Educational Measurement, 2024
Recently, Bayesian diagnostic classification modeling has been becoming popular in health psychology, education, and sociology. Typically information criteria are used for model selection when researchers want to choose the best model among alternative models. In Bayesian estimation, posterior predictive checking is a flexible Bayesian model…
Descriptors: Bayesian Statistics, Cognitive Measurement, Models, Classification
Jean-Paul Fox – Journal of Educational and Behavioral Statistics, 2025
Popular item response theory (IRT) models are considered complex, mainly due to the inclusion of a random factor variable (latent variable). The random factor variable represents the incidental parameter problem since the number of parameters increases when including data of new persons. Therefore, IRT models require a specific estimation method…
Descriptors: Sample Size, Item Response Theory, Accuracy, Bayesian Statistics
Tugay Kaçak; Abdullah Faruk Kiliç – International Journal of Assessment Tools in Education, 2025
Researchers continue to choose PCA in scale development and adaptation studies because it is the default setting and overestimates measurement quality. When PCA is utilized in investigations, the explained variance and factor loadings can be exaggerated. PCA, in contrast to the models given in the literature, should be investigated in…
Descriptors: Factor Analysis, Monte Carlo Methods, Mathematical Models, Sample Size
Sinharay, Sandip – Journal of Educational Measurement, 2023
Technical difficulties and other unforeseen events occasionally lead to incomplete data on educational tests, which necessitates the reporting of imputed scores to some examinees. While there exist several approaches for reporting imputed scores, there is a lack of any guidance on the reporting of the uncertainty of imputed scores. In this paper,…
Descriptors: Evaluation Methods, Scores, Standardized Tests, Simulation
Sohee Kim; Ki Lynn Cole – International Journal of Testing, 2025
This study conducted a comprehensive comparison of Item Response Theory (IRT) linking methods applied to a bifactor model, examining their performance on both multiple choice (MC) and mixed format tests within the common item nonequivalent group design framework. Four distinct multidimensional IRT linking approaches were explored, consisting of…
Descriptors: Item Response Theory, Comparative Analysis, Models, Item Analysis
Stefanie A. Wind; Benjamin Lugu – Applied Measurement in Education, 2024
Researchers who use measurement models for evaluation purposes often select models with stringent requirements, such as Rasch models, which are parametric. Mokken Scale Analysis (MSA) offers a theory-driven nonparametric modeling approach that may be more appropriate for some measurement applications. Researchers have discussed using MSA as a…
Descriptors: Item Response Theory, Data Analysis, Simulation, Nonparametric Statistics
Xinyue Li – Research Matters, 2024
Extended reality (XR) -- encompassing virtual reality (VR), augmented reality (AR), and mixed reality (MR) -- emerges as a potential transformative tool in educational realms. This article explores the potential of XR in facilitating mathematics assessments; it proposes a list of mathematical topics that could be effectively mediated by XR's…
Descriptors: Computer Simulation, Educational Technology, Technology Uses in Education, Mathematics Instruction
Martin Bäckström; Fredrik Björklund – Educational and Psychological Measurement, 2024
The forced-choice response format is often considered superior to the standard Likert-type format for controlling social desirability in personality inventories. We performed simulations and found that the trait information based on the two formats converges when the number of items is high and forced-choice items are mixed with regard to…
Descriptors: Likert Scales, Item Analysis, Personality Traits, Personality Measures