Publication Date
In 2025 | 4 |
Since 2024 | 13 |
Since 2021 (last 5 years) | 29 |
Since 2016 (last 10 years) | 51 |
Since 2006 (last 20 years) | 153 |
Descriptor
Evaluation Methods | 229 |
Models | 229 |
Simulation | 181 |
Computer Simulation | 53 |
Item Response Theory | 47 |
Computation | 38 |
Comparative Analysis | 36 |
Data Analysis | 32 |
Teaching Methods | 31 |
Decision Making | 23 |
Prediction | 23 |
More ▼ |
Source
Author
Cai, Li | 3 |
Falk, Carl F. | 3 |
Mislevy, Robert J. | 3 |
Rose, Andrew M. | 3 |
Wilson, Mark | 3 |
Barnes, Tiffany, Ed. | 2 |
Ceulemans, Eva | 2 |
Cohen, Allan S. | 2 |
Green, Jennifer L. | 2 |
Ifenthaler, Dirk, Ed. | 2 |
Jihong Zhang | 2 |
More ▼ |
Publication Type
Education Level
Location
Australia | 3 |
China | 3 |
Afghanistan | 2 |
Germany | 2 |
Greece | 2 |
Ireland | 2 |
Italy | 2 |
Japan | 2 |
Malaysia | 2 |
Mexico | 2 |
Netherlands | 2 |
More ▼ |
Laws, Policies, & Programs
Assessments and Surveys
Program for International… | 4 |
California Achievement Tests | 2 |
Medical College Admission Test | 1 |
National Longitudinal Study… | 1 |
Trends in International… | 1 |
Wechsler Adult Intelligence… | 1 |
What Works Clearinghouse Rating
Edmonds, Bruce – International Journal of Social Research Methodology, 2023
This paper looks at the tension between the desire to claim predictive ability for Agent-Based Models (ABMs) and its extreme difficulty for social and ecological systems, suggesting that this is the main cause for the continuance of a rhetoric of prediction that is at odds with what is achievable. Following others, it recommends that it is better…
Descriptors: Models, Prediction, Evaluation Methods, Standards
Jihong Zhang; Jonathan Templin; Xinya Liang – Journal of Educational Measurement, 2024
Recently, Bayesian diagnostic classification modeling has been becoming popular in health psychology, education, and sociology. Typically information criteria are used for model selection when researchers want to choose the best model among alternative models. In Bayesian estimation, posterior predictive checking is a flexible Bayesian model…
Descriptors: Bayesian Statistics, Cognitive Measurement, Models, Classification
Jean-Paul Fox – Journal of Educational and Behavioral Statistics, 2025
Popular item response theory (IRT) models are considered complex, mainly due to the inclusion of a random factor variable (latent variable). The random factor variable represents the incidental parameter problem since the number of parameters increases when including data of new persons. Therefore, IRT models require a specific estimation method…
Descriptors: Sample Size, Item Response Theory, Accuracy, Bayesian Statistics
Sohee Kim; Ki Lynn Cole – International Journal of Testing, 2025
This study conducted a comprehensive comparison of Item Response Theory (IRT) linking methods applied to a bifactor model, examining their performance on both multiple choice (MC) and mixed format tests within the common item nonequivalent group design framework. Four distinct multidimensional IRT linking approaches were explored, consisting of…
Descriptors: Item Response Theory, Comparative Analysis, Models, Item Analysis
A. M. Sadek; Fahad Al-Muhlaki – Measurement: Interdisciplinary Research and Perspectives, 2024
In this study, the accuracy of the artificial neural network (ANN) was assessed considering the uncertainties associated with the randomness of the data and the lack of learning. The Monte-Carlo algorithm was applied to simulate the randomness of the input variables and evaluate the output distribution. It has been shown that under certain…
Descriptors: Monte Carlo Methods, Accuracy, Artificial Intelligence, Guidelines
Kazuhiro Yamaguchi – Journal of Educational and Behavioral Statistics, 2025
This study proposes a Bayesian method for diagnostic classification models (DCMs) for a partially known Q-matrix setting between exploratory and confirmatory DCMs. This Q-matrix setting is practical and useful because test experts have pre-knowledge of the Q-matrix but cannot readily specify it completely. The proposed method employs priors for…
Descriptors: Models, Classification, Bayesian Statistics, Evaluation Methods
Pere J. Ferrando; Ana Hernández-Dorado; Urbano Lorenzo-Seva – Structural Equation Modeling: A Multidisciplinary Journal, 2024
A frequent criticism of exploratory factor analysis (EFA) is that it does not allow correlated residuals to be modelled, while they can be routinely specified in the confirmatory (CFA) model. In this article, we propose an EFA approach in which both the common factor solution and the residual matrix are unrestricted (i.e., the correlated residuals…
Descriptors: Correlation, Factor Analysis, Models, Goodness of Fit
Joakim Wallmark; James O. Ramsay; Juan Li; Marie Wiberg – Journal of Educational and Behavioral Statistics, 2024
Item response theory (IRT) models the relationship between the possible scores on a test item against a test taker's attainment of the latent trait that the item is intended to measure. In this study, we compare two models for tests with polytomously scored items: the optimal scoring (OS) model, a nonparametric IRT model based on the principles of…
Descriptors: Item Response Theory, Test Items, Models, Scoring
Somayeh B. Shafiei; Saeed Shadpour; Farzan Sasangohar; James L. Mohler; Kristopher Attwood; Zhe Jing – npj Science of Learning, 2024
The existing performance evaluation methods in robot-assisted surgery (RAS) are mainly subjective, costly, and affected by shortcomings such as the inconsistency of results and dependency on the raters' opinions. The aim of this study was to develop models for an objective evaluation of performance and rate of learning RAS skills while practicing…
Descriptors: Robotics, Surgery, Eye Movements, Medicine
Wind, Stefanie A.; Ge, Yuan – Measurement: Interdisciplinary Research and Perspectives, 2023
In selected-response assessments such as attitude surveys with Likert-type rating scales, examinees often select from rating scale categories to reflect their locations on a construct. Researchers have observed that some examinees exhibit "response styles," which are systematic patterns of responses in which examinees are more likely to…
Descriptors: Goodness of Fit, Responses, Likert Scales, Models
Jihong Zhang – ProQuest LLC, 2022
Recently, Bayesian diagnostic classification modeling has been becoming popular in health psychology, education, and sociology. Typically information criteria are used for model selection when researchers want to choose the best model among alternative models. In Bayesian estimation, posterior predictive checking is a flexible Bayesian model…
Descriptors: Bayesian Statistics, Cognitive Measurement, Models, Classification
Daniel A. Mak; Sebastian Dunn; David Coombes; Carlo R. Carere; Jane R. Allison; Volker Nock; André O. Hudson; Renwick C. J. Dobson – Biochemistry and Molecular Biology Education, 2024
Enzymes are nature's catalysts, mediating chemical processes in living systems. The study of enzyme function and mechanism includes defining the maximum catalytic rate and affinity for substrate/s (among other factors), referred to as enzyme kinetics. Enzyme kinetics is a staple of biochemistry curricula and other disciplines, from molecular and…
Descriptors: Biochemistry, Kinetics, Science Instruction, Teaching Methods
Guo, Wenjing; Choi, Youn-Jeng – Educational and Psychological Measurement, 2023
Determining the number of dimensions is extremely important in applying item response theory (IRT) models to data. Traditional and revised parallel analyses have been proposed within the factor analysis framework, and both have shown some promise in assessing dimensionality. However, their performance in the IRT framework has not been…
Descriptors: Item Response Theory, Evaluation Methods, Factor Analysis, Guidelines
An Improved Two-Stage Randomized Response Model for Estimating the Proportion of Sensitive Attribute
Narjis, Ghulam; Shabbir, Javid – Sociological Methods & Research, 2023
The randomized response technique (RRT) is an effective method designed to obtain the stigmatized information from respondents while assuring the privacy. In this study, we propose a new two-stage RRT model to estimate the prevalence of sensitive attribute ([pi]). A simulation study shows that the empirical mean and variance of proposed estimator…
Descriptors: Comparative Analysis, Incidence, Efficiency, Models
Emma Somer; Carl Falk; Milica Miocevic – Structural Equation Modeling: A Multidisciplinary Journal, 2024
Factor Score Regression (FSR) is increasingly employed as an alternative to structural equation modeling (SEM) in small samples. Despite its popularity in psychology, the performance of FSR in multigroup models with small samples remains relatively unknown. The goal of this study was to examine the performance of FSR, namely Croon's correction and…
Descriptors: Scores, Structural Equation Models, Comparative Analysis, Sample Size