Publication Date
In 2025 | 5 |
Since 2024 | 29 |
Since 2021 (last 5 years) | 70 |
Since 2016 (last 10 years) | 137 |
Since 2006 (last 20 years) | 205 |
Descriptor
Accuracy | 205 |
Simulation | 205 |
Item Response Theory | 57 |
Models | 56 |
Comparative Analysis | 42 |
Test Items | 41 |
Computation | 39 |
Sample Size | 35 |
Error of Measurement | 33 |
Statistical Analysis | 33 |
Classification | 32 |
More ▼ |
Source
Author
Amanda Goodwin | 4 |
Chang, Hua-Hua | 4 |
Cheng, Ying | 4 |
Matthew Naveiras | 4 |
Moses, Tim | 4 |
Sun-Joo Cho | 4 |
Chun Wang | 3 |
Uto, Masaki | 3 |
Bloom, Howard S. | 2 |
Bolsinova, Maria | 2 |
Bramley, Tom | 2 |
More ▼ |
Publication Type
Education Level
Higher Education | 25 |
Postsecondary Education | 22 |
Secondary Education | 17 |
Middle Schools | 8 |
Elementary Education | 7 |
Junior High Schools | 7 |
Elementary Secondary Education | 4 |
Grade 7 | 4 |
High Schools | 4 |
Grade 8 | 3 |
Grade 9 | 3 |
More ▼ |
Audience
Researchers | 1 |
Teachers | 1 |
Location
Netherlands | 3 |
Australia | 2 |
Germany | 2 |
Indiana | 2 |
Massachusetts | 2 |
Spain | 2 |
United Kingdom (England) | 2 |
California | 1 |
China | 1 |
China (Guangzhou) | 1 |
Czech Republic | 1 |
More ▼ |
Laws, Policies, & Programs
Fulbright Hays Act | 1 |
No Child Left Behind Act 2001 | 1 |
Assessments and Surveys
What Works Clearinghouse Rating
Julian F. Lohmann; Nils Machts; Jens Möller; Steffen Zitzmann – Educational Psychology Review, 2025
We propose a novel approach for modeling judgment accuracy that, for the first time, allows for simultaneously considering the rank, level, and differentiation component, the predominantly applied operationalization of teacher judgment accuracy. These components are conceptualized as latent, unobserved individual abilities. The model is introduced…
Descriptors: Teacher Attitudes, Evaluative Thinking, Accuracy, Models
Guido Schwarzer; Gerta Rücker; Cristina Semaca – Research Synthesis Methods, 2024
The "LFK" index has been promoted as an improved method to detect bias in meta-analysis. Putatively, its performance does not depend on the number of studies in the meta-analysis. We conducted a simulation study, comparing the "LFK" index test to three standard tests for funnel plot asymmetry in settings with smaller or larger…
Descriptors: Bias, Meta Analysis, Simulation, Evaluation Methods
Jean-Paul Fox – Journal of Educational and Behavioral Statistics, 2025
Popular item response theory (IRT) models are considered complex, mainly due to the inclusion of a random factor variable (latent variable). The random factor variable represents the incidental parameter problem since the number of parameters increases when including data of new persons. Therefore, IRT models require a specific estimation method…
Descriptors: Sample Size, Item Response Theory, Accuracy, Bayesian Statistics
Julia-Kim Walther; Martin Hecht; Steffen Zitzmann – Structural Equation Modeling: A Multidisciplinary Journal, 2025
Small sample sizes pose a severe threat to convergence and accuracy of between-group level parameter estimates in multilevel structural equation modeling (SEM). However, in certain situations, such as pilot studies or when populations are inherently small, increasing samples sizes is not feasible. As a remedy, we propose a two-stage regularized…
Descriptors: Sample Size, Hierarchical Linear Modeling, Structural Equation Models, Matrices
Dexin Shi; Bo Zhang; Ren Liu; Zhehan Jiang – Educational and Psychological Measurement, 2024
Multiple imputation (MI) is one of the recommended techniques for handling missing data in ordinal factor analysis models. However, methods for computing MI-based fit indices under ordinal factor analysis models have yet to be developed. In this short note, we introduced the methods of using the standardized root mean squared residual (SRMR) and…
Descriptors: Goodness of Fit, Factor Analysis, Simulation, Accuracy
Ferdinand Valentin Stoye; Claudia Tschammler; Oliver Kuss; Annika Hoyer – Research Synthesis Methods, 2024
The development of new statistical models for the meta-analysis of diagnostic test accuracy studies is still an ongoing field of research, especially with respect to summary receiver operating characteristic (ROC) curves. In the recently published updated version of the "Cochrane Handbook for Systematic Reviews of Diagnostic Test…
Descriptors: Diagnostic Tests, Accuracy, Barriers, Models
Aiman Mohammad Freihat; Omar Saleh Bani Yassin – Educational Process: International Journal, 2025
Background/purpose: This study aimed to reveal the accuracy of estimation of multiple-choice test items parameters following the models of the item-response theory in measurement. Materials/methods: The researchers depended on the measurement accuracy indicators, which express the absolute difference between the estimated and actual values of the…
Descriptors: Accuracy, Computation, Multiple Choice Tests, Test Items
Sotoudeh, Ramina; DiMaggio, Paul – Sociological Methods & Research, 2023
Sociologists increasingly face choices among competing algorithms that represent reasonable approaches to the same task, with little guidance in choosing among them. We develop a strategy that uses simulated data to identify the conditions under which different methods perform well and applies what is learned from the simulations to predict which…
Descriptors: Algorithms, Simulation, Prediction, Correlation
A. M. Sadek; Fahad Al-Muhlaki – Measurement: Interdisciplinary Research and Perspectives, 2024
In this study, the accuracy of the artificial neural network (ANN) was assessed considering the uncertainties associated with the randomness of the data and the lack of learning. The Monte-Carlo algorithm was applied to simulate the randomness of the input variables and evaluate the output distribution. It has been shown that under certain…
Descriptors: Monte Carlo Methods, Accuracy, Artificial Intelligence, Guidelines
Gorney, Kylie; Wollack, James A. – Journal of Educational Measurement, 2022
Detection methods for item preknowledge are often evaluated in simulation studies where models are used to generate the data. To ensure the reliability of such methods, it is crucial that these models are able to accurately represent situations that are encountered in practice. The purpose of this article is to provide a critical analysis of…
Descriptors: Prior Learning, Simulation, Models, Reaction Time
de Jong, Valentijn M. T.; Campbell, Harlan; Maxwell, Lauren; Jaenisch, Thomas; Gustafson, Paul; Debray, Thomas P. A. – Research Synthesis Methods, 2023
A common problem in the analysis of multiple data sources, including individual participant data meta-analysis (IPD-MA), is the misclassification of binary variables. Misclassification may lead to biased estimators of model parameters, even when the misclassification is entirely random. We aimed to develop statistical methods that facilitate…
Descriptors: Classification, Meta Analysis, Bayesian Statistics, Evaluation Methods
Thompson, W. Jake; Nash, Brooke; Clark, Amy K.; Hoover, Jeffrey C. – Journal of Educational Measurement, 2023
As diagnostic classification models become more widely used in large-scale operational assessments, we must give consideration to the methods for estimating and reporting reliability. Researchers must explore alternatives to traditional reliability methods that are consistent with the design, scoring, and reporting levels of diagnostic assessment…
Descriptors: Diagnostic Tests, Simulation, Test Reliability, Accuracy
Wiebke M. Roling; Marcus Grum; Norbert Gronau; Annette Kluge – Journal of Workplace Learning, 2024
Purpose: The purpose of this study was to investigate work-related adaptive performance from a longitudinal process perspective. This paper clustered specific behavioral patterns following the introduction of a change and related them to retentivity as an individual cognitive ability. In addition, this paper investigated whether the occurrence of…
Descriptors: Foreign Countries, Behavior Patterns, Retention (Psychology), Manufacturing Industry
Matthew J. Madison; Seungwon Chung; Junok Kim; Laine P. Bradshaw – Grantee Submission, 2023
Recent developments have enabled the modeling of longitudinal assessment data in a diagnostic classification model (DCM) framework. These longitudinal DCMs were developed to provide measures of student growth on a discrete scale in the form of attribute mastery transitions, thereby supporting categorical and criterion-referenced interpretations of…
Descriptors: Models, Cognitive Measurement, Diagnostic Tests, Classification
Yuan Fang; Lijuan Wang – Grantee Submission, 2024
Dynamic structural equation modeling (DSEM) is a useful technique for analyzing intensive longitudinal data. A challenge of applying DSEM is the missing data problem. The impact of missing data on DSEM, especially on widely applied DSEM such as the two-level vector autoregressive (VAR) cross-lagged models, however, is understudied. To fill the…
Descriptors: Structural Equation Models, Research Problems, Longitudinal Studies, Simulation