Publication Date
In 2025 | 3 |
Since 2024 | 13 |
Since 2021 (last 5 years) | 39 |
Since 2016 (last 10 years) | 78 |
Since 2006 (last 20 years) | 218 |
Descriptor
Evaluation Methods | 283 |
Inferences | 214 |
Statistical Inference | 73 |
Models | 47 |
Research Methodology | 44 |
Student Evaluation | 42 |
Comparative Analysis | 37 |
Validity | 34 |
Foreign Countries | 33 |
Correlation | 31 |
Statistical Analysis | 31 |
More ▼ |
Source
Author
Bloom, Howard S. | 3 |
Steiner, Peter M. | 3 |
Suen, Hoi K. | 3 |
Avi Feller | 2 |
Bahramlou, Khosro | 2 |
Baumgartner, Michael | 2 |
Ben-Michael, Eli | 2 |
Blunk, Merrie | 2 |
Cook, Thomas D. | 2 |
Crosson, Amy C. | 2 |
Ercikan, Kadriye | 2 |
More ▼ |
Publication Type
Education Level
Location
California | 5 |
United Kingdom (England) | 4 |
Australia | 3 |
Israel | 3 |
United States | 3 |
China | 2 |
Kentucky | 2 |
Netherlands | 2 |
New York | 2 |
Ohio | 2 |
Taiwan | 2 |
More ▼ |
Laws, Policies, & Programs
No Child Left Behind Act 2001 | 2 |
Head Start | 1 |
Assessments and Surveys
What Works Clearinghouse Rating
Does not meet standards | 1 |
Sinharay, Sandip – Journal of Educational Measurement, 2023
Technical difficulties and other unforeseen events occasionally lead to incomplete data on educational tests, which necessitates the reporting of imputed scores to some examinees. While there exist several approaches for reporting imputed scores, there is a lack of any guidance on the reporting of the uncertainty of imputed scores. In this paper,…
Descriptors: Evaluation Methods, Scores, Standardized Tests, Simulation
Kylie L. Anglin – Annenberg Institute for School Reform at Brown University, 2025
Since 2018, institutions of higher education have been aware of the "enrollment cliff" which refers to expected declines in future enrollment. This paper attempts to describe how prepared institutions in Ohio are for this future by looking at trends leading up to the anticipated decline. Using IPEDS data from 2012-2022, we analyze trends…
Descriptors: Validity, Artificial Intelligence, Models, Best Practices
Wendy Chan – Asia Pacific Education Review, 2024
As evidence from evaluation and experimental studies continue to influence decision and policymaking, applied researchers and practitioners require tools to derive valid and credible inferences. Over the past several decades, research in causal inference has progressed with the development and application of propensity scores. Since their…
Descriptors: Probability, Scores, Causal Models, Statistical Inference
Nan Xie; Zhengxu Li; Haipeng Lu; Wei Pang; Jiayin Song; Beier Lu – IEEE Transactions on Learning Technologies, 2025
Classroom engagement is a critical factor for evaluating students' learning outcomes and teachers' instructional strategies. Traditional methods for detecting classroom engagement, such as coding and questionnaires, are often limited by delays, subjectivity, and external interference. While some neural network models have been proposed to detect…
Descriptors: Learner Engagement, Artificial Intelligence, Technology Uses in Education, Educational Technology
Roderick J. Little; James R. Carpenter; Katherine J. Lee – Sociological Methods & Research, 2024
Missing data are a pervasive problem in data analysis. Three common methods for addressing the problem are (a) complete-case analysis, where only units that are complete on the variables in an analysis are included; (b) weighting, where the complete cases are weighted by the inverse of an estimate of the probability of being complete; and (c)…
Descriptors: Foreign Countries, Probability, Robustness (Statistics), Responses
Kylie Anglin – AERA Open, 2024
Given the rapid adoption of machine learning methods by education researchers, and the growing acknowledgment of their inherent risks, there is an urgent need for tailored methodological guidance on how to improve and evaluate the validity of inferences drawn from these methods. Drawing on an integrative literature review and extending a…
Descriptors: Validity, Artificial Intelligence, Models, Best Practices
Baumgartner, Michael; Ambühl, Mathias – Sociological Methods & Research, 2023
Consistency and coverage are two core parameters of model fit used by configurational comparative methods (CCMs) of causal inference. Among causal models that perform equally well in other respects (e.g., robustness or compliance with background theories), those with higher consistency and coverage are typically considered preferable. Finding the…
Descriptors: Causal Models, Evaluation Methods, Goodness of Fit, Scores
James Ohisei Uanhoro – Educational and Psychological Measurement, 2024
Accounting for model misspecification in Bayesian structural equation models is an active area of research. We present a uniquely Bayesian approach to misspecification that models the degree of misspecification as a parameter--a parameter akin to the correlation root mean squared residual. The misspecification parameter can be interpreted on its…
Descriptors: Bayesian Statistics, Structural Equation Models, Simulation, Statistical Inference
Panchompoo Wisittanawat; Richard Lehrer – Cognition and Instruction, 2024
This report characterizes forms of dialogic support that a sixth-grade teacher generated during whole-class and small-group conversations to help students develop a practice of statistical modeling. During four weeks of instruction, students constructed and revised models to account for variability and uncertainty across a variety of random…
Descriptors: Statistics Education, Mathematical Models, Grade 6, Evaluation Methods
Shunji Wang; Katerina M. Marcoulides; Jiashan Tang; Ke-Hai Yuan – Structural Equation Modeling: A Multidisciplinary Journal, 2024
A necessary step in applying bi-factor models is to evaluate the need for domain factors with a general factor in place. The conventional null hypothesis testing (NHT) was commonly used for such a purpose. However, the conventional NHT meets challenges when the domain loadings are weak or the sample size is insufficient. This article proposes…
Descriptors: Hypothesis Testing, Error of Measurement, Comparative Analysis, Monte Carlo Methods
Parkkinen, Veli-Pekka; Baumgartner, Michael – Sociological Methods & Research, 2023
In recent years, proponents of configurational comparative methods (CCMs) have advanced various dimensions of robustness as instrumental to model selection. But these robustness considerations have not led to computable robustness measures, and they have typically been applied to the analysis of real-life data with unknown underlying causal…
Descriptors: Robustness (Statistics), Comparative Analysis, Causal Models, Models
Daniel Koretz – Journal of Educational and Behavioral Statistics, 2024
A critically important balance in educational measurement between practical concerns and matters of technique has atrophied in recent decades, and as a result, some important issues in the field have not been adequately addressed. I start with the work of E. F. Lindquist, who exemplified the balance that is now wanting. Lindquist was arguably the…
Descriptors: Educational Assessment, Evaluation Methods, Achievement Tests, Educational History
Marianne Rice; Kausalai Wijekumar; Kacee Lambright; Abigail Bristow – Technology, Knowledge and Learning, 2024
Inferencing is an important and complex process required for successful reading comprehension. Previous research has suggested instruction in inferencing is effective at improving reading comprehension. However, varying definitions of inferencing is likely impacting how inferencing instruction is implemented in practice and inferencing ability is…
Descriptors: Inferences, Reading Comprehension, Textbooks, Grade 4
Mingya Huang; David Kaplan – Journal of Educational and Behavioral Statistics, 2025
The issue of model uncertainty has been gaining interest in education and the social sciences community over the years, and the dominant methods for handling model uncertainty are based on Bayesian inference, particularly, Bayesian model averaging. However, Bayesian model averaging assumes that the true data-generating model is within the…
Descriptors: Bayesian Statistics, Hierarchical Linear Modeling, Statistical Inference, Predictor Variables
Oscar Clivio; Avi Feller; Chris Holmes – Grantee Submission, 2024
Reweighting a distribution to minimize a distance to a target distribution is a powerful and flexible strategy for estimating a wide range of causal effects, but can be challenging in practice because optimal weights typically depend on knowledge of the underlying data generating process. In this paper, we focus on design-based weights, which do…
Descriptors: Evaluation Methods, Causal Models, Error of Measurement, Guidelines