Publication Date
In 2025 | 7 |
Since 2024 | 41 |
Since 2021 (last 5 years) | 137 |
Since 2016 (last 10 years) | 263 |
Since 2006 (last 20 years) | 761 |
Descriptor
Evaluation Methods | 1463 |
Validity | 1463 |
Reliability | 470 |
Student Evaluation | 251 |
Foreign Countries | 232 |
Higher Education | 179 |
Models | 170 |
Research Methodology | 165 |
Elementary Secondary Education | 151 |
Evaluation Criteria | 148 |
Comparative Analysis | 146 |
More ▼ |
Source
Author
Fink, Arlene | 6 |
Brandon, Paul R. | 5 |
Herman, Joan L. | 5 |
Raykov, Tenko | 5 |
Bastick, Tony | 4 |
Fuchs, Douglas | 4 |
Fuchs, Lynn S. | 4 |
Kratochwill, Thomas R. | 4 |
Linn, Robert L. | 4 |
Marsh, Herbert W. | 4 |
Thompson, Bruce | 4 |
More ▼ |
Publication Type
Education Level
Audience
Researchers | 67 |
Practitioners | 26 |
Policymakers | 11 |
Students | 11 |
Administrators | 9 |
Teachers | 9 |
Media Staff | 3 |
Community | 2 |
Counselors | 2 |
Location
Australia | 27 |
United Kingdom | 22 |
Canada | 20 |
United States | 17 |
California | 13 |
Netherlands | 12 |
United Kingdom (England) | 11 |
Arizona | 10 |
New Zealand | 10 |
China | 9 |
Florida | 8 |
More ▼ |
Laws, Policies, & Programs
Assessments and Surveys
What Works Clearinghouse Rating
Alireza Akbari; Mohammadtaghi Shahnazari – Journal of Applied Research in Higher Education, 2025
Purpose: The primary objective of this research paper was to examine the objectivity of the preselected items evaluation (PIE) method, a prevalent translation scoring method deployed by international institutions such as UAntwerpen, UGent and the University of Granada. Design/methodology/approach: This research critically analyzed the scientific…
Descriptors: Evaluation Methods, Translation, Difficulty Level, Validity
Tavares, Walter; Kinnear, Benjamin; Schumacher, Daniel J.; Forte, Milena – Advances in Health Sciences Education, 2023
In this perspective, the authors critically examine "rater training" as it has been conceptualized and used in medical education. By "rater training," they mean the educational events intended to "improve" rater performance and contributions during assessment events. Historically, rater training programs have focused…
Descriptors: Medical Education, Interrater Reliability, Evaluation Methods, Training
Timothy R. Konold; Elizabeth A. Sanders; Kelvin Afolabi – Structural Equation Modeling: A Multidisciplinary Journal, 2025
Measurement invariance (MI) is an essential part of validity evidence concerned with ensuring that tests function similarly across groups, contexts, and time. Most evaluations of MI involve multigroup confirmatory factor analyses (MGCFA) that assume simple structure. However, recent research has shown that constraining non-target indicators to…
Descriptors: Evaluation Methods, Error of Measurement, Validity, Monte Carlo Methods
Kylie L. Anglin – Annenberg Institute for School Reform at Brown University, 2025
Since 2018, institutions of higher education have been aware of the "enrollment cliff" which refers to expected declines in future enrollment. This paper attempts to describe how prepared institutions in Ohio are for this future by looking at trends leading up to the anticipated decline. Using IPEDS data from 2012-2022, we analyze trends…
Descriptors: Validity, Artificial Intelligence, Models, Best Practices
Wendy Chan – Asia Pacific Education Review, 2024
As evidence from evaluation and experimental studies continue to influence decision and policymaking, applied researchers and practitioners require tools to derive valid and credible inferences. Over the past several decades, research in causal inference has progressed with the development and application of propensity scores. Since their…
Descriptors: Probability, Scores, Causal Models, Statistical Inference
Abede Mack; Katelynn Carter-Rogers; Priscilla Bahaw; Ayanna Stephens – Discover Education, 2024
Appetite for entrepreneurship education (EE) among vocational students has surged dramatically, driven by persistent challenges of unemployment. As a result, vocational institutions are increasingly focused on how much entrepreneurship exposure students receive, particularly how frequently instructors impart core business knowledge and skills to…
Descriptors: Entrepreneurship, Technical Education, Student Attitudes, Knowledge Level
Divya Varier; Marvin G. Powell; Stephanie Dodman; Samantha T. Ives; Elizabeth DeMulder; Jenice L. View – Educational Assessment, 2024
Considerable literature is devoted to teachers' assessment use to support teaching and learning. The study examined the factor structure of a measure of teachers' assessment use along the assessments "of", "for", and "as" learning purpose dimensions. The study also examined the factor structure of teachers' perceived…
Descriptors: Assessment Literacy, Elementary Secondary Education, Teacher Evaluation, Teacher Attitudes
Naumann, Sandra; Byrne, Michelle L.; de la Fuente, Alethia; Harrewijn, Anita; Nugiel, Tehila; Rosen, Maya; van Atteveldt, Nienke; Matusz, Pawel J. – Mind, Brain, and Education, 2022
In cognitive neurosciences, fundamental principles of mental processes and functional brain organization have been established with highly controlled tasks and testing environments. Recent technical advances allowed the investigation of these functions and their brain mechanisms in naturalistic settings. The diversity in those approaches have been…
Descriptors: Evaluation Methods, Neurosciences, Educational Research, Validity
Edgar C. Merkle; Oludare Ariyo; Sonja D. Winter; Mauricio Garnier-Villarreal – Grantee Submission, 2023
We review common situations in Bayesian latent variable models where the prior distribution that a researcher specifies differs from the prior distribution used during estimation. These situations can arise from the positive definite requirement on correlation matrices, from sign indeterminacy of factor loadings, and from order constraints on…
Descriptors: Models, Bayesian Statistics, Correlation, Evaluation Methods
Eric Jones – ProQuest LLC, 2022
The assessment of human performance is not a new phenomenon. We have evidence that people have been required to prove their worth dating back at least to the Epic of Gilgamesh. What has changed, at least on a large scale, is the importance given to quantitative evidence in the evaluation process. For example, many employers have begun subjecting…
Descriptors: Performance Based Assessment, Evaluation Methods, Semiotics, Theories
Jiangang Hao; Alina A. von Davier; Victoria Yaneva; Susan Lottridge; Matthias von Davier; Deborah J. Harris – Educational Measurement: Issues and Practice, 2024
The remarkable strides in artificial intelligence (AI), exemplified by ChatGPT, have unveiled a wealth of opportunities and challenges in assessment. Applying cutting-edge large language models (LLMs) and generative AI to assessment holds great promise in boosting efficiency, mitigating bias, and facilitating customized evaluations. Conversely,…
Descriptors: Evaluation Methods, Artificial Intelligence, Educational Change, Computer Software
Kylie Anglin – AERA Open, 2024
Given the rapid adoption of machine learning methods by education researchers, and the growing acknowledgment of their inherent risks, there is an urgent need for tailored methodological guidance on how to improve and evaluate the validity of inferences drawn from these methods. Drawing on an integrative literature review and extending a…
Descriptors: Validity, Artificial Intelligence, Models, Best Practices
Fouché, Ilse – Applied Linguistics, 2023
This article, located in the discipline of academic literacy studies, draws upon the fields of critical realism, design research, and evaluation studies. It reports on the validation of a flexible evaluation design for assessing the impact of academic literacy interventions. The design was validated in two ways. Firstly, through a process of…
Descriptors: Foreign Countries, Intervention, Literacy Education, Feedback (Response)
Binici, Salih; Cuhadar, Ismail – Journal of Educational Measurement, 2022
Validity of performance standards is a key element for the defensibility of standard setting results, and validating performance standards requires collecting multiple pieces of evidence at every step during the standard setting process. This study employs a statistical procedure, latent class analysis, to set performance standards and compares…
Descriptors: Validity, Performance, Standards, Multivariate Analysis
Yingchen Wang – SAGE Open, 2024
Surveys are typical for student evaluation of teaching (SET). Survey research consistently confirms the negative impacts of careless responses on research validity, including low data quality and invalid research inferences. SET literature seldom addresses if careless responses are present and how to improve. To improve evaluation practices and…
Descriptors: Student Evaluation of Teacher Performance, Responses, Validity, Data Use