Publication Date
| In 2026 | 0 |
| Since 2025 | 3 |
| Since 2022 (last 5 years) | 22 |
| Since 2017 (last 10 years) | 35 |
| Since 2007 (last 20 years) | 118 |
Descriptor
| Evaluation Methods | 210 |
| Validity | 210 |
| Models | 171 |
| Reliability | 67 |
| Program Evaluation | 35 |
| Foreign Countries | 34 |
| Student Evaluation | 33 |
| Elementary Secondary Education | 26 |
| Higher Education | 25 |
| Statistical Analysis | 25 |
| Educational Assessment | 23 |
| More ▼ | |
Source
Author
Publication Type
Education Level
| Higher Education | 28 |
| Elementary Secondary Education | 20 |
| Postsecondary Education | 20 |
| Secondary Education | 14 |
| Elementary Education | 7 |
| High Schools | 7 |
| Middle Schools | 4 |
| Grade 8 | 3 |
| Adult Education | 2 |
| Grade 11 | 2 |
| Grade 12 | 2 |
| More ▼ | |
Audience
| Researchers | 16 |
| Practitioners | 2 |
| Policymakers | 1 |
Location
| Australia | 6 |
| Netherlands | 4 |
| Cyprus | 3 |
| New York | 3 |
| South Korea | 3 |
| Thailand | 3 |
| United States | 3 |
| California | 2 |
| Canada | 2 |
| Connecticut | 2 |
| Florida | 2 |
| More ▼ | |
Laws, Policies, & Programs
| Elementary and Secondary… | 4 |
| Every Student Succeeds Act… | 3 |
| Education Consolidation… | 1 |
| Race to the Top | 1 |
Assessments and Surveys
What Works Clearinghouse Rating
Edgar C. Merkle; Oludare Ariyo; Sonja D. Winter; Mauricio Garnier-Villarreal – Grantee Submission, 2023
We review common situations in Bayesian latent variable models where the prior distribution that a researcher specifies differs from the prior distribution used during estimation. These situations can arise from the positive definite requirement on correlation matrices, from sign indeterminacy of factor loadings, and from order constraints on…
Descriptors: Models, Bayesian Statistics, Correlation, Evaluation Methods
Kylie L. Anglin – Annenberg Institute for School Reform at Brown University, 2025
Since 2018, institutions of higher education have been aware of the "enrollment cliff" which refers to expected declines in future enrollment. This paper attempts to describe how prepared institutions in Ohio are for this future by looking at trends leading up to the anticipated decline. Using IPEDS data from 2012-2022, we analyze trends…
Descriptors: Validity, Artificial Intelligence, Models, Best Practices
Wendy Chan – Asia Pacific Education Review, 2024
As evidence from evaluation and experimental studies continue to influence decision and policymaking, applied researchers and practitioners require tools to derive valid and credible inferences. Over the past several decades, research in causal inference has progressed with the development and application of propensity scores. Since their…
Descriptors: Probability, Scores, Causal Models, Statistical Inference
Christina Glasauer; Martin K. Yeh; Lois Anne DeLong; Yu Yan; Yanyan Zhuang – Computer Science Education, 2025
Background and Context: Feedback on one's progress is essential to new programming language learners, particularly in out-of-classroom settings. Though many study materials offer assessment mechanisms, most do not examine the accuracy of the feedback they deliver, nor give evidence on its validity. Objective: We investigate the potential use of a…
Descriptors: Novices, Computer Science Education, Programming, Accuracy
Kylie Anglin – AERA Open, 2024
Given the rapid adoption of machine learning methods by education researchers, and the growing acknowledgment of their inherent risks, there is an urgent need for tailored methodological guidance on how to improve and evaluate the validity of inferences drawn from these methods. Drawing on an integrative literature review and extending a…
Descriptors: Validity, Artificial Intelligence, Models, Best Practices
Reichardt, Charles S. – American Journal of Evaluation, 2022
Evaluators are often called upon to assess the effects of programs. To assess a program effect, evaluators need a clear understanding of how a program effect is defined. Arguably, the most widely used definition of a program effect is the counterfactual one. According to the counterfactual definition, a program effect is the difference between…
Descriptors: Program Evaluation, Definitions, Causal Models, Evaluation Methods
Hyemin Yoon; HyunJin Kim; Sangjin Kim – Measurement: Interdisciplinary Research and Perspectives, 2024
We have maintained the customer grade system that is being implemented to customers with excellent performance through customer segmentation for years. Currently, financial institutions that operate the customer grade system provide similar services based on the score calculation criteria, but the score calculation criteria vary from the financial…
Descriptors: Classification, Artificial Intelligence, Prediction, Decision Making
Manuel T. Rein; Jeroen K. Vermunt; Kim De Roover; Leonie V. D. E. Vogelsmeier – Structural Equation Modeling: A Multidisciplinary Journal, 2025
Researchers often study dynamic processes of latent variables in everyday life, such as the interplay of positive and negative affect over time. An intuitive approach is to first estimate the measurement model of the latent variables, then compute factor scores, and finally use these factor scores as observed scores in vector autoregressive…
Descriptors: Measurement Techniques, Factor Analysis, Scores, Validity
Amrein-Beardsley, Audrey – Educational Assessment, Evaluation and Accountability, 2023
Until recently, legal challenges to using value-added models (VAMs) throughout the United States (US) for high-stakes teacher evaluative decisions (e.g., merit pay, tenure, and termination) were unsuccessful, especially in the state of Florida. Hence, prior and still, multiple teachers throughout Florida have been terminated or involuntarily…
Descriptors: Teacher Dismissal, Case Studies, Court Litigation, Value Added Models
Ji, Xuejun Ryan; Wu, Amery D. – Educational Measurement: Issues and Practice, 2023
The Cross-Classified Mixed Effects Model (CCMEM) has been demonstrated to be a flexible framework for evaluating reliability by measurement specialists. Reliability can be estimated based on the variance components of the test scores. Built upon their accomplishment, this study extends the CCMEM to be used for evaluating validity evidence.…
Descriptors: Measurement, Validity, Reliability, Models
Audrey Amrein-Beardsley; Matthew Ryan Lavery; Jessica Holloway; Margarita Pivovarova; Debbie L. Hahs-Vaughn – Education Policy Analysis Archives, 2023
Local education agencies (LEAs) continue to use value-added models (VAMs) for teacher evaluation policies and purposes, often with consequences attached. Although the Every Student Succeeds Act (ESSA) provides more flexibility to LEAs, few have discontinued VAM use, suggesting they interpret VAMs as a valid measure of teacher effectiveness. In…
Descriptors: Value Added Models, Evaluation Methods, Teacher Evaluation, Teacher Effectiveness
Manapat, Patrick D.; Edwards, Michael C. – Educational and Psychological Measurement, 2022
When fitting unidimensional item response theory (IRT) models, the population distribution of the latent trait ([theta]) is often assumed to be normally distributed. However, some psychological theories would suggest a nonnormal [theta]. For example, some clinical traits (e.g., alcoholism, depression) are believed to follow a positively skewed…
Descriptors: Robustness (Statistics), Computational Linguistics, Item Response Theory, Psychological Patterns
Price, Heather E.; Smith, Christian – Field Methods, 2021
To identify the dominant cultural models among parents transmitting faith to their children, we find few methodological guidelines to guide coding and analysis of semi-structured interviews. We thus developed a three-phase procedure for our research team. Phase-one follows Campbell et al. by unitizing on meanings rather than words/pages, including…
Descriptors: Semi Structured Interviews, Parents, Religion, Reliability
Heritage, Margaret; Wylie, Caroline – National Research and Development Center to Improve Education for Secondary English Learners at WestEd, 2021
The Comprehensive Assessment System (CAS) Framework presents a vision for a system of assessments for English Learners in secondary grades that brings assessment closer to the classroom and fully involves teachers in assessment development and validation. The CAS Framework is intended to signal a new and equitable direction and to provoke…
Descriptors: Secondary School Students, English Language Learners, Student Evaluation, Models
Meyer, J. Patrick; Dahlin, Michael – NWEA, 2022
The MAP® Growth™ theory of action describes key features of MAP Growth and its position in a comprehensive assessment system. The basic premise of the theory of action is that all students learn when MAP Growth is situated in a comprehensive assessment system and used for its intended purposes to yield information about student learning and enable…
Descriptors: Achievement Tests, Academic Achievement, Achievement Gains, Student Evaluation

Peer reviewed
Direct link
