Publication Date
In 2025 | 34 |
Since 2024 | 127 |
Since 2021 (last 5 years) | 347 |
Since 2016 (last 10 years) | 661 |
Since 2006 (last 20 years) | 1804 |
Descriptor
Evaluation Methods | 3945 |
Test Validity | 2067 |
Validity | 1463 |
Test Reliability | 987 |
Student Evaluation | 798 |
Foreign Countries | 628 |
Test Construction | 551 |
Reliability | 523 |
Higher Education | 450 |
Measurement Techniques | 417 |
Elementary Secondary Education | 414 |
More ▼ |
Source
Author
Fuchs, Lynn S. | 12 |
Baker, Eva L. | 11 |
Cronin, John | 11 |
Marsh, Herbert W. | 11 |
Amrein-Beardsley, Audrey | 9 |
Linn, Robert L. | 9 |
Sireci, Stephen G. | 9 |
Raykov, Tenko | 8 |
Deno, Stanley L. | 7 |
Epstein, Michael H. | 7 |
Matson, Johnny L. | 7 |
More ▼ |
Publication Type
Education Level
Audience
Researchers | 193 |
Practitioners | 121 |
Teachers | 45 |
Administrators | 31 |
Policymakers | 27 |
Students | 15 |
Counselors | 7 |
Media Staff | 4 |
Community | 3 |
Support Staff | 3 |
Parents | 2 |
More ▼ |
Location
Australia | 66 |
United Kingdom | 56 |
Canada | 47 |
California | 32 |
Netherlands | 30 |
United States | 30 |
United Kingdom (England) | 26 |
Germany | 23 |
Turkey | 22 |
Taiwan | 21 |
China | 20 |
More ▼ |
Laws, Policies, & Programs
Assessments and Surveys
What Works Clearinghouse Rating
Does not meet standards | 1 |
MacKeen, Jessica; Wright, Tarah; Séguin, Daniel; Cray, Heather – International Journal of Early Childhood Environmental Education, 2022
In this study, we use face and content validity to determine whether a modified game-based testing instrument is appropriate and relevant for quantifying preschool children's emotional, cognitive, and attitudinal affinity with nature. Six environmental psychology experts completed a questionnaire and subsequent interviews with three of them…
Descriptors: Psychometrics, Test Validity, Game Based Learning, Evaluation Methods
Manuel T. Rein; Jeroen K. Vermunt; Kim De Roover; Leonie V. D. E. Vogelsmeier – Structural Equation Modeling: A Multidisciplinary Journal, 2025
Researchers often study dynamic processes of latent variables in everyday life, such as the interplay of positive and negative affect over time. An intuitive approach is to first estimate the measurement model of the latent variables, then compute factor scores, and finally use these factor scores as observed scores in vector autoregressive…
Descriptors: Measurement Techniques, Factor Analysis, Scores, Validity
Ole J. Kemi – Advances in Physiology Education, 2025
Students are assessed by coursework and/or exams, all of which are marked by assessors (markers). Student and marker performances are then subject to end-of-session board of examiner handling and analysis. This occurs annually and is the basis for evaluating students but also the wider learning and teaching efficiency of an academic institution.…
Descriptors: Undergraduate Students, Evaluation Methods, Evaluation Criteria, Academic Standards
Dempsey, Lynn – Child Language Teaching and Therapy, 2021
Planning intervention for narrative comprehension deficits requires a thorough understanding of a child's skill in all component domains. The purpose of this study was to examine the validity of three methods of measuring pre-readers' event knowledge, an important predictor of story comprehension. Thirty-eight typically developing children (12…
Descriptors: Test Validity, Evaluation Methods, Preschool Children, Knowledge Level
Jamaal L. Moore; Zhihui Yi; Jessica M. Hinman; Becky F. Barron; Mark R. Dixon – Journal of Developmental and Physical Disabilities, 2021
The current study examined the convergent validity between the standardized PEAK Comprehensive Assessment (PCA) and the semi-standardized PEAK Pre-assessment (PEAK-PA). Twenty-two participants were administered each tool, and an item by item analysis was conducted to evaluate correlations between tests. The results suggested a strong positive…
Descriptors: Validity, Evaluation Methods, Standardized Tests, Correlation
Amrein-Beardsley, Audrey – Educational Assessment, Evaluation and Accountability, 2023
Until recently, legal challenges to using value-added models (VAMs) throughout the United States (US) for high-stakes teacher evaluative decisions (e.g., merit pay, tenure, and termination) were unsuccessful, especially in the state of Florida. Hence, prior and still, multiple teachers throughout Florida have been terminated or involuntarily…
Descriptors: Teacher Dismissal, Case Studies, Court Litigation, Value Added Models
Ji, Xuejun Ryan; Wu, Amery D. – Educational Measurement: Issues and Practice, 2023
The Cross-Classified Mixed Effects Model (CCMEM) has been demonstrated to be a flexible framework for evaluating reliability by measurement specialists. Reliability can be estimated based on the variance components of the test scores. Built upon their accomplishment, this study extends the CCMEM to be used for evaluating validity evidence.…
Descriptors: Measurement, Validity, Reliability, Models
Audrey Amrein-Beardsley; Matthew Ryan Lavery; Jessica Holloway; Margarita Pivovarova; Debbie L. Hahs-Vaughn – Education Policy Analysis Archives, 2023
Local education agencies (LEAs) continue to use value-added models (VAMs) for teacher evaluation policies and purposes, often with consequences attached. Although the Every Student Succeeds Act (ESSA) provides more flexibility to LEAs, few have discontinued VAM use, suggesting they interpret VAMs as a valid measure of teacher effectiveness. In…
Descriptors: Value Added Models, Evaluation Methods, Teacher Evaluation, Teacher Effectiveness
Preheim, Michael – ProQuest LLC, 2023
Knowledge assessments in undergraduate mathematics education commonly evaluate response correctness to determine learner proficiency. However, simultaneous evaluation of learner metacognition more accurately assesses the multiple dimensions of knowledge and has been shown to increase assessment validity and reliability. Research into…
Descriptors: Undergraduate Students, Mathematics Education, College Mathematics, Metacognition
Christopher Vatland; Erin E. Barton; Lam Pham; Lise Fox; Mary Louise Hemmeter; Gary Henry – Journal of Positive Behavior Interventions, 2023
In recent years, there has been increased attention regarding systems-level implementation to support the sustained use of evidence-based interventions and supports in authentic early childhood settings. With this comes a need to accurately measure implementation fidelity of the critical features within a framework as well as individual practices.…
Descriptors: Test Construction, Test Validity, Program Implementation, Evidence Based Practice
Mojgan Rashtchi; SeyyedeFateme Ghazi Mir Saeed – Sage Research Methods Cases, 2023
The reason for conducting the present case study was the problems the researchers encountered during data collection for another research project (Primary Study) entitled "The effects of virtual versus traditional flipped classes on EFL learners' grammar knowledge, self-regulation, and autonomy." Two online questionnaires were…
Descriptors: Data Collection, Questionnaires, Barriers, Research Methodology
Nurihan Nasir; Mazlini Adnan; Murugan Rajoo; Anis Oweeda Ismail; Riyan Hidayat – International Electronic Journal of Mathematics Education, 2024
Classroom assessment is essential for tracking students' progress and improving teaching and learning in the classroom. However, the lack of clear documentation to guide teachers in assessing student mastery often hinders effective communication between teachers and stakeholders about the students' progress. This study aimed to develop and test…
Descriptors: Secondary Schools, Evaluation Methods, Electronic Learning, Student Evaluation
Clàudia Roca; Ignasi Ivern; Ignacio Cifre; Olga Bruna – International Journal of Language & Communication Disorders, 2024
Background: In the Spanish and Catalan context, there is currently a lack of standardized, linguistically adapted tools to assess people with communication disorders. This lack is especially evident when it comes to instruments designed to assess functional communication. Aims: The main objective of this study is to adapt the instrument entitled…
Descriptors: Aphasia, Foreign Countries, Communication Disorders, Spanish Speaking
Süleyman Avci; Mustafa Özgenel – International Journal of Psychology and Educational Studies, 2024
The purpose of this study was to adapt the Expectancy Value Scale, Students' Motivation for Homework Scale, Homework Interest Scale, Homework Affective Attitude Scale, Math Homework Purposes Scale into Turkish and to develop the Homework Self Efficacy Scale. 1555 middle school students of 5th and 8th grades participated in the study. The students…
Descriptors: Foreign Countries, Psychometrics, Mathematics Instruction, Homework
Flor de Lis González-Mujico – Education and Information Technologies, 2024
Over the past decade, self-assessment tools have garnered significant attention in the interest of measuring the skillset required by educators and students to function productively and ethically in digitally mediated environments, particularly in relation to education policy implementation. Since stated beliefs do not always align with actual…
Descriptors: Technological Literacy, Evaluation Methods, Test Validity, Test Construction