Publication Date
In 2025 | 0 |
Since 2024 | 1 |
Since 2021 (last 5 years) | 2 |
Since 2016 (last 10 years) | 9 |
Since 2006 (last 20 years) | 20 |
Descriptor
Evaluation Utilization | 49 |
Test Validity | 49 |
Test Reliability | 17 |
Elementary Secondary Education | 13 |
Test Use | 12 |
Evaluation Methods | 11 |
Student Evaluation | 10 |
Scores | 9 |
Academic Achievement | 8 |
Educational Assessment | 8 |
Evaluation Problems | 7 |
More ▼ |
Source
Author
Abrami, Philip C. | 2 |
Abedi, Jamal | 1 |
Afflerbach, Peter, Ed. | 1 |
Arzi, Hanna J. | 1 |
Baker, Charles E. | 1 |
Bartel, Kathleen | 1 |
Belur, Vinetha | 1 |
Benson, Jeri | 1 |
Beran, Tanya N. | 1 |
Camara, Wayne J. | 1 |
Carlisle, Joanne | 1 |
More ▼ |
Publication Type
Education Level
Elementary Secondary Education | 6 |
Higher Education | 5 |
Postsecondary Education | 4 |
Elementary Education | 3 |
Grade 1 | 2 |
Early Childhood Education | 1 |
Grade 12 | 1 |
Grade 2 | 1 |
Grade 3 | 1 |
Grade 4 | 1 |
Grade 8 | 1 |
More ▼ |
Audience
Practitioners | 3 |
Teachers | 3 |
Administrators | 1 |
Community | 1 |
Counselors | 1 |
Policymakers | 1 |
Researchers | 1 |
Students | 1 |
Laws, Policies, & Programs
Elementary and Secondary… | 1 |
Kentucky Education Reform Act… | 1 |
No Child Left Behind Act 2001 | 1 |
Assessments and Surveys
What Works Clearinghouse Rating
Chitra Sabapathy – Shanlax International Journal of Education, 2024
Background: Mid-semester evaluations are gaining traction as a means to gather evaluation data for formative purposes. However, it is not clear if course coordinators who conduct these evaluations are adequately equipped with evaluative knowledge and skills to guide them through their evaluative processes. Objectives: This study is a…
Descriptors: Evaluation Methods, Instructor Coordinators, Tutors, College Students
Daniella Winter; Yoram Braw – Journal of Attention Disorders, 2022
Background: The current study aimed to validate the utility of previously established validity indicators derived from MOXO-d-CPT's continuous performance test. Method: Healthy simulators feigned impairment after searching online for relevant information, an ecologically valid coaching condition (n = 39). They were compared to ADHD patients (n =…
Descriptors: Foreign Countries, Undergraduate Students, Attention Deficit Hyperactivity Disorder, Computer Simulation
Koretz, Daniel – American Educator, 2018
In "The Testing Charade: Pretending to Make Schools Better", the author's new book from which this article is drawn, the failures of test-based accountability are documented and some of the most egregious misuses and outright abuses of testing are described, along with some of the most serious negative effects. Neither good intentions…
Descriptors: Accountability, Testing, Testing Problems, Test Validity
Sireci, Stephen G. – Assessment in Education: Principles, Policy & Practice, 2016
A misconception exists that validity may refer only to the "interpretation" of test scores and not to the "uses" of those scores. The development and evolution of validity theory illustrate test score interpretation was a primary focus in the earliest days of modern testing, and that validating interpretations derived from test…
Descriptors: Test Validity, Misconceptions, Evaluation Utilization, Data Interpretation
Cizek, Gregory J. – Assessment in Education: Principles, Policy & Practice, 2016
Advances in validity theory and alacrity in validation practice have suffered because the term "validity" has been used to refer to two incompatible concerns: (1) the degree of support for specified interpretations of test scores (i.e. intended score meaning) and (2) the degree of support for specified applications (i.e. intended test…
Descriptors: Scores, Definitions, Evaluation Utilization, Data Interpretation
Moss, Pamela A. – Assessment in Education: Principles, Policy & Practice, 2016
The conventional focus of validity in educational measurement has been on intended interpretations and uses of test scores. Empirical studies of test use by teachers, administrators and policy-makers show that actual interpretations and uses of test scores in context are invariably shaped by local users' questions, which frequently require…
Descriptors: Test Validity, Evaluation Utilization, Educational Assessment, Scores
Sessoms, John; Henson, Robert A. – Measurement: Interdisciplinary Research and Perspectives, 2018
Diagnostic classification models (DCMs) classify examinees based on the skills they have mastered given their test performance. This classification enables targeted feedback that can inform remedial instruction. Unfortunately, applications of DCMs have been criticized (e.g., no validity support). Generally, these evaluations have been brief and…
Descriptors: Literature Reviews, Classification, Models, Criticism
Klieger, David M.; Belur, Vinetha; Kotloff, Lauren J. – ETS Research Report Series, 2017
This survey study investigated how graduate school admissions committees perceive and use the "GRE"® General Test and "GRE"® Subject Tests after the launch of the "GRE"® revised General Test in August 2011. These perceptions and uses impact the validity of the tests. Prior research about the perceptions and uses of…
Descriptors: Research Reports, College Entrance Examinations, Evaluation Utilization, Graduate Study
Gorur, Radhika – European Educational Research Journal, 2016
PISA is an extremely influential large-scale assessment, and its "policy lessons" are being incorporated in a range of nations all over the world. In this paper I argue that not only is PISA influencing policies and practices, but also that "seeing like PISA" is becoming a widespread phenomenon. Globally, education…
Descriptors: International Assessment, Evaluation Utilization, Test Reliability, Test Validity
Gergen, Kenneth J.; Dixon-Román, Ezekiel J. – Teachers College Record, 2014
In the present offering we challenge the presumption that the educational testing of students provides objective information about such students. This presumption largely rests on an empiricist account of science. In light of mounting criticism, however, empiricist foundationalism has given way to a social epistemology. From this standpoint,…
Descriptors: Epistemology, Educational Testing, Test Validity, Evaluation Utilization
Rogers, W. Todd – Canadian Journal of Education, 2014
Principals and teachers do not use large-scale assessment results because the lack of distinct and reliable subtests prevents identifying strengths and weaknesses of students and instruction, the results arrive too late to be used, and principals and teachers need assistance to use the results to improve instruction so as to improve student…
Descriptors: Foreign Countries, Group Testing, Multidimensional Scaling, Evaluation Utilization
Titley, Jonathan E.; D'Amato, Rik Carl; Koehler-Hak, Kathrine M. – Contemporary School Psychology, 2014
The identification of children at-risk for reading problems can be costly and time-consuming. Previous research has indicated that teachers are relatively accurate in assessing children's overall reading ability. This study investigated the accuracy of kindergarten and first grade teacher rating scales in predicting children's reading…
Descriptors: Literacy, Student Evaluation, Achievement Rating, At Risk Students
Feuer, Michael J. – Educational Testing Service, 2011
Few arguments about education are as effective at galvanizing public attention and motivating political action as those that compare the performance of students with their counterparts in other countries and that connect academic achievement to economic performance. Because data from international large-scale assessments (ILSA) have a powerful…
Descriptors: International Assessment, Test Interpretation, Testing Problems, Comparative Testing
Penfield, Randall D. – Educational Researcher, 2010
A growing body of research showing that grade retention serves as an educationally low-quality placement has raised increasing concerns about whether the use of standardized tests in making decisions concerning grade retention conforms to current standards for appropriate and nondiscriminatory test use. This article examines the extent to which…
Descriptors: Test Use, Grade Repetition, Standardized Tests, Learning Readiness
Lane, Suzanne; Zumbo, Bruno D.; Abedi, Jamal; Benson, Jeri; Dossey, John; Elliott, Stephen N.; Kane, Michael; Linn, Robert; Paredes-Ziker, Cindy; Rodriguez, Michael; Schraw, Gregg; Slattery, Jean; Thomas, Veronica; Willhoft, Joe – Applied Measurement in Education, 2009
Given the changing landscape of educational accountability at the local, state, and national levels, and the changes in the uses of the National Assessment of Educational Progress (NAEP), including the evolving uses of NAEP as a policy tool to interpret state assessment and accountability systems, an explicit statement of the current and potential…
Descriptors: National Competency Tests, Academic Achievement, Accountability, Test Validity