Publication Date
In 2025 | 0 |
Since 2024 | 0 |
Since 2021 (last 5 years) | 0 |
Since 2016 (last 10 years) | 2 |
Since 2006 (last 20 years) | 9 |
Descriptor
Source
Author
Black, Glenda L. | 1 |
Caudell, Lee Sherman | 1 |
D'Amato, Rik Carl | 1 |
Dixon-Román, Ezekiel J. | 1 |
Duque, Matthew | 1 |
Gergen, Kenneth J. | 1 |
Gorur, Radhika | 1 |
Hayhoe, Mike | 1 |
Henson, Robert A. | 1 |
Kendall, Ian M. | 1 |
Knoetze, Jan | 1 |
More ▼ |
Publication Type
Journal Articles | 15 |
Reports - Research | 8 |
Reports - Evaluative | 3 |
Information Analyses | 2 |
Reports - Descriptive | 2 |
Opinion Papers | 1 |
Education Level
Elementary Education | 2 |
Elementary Secondary Education | 2 |
Early Childhood Education | 1 |
Grade 1 | 1 |
Kindergarten | 1 |
Preschool Education | 1 |
Primary Education | 1 |
Audience
Administrators | 1 |
Practitioners | 1 |
Teachers | 1 |
Location
Canada | 2 |
Australia | 1 |
Germany | 1 |
South Africa | 1 |
Laws, Policies, & Programs
Elementary and Secondary… | 1 |
Elementary and Secondary… | 1 |
No Child Left Behind Act 2001 | 1 |
Assessments and Surveys
Basic Reading Inventory | 1 |
Program for International… | 1 |
What Works Clearinghouse Rating
Sessoms, John; Henson, Robert A. – Measurement: Interdisciplinary Research and Perspectives, 2018
Diagnostic classification models (DCMs) classify examinees based on the skills they have mastered given their test performance. This classification enables targeted feedback that can inform remedial instruction. Unfortunately, applications of DCMs have been criticized (e.g., no validity support). Generally, these evaluations have been brief and…
Descriptors: Literature Reviews, Classification, Models, Criticism
Nilsson, Nina L. – Reading & Writing Quarterly, 2013
Over time, criticisms related to the technical rigor of informal reading inventories (IRIs) have led many to question using these assessment instruments for high- or low-stakes purposes. In this article, I examine reliability evidence reported in 11 new and updated IRIs and make comparisons with Spector's earlier analysis that revealed fewer than…
Descriptors: Informal Reading Inventories, Test Reliability, Evaluation Utilization, Content Analysis
Gorur, Radhika – European Educational Research Journal, 2016
PISA is an extremely influential large-scale assessment, and its "policy lessons" are being incorporated in a range of nations all over the world. In this paper I argue that not only is PISA influencing policies and practices, but also that "seeing like PISA" is becoming a widespread phenomenon. Globally, education…
Descriptors: International Assessment, Evaluation Utilization, Test Reliability, Test Validity
Gergen, Kenneth J.; Dixon-Román, Ezekiel J. – Teachers College Record, 2014
In the present offering we challenge the presumption that the educational testing of students provides objective information about such students. This presumption largely rests on an empiricist account of science. In light of mounting criticism, however, empiricist foundationalism has given way to a social epistemology. From this standpoint,…
Descriptors: Epistemology, Educational Testing, Test Validity, Evaluation Utilization
Rogers, W. Todd – Canadian Journal of Education, 2014
Principals and teachers do not use large-scale assessment results because the lack of distinct and reliable subtests prevents identifying strengths and weaknesses of students and instruction, the results arrive too late to be used, and principals and teachers need assistance to use the results to improve instruction so as to improve student…
Descriptors: Foreign Countries, Group Testing, Multidimensional Scaling, Evaluation Utilization
Polikoff, Morgan S.; McEachin, Andrew J.; Wrabel, Stephani L.; Duque, Matthew – Educational Researcher, 2014
Forty-two states and the District of Columbia have recently received waivers to the school accountability requirements of the No Child Left Behind Act (NCLB). As the prospects for reauthorizing the Act in the near term are dim, these new accountability systems will be law for at least several years. Drawing on a four-part framework from the…
Descriptors: Accountability, Federal Legislation, Educational Legislation, Educational Policy
Titley, Jonathan E.; D'Amato, Rik Carl; Koehler-Hak, Kathrine M. – Contemporary School Psychology, 2014
The identification of children at-risk for reading problems can be costly and time-consuming. Previous research has indicated that teachers are relatively accurate in assessing children's overall reading ability. This study investigated the accuracy of kindergarten and first grade teacher rating scales in predicting children's reading…
Descriptors: Literacy, Student Evaluation, Achievement Rating, At Risk Students
Black, Glenda L. – Action in Teacher Education, 2014
Assessment is a complex function requiring an understanding of student learning, assessment principles, practices, and purposes of data to implement effective classroom assessment. The purpose of this study was to add to the growing base of knowledge about teachers' engagement with assessment data and their motivation for classroom assessment.…
Descriptors: Motivation, Class Activities, Semi Structured Interviews, Evaluation Utilization
Knoetze, Jan; Vermoter, Carey-Lee – South African Journal of Education, 2007
Psychological methods of assessing intelligence have been criticised because of their limited diagnostic-remedial nature and especially their lack of potential for initiating effective and pragmatic intervention programmes. Similarly, the means through which the results of such methods are communicated in order to make them useful and constructive…
Descriptors: Educational Assessment, Remedial Reading, Psychoeducational Methods, Focus Groups

Murphy, Kevin R.; And Others – Journal of Educational Psychology, 1984
Using 45 undergraduate evaluations of videotaped lectures, this study examined the effects of the purposes of rating on measures of accuracy in observing teacher behavior and in evaluating teacher performance. Results suggest that the purpose affects the way raters process behavioral information without necessarily affecting the general level of…
Descriptors: Behavior Rating Scales, Decision Making, Evaluation Utilization, Higher Education
Hayhoe, Mike – Highway One, 1985
Stresses the importance of devising accurate methods of evaluation rather than teaching material only because it can be easily evaluated. (DF)
Descriptors: Accountability, English Instruction, Evaluation Criteria, Evaluation Methods

Russon, Craig; Koehly, Laura M. – Evaluation and Program Planning, 1995
A scale was developed for measuring the persuasive impact of qualitative and quantitative evaluation reports on decision makers. Using two exploratory (n=192 graduate and undergraduate students) and two confirmatory (n=200 administrators) samples, researchers developed a 28-item Likert-type scale that demonstrated high reliability and validity.…
Descriptors: Administrators, Attention, College Students, Comprehension
Caudell, Lee Sherman – Northwest Education, 1996
Most states have expanded their statewide testing programs to include alternative educational assessments, and two (Kentucky and Maine) have completely abandoned the multiple-choice format. However, over half of states designing alternative assessments are encountering major difficulties related to the high cost of performance-based assessments,…
Descriptors: Accountability, Alternative Assessment, Costs, Educational Assessment

Radocy, Rudolf E. – Music Educators Journal, 1989
Identifies the underlying concepts of student evaluation. Offers suggestions for evaluating musical achievement. Maintains that all evaluations are subjective, and suggests techniques for minimizing subjectivity. Considers various test formats, and discusses objectives for both classroom and performance achievement. (RW)
Descriptors: Academic Achievement, Elementary Secondary Education, Evaluation Criteria, Evaluation Problems
Kendall, Ian M. – Psychological Test Bulletin, 1991
A survey identified 352 psychological tests that are used in Australia but produced in other countries. About 89 percent have been standardized for use in Australia, but no local data have been compiled for some high-use tests. Responsibilities of test distributors and users are discussed. (SLD)
Descriptors: Evaluation Utilization, Foreign Countries, International Studies, Local Norms