Publication Date
In 2025 | 0 |
Since 2024 | 0 |
Since 2021 (last 5 years) | 0 |
Since 2016 (last 10 years) | 2 |
Since 2006 (last 20 years) | 4 |
Descriptor
Achievement Rating | 4 |
Correlation | 4 |
Program Validation | 4 |
Evaluation Methods | 3 |
Administrator Attitudes | 2 |
Scores | 2 |
Teacher Attitudes | 2 |
Academic Standards | 1 |
Accountability | 1 |
Achievement Tests | 1 |
Administrator Behavior | 1 |
More ▼ |
Source
Applied Measurement in… | 1 |
Educational Assessment,… | 1 |
Leadership and Policy in… | 1 |
Stanford Center for Education… | 1 |
Author
Publication Type
Reports - Research | 4 |
Journal Articles | 3 |
Education Level
Elementary Secondary Education | 1 |
Grade 4 | 1 |
Higher Education | 1 |
Postsecondary Education | 1 |
Audience
Location
Laws, Policies, & Programs
Assessments and Surveys
What Works Clearinghouse Rating
Minor, Elizabeth Covay; Porter, Andrew C.; Murphy, Joseph; Goldring, Ellen; Elliott, Stephen N. – Educational Assessment, Evaluation and Accountability, 2017
The Vanderbilt Assessment for Leadership in Education (VAL-ED) is a 360-degree learning-centered behaviors principal evaluation tool that includes ratings from the principal, supervisors, and teachers. The current study assesses the test-retest reliability of the VAL-ED for a sample of seven school districts as part of multiple validity and…
Descriptors: Administrator Evaluation, Principals, Test Reliability, Test Validity
Powers, Donald E.; Escoffery, David S.; Duchnowski, Matthew P. – Applied Measurement in Education, 2015
By far, the most frequently used method of validating (the interpretation and use of) automated essay scores has been to compare them with scores awarded by human raters. Although this practice is questionable, human-machine agreement is still often regarded as the "gold standard." Our objective was to refine this model and apply it to…
Descriptors: Essays, Test Scoring Machines, Program Validation, Criterion Referenced Tests
Reardon, Sean F.; Kalogrides, Demetra; Ho, Andrew D. – Stanford Center for Education Policy Analysis, 2017
There is no comprehensive database of U.S. district-level test scores that is comparable across states. We describe and evaluate a method for constructing such a database. First, we estimate linear, reliability-adjusted linking transformations from state test score scales to the scale of the National Assessment of Educational Progress (NAEP). We…
Descriptors: School Districts, Scores, Statistical Distributions, Database Design
Blitz, Mark H.; Modeste, Marsha – Leadership and Policy in Schools, 2015
The Comprehensive Assessment of Leadership for Learning (CALL) is a multi-source assessment of distributed instructional leadership. As part of the validation of CALL, researchers examined differences between teacher and leader ratings in assessing distributed leadership practices. The authors utilized a t-test for equality of means for the…
Descriptors: Participative Decision Making, Transformational Leadership, Educational Practices, Instructional Leadership