Publication Date
In 2025 | 0 |
Since 2024 | 0 |
Since 2021 (last 5 years) | 1 |
Since 2016 (last 10 years) | 1 |
Since 2006 (last 20 years) | 2 |
Descriptor
Educational Testing | 3 |
Standardized Tests | 3 |
Achievement Tests | 2 |
Error of Measurement | 2 |
Generalizability Theory | 2 |
Mathematics Tests | 2 |
Reading Tests | 2 |
Scores | 2 |
Accountability | 1 |
Bayesian Statistics | 1 |
Correlation | 1 |
More ▼ |
Source
Journal of Educational and… | 3 |
Author
Boyd, Donald | 1 |
Ho, Andrew D. | 1 |
Kalogrides, Demetra | 1 |
Lankford, Hamilton | 1 |
Loeb, Susanna | 1 |
Longford, N. T. | 1 |
Reardon, Sean F. | 1 |
Wyckoff, James | 1 |
Publication Type
Journal Articles | 3 |
Reports - Research | 2 |
Reports - Descriptive | 1 |
Education Level
Grade 4 | 2 |
Grade 8 | 2 |
Elementary Education | 1 |
Grade 3 | 1 |
Grade 5 | 1 |
Grade 6 | 1 |
Grade 7 | 1 |
Intermediate Grades | 1 |
Junior High Schools | 1 |
Middle Schools | 1 |
Secondary Education | 1 |
More ▼ |
Audience
Location
New York | 1 |
Laws, Policies, & Programs
Assessments and Surveys
Measures of Academic Progress | 1 |
National Assessment of… | 1 |
What Works Clearinghouse Rating
Reardon, Sean F.; Kalogrides, Demetra; Ho, Andrew D. – Journal of Educational and Behavioral Statistics, 2021
Linking score scales across different tests is considered speculative and fraught, even at the aggregate level. We introduce and illustrate validation methods for aggregate linkages, using the challenge of linking U.S. school district average test scores across states as a motivating example. We show that aggregate linkages can be validated both…
Descriptors: Equated Scores, Validity, Methods, School Districts
Boyd, Donald; Lankford, Hamilton; Loeb, Susanna; Wyckoff, James – Journal of Educational and Behavioral Statistics, 2013
Test-based accountability as well as value-added asessments and much experimental and quasi-experimental research in education rely on achievement tests to measure student skills and knowledge. Yet, we know little regarding fundamental properties of these tests, an important example being the extent of measurement error and its implications for…
Descriptors: Accountability, Educational Research, Educational Testing, Error of Measurement

Longford, N. T. – Journal of Educational and Behavioral Statistics, 1994
Presents a model-based approach to rater reliability for essays read by multiple raters. The approach is motivated by generalizability theory, and variation of rater severity and rater inconsistency is considered in the presence of between-examinee variations. Illustrates methods with data from standardized educational tests. (Author/SLD)
Descriptors: Educational Testing, Essay Tests, Generalizability Theory, Interrater Reliability