Publication Date
| In 2026 | 0 |
| Since 2025 | 2 |
| Since 2022 (last 5 years) | 2 |
| Since 2017 (last 10 years) | 13 |
Descriptor
Source
Author
| Arya, Diana J. | 1 |
| Boardman, Alison G. | 1 |
| Brumley, Benjamin Pratt | 1 |
| Buckley, Pamela | 1 |
| Carl Westine | 1 |
| Dena Dossett | 1 |
| Early, Diane M. | 1 |
| Ho, Andrew D. | 1 |
| Hofer, S. I. | 1 |
| Jason C. Immekus | 1 |
| Jeffrey C. Valentine | 1 |
| More ▼ | |
Publication Type
| Reports - Research | 11 |
| Journal Articles | 9 |
| Dissertations/Theses -… | 2 |
| Tests/Questionnaires | 2 |
| Numerical/Quantitative Data | 1 |
Education Level
| Junior High Schools | 4 |
| Secondary Education | 4 |
| Early Childhood Education | 3 |
| Middle Schools | 3 |
| Elementary Education | 2 |
| High Schools | 1 |
| Kindergarten | 1 |
| Preschool Education | 1 |
| Primary Education | 1 |
Audience
Laws, Policies, & Programs
Assessments and Surveys
| Early Childhood Environment… | 1 |
| National Assessment of… | 1 |
| Test of English for… | 1 |
What Works Clearinghouse Rating
Tong Wu; Stella Y. Kim; Carl Westine; Michelle Boyer – Journal of Educational Measurement, 2025
While significant attention has been given to test equating to ensure score comparability, limited research has explored equating methods for rater-mediated assessments, where human raters inherently introduce error. If not properly addressed, these errors can undermine score interchangeability and test validity. This study proposes an equating…
Descriptors: Item Response Theory, Evaluators, Error of Measurement, Test Validity
Karen Blackburn Hoeve – ProQuest LLC, 2021
High stakes test-based accountability systems primarily rely on aggregates and derivatives of scores from tests that were originally developed to measure individual student mastery of content specifications. Current validity models do not explicitly address this use of aggregate scores to measure the performance of teachers, administrators, and…
Descriptors: Accountability, Test Validity, High Stakes Tests, Hierarchical Linear Modeling
Stephen M. Leach; Jason C. Immekus; Jeffrey C. Valentine; Prathiba Batley; Dena Dossett; Tamara Lewis; Thomas Reece – Assessment for Effective Intervention, 2025
Educators commonly use school climate survey scores to inform and evaluate interventions for equitably improving learning and reducing educational disparities. Unfortunately, validity evidence to support these (and other) score uses often falls short. In response, Whitehouse et al. proposed a collaborative, two-part validity testing framework for…
Descriptors: School Surveys, Measurement, Hierarchical Linear Modeling, Educational Environment
Reardon, Sean F.; Ho, Andrew D.; Kalogrides, Demetra – Stanford Center for Education Policy Analysis, 2019
Linking score scales across different tests is considered speculative and fraught, even at the aggregate level (Feuer et al., 1999; Thissen, 2007). We introduce and illustrate validation methods for aggregate linkages, using the challenge of linking U.S. school district average test scores across states as a motivating example. We show that…
Descriptors: Test Validity, Evaluation Methods, School Districts, Scores
Brumley, Benjamin Pratt – ProQuest LLC, 2019
Children from low-income households are at risk for entering school behind their more economically advantaged peers across major domains of school readiness. The Head Start program represents the federal government's response to these achievement gaps by mandating the use of scientifically based assessments and curricula to provide children with…
Descriptors: School Readiness, Learning Processes, Preschool Children, Measures (Individuals)
Early, Diane M.; Sideris, John; Neitzel, Jennifer; LaForett, Doré R.; Nehler, Chelsea G. – Grantee Submission, 2018
The Early Childhood Environment Rating Scale-Third Edition (ECERS-3) is the latest version of one of the most widely used observational tools for assessing the quality of classrooms serving preschool-aged children. This study was the first assessment of its factor structure and validity, an important step given its widespread use. An ECERS-3…
Descriptors: Rating Scales, Early Childhood Education, Educational Quality, Factor Structure
Buckley, Pamela; Moore, Brooke; Boardman, Alison G.; Arya, Diana J.; Maul, Andrew – American Educational Research Journal, 2017
K-12 intervention studies often include fidelity of implementation (FOI) as a mediating variable, though most do not report the validity of fidelity measures. This article discusses the critical need for validated FOI scales. To illustrate our point, we describe the development and validation of the Implementation Validity Checklist (IVC-R), an…
Descriptors: Intervention, Fidelity, Program Implementation, Test Validity
Muijselaar, Marloes M. L. – Scientific Studies of Reading, 2018
We investigated the dimensionality of inference making in samples of 4- to 9-year-olds (Ns = 416-783) to determine if local and global coherence inferences could be distinguished. In addition, we examined the validity of our experimenter-developed inference measure by comparing with three additional measures of listening comprehension. Multitrait,…
Descriptors: Inferences, Thinking Skills, Young Children, Listening Comprehension
Neugebauer, Sabina Rak – Assessment for Effective Intervention, 2017
While educators and researchers agree on the crucial role of literacy motivation for performance, research on methods for accurately assessing adolescent reading motivation is still uncommon. The most used reading motivation instruments do not attend to the multiple content areas in which adolescents read. The present study examines a new…
Descriptors: Reading Motivation, Content Area Reading, Literacy, Measures (Individuals)
Shinogaya, Keito – Journal of Educational Research, 2018
Preparation is an effective and necessary activity; however, most students do not prepare for future lessons. The present study addressed this problem and examined how learners' motives, beliefs, and perceptions affected their strategy use during preparation for future lessons. Participants were 219 Japanese junior high school students who…
Descriptors: Beliefs, Student Attitudes, Learning Strategies, Foreign Countries
Lichtenberger, A.; Wagner, C.; Hofer, S. I.; Stem, E.; Vaterlaus, A. – Physical Review Physics Education Research, 2017
The kinematics concept test (KCT) is a multiple-choice test designed to evaluate students' conceptual understanding of kinematics at the high school level. The test comprises 49 multiple-choice items about velocity and acceleration, which are based on seven kinematic concepts and which make use of three different representations. In the first part…
Descriptors: Foreign Countries, High School Students, Psychometrics, Multiple Choice Tests
Regional Educational Laboratory Midwest, 2019
These are the appendixes for the report "Children's Knowledge and Skills at Kindergarten Entry in Illinois: Results from the First Statewide Administration of the Kindergarten Individual Development Survey." At least half of states administer or are developing kindergarten entry assessments. In fall 2017 the Illinois State Board of…
Descriptors: Kindergarten, School Readiness, Public Schools, Test Validity
Wei, Youhua; Low, Albert – ETS Research Report Series, 2017
In most large-scale programs of tests that aid in making high-stakes decisions, such as the "TOEIC"® family of products and service, it is not unusual for a significant portion of test takers to retake the test at multiple times.The study reported here used multilevel growth modeling to explore the score change patterns of nearly 20,000…
Descriptors: English (Second Language), Language Tests, Second Language Learning, Scores

Peer reviewed
Direct link
