Publication Date
| In 2026 | 0 |
| Since 2025 | 1 |
| Since 2022 (last 5 years) | 1 |
| Since 2017 (last 10 years) | 6 |
| Since 2007 (last 20 years) | 8 |
Descriptor
Source
| ETS Research Report Series | 2 |
| Assessment for Effective… | 1 |
| Grantee Submission | 1 |
| Journal of Education for… | 1 |
| ProQuest LLC | 1 |
| Scientific Studies of Reading | 1 |
| Stanford Center for Education… | 1 |
Author
| Bochenek, Jennifer | 2 |
| Burkander, Kri | 1 |
| Cline, Fred | 1 |
| Dena Dossett | 1 |
| Early, Diane M. | 1 |
| Ho, Andrew D. | 1 |
| Jason C. Immekus | 1 |
| Jeffrey C. Valentine | 1 |
| Kalogrides, Demetra | 1 |
| Karen Blackburn Hoeve | 1 |
| Klieger, David | 1 |
| More ▼ | |
Publication Type
| Reports - Research | 7 |
| Journal Articles | 6 |
| Tests/Questionnaires | 2 |
| Dissertations/Theses -… | 1 |
Education Level
| Higher Education | 2 |
| Postsecondary Education | 2 |
| Early Childhood Education | 1 |
| Elementary Education | 1 |
| Junior High Schools | 1 |
| Middle Schools | 1 |
| Secondary Education | 1 |
Audience
Laws, Policies, & Programs
Assessments and Surveys
| Early Childhood Environment… | 1 |
| Graduate Record Examinations | 1 |
| National Assessment of… | 1 |
| Test of English for… | 1 |
What Works Clearinghouse Rating
Karen Blackburn Hoeve – ProQuest LLC, 2021
High stakes test-based accountability systems primarily rely on aggregates and derivatives of scores from tests that were originally developed to measure individual student mastery of content specifications. Current validity models do not explicitly address this use of aggregate scores to measure the performance of teachers, administrators, and…
Descriptors: Accountability, Test Validity, High Stakes Tests, Hierarchical Linear Modeling
Stephen M. Leach; Jason C. Immekus; Jeffrey C. Valentine; Prathiba Batley; Dena Dossett; Tamara Lewis; Thomas Reece – Assessment for Effective Intervention, 2025
Educators commonly use school climate survey scores to inform and evaluate interventions for equitably improving learning and reducing educational disparities. Unfortunately, validity evidence to support these (and other) score uses often falls short. In response, Whitehouse et al. proposed a collaborative, two-part validity testing framework for…
Descriptors: School Surveys, Measurement, Hierarchical Linear Modeling, Educational Environment
Reardon, Sean F.; Ho, Andrew D.; Kalogrides, Demetra – Stanford Center for Education Policy Analysis, 2019
Linking score scales across different tests is considered speculative and fraught, even at the aggregate level (Feuer et al., 1999; Thissen, 2007). We introduce and illustrate validation methods for aggregate linkages, using the challenge of linking U.S. school district average test scores across states as a motivating example. We show that…
Descriptors: Test Validity, Evaluation Methods, School Districts, Scores
Early, Diane M.; Sideris, John; Neitzel, Jennifer; LaForett, Doré R.; Nehler, Chelsea G. – Grantee Submission, 2018
The Early Childhood Environment Rating Scale-Third Edition (ECERS-3) is the latest version of one of the most widely used observational tools for assessing the quality of classrooms serving preschool-aged children. This study was the first assessment of its factor structure and validity, an important step given its widespread use. An ECERS-3…
Descriptors: Rating Scales, Early Childhood Education, Educational Quality, Factor Structure
Muijselaar, Marloes M. L. – Scientific Studies of Reading, 2018
We investigated the dimensionality of inference making in samples of 4- to 9-year-olds (Ns = 416-783) to determine if local and global coherence inferences could be distinguished. In addition, we examined the validity of our experimenter-developed inference measure by comparing with three additional measures of listening comprehension. Multitrait,…
Descriptors: Inferences, Thinking Skills, Young Children, Listening Comprehension
Wei, Youhua; Low, Albert – ETS Research Report Series, 2017
In most large-scale programs of tests that aid in making high-stakes decisions, such as the "TOEIC"® family of products and service, it is not unusual for a significant portion of test takers to retake the test at multiple times.The study reported here used multilevel growth modeling to explore the score change patterns of nearly 20,000…
Descriptors: English (Second Language), Language Tests, Second Language Learning, Scores
Ling, Guangming; Bochenek, Jennifer; Burkander, Kri – Journal of Education for Business, 2015
By applying multilevel models with random effects, the authors reviewed and synthesized findings from 30 studies that were published in the last 20 years exploring the relationship between the Educational Testing Service Major Field Test for a Bachelor's Degree in Business (MFTB) and related factors. The results suggest that MFTB scores correlated…
Descriptors: Bachelors Degrees, Institutional Research, Educational Testing, Scores
Young, John W.; Klieger, David; Bochenek, Jennifer; Li, Chen; Cline, Fred – ETS Research Report Series, 2014
Scores from the "GRE"® revised General Test provide important information regarding the verbal and quantitative reasoning abilities and analytical writing skills of applicants to graduate programs. The validity and utility of these scores depend upon the degree to which the scores predict success in graduate and business school in…
Descriptors: Business Schools, Scores, Test Validity, College Entrance Examinations

Direct link
Peer reviewed
