ERIC - Search Results

Publication Date

In 2025	0
Since 2024	0
Since 2021 (last 5 years)	1
Since 2016 (last 10 years)	1
Since 2006 (last 20 years)	4

Descriptor

Educational Testing	4
Error of Measurement	4
Achievement Tests	2
Correlation	2
Mathematics Tests	2
Reading Tests	2
Scores	2
Standardized Tests	2
Accountability	1
Bayesian Statistics	1
Comparative Analysis	1
Computation	1
Educational Policy	1
Educational Research	1
Effect Size	1
Equated Scores	1
Generalizability Theory	1
Grade 4	1
Grade 8	1
High Stakes Tests	1
Item Response Theory	1
Language Arts	1
Language Tests	1
Longitudinal Studies	1
Markov Processes	1
More ▼

Source

Journal of Educational and…

Author

Boyd, Donald	1
Haberman, Shelby J.	1
Ho, Andrew D.	1
Kalogrides, Demetra	1
Lankford, Hamilton	1
Liu, Yuming	1
Loeb, Susanna	1
Reardon, Sean F.	1
Schulz, E. Matthew	1
Wyckoff, James	1
Yu, Lei	1
More ▼

Publication Type

Journal Articles	4
Reports - Research	3
Reports - Evaluative	1

Education Level

Grade 4	2
Grade 8	2
Elementary Education	1
Grade 3	1
Grade 5	1
Grade 6	1
Grade 7	1
Intermediate Grades	1
Junior High Schools	1
Middle Schools	1
Secondary Education	1
More ▼

Audience

Location

New York

Laws, Policies, & Programs

Assessments and Surveys

Iowa Tests of Basic Skills	1
Measures of Academic Progress	1
National Assessment of…	1

What Works Clearinghouse Rating

Showing all 4 results Save | Export

Validation Methods for Aggregate-Level Test Scale Linking: A Case Study Mapping School District Test Score Distributions to a Common Scale

Peer reviewed
PDF on ERIC

Download full text

Direct link

Reardon, Sean F.; Kalogrides, Demetra; Ho, Andrew D. – Journal of Educational and Behavioral Statistics, 2021

Linking score scales across different tests is considered speculative and fraught, even at the aggregate level. We introduce and illustrate validation methods for aggregate linkages, using the challenge of linking U.S. school district average test scores across states as a motivating example. We show that aggregate linkages can be validated both…

Descriptors: Equated Scores, Validity, Methods, School Districts

Measuring Test Measurement Error: A General Approach

Peer reviewed

Direct link

Boyd, Donald; Lankford, Hamilton; Loeb, Susanna; Wyckoff, James – Journal of Educational and Behavioral Statistics, 2013

Test-based accountability as well as value-added asessments and much experimental and quasi-experimental research in education rely on achievement tests to measure student skills and knowledge. Yet, we know little regarding fundamental properties of these tests, an important example being the extent of measurement error and its implications for…

Descriptors: Accountability, Educational Research, Educational Testing, Error of Measurement

When Can Subscores Have Value?

Peer reviewed

Direct link

Haberman, Shelby J. – Journal of Educational and Behavioral Statistics, 2008

In educational tests, subscores are often generated from a portion of the items in a larger test. Guidelines based on mean squared error are proposed to indicate whether subscores are worth reporting. Alternatives considered are direct reports of subscores, estimates of subscores based on total score, combined estimates based on subscores and…

Descriptors: Testing Programs, Regression (Statistics), Scores, Student Evaluation

Standard Error Estimation of 3PL IRT True Score Equating with an MCMC Method

Peer reviewed

Direct link

Liu, Yuming; Schulz, E. Matthew; Yu, Lei – Journal of Educational and Behavioral Statistics, 2008

A Markov chain Monte Carlo (MCMC) method and a bootstrap method were compared in the estimation of standard errors of item response theory (IRT) true score equating. Three test form relationships were examined: parallel, tau-equivalent, and congeneric. Data were simulated based on Reading Comprehension and Vocabulary tests of the Iowa Tests of…

Descriptors: Reading Comprehension, Test Format, Markov Processes, Educational Testing