ERIC - Search Results

Publication Date

In 2026	0
Since 2025	1
Since 2022 (last 5 years)	1
Since 2017 (last 10 years)	4
Since 2007 (last 20 years)	5

Source

Educational Measurement:…

Author

Bakeman, Roger	1
Brennan, Robert L.	1
Castellano, Katherine E.	1
Derek C. Briggs	1
Furtak, Erin Marie	1
Harvill, Leo M.	1
Johnson, Eugene G.	1
Jones, Andrew T.	1
Kolen, Michael J.	1
Kopp, Jason P.	1
Laurie Davis	1
McCaffrey, Daniel F.	1
Ong, Thai Q.	1
Ruiz-Primo, Maria Araceli	1
Sanford R. Student	1
Tong, Ye	1
More ▼

Publication Type

Journal Articles	7
Reports - Research	4
Reports - Evaluative	2
Reports - Descriptive	1

Education Level

Junior High Schools	2
Middle Schools	2
Secondary Education	2
Elementary Education	1
Elementary Secondary Education	1

Audience

Location

Laws, Policies, & Programs

No Child Left Behind Act 2001

Assessments and Surveys

Iowa Tests of Basic Skills	1
Iowa Tests of Educational…	1
National Assessment of…	1

What Works Clearinghouse Rating

Showing all 7 results Save | Export

Growth across Grades and Common Item Grade Alignment in Vertical Scaling Using the Rasch Model

Peer reviewed

Direct link

Sanford R. Student; Derek C. Briggs; Laurie Davis – Educational Measurement: Issues and Practice, 2025

Vertical scales are frequently developed using common item nonequivalent group linking. In this design, one can use upper-grade, lower-grade, or mixed-grade common items to estimate the linking constants that underlie the absolute measurement of growth. Using the Rasch model and a dataset from Curriculum Associates' i-Ready Diagnostic in math in…

Descriptors: Elementary School Mathematics, Elementary School Students, Middle School Mathematics, Middle School Students

The Invariance Paradox: Using Optimal Test Design to Minimize Bias

Peer reviewed

Direct link

Jones, Andrew T.; Kopp, Jason P.; Ong, Thai Q. – Educational Measurement: Issues and Practice, 2020

Studies investigating invariance have often been limited to measurement or prediction invariance. Selection invariance, wherein the use of test scores for classification results in equivalent classification accuracy between groups, has received comparatively little attention in the psychometric literature. Previous research suggests that some form…

Descriptors: Test Construction, Test Bias, Classification, Accuracy

The Accuracy of Aggregate Student Growth Percentiles as Indicators of Educator Performance

Peer reviewed

Direct link

Castellano, Katherine E.; McCaffrey, Daniel F. – Educational Measurement: Issues and Practice, 2017

Mean or median student growth percentiles (MGPs) are a popular measure of educator performance, but they lack rigorous evaluation. This study investigates the error in MGP due to test score measurement error (ME). Using analytic derivations, we find that errors in the commonly used MGP are correlated with average prior latent achievement: Teachers…

Descriptors: Teacher Evaluation, Teacher Effectiveness, Value Added Models, Achievement Gains

Exploring the Utility of Sequential Analysis in Studying Informal Formative Assessment Practices

Peer reviewed

Direct link

Furtak, Erin Marie; Ruiz-Primo, Maria Araceli; Bakeman, Roger – Educational Measurement: Issues and Practice, 2017

Formative assessment is a classroom practice that has received much attention in recent years for its established potential at increasing student learning. A frequent analytic approach for determining the quality of formative assessment practices is to develop a coding scheme and determine frequencies with which the codes are observed; however,…

Descriptors: Sequential Approach, Formative Evaluation, Alternative Assessment, Incidence

Scaling: An Items Module

Peer reviewed

Direct link

Tong, Ye; Kolen, Michael J. – Educational Measurement: Issues and Practice, 2010

"Scaling" is the process of constructing a score scale that associates numbers or other ordered indicators with the performance of examinees. Scaling typically is conducted to aid users in interpreting test results. This module describes different types of raw scores and scale scores, illustrates how to incorporate various sources of…

Descriptors: Test Results, Scaling, Measures (Individuals), Raw Scores

Generalizability of Performance Assessments.

Peer reviewed

Brennan, Robert L.; Johnson, Eugene G. – Educational Measurement: Issues and Practice, 1995

The application of generalizability theory to the reliability and error variance estimation for performance assessment scores is discussed. Decision makers concerned with performance assessment need to realize the restrictions that limit generalizability such as limitations that lead to reductions in the number of tasks possible, rater quality,…

Descriptors: Decision Making, Educational Assessment, Error of Measurement, Estimation (Mathematics)

NCME Instructional Module: Standard Error of Measurement.

Peer reviewed

Harvill, Leo M. – Educational Measurement: Issues and Practice, 1991

This paper discusses standard error of measurement (SEM), the amount of variation or spread in the measurement errors for a test, and gives information needed to interpret test scores using SEMs. SEMs at various score levels should be used in calculating score bands rather than a single SEM value. (SLD)

Descriptors: Definitions, Equations (Mathematics), Error of Measurement, Estimation (Mathematics)

Error of Measurement	7
Test Reliability	7
Test Construction	3
Test Validity	3
Accuracy	2
Achievement Gains	2
Alternative Assessment	2
Decision Making	2
Educational Assessment	2
Estimation (Mathematics)	2
Evaluation Criteria	2
Evaluation Methods	2
Item Response Theory	2
Raw Scores	2
Scaling	2
Test Bias	2
Test Interpretation	2
Test Results	2
Classification	1
Classroom Observation…	1
Data Collection	1
Definitions	1
Educational Practices	1
Elementary School Mathematics	1
Elementary School Students	1
More ▼