ERIC - Search Results

Publication Date

In 2025	1
Since 2024	2
Since 2021 (last 5 years)	3
Since 2016 (last 10 years)	10
Since 2006 (last 20 years)	24

Source

Educational Measurement:…

Publication Type

Journal Articles	24
Reports - Research	12
Reports - Descriptive	10
Opinion Papers	1
Reports - Evaluative	1

Education Level

Elementary Secondary Education	4
Junior High Schools	3
Middle Schools	3
Secondary Education	3
Elementary Education	2
Grade 4	2
Grade 5	2
Higher Education	2
Adult Education	1
Postsecondary Education	1

Audience

Location

Haiti	1
United States	1

Laws, Policies, & Programs

No Child Left Behind Act 2001

Assessments and Surveys

Iowa Tests of Basic Skills	1
Iowa Tests of Educational…	1
National Assessment of…	1

What Works Clearinghouse Rating

Showing 1 to 15 of 24 results Save | Export

Digital Module 27: Hierarchical Rater Models

Peer reviewed

Direct link

Casabianca, Jodi M. – Educational Measurement: Issues and Practice, 2021

Module Overview: In this digital ITEMS module, Dr. Jodi M. Casabianca provides a primer on the "hierarchical rater model" (HRM) framework and the recent expansions to the model for analyzing raters and ratings of constructed responses. In the first part of the module, she establishes an understanding of the nature of constructed…

Descriptors: Hierarchical Linear Modeling, Rating Scales, Error of Measurement, Item Response Theory

Growth across Grades and Common Item Grade Alignment in Vertical Scaling Using the Rasch Model

Peer reviewed

Direct link

Sanford R. Student; Derek C. Briggs; Laurie Davis – Educational Measurement: Issues and Practice, 2025

Vertical scales are frequently developed using common item nonequivalent group linking. In this design, one can use upper-grade, lower-grade, or mixed-grade common items to estimate the linking constants that underlie the absolute measurement of growth. Using the Rasch model and a dataset from Curriculum Associates' i-Ready Diagnostic in math in…

Descriptors: Elementary School Mathematics, Elementary School Students, Middle School Mathematics, Middle School Students

Reframing Research and Assessment Practices: Advancing an Antiracist and Anti-Ableist Research Agenda

Peer reviewed

Direct link

Angela Johnson; Elizabeth Barker; Marcos Viveros Cespedes – Educational Measurement: Issues and Practice, 2024

Educators and researchers strive to build policies and practices on data and evidence, especially on academic achievement scores. When assessment scores are inaccurate for specific student populations or when scores are inappropriately used, even data-driven decisions will be misinformed. To maximize the impact of the research-practice-policy…

Descriptors: Equal Education, Inclusion, Evaluation Methods, Error of Measurement

It's Not Just Angoff: Misperceptions of Hard and Easy Items in Bookmark-Type Ratings

Peer reviewed

Direct link

Wyse, Adam E.; Babcock, Ben – Educational Measurement: Issues and Practice, 2020

A common belief is that the Bookmark method is a cognitively simpler standard-setting method than the modified Angoff method. However, a limited amount of research has investigated panelist's ability to perform well the Bookmark method, and whether some of the challenges panelists face with the Angoff method may also be present in the Bookmark…

Descriptors: Standard Setting (Scoring), Evaluation Methods, Testing Problems, Test Items

The Invariance Paradox: Using Optimal Test Design to Minimize Bias

Peer reviewed

Direct link

Jones, Andrew T.; Kopp, Jason P.; Ong, Thai Q. – Educational Measurement: Issues and Practice, 2020

Studies investigating invariance have often been limited to measurement or prediction invariance. Selection invariance, wherein the use of test scores for classification results in equivalent classification accuracy between groups, has received comparatively little attention in the psychometric literature. Previous research suggests that some form…

Descriptors: Test Construction, Test Bias, Classification, Accuracy

Digital Module 18: Automated Scoring

Peer reviewed

Direct link

Lottridge, Sue; Burkhardt, Amy; Boyer, Michelle – Educational Measurement: Issues and Practice, 2020

In this digital ITEMS module, Dr. Sue Lottridge, Amy Burkhardt, and Dr. Michelle Boyer provide an overview of automated scoring. Automated scoring is the use of computer algorithms to score unconstrained open-ended test items by mimicking human scoring. The use of automated scoring is increasing in educational assessment programs because it allows…

Descriptors: Computer Assisted Testing, Scoring, Automation, Educational Assessment

A Technical Note on IRT Simulation Studies: Dealing with Truth, Estimates, Observed Data, and Residuals

Peer reviewed

Direct link

Luecht, Richard; Ackerman, Terry A. – Educational Measurement: Issues and Practice, 2018

Simulation studies are extremely common in the item response theory (IRT) research literature. This article presents a didactic discussion of "truth" and "error" in IRT-based simulation studies. We ultimately recommend that future research focus less on the simple recovery of parameters from a convenient generating IRT model,…

Descriptors: Item Response Theory, Simulation, Ethics, Error of Measurement

Covariate Measurement Error Correction for Student Growth Percentiles Using the SIMEX Method

Peer reviewed

Direct link

Shang, Yi; VanIwaarden, Adam; Betebenner, Damian W. – Educational Measurement: Issues and Practice, 2015

In this study, we examined the impact of covariate measurement error (ME) on the estimation of quantile regression and student growth percentiles (SGPs), and find that SGPs tend to be overestimated among students with higher prior achievement and underestimated among those with lower prior achievement, a problem we describe as ME endogeneity in…

Descriptors: Error of Measurement, Regression (Statistics), Achievement Gains, Students

The Impact of Measurement Error on the Accuracy of Individual and Aggregate SGP

Peer reviewed

Direct link

McCaffrey, Daniel F.; Castellano, Katherine E.; Lockwood, J. R. – Educational Measurement: Issues and Practice, 2015

Student growth percentiles (SGPs) express students' current observed scores as percentile ranks in the distribution of scores among students with the same prior-year scores. A common concern about SGPs at the student level, and mean or median SGPs (MGPs) at the aggregate level, is potential bias due to test measurement error (ME). Shang,…

Descriptors: Error of Measurement, Accuracy, Achievement Gains, Students

Do 45% of College Students Lack Critical Thinking Skills? Revisiting a Central Conclusion of "Academically Adrift"

Peer reviewed

Direct link

Lane, David; Oswald, Frederick L. – Educational Measurement: Issues and Practice, 2016

The educational literature, the popular press, and educated laypeople have all echoed a conclusion from the book "Academically Adrift" by Richard Arum and Josipa Roksa (which has now become received wisdom), namely, that 45% of college students showed no significant gains in critical thinking skills. Similar results were reported by…

Descriptors: College Students, Critical Thinking, Thinking Skills, Statistical Analysis

The Accuracy of Aggregate Student Growth Percentiles as Indicators of Educator Performance

Peer reviewed

Direct link

Castellano, Katherine E.; McCaffrey, Daniel F. – Educational Measurement: Issues and Practice, 2017

Mean or median student growth percentiles (MGPs) are a popular measure of educator performance, but they lack rigorous evaluation. This study investigates the error in MGP due to test score measurement error (ME). Using analytic derivations, we find that errors in the commonly used MGP are correlated with average prior latent achievement: Teachers…

Descriptors: Teacher Evaluation, Teacher Effectiveness, Value Added Models, Achievement Gains

Uncovering Multivariate Structure in Classroom Observations in the Presence of Rater Errors

Peer reviewed

Direct link

McCaffrey, Daniel F.; Yuan, Kun; Savitsky, Terrance D.; Lockwood, J. R.; Edelen, Maria O. – Educational Measurement: Issues and Practice, 2015

We examine the factor structure of scores from the CLASS-S protocol obtained from observations of middle school classroom teaching. Factor analysis has been used to support both interpretations of scores from classroom observation protocols, like CLASS-S, and the theories about teaching that underlie them. However, classroom observations contain…

Descriptors: Factor Structure, Multivariate Analysis, Scores, Factor Analysis

Quantifying Error and Uncertainty Reductions in Scaling Functions: An ITEMS Module

Peer reviewed

Direct link

Moses, Tim – Educational Measurement: Issues and Practice, 2014

This module describes and extends X-to-Y regression measures that have been proposed for use in the assessment of X-to-Y scaling and equating results. Measures are developed that are similar to those based on prediction error in regression analyses but that are directly suited to interests in scaling and equating evaluations. The regression and…

Descriptors: Scaling, Regression (Statistics), Equated Scores, Comparative Analysis

Exploring the Utility of Sequential Analysis in Studying Informal Formative Assessment Practices

Peer reviewed

Direct link

Furtak, Erin Marie; Ruiz-Primo, Maria Araceli; Bakeman, Roger – Educational Measurement: Issues and Practice, 2017

Formative assessment is a classroom practice that has received much attention in recent years for its established potential at increasing student learning. A frequent analytic approach for determining the quality of formative assessment practices is to develop a coding scheme and determine frequencies with which the codes are observed; however,…

Descriptors: Sequential Approach, Formative Evaluation, Alternative Assessment, Incidence

The Effect of Ignoring Classroom-Level Variance in Estimating the Generalizability of School Mean Scores

Peer reviewed

Direct link

Wei, Xin; Haertel, Edward – Educational Measurement: Issues and Practice, 2011

Contemporary educational accountability systems, including state-level systems prescribed under No Child Left Behind as well as those envisioned under the "Race to the Top" comprehensive assessment competition, rely on school-level summaries of student test scores. The precision of these score summaries is almost always evaluated using models that…

Descriptors: Scores, Reliability, Computation, Generalizability Theory

Previous Page | Next Page »

Pages: 1 | 2

Error of Measurement	24
Evaluation Methods	6
Scores	6
Achievement Gains	5
Test Items	5
Test Reliability	5
Test Validity	5
Comparative Analysis	4
Generalizability Theory	4
Item Response Theory	4
Scoring	4
Accuracy	3
Bias	3
Computation	3
Correlation	3
Educational Assessment	3
Evaluation Criteria	3
Measurement Techniques	3
Psychometrics	3
Regression (Statistics)	3
Scaling	3
Simulation	3
Test Bias	3
Test Construction	3
Academic Achievement	2
More ▼

McCaffrey, Daniel F.	3
Castellano, Katherine E.	2
Kolen, Michael J.	2
Li, Min	2
Lockwood, J. R.	2
Solano-Flores, Guillermo	2
Ackerman, Terry A.	1
Anderson, Dan	1
Angela Johnson	1
Babcock, Ben	1
Bakeman, Roger	1
Betebenner, Damian W.	1
Boyer, Michelle	1
Burkhardt, Amy	1
Casabianca, Jodi M.	1
Derek C. Briggs	1
Edelen, Maria O.	1
Elizabeth Barker	1
Furtak, Erin Marie	1
Haertel, Edward	1
Ito, Kyoko	1
Jones, Andrew T.	1
Kopp, Jason P.	1
Lane, David	1
Laurie Davis	1
More ▼