ERIC - Search Results

Publication Date

In 2025	2
Since 2024	2
Since 2021 (last 5 years)	2
Since 2016 (last 10 years)	12
Since 2006 (last 20 years)	13

Source

Educational Measurement:…

Publication Type

Journal Articles	15
Reports - Research	15
Information Analyses	2

Education Level

Higher Education	2
Junior High Schools	2
Middle Schools	2
Secondary Education	2
Elementary Education	1
Elementary Secondary Education	1
Postsecondary Education	1

Audience

Location

Laws, Policies, & Programs

Assessments and Surveys

What Works Clearinghouse Rating

Showing all 15 results Save | Export

Demystifying Adequate Growth Percentiles

Peer reviewed

Direct link

Katherine E. Castellano; Daniel F. McCaffrey; Joseph A. Martineau – Educational Measurement: Issues and Practice, 2025

Growth-to-standard models evaluate student growth against the growth needed to reach a future standard or target of interest, such as proficiency. A common growth-to-standard model involves comparing the popular Student Growth Percentile (SGP) to Adequate Growth Percentiles (AGPs). AGPs follow from an involved process based on fitting a series of…

Descriptors: Student Evaluation, Growth Models, Student Educational Objectives, Educational Indicators

Growth across Grades and Common Item Grade Alignment in Vertical Scaling Using the Rasch Model

Peer reviewed

Direct link

Sanford R. Student; Derek C. Briggs; Laurie Davis – Educational Measurement: Issues and Practice, 2025

Vertical scales are frequently developed using common item nonequivalent group linking. In this design, one can use upper-grade, lower-grade, or mixed-grade common items to estimate the linking constants that underlie the absolute measurement of growth. Using the Rasch model and a dataset from Curriculum Associates' i-Ready Diagnostic in math in…

Descriptors: Elementary School Mathematics, Elementary School Students, Middle School Mathematics, Middle School Students

Using the "Joint Standards" to Design Postsecondary Assessments with Evidence of Validity and Reliability: An Approach to CAEP Accreditation

Peer reviewed

Direct link

Wilkerson, Judy R. – Educational Measurement: Issues and Practice, 2020

Validity and reliability are a major focus in teacher education accreditation by the Council for Accreditation of Educator Preparation (CAEP). CAEP requires the use of "accepted research standards," but many faculty and administrators are unsure how to meet this requirement. The Standards of Educational and Psychological Testing…

Descriptors: Test Construction, Test Validity, Test Reliability, Teacher Education Programs

The Invariance Paradox: Using Optimal Test Design to Minimize Bias

Peer reviewed

Direct link

Jones, Andrew T.; Kopp, Jason P.; Ong, Thai Q. – Educational Measurement: Issues and Practice, 2020

Studies investigating invariance have often been limited to measurement or prediction invariance. Selection invariance, wherein the use of test scores for classification results in equivalent classification accuracy between groups, has received comparatively little attention in the psychometric literature. Previous research suggests that some form…

Descriptors: Test Construction, Test Bias, Classification, Accuracy

Systematic Comparison of Decision Accuracy of Complex Compensatory Decision Rules Combining Multiple Tests in a Higher Education Context

Peer reviewed

Direct link

Yocarini, Iris E.; Bouwmeester, Samantha; Smeets, Guus; Arends, Lidia R. – Educational Measurement: Issues and Practice, 2018

This real-data-guided simulation study systematically evaluated the decision accuracy of complex decision rules combining multiple tests within different realistic curricula. Specifically, complex decision rules combining conjunctive aspects and compensatory aspects were evaluated. A conjunctive aspect requires a minimum level of performance,…

Descriptors: Comparative Analysis, Decision Making, Accuracy, Higher Education

Rater Certification Tests: A Psychometric Approach

Peer reviewed

Direct link

Attali, Yigal – Educational Measurement: Issues and Practice, 2019

Rater training is an important part of developing and conducting large-scale constructed-response assessments. As part of this process, candidate raters have to pass a certification test to confirm that they are able to score consistently and accurately before they begin scoring operationally. Moreover, many assessment programs require raters to…

Descriptors: Evaluators, Certification, High Stakes Tests, Scoring

Digital ITEMS Module 1: Reliability in Classical Test Theory

Peer reviewed

Direct link

Lewis, Charlie; Chajewski, Michael; Rupp, André A. – Educational Measurement: Issues and Practice, 2018

In this ITEMS module, we provide a two-part introduction to the topic of reliability from the perspective of "classical test theory" (CTT). In the first part, which is directed primarily at beginning learners, we review and build on the content presented in the original didactic ITEMS article by Traub and Rowley (1991). Specifically, we…

Descriptors: Test Reliability, Test Theory, Computation, Data Collection

Reliably Assessing Growth with Longitudinal Diagnostic Classification Models

Peer reviewed

Direct link

Madison, Matthew J. – Educational Measurement: Issues and Practice, 2019

Recent advances have enabled diagnostic classification models (DCMs) to accommodate longitudinal data. These longitudinal DCMs were developed to study how examinees change, or transition, between different attribute mastery statuses over time. This study examines using longitudinal DCMs as an approach to assessing growth and serves three purposes:…

Descriptors: Longitudinal Studies, Item Response Theory, Psychometrics, Criterion Referenced Tests

Impact of Both Local Item Dependencies and Cut-Point Locations on Examinee Classifications

Peer reviewed

Direct link

Rubright, Jonathan D. – Educational Measurement: Issues and Practice, 2018

Performance assessments, scenario-based tasks, and other groups of items carry a risk of violating the local item independence assumption made by unidimensional item response theory (IRT) models. Previous studies have identified negative impacts of ignoring such violations, most notably inflated reliability estimates. Still, the influence of this…

Descriptors: Performance Based Assessment, Item Response Theory, Models, Test Reliability

Using Evidence-Centered Design to Create a Special Educator Observation System

Peer reviewed

Direct link

Johnson, Evelyn S.; Crawford, Angela; Moylan, Laura A.; Zheng, Yuzhu – Educational Measurement: Issues and Practice, 2018

The evidence-centered design framework was used to create a special education teacher observation system, Recognizing Effective Special Education Teachers. Extensive reviews of research informed the domain analysis and modeling stages, and led to the conceptual framework in which effective special education teaching is operationalized as the…

Descriptors: Evidence Based Practice, Special Education Teachers, Observation, Disabilities

The Accuracy of Aggregate Student Growth Percentiles as Indicators of Educator Performance

Peer reviewed

Direct link

Castellano, Katherine E.; McCaffrey, Daniel F. – Educational Measurement: Issues and Practice, 2017

Mean or median student growth percentiles (MGPs) are a popular measure of educator performance, but they lack rigorous evaluation. This study investigates the error in MGP due to test score measurement error (ME). Using analytic derivations, we find that errors in the commonly used MGP are correlated with average prior latent achievement: Teachers…

Descriptors: Teacher Evaluation, Teacher Effectiveness, Value Added Models, Achievement Gains

Exploring the Utility of Sequential Analysis in Studying Informal Formative Assessment Practices

Peer reviewed

Direct link

Furtak, Erin Marie; Ruiz-Primo, Maria Araceli; Bakeman, Roger – Educational Measurement: Issues and Practice, 2017

Formative assessment is a classroom practice that has received much attention in recent years for its established potential at increasing student learning. A frequent analytic approach for determining the quality of formative assessment practices is to develop a coding scheme and determine frequencies with which the codes are observed; however,…

Descriptors: Sequential Approach, Formative Evaluation, Alternative Assessment, Incidence

A Meta-Analysis of Research on the Read Aloud Accommodation

Peer reviewed

Direct link

Buzick, Heather; Stone, Elizabeth – Educational Measurement: Issues and Practice, 2014

Read aloud is a testing accommodation that has been studied by many researchers, and its use on K-12 assessments continues to be debated because of its potential to change the measured construct or unfairly increase test scores. This study is a summary of quantitative research on the read aloud accommodation. Previous studies contributed…

Descriptors: Meta Analysis, Reading Aloud to Others, Educational Research, Statistical Analysis

Survey of the Technical Characteristics of Published Educational Achievement Tests.

Peer reviewed

Hall, Bruce W. – Educational Measurement: Issues and Practice, 1985

A sample (N=37) of currently published achievement tests was surveyed as to the availability of five types of technical data: (1) item selection techniques; (2) standardization; (3) types of norms; (4) types of validating data; and (5) types of reliability data. Recommendations for publishers and cautions for test users are given. (BS)

Descriptors: Achievement Tests, Criterion Referenced Tests, Information Needs, Norm Referenced Tests

Variables in Eliciting Writing Samples.

Peer reviewed

Moran, Mary Ross; And Others – Educational Measurement: Issues and Practice, 1991

Practices identified by experts as critical variables in eliciting writing samples were checked against 12 randomly selected studies using holistic ratings to derive descriptions of inferential statistical results for described samples. The studies often lacked precise information about these variables, limiting understanding of writing evaluation…

Descriptors: Cues, Educational Practices, Examiners, Holistic Evaluation

Test Reliability	15
Test Validity	8
Achievement Gains	4
Error of Measurement	4
Test Construction	4
Decision Making	3
Item Response Theory	3
Models	3
Tests	3
Accuracy	2
Alternative Assessment	2
Classification	2
Comparative Analysis	2
Criterion Referenced Tests	2
Educational Practices	2
Evaluation Criteria	2
Evaluation Methods	2
Growth Models	2
Mathematics Achievement	2
Predictor Variables	2
Psychometrics	2
Scores	2
Simulation	2
Student Evaluation	2
Teacher Evaluation	2
More ▼

Arends, Lidia R.	1
Attali, Yigal	1
Bakeman, Roger	1
Bouwmeester, Samantha	1
Buzick, Heather	1
Castellano, Katherine E.	1
Chajewski, Michael	1
Crawford, Angela	1
Daniel F. McCaffrey	1
Derek C. Briggs	1
Furtak, Erin Marie	1
Hall, Bruce W.	1
Johnson, Evelyn S.	1
Jones, Andrew T.	1
Joseph A. Martineau	1
Katherine E. Castellano	1
Kopp, Jason P.	1
Laurie Davis	1
Lewis, Charlie	1
Madison, Matthew J.	1
McCaffrey, Daniel F.	1
Moran, Mary Ross	1
Moylan, Laura A.	1
Ong, Thai Q.	1
More ▼