ERIC - Search Results

Publication Date

In 2025	1
Since 2024	1
Since 2021 (last 5 years)	2
Since 2016 (last 10 years)	7
Since 2006 (last 20 years)	12

Source

Educational Measurement:…

Publication Type

Journal Articles	12
Reports - Research	6
Reports - Descriptive	3
Reports - Evaluative	3

Education Level

Secondary Education	3
Elementary Secondary Education	2
Elementary Education	1
Grade 3	1
Grade 4	1
Grade 5	1
Grade 6	1
Grade 7	1
High Schools	1
Higher Education	1
Intermediate Grades	1
Junior High Schools	1
Middle Schools	1
Postsecondary Education	1
More ▼

Audience

Location

Colorado

Laws, Policies, & Programs

No Child Left Behind Act 2001

Assessments and Surveys

Iowa Tests of Basic Skills	1
Iowa Tests of Educational…	1
National Assessment of…	1
Program for International…	1
SAT (College Admission Test)	1

What Works Clearinghouse Rating

Showing all 12 results Save | Export

Reconceptualization of Coefficient Alpha Reliability for Test Summed and Scaled Scores

Peer reviewed

Direct link

Almehrizi, Rashid S. – Educational Measurement: Issues and Practice, 2022

Coefficient alpha reliability persists as the most common reliability coefficient reported in research. The assumptions for its use are, however, not well-understood. The current paper challenges the commonly used expressions of coefficient alpha and argues that while these expressions are correct when estimating reliability for summed scores,…

Descriptors: Reliability, Scores, Scaling, Statistical Analysis

Growth across Grades and Common Item Grade Alignment in Vertical Scaling Using the Rasch Model

Peer reviewed

Direct link

Sanford R. Student; Derek C. Briggs; Laurie Davis – Educational Measurement: Issues and Practice, 2025

Vertical scales are frequently developed using common item nonequivalent group linking. In this design, one can use upper-grade, lower-grade, or mixed-grade common items to estimate the linking constants that underlie the absolute measurement of growth. Using the Rasch model and a dataset from Curriculum Associates' i-Ready Diagnostic in math in…

Descriptors: Elementary School Mathematics, Elementary School Students, Middle School Mathematics, Middle School Students

A Comparison of Two Alternate Scaling Approaches Employed for Task Analyses in Credentialing Examination Development

Peer reviewed

Direct link

Fidler, James R.; Risk, Nicole M. – Educational Measurement: Issues and Practice, 2019

Credentialing examination developers rely on task (job) analyses for establishing inventories of task and knowledge areas in which competency is required for safe and successful practice in target occupations. There are many ways in which task-related information may be gathered from practitioner ratings, each with its own advantage and…

Descriptors: Job Analysis, Scaling, Licensing Examinations (Professions), Test Construction

Digital ITEMS Module 03: Nonparametric Item Response Theory

Peer reviewed

Direct link

Wind, Stefanie A. – Educational Measurement: Issues and Practice, 2018

In this digital ITEMS module, we introduce the framework of nonparametric item response theory (IRT), in particular Mokken scaling, which can be used to evaluate fundamental measurement properties with less strict assumptions than parametric IRT models. We walk through the key distinction between parametric and nonparametric models, introduce the…

Descriptors: Educational Assessment, Nonparametric Statistics, Item Response Theory, Scaling

How Robust Are Cross-Country Comparisons of PISA Scores to the Scaling Model Used?

Peer reviewed

Direct link

Jerrim, John; Parker, Philip; Choi, Alvaro; Chmielewski, Anna Katyn; Sälzer, Christine; Shure, Nikki – Educational Measurement: Issues and Practice, 2018

The Programme for International Student Assessment (PISA) is an important international study of 15-olds' knowledge and skills. New results are released every 3 years, and have a substantial impact upon education policy. Yet, despite its influence, the methodology underpinning PISA has received significant criticism. Much of this criticism has…

Descriptors: Educational Assessment, Comparative Education, Achievement Tests, Foreign Countries

An Instructional Module on Mokken Scale Analysis

Peer reviewed

Direct link

Wind, Stefanie A. – Educational Measurement: Issues and Practice, 2017

Mokken scale analysis (MSA) is a probabilistic-nonparametric approach to item response theory (IRT) that can be used to evaluate fundamental measurement properties with less strict assumptions than parametric IRT models. This instructional module provides an introduction to MSA as a probabilistic-nonparametric framework in which to explore…

Descriptors: Probability, Nonparametric Statistics, Item Response Theory, Scaling

Within-High-School versus Across-High-School Scaling of Admissions Assessments: Implications for Validity and Diversity Effects

Peer reviewed

Direct link

Kostal, Jack W.; Sackett, Paul R.; Kuncel, Nathan R.; Walmsley, Philip T.; Stemig, Melissa S. – Educational Measurement: Issues and Practice, 2017

Previous research has established that SAT scores and high school grade point average (HSGPA) differ in their predictive power and in the size of mean differences across racial/ethnic groups. However, the SAT is scaled nationally across all test takers while HSGPA is scaled locally within a school. In this study, the researchers propose that this…

Descriptors: College Entrance Examinations, Scaling, Grade Point Average, Differences

Quantifying Error and Uncertainty Reductions in Scaling Functions: An ITEMS Module

Peer reviewed

Direct link

Moses, Tim – Educational Measurement: Issues and Practice, 2014

This module describes and extends X-to-Y regression measures that have been proposed for use in the assessment of X-to-Y scaling and equating results. Measures are developed that are similar to those based on prediction error in regression analyses but that are directly suited to interests in scaling and equating evaluations. The regression and…

Descriptors: Scaling, Regression (Statistics), Equated Scores, Comparative Analysis

Standard-Setting Methods as Measurement Processes

Peer reviewed

Direct link

Nichols, Paul; Twing, Jon; Mueller, Canda D.; O'Malley, Kimberly – Educational Measurement: Issues and Practice, 2010

Some writers in the measurement literature have been skeptical of the meaningfulness of achievement standards and described the standard-setting process as blatantly arbitrary. We argue that standard setting is more appropriately conceived of as a measurement process similar to student assessment. The construct being measured is the panelists'…

Descriptors: Scaling, Achievement, Standard Setting (Scoring), Measurement

Scaling: An Items Module

Peer reviewed

Direct link

Tong, Ye; Kolen, Michael J. – Educational Measurement: Issues and Practice, 2010

"Scaling" is the process of constructing a score scale that associates numbers or other ordered indicators with the performance of examinees. Scaling typically is conducted to aid users in interpreting test results. This module describes different types of raw scores and scale scores, illustrates how to incorporate various sources of…

Descriptors: Test Results, Scaling, Measures (Individuals), Raw Scores

The Impact of Vertical Scaling Decisions on Growth Interpretations

Peer reviewed

Direct link

Briggs, Derek C.; Weeks, Jonathan P. – Educational Measurement: Issues and Practice, 2009

Most growth models implicitly assume that test scores have been vertically scaled. What may not be widely appreciated are the different choices that must be made when creating a vertical score scale. In this paper empirical patterns of growth in student achievement are compared as a function of different approaches to creating a vertical scale.…

Descriptors: Scaling, Models, Longitudinal Studies, Academic Achievement

Examining Contextual Effects in a Practice Analysis: An Application of Dual Scaling

Peer reviewed

Direct link

De Champlain, Andre F.; Cuddy, Monica M.; LaDuca, Tony – Educational Measurement: Issues and Practice, 2007

Practice analyses are routinely used in support of the development of occupational and professional certification and licensure examinations. These analyses usually survey incumbents to obtain importance ratings of (1) specific tasks and (2) knowledge, skill, and ability (KSA) statements deemed by subject matter experts as essential to safe and…

Descriptors: Scaling, Licensing Examinations (Professions), Context Effect, Rating Scales

Scaling	12
Item Response Theory	5
Educational Assessment	4
Comparative Analysis	3
Error of Measurement	3
Models	3
Scores	3
Licensing Examinations…	2
Measurement	2
Nonparametric Statistics	2
Rating Scales	2
Reading Tests	2
Test Construction	2
Test Items	2
Test Reliability	2
Academic Achievement	1
Accuracy	1
Achievement	1
Achievement Gains	1
Achievement Tests	1
Allied Health Occupations	1
Best Practices	1
College Entrance Examinations	1
College Students	1
Comparative Education	1
More ▼

Wind, Stefanie A.	2
Almehrizi, Rashid S.	1
Briggs, Derek C.	1
Chmielewski, Anna Katyn	1
Choi, Alvaro	1
Cuddy, Monica M.	1
De Champlain, Andre F.	1
Derek C. Briggs	1
Fidler, James R.	1
Jerrim, John	1
Kolen, Michael J.	1
Kostal, Jack W.	1
Kuncel, Nathan R.	1
LaDuca, Tony	1
Laurie Davis	1
Moses, Tim	1
Mueller, Canda D.	1
Nichols, Paul	1
O'Malley, Kimberly	1
Parker, Philip	1
Risk, Nicole M.	1
Sackett, Paul R.	1
Sanford R. Student	1
Shure, Nikki	1
Stemig, Melissa S.	1
More ▼