ERIC - Search Results

Publication Date

In 2025	0
Since 2024	0
Since 2021 (last 5 years)	4
Since 2016 (last 10 years)	7
Since 2006 (last 20 years)	13

Descriptor

Equated Scores	17
Statistical Analysis	10
Error of Measurement	7
Probability	5
Test Items	5
Equations (Mathematics)	4
Item Response Theory	4
Simulation	4
Sample Size	3
Sampling	3
Statistical Distributions	3
College Entrance Examinations	2
Computation	2
Evaluation Methods	2
Test Bias	2
Test Format	2
Weighted Scores	2
Achievement Tests	1
Bayesian Statistics	1
Causal Models	1
College Students	1
Data Collection	1
Difficulty Level	1
Educational Testing	1
Error Patterns	1
More ▼

Source

Journal of Educational and…

Publication Type

Journal Articles	17
Reports - Evaluative	7
Reports - Research	6
Reports - Descriptive	4

Education Level

Higher Education	2
Postsecondary Education	2
Elementary Education	1
Grade 4	1
Grade 8	1
Intermediate Grades	1
Junior High Schools	1
Middle Schools	1
Secondary Education	1

Audience

Location

Sweden

Laws, Policies, & Programs

Assessments and Surveys

Armed Services Vocational…	1
Measures of Academic Progress	1
National Assessment of…	1

What Works Clearinghouse Rating

Showing 1 to 15 of 17 results Save | Export

What Is Actually Equated in "Test Equating"? A Didactic Note

Peer reviewed

Direct link

van der Linden, Wim J. – Journal of Educational and Behavioral Statistics, 2022

The current literature on test equating generally defines it as the process necessary to obtain score comparability between different test forms. The definition is in contrast with Lord's foundational paper which viewed equating as the process required to obtain comparability of measurement scale between forms. The distinction between the notions…

Descriptors: Equated Scores, Test Items, Scores, Probability

Testing Differential Item Functioning without Predefined Anchor Items Using Robust Regression

Peer reviewed

Direct link

Wang, Weimeng; Liu, Yang; Liu, Hongyun – Journal of Educational and Behavioral Statistics, 2022

Differential item functioning (DIF) occurs when the probability of endorsing an item differs across groups for individuals with the same latent trait level. The presence of DIF items may jeopardize the validity of an instrument; therefore, it is crucial to identify DIF items in routine operations of educational assessment. While DIF detection…

Descriptors: Test Bias, Test Items, Equated Scores, Regression (Statistics)

Model Misspecification and Robustness of Observed-Score Test Equating Using Propensity Scores

Peer reviewed

Direct link

Wallin, Gabriel; Wiberg, Marie – Journal of Educational and Behavioral Statistics, 2023

This study explores the usefulness of covariates on equating test scores from nonequivalent test groups. The covariates are captured by an estimated propensity score, which is used as a proxy for latent ability to balance the test groups. The objective is to assess the sensitivity of the equated scores to various misspecifications in the…

Descriptors: Models, Error of Measurement, Robustness (Statistics), Equated Scores

A Bayesian Nonparametric Latent Approach for Score Distributions in Test Equating

Peer reviewed

Direct link

Varas, Inés M.; González, Jorge; Quintana, Fernando A. – Journal of Educational and Behavioral Statistics, 2020

Equating is a family of statistical models and methods used to adjust scores on different test forms so that they can be comparable and used interchangeably. Equated scores are obtained estimating the equating transformation function, which maps the scores on the scale of one test form into their equivalents on the scale of other one. All the…

Descriptors: Bayesian Statistics, Nonparametric Statistics, Equated Scores, Statistical Analysis

Kernel Equating Using Propensity Scores for Nonequivalent Groups

Peer reviewed

Direct link

Wallin, Gabriel; Wiberg, Marie – Journal of Educational and Behavioral Statistics, 2019

When equating two test forms, the equated scores will be biased if the test groups differ in ability. To adjust for the ability imbalance between nonequivalent groups, a set of common items is often used. When no common items are available, it has been suggested to use covariates correlated with the test scores instead. In this article, we reduce…

Descriptors: Equated Scores, Test Items, Probability, College Entrance Examinations

Lord's Equity Theorem Revisited

Peer reviewed

Direct link

van der Linden, Wim J. – Journal of Educational and Behavioral Statistics, 2019

Lord's (1980) equity theorem claims observed-score equating to be possible only when two test forms are perfectly reliable or strictly parallel. An analysis of its proof reveals use of an incorrect statistical assumption. The assumption does not invalidate the theorem itself though, which can be shown to follow directly from the discrete nature of…

Descriptors: Equated Scores, Testing Problems, Item Response Theory, Evaluation Methods

Validation Methods for Aggregate-Level Test Scale Linking: A Case Study Mapping School District Test Score Distributions to a Common Scale

Peer reviewed
PDF on ERIC

Download full text

Direct link

Reardon, Sean F.; Kalogrides, Demetra; Ho, Andrew D. – Journal of Educational and Behavioral Statistics, 2021

Linking score scales across different tests is considered speculative and fraught, even at the aggregate level. We introduce and illustrate validation methods for aggregate linkages, using the challenge of linking U.S. school district average test scores across states as a motivating example. We show that aggregate linkages can be validated both…

Descriptors: Equated Scores, Validity, Methods, School Districts

Pseudo-Equivalent Groups and Linking

Peer reviewed

Direct link

Haberman, Shelby J. – Journal of Educational and Behavioral Statistics, 2015

Adjustment by minimum discriminant information provides an approach to linking test forms in the case of a nonequivalent groups design with no satisfactory common items. This approach employs background information on individual examinees in each administration so that weighted samples of examinees form pseudo-equivalent groups in the sense that…

Descriptors: Equated Scores, Statistical Analysis, Tests, Weighted Scores

Equating without an Anchor for Nonequivalent Groups of Examinees

Peer reviewed

Direct link

Longford, Nicholas T. – Journal of Educational and Behavioral Statistics, 2015

An equating procedure for a testing program with evolving distribution of examinee profiles is developed. No anchor is available because the original scoring scheme was based on expert judgment of the item difficulties. Pairs of examinees from two administrations are formed by matching on coarsened propensity scores derived from a set of…

Descriptors: Equated Scores, Testing Programs, College Entrance Examinations, Scoring

Standard Errors of Equating Differences: Prior Developments, Extensions, and Simulations

Peer reviewed

Direct link

Moses, Tim; Zhang, Wenmin – Journal of Educational and Behavioral Statistics, 2011

The purpose of this article was to extend the use of standard errors for equated score differences (SEEDs) to traditional equating functions. The SEEDs are described in terms of their original proposal for kernel equating functions and extended so that SEEDs for traditional linear and traditional equipercentile equating functions can be computed.…

Descriptors: Equated Scores, Error Patterns, Evaluation Research, Statistical Analysis

Standard Errors of Equating for the Percentile Rank-Based Equipercentile Equating with Log-Linear Presmoothing

Peer reviewed

Direct link

Wang, Tianyou – Journal of Educational and Behavioral Statistics, 2009

Holland and colleagues derived a formula for analytical standard error of equating using the delta-method for the kernel equating method. Extending their derivation, this article derives an analytical standard error of equating procedure for the conventional percentile rank-based equipercentile equating with log-linear smoothing. This procedure is…

Descriptors: Error of Measurement, Equated Scores, Statistical Analysis, Statistical Inference

New Results on the Linear Equating Methods for the Non-Equivalent-Groups Design

Peer reviewed

Direct link

von Davier, Alina A. – Journal of Educational and Behavioral Statistics, 2008

The two most common observed-score equating functions are the linear and equipercentile functions. These are often seen as different methods, but von Davier, Holland, and Thayer showed that any equipercentile equating function can be decomposed into linear and nonlinear parts. They emphasized the dominant role of the linear part of the nonlinear…

Descriptors: Equated Scores, Causal Models, Structural Equation Models, Data Collection

Using the Kernel Method of Test Equating for Estimating the Standard Errors of Population Invariance Measures

Peer reviewed

Direct link

Moses, Tim – Journal of Educational and Behavioral Statistics, 2008

Equating functions are supposed to be population invariant, meaning that the choice of subpopulation used to compute the equating function should not matter. The extent to which equating functions are population invariant is typically assessed in terms of practical difference criteria that do not account for equating functions' sampling…

Descriptors: Equated Scores, Error of Measurement, Sampling, Evaluation Methods

A Unified Approach to Linear Equating for the Nonequivalent Groups Design

Peer reviewed

Direct link

von Davier, Alina A.; Kong, Nan – Journal of Educational and Behavioral Statistics, 2005

This article describes a new, unified framework for linear equating in a non-equivalent groups anchor test (NEAT) design. The authors focus on three methods for linear equating in the NEAT design--Tucker, Levine observed-score, and chain--and develop a common parameterization that shows that each particular equating method is a special case of the…

Descriptors: Equations (Mathematics), Sample Size, Statistical Distributions, Error of Measurement

Item Response Theory True Score Equatings and Their Standard Errors.

Peer reviewed

Ogasawara, Haruhiko – Journal of Educational and Behavioral Statistics, 2001

Provides asymptotic standard errors of the estimates of equated scores from several types of item response theory (IRT) true score equatings. Equating designs considered cover those with internal or external common items and separate or simultaneous estimation. Uses marginal maximum likelihood estimation for the estimation of item parameters. (SLD)

Descriptors: Equated Scores, Error of Measurement, Estimation (Mathematics), Item Response Theory

Previous Page | Next Page »

Pages: 1 | 2

Moses, Tim	2
Wallin, Gabriel	2
Wiberg, Marie	2
van der Linden, Wim J.	2
von Davier, Alina A.	2
Cheng, Philip E.	1
González, Jorge	1
Haberman, Shelby J.	1
Ho, Andrew D.	1
Kalogrides, Demetra	1
Kong, Nan	1
Liou, Michelle	1
Little, Roderick J. A.	1
Liu, Hongyun	1
Liu, Yang	1
Longford, Nicholas T.	1
Ogasawara, Haruhiko	1
Quintana, Fernando A.	1
Reardon, Sean F.	1
Rubin, Donald B.	1
Varas, Inés M.	1
Wang, Tianyou	1
Wang, Weimeng	1
Zhang, Wenmin	1
More ▼