ERIC - Search Results

Publication Date

In 2025	0
Since 2024	1
Since 2021 (last 5 years)	1
Since 2016 (last 10 years)	3
Since 2006 (last 20 years)	18

Descriptor

Equated Scores	24
Simulation	24
Statistical Analysis	24
Item Response Theory	10
Error of Measurement	9
Test Items	8
Comparative Analysis	5
Sampling	5
Test Format	5
Computation	4
Evaluation Methods	4
Psychometrics	4
Sample Size	4
Accuracy	3
Difficulty Level	3
Evaluation Research	3
Testing	3
Ability Grouping	2
Achievement Tests	2
Correlation	2
Cutting Scores	2
Educational Assessment	2
Equations (Mathematics)	2
Grade 4	2
Item Analysis	2
More ▼

Source

ETS Research Report Series	6
Journal of Educational and…	3
ProQuest LLC	2
American Institutes for…	1
Applied Measurement in…	1
Educational Sciences: Theory…	1
Educational Testing Service	1
Educational and Psychological…	1
International Journal of…	1
Journal of Educational…	1
Journal of Experimental…	1
Practical Assessment,…	1
Studies in Educational…	1
Teaching Statistics: An…	1
More ▼

Publication Type

Journal Articles	18
Reports - Research	15
Reports - Evaluative	6
Dissertations/Theses -…	2
Numerical/Quantitative Data	2
Speeches/Meeting Papers	2
Reports - Descriptive	1

Education Level

Elementary Education	2
Elementary Secondary Education	2
Grade 4	1
Grade 8	1
Higher Education	1
Intermediate Grades	1
Junior High Schools	1
Middle Schools	1
Postsecondary Education	1
Secondary Education	1

Audience

Researchers

Location

Florida	1
Singapore	1

Laws, Policies, & Programs

No Child Left Behind Act 2001

Assessments and Surveys

Armed Services Vocational…	1
Florida Comprehensive…	1
Test of English as a Foreign…	1
Trends in International…	1

What Works Clearinghouse Rating

Showing 1 to 15 of 24 results Save | Export

Impacts of Differences in Group Abilities and Anchor Test Features on Three Non-IRT Test Equating Methods

Peer reviewed
PDF on ERIC

Download full text

Inga Laukaityte; Marie Wiberg – Practical Assessment, Research & Evaluation, 2024

The overall aim was to examine effects of differences in group ability and features of the anchor test form on equating bias and the standard error of equating (SEE) using both real and simulated data. Chained kernel equating, Postratification kernel equating, and Circle-arc equating were studied. A college admissions test with four different…

Descriptors: Ability Grouping, Test Items, College Entrance Examinations, High Stakes Tests

Psychometric Consequences of Subpopulation Item Parameter Drift

Peer reviewed

Direct link

Huggins-Manley, Anne Corinne – Educational and Psychological Measurement, 2017

This study defines subpopulation item parameter drift (SIPD) as a change in item parameters over time that is dependent on subpopulations of examinees, and hypothesizes that the presence of SIPD in anchor items is associated with bias and/or lack of invariance in three psychometric outcomes. Results show that SIPD in anchor items is associated…

Descriptors: Psychometrics, Test Items, Item Response Theory, Hypothesis Testing

The Impact of Test Dimensionality, Common-Item Set Format, and Scale Linking Methods on Mixed-Format Test Equating

Peer reviewed
PDF on ERIC

Download full text

Öztürk-Gübes, Nese; Kelecioglu, Hülya – Educational Sciences: Theory and Practice, 2016

The purpose of this study was to examine the impact of dimensionality, common-item set format, and different scale linking methods on preserving equity property with mixed-format test equating. Item response theory (IRT) true-score equating (TSE) and IRT observed-score equating (OSE) methods were used under common-item nonequivalent groups design.…

Descriptors: Test Format, Item Response Theory, True Scores, Equated Scores

Examining the Impact of Drifted Polytomous Anchor Items on Test Characteristic Curve (TCC) Linking and IRT True Score Equating. Research Report. ETS RR-12-09

Peer reviewed
PDF on ERIC

Download full text

Li, Yanmei – ETS Research Report Series, 2012

In a common-item (anchor) equating design, the common items should be evaluated for item parameter drift. Drifted items are often removed. For a test that contains mostly dichotomous items and only a small number of polytomous items, removing some drifted polytomous anchor items may result in anchor sets that no longer resemble mini-versions of…

Descriptors: Scores, Item Response Theory, Equated Scores, Simulation

Research on Standard Errors of Equating Differences. Research Report. ETS RR-10-25

Download full text

Moses, Tim; Zhang, Wenmin – Educational Testing Service, 2010

In this paper, the "standard error of equating difference" (SEED) is described in terms of originally proposed kernel equating functions (von Davier, Holland, & Thayer, 2004) and extended to incorporate traditional linear and equipercentile functions. These derivations expand on prior developments of SEEDs and standard errors of equating and…

Descriptors: Equated Scores, Simulation, Testing, Statistical Analysis

Exploring Alternative Test Form Linking Designs with Modified Equating Sample Size and Anchor Test Length. Research Report. ETS RR-13-02

Peer reviewed
PDF on ERIC

Download full text

Wang, Lin; Qian, Jiahe; Lee, Yi-Hsuan – ETS Research Report Series, 2013

The purpose of this study was to evaluate the combined effects of reduced equating sample size and shortened anchor test length on item response theory (IRT)-based linking and equating results. Data from two independent operational forms of a large-scale testing program were used to establish the baseline results for evaluating the results from…

Descriptors: Test Construction, Item Response Theory, Testing Programs, Simulation

Standard Errors of Equating Differences: Prior Developments, Extensions, and Simulations

Peer reviewed

Direct link

Moses, Tim; Zhang, Wenmin – Journal of Educational and Behavioral Statistics, 2011

The purpose of this article was to extend the use of standard errors for equated score differences (SEEDs) to traditional equating functions. The SEEDs are described in terms of their original proposal for kernel equating functions and extended so that SEEDs for traditional linear and traditional equipercentile equating functions can be computed.…

Descriptors: Equated Scores, Error Patterns, Evaluation Research, Statistical Analysis

Evaluating Equating Results in the Non-Equivalent Groups with Anchor Test Design Using Equipercentile and Equity Criteria

Direct link

Duong, Minh Quang – ProQuest LLC, 2011

Testing programs often use multiple test forms of the same test to control item exposure and to ensure test security. Although test forms are constructed to be as similar as possible, they often differ. Test equating techniques are those statistical methods used to adjust scores obtained on different test forms of the same test so that they are…

Descriptors: Equated Scores, Statistical Analysis, Item Response Theory, Evaluation Criteria

Observed-Score Equating with a Heterogeneous Target Population

Peer reviewed

Direct link

Duong, Minh Q.; von Davier, Alina A. – International Journal of Testing, 2012

Test equating is a statistical procedure for adjusting for test form differences in difficulty in a standardized assessment. Equating results are supposed to hold for a specified target population (Kolen & Brennan, 2004; von Davier, Holland, & Thayer, 2004) and to be (relatively) independent of the subpopulations from the target population (see…

Descriptors: Ability Grouping, Difficulty Level, Psychometrics, Statistical Analysis

A Comparison of Equating/Linking Using the Stocking-Lord Method and Concurrent Calibration with Mixed-Format Tests in the Non-Equivalent Groups Common-Item Design under IRT

Direct link

Tian, Feng – ProQuest LLC, 2011

There has been a steady increase in the use of mixed-format tests, that is, tests consisting of both multiple-choice items and constructed-response items in both classroom and large-scale assessments. This calls for appropriate equating methods for such tests. As Item Response Theory (IRT) has rapidly become mainstream as the theoretical basis for…

Descriptors: Item Response Theory, Comparative Analysis, Equated Scores, Statistical Analysis

Standard Errors of Equating for the Percentile Rank-Based Equipercentile Equating with Log-Linear Presmoothing

Peer reviewed

Direct link

Wang, Tianyou – Journal of Educational and Behavioral Statistics, 2009

Holland and colleagues derived a formula for analytical standard error of equating using the delta-method for the kernel equating method. Extending their derivation, this article derives an analytical standard error of equating procedure for the conventional percentile rank-based equipercentile equating with log-linear smoothing. This procedure is…

Descriptors: Error of Measurement, Equated Scores, Statistical Analysis, Statistical Inference

Selection Strategies for Univariate Loglinear Smoothing Models and Their Effect on Equating Function Accuracy

Peer reviewed

Direct link

Moses, Tim; Holland, Paul W. – Journal of Educational Measurement, 2009

In this study, we compared 12 statistical strategies proposed for selecting loglinear models for smoothing univariate test score distributions and for enhancing the stability of equipercentile equating functions. The major focus was on evaluating the effects of the selection strategies on equating function accuracy. Selection strategies' influence…

Descriptors: Equated Scores, Selection, Statistical Analysis, Models

The Influence of Strategies for Selecting Loglinear Smoothing Models on Equating Functions. Research Report. ETS RR-08-25

Peer reviewed
PDF on ERIC

Download full text

Moses, Tim; Holland, Paul – ETS Research Report Series, 2008

This study addressed 2 issues of using loglinear models for smoothing univariate test score distributions and for enhancing the stability of equipercentile equating functions. One issue was a comparative assessment of several statistical strategies that have been proposed for selecting 1 from several competing model parameterizations. Another…

Descriptors: Equated Scores, Selection, Models, Statistical Analysis

Creating Realistic Data Sets with Specified Properties via Simulation

Peer reviewed

Direct link

Goldman, Robert N.; McKenzie, John D. Jr. – Teaching Statistics: An International Journal for Teachers, 2009

We explain how to simulate both univariate and bivariate raw data sets having specified values for common summary statistics. The first example illustrates how to "construct" a data set having prescribed values for the mean and the standard deviation--for a one-sample t test with a specified outcome. The second shows how to create a bivariate data…

Descriptors: Correlation, Equated Scores, Statistical Analysis, Weighted Scores

Using the Kernel Method of Test Equating for Estimating the Standard Errors of Population Invariance Measures

Peer reviewed

Direct link

Moses, Tim – Journal of Educational and Behavioral Statistics, 2008

Equating functions are supposed to be population invariant, meaning that the choice of subpopulation used to compute the equating function should not matter. The extent to which equating functions are population invariant is typically assessed in terms of practical difference criteria that do not account for equating functions' sampling…

Descriptors: Equated Scores, Error of Measurement, Sampling, Evaluation Methods

Previous Page | Next Page »

Pages: 1 | 2

Moses, Tim	6
Holland, Paul	3
Holland, Paul W.	2
Zhang, Wenmin	2
von Davier, Alina A.	2
Boldt, R. F.	1
Casabianca, Jodi	1
Duong, Minh Q.	1
Duong, Minh Quang	1
Eignor, Daniel R.	1
Fairbank, Benjamin A., Jr.	1
Gallagher, Larry	1
Goldman, Robert N.	1
Gopal, Saminanthan	1
Grant, Mary C.	1
Guan, Toh Poh	1
Huggins-Manley, Anne Corinne	1
Inga Laukaityte	1
Kelecioglu, Hülya	1
Lee, Yi-Hsuan	1
Li, Yanmei	1
Livingston, Samuel A.	1
Marascuilo, Leonard A.	1
Marie Wiberg	1
More ▼