ERIC - Search Results

Publication Date

In 2025	0
Since 2024	0
Since 2021 (last 5 years)	2
Since 2016 (last 10 years)	8
Since 2006 (last 20 years)	74

Descriptor

Equated Scores	78
Statistical Analysis	35
Test Items	35
Comparative Analysis	31
Accuracy	20
Error of Measurement	19
Item Response Theory	19
Sample Size	16
Raw Scores	15
Computation	14
College Entrance Examinations	12
Scaling	12
Simulation	11
Test Format	11
Difficulty Level	10
Models	10
Multiple Choice Tests	10
Test Construction	10
Licensing Examinations…	9
Scores	9
Correlation	8
Sampling	8
True Scores	8
Regression (Statistics)	7
Differences	6
More ▼

Source

ETS Research Report Series

Publication Type

Journal Articles	78
Reports - Research	78
Speeches/Meeting Papers	4
Numerical/Quantitative Data	2
Collected Works - General	1
Tests/Questionnaires	1

Education Level

Higher Education	12
Postsecondary Education	12
High Schools	2
Secondary Education	2
Elementary Education	1
Elementary Secondary Education	1

Audience

Location

Laws, Policies, & Programs

Assessments and Surveys

SAT (College Admission Test)	11
ACT Assessment	2
Advanced Placement…	2
College Level Examination…	2
Graduate Record Examinations	2
Praxis Series	2
Law School Admission Test	1

What Works Clearinghouse Rating

Showing 1 to 15 of 78 results Save | Export

Effect of Statistically Matching Equating Samples for Common-Item Equating. Research Report. ETS RR-21-02

Peer reviewed
PDF on ERIC

Download full text

Lu, Ru; Kim, Sooyeon – ETS Research Report Series, 2021

This study evaluated the impact of subgroup weighting for equating through a common-item anchor. We used data from a single test form to create two research forms for which the equating relationship was known. The results showed that equating was most accurate when the new form and reference form samples were weighted to be similar to the target…

Descriptors: Equated Scores, Weighted Scores, Raw Scores, Test Items

Comparisons among Approaches to Link Tests Using Random Samples Selected under Suboptimal Conditions. Research Report. ETS RR-21-14

Peer reviewed
PDF on ERIC

Download full text

Kim, Sooyeon; Walker, Michael E. – ETS Research Report Series, 2021

Equating the scores from different forms of a test requires collecting data that link the forms. Problems arise when the test forms to be linked are given to groups that are not equivalent and the forms share no common items by which to measure or adjust for this group nonequivalence. We compared three approaches to adjusting for group…

Descriptors: Equated Scores, Weighted Scores, Sampling, Multiple Choice Tests

A Simulation Study to Compare Nonequivalent Groups with Anchor Test Equating and Pseudo-Equivalent Group Linking. Research Report. ETS RR-18-08

Peer reviewed
PDF on ERIC

Download full text

Lu, Ru; Guo, Hongwen – ETS Research Report Series, 2018

In this paper we compare the newly developed pseudo-equivalent groups (PEG) linking method with the linking methods based on the traditional nonequivalent groups with anchor test (NEAT) design and illustrate how to use the PEG methods under imperfect equating conditions. To do this, we proposed a new method that combines the features of PEG…

Descriptors: Equated Scores, Comparative Analysis, Test Items, Background

Different Methods of Adjusting for Form Difficulty under the Rasch Model: Impact on Consistency of Assessment Results. Research Report. ETS RR-19-08

Peer reviewed
PDF on ERIC

Download full text

Manna, Venessa F.; Gu, Lixiong – ETS Research Report Series, 2019

When using the Rasch model, equating with a nonequivalent groups anchor test design is commonly achieved by adjustment of new form item difficulty using an additive equating constant. Using simulated 5-year data, this report compares 4 approaches to calculating the equating constants and the subsequent impact on equating results. The 4 approaches…

Descriptors: Item Response Theory, Test Items, Test Construction, Sample Size

Grouping Effects on Jackknifed Variance Estimation for Item Response Theory Scaling and Equating with Cluster-Based Assessment Data. Research Report. ETS RR-18-16

Peer reviewed
PDF on ERIC

Download full text

Wang, Lin; Qian, Jiahe; Lee, Yi-Hsuan – ETS Research Report Series, 2018

Educational assessment data are often collected from a set of test centers across various geographic regions, and therefore the data samples contain clusters. Such cluster-based data may result in clustering effects in variance estimation. However, in many grouped jackknife variance estimation applications, jackknife groups are often formed by a…

Descriptors: Item Response Theory, Scaling, Equated Scores, Cluster Grouping

Linking Composite Scores: Effects of Anchor Test Length and Content Representativeness. Research Report. ETS RR-16-36

Peer reviewed
PDF on ERIC

Download full text

Lin, Peng; Dorans, Neil; Weeks, Jonathan – ETS Research Report Series, 2016

The nonequivalent groups with anchor test (NEAT) design is frequently used in test score equating or linking. One important assumption of the NEAT design is that the anchor test is a miniversion of the 2 tests to be equated/linked. When the content of the 2 tests is different, it is not possible for the anchor test to be adequately representative…

Descriptors: Equated Scores, Test Length, Test Content, Difficulty Level

An Evaluation of the Single-Group Growth Model as an Alternative to Common-Item Equating. Research Report. ETS RR-16-01

Peer reviewed
PDF on ERIC

Download full text

Wei, Youhua; Morgan, Rick – ETS Research Report Series, 2016

As an alternative to common-item equating when common items do not function as expected, the single-group growth model (SGGM) scaling uses common examinees or repeaters to link test scores on different forms. The SGGM scaling assumes that, for repeaters taking adjacent administrations, the conditional distribution of scale scores in later…

Descriptors: Equated Scores, Growth Models, Scaling, Computation

Long-Term Impact of Valid Case Criterion on Capturing Population-Level Growth under Item Response Theory Equating. Research Report. ETS RR-17-17

Peer reviewed
PDF on ERIC

Download full text

Deng, Weiling; Monfils, Lora – ETS Research Report Series, 2017

Using simulated data, this study examined the impact of different levels of stringency of the valid case inclusion criterion on item response theory (IRT)-based true score equating over 5 years in the context of K-12 assessment when growth in student achievement is expected. Findings indicate that the use of the most stringent inclusion criterion…

Descriptors: Item Response Theory, Equated Scores, True Scores, Educational Assessment

Poststratification Equating Based on True Anchor Scores and Its Relationship to Levine Observed Score Equating. Research Report. ETS RR-13-11

Peer reviewed
PDF on ERIC

Download full text

Chen, Haiwen; Livingston, Samuel A. – ETS Research Report Series, 2013

This paper presents a new equating method for the nonequivalent groups with anchor test design: poststratification equating based on true anchor scores. The linear version of this method is shown to be equivalent, under certain conditions, to Levine observed score equating, in the same way that the linear version of poststratification equating is…

Descriptors: Equated Scores, Test Items, Methods

Use of Jackknifing to Evaluate Effects of Anchor Item Selection on Equating with the Nonequivalent Groups with Anchor Test (NEAT) Design. Research Report. ETS RR-15-10

Peer reviewed
PDF on ERIC

Download full text

Lu, Ru; Haberman, Shelby; Guo, Hongwen; Liu, Jinghua – ETS Research Report Series, 2015

In this study, we apply jackknifing to anchor items to evaluate the impact of anchor selection on equating stability. In an ideal world, the choice of anchor items should have little impact on equating results. When this ideal does not correspond to reality, selection of anchor items can strongly influence equating results. This influence does not…

Descriptors: Test Construction, Equated Scores, Test Items, Sampling

Estimating Conditional Distributions of Scores on an Alternate Form of a Test. Research Report. ETS RR-15-18

Peer reviewed
PDF on ERIC

Download full text

Livingston, Samuel A.; Chen, Haiwen H. – ETS Research Report Series, 2015

Quantitative information about test score reliability can be presented in terms of the distribution of equated scores on an alternate form of the test for test takers with a given score on the form taken. In this paper, we describe a procedure for estimating that distribution, for any specified score on the test form taken, by estimating the joint…

Descriptors: Scores, Statistical Distributions, Research Reports, Equated Scores

Choice of Target Population Weights in Rater Comparability Scoring and Equating. Research Report. ETS RR-13-03

Peer reviewed
PDF on ERIC

Download full text

Puhan, Gautam – ETS Research Report Series, 2013

The purpose of this study was to demonstrate that the choice of sample weights when defining the target population under poststratification equating can be a critical factor in determining the accuracy of the equating results under a unique equating scenario, known as "rater comparability scoring and equating." The nature of data…

Descriptors: Scoring, Equated Scores, Sampling, Accuracy

A Comparison of Raw-to-Scale Conversion Consistency between Single- and Multiple-Linking Using a Nonequivalent Groups Anchor Test Design. Research Report. ETS RR-14-13

Peer reviewed
PDF on ERIC

Download full text

Liu, Jinghua; Guo, Hongwen; Dorans, Neil J. – ETS Research Report Series, 2014

Maintaining score interchangeability and scale consistency is crucial for any testing programs that administer multiple forms across years. The use of a multiple linking design, which involves equating a new form to multiple old forms and averaging the conversions, has been proposed to control scale drift. However, the use of multiple linking…

Descriptors: Comparative Analysis, Reliability, Test Construction, Equated Scores

Effect of Item Response Theory (IRT) Model Selection on Testlet-Based Test Equating. Research Report. ETS RR-14-19

Peer reviewed
PDF on ERIC

Download full text

Cao, Yi; Lu, Ru; Tao, Wei – ETS Research Report Series, 2014

The local item independence assumption underlying traditional item response theory (IRT) models is often not met for tests composed of testlets. There are 3 major approaches to addressing this issue: (a) ignore the violation and use a dichotomous IRT model (e.g., the 2-parameter logistic [2PL] model), (b) combine the interdependent items to form a…

Descriptors: Item Response Theory, Equated Scores, Test Items, Simulation

Enhancing the Equating of Item Difficulty Metrics: Estimation of Reference Distribution. Research Report. ETS RR-14-07

Peer reviewed
PDF on ERIC

Download full text

Ali, Usama S.; Walker, Michael E. – ETS Research Report Series, 2014

Two methods are currently in use at Educational Testing Service (ETS) for equating observed item difficulty statistics. The first method involves the linear equating of item statistics in an observed sample to reference statistics on the same items. The second method, or the item response curve (IRC) method, involves the summation of conditional…

Descriptors: Difficulty Level, Test Items, Equated Scores, Causal Models

Previous Page | Next Page »

Pages: 1 | 2 | 3 | 4 | 5 | 6

von Davier, Alina A.	15
Kim, Sooyeon	14
Liu, Jinghua	10
Moses, Tim	10
Guo, Hongwen	8
Livingston, Samuel A.	8
Holland, Paul	7
Puhan, Gautam	6
Walker, Michael E.	6
Haberman, Shelby J.	5
Lee, Yi-Hsuan	5
Dorans, Neil J.	4
Haberman, Shelby	4
Holland, Paul W.	4
Lu, Ru	4
Curley, Edward	3
Dorans, Neil	3
Qian, Jiahe	3
Sinharay, Sandip	3
Chen, Haiwen	2
Han, Ning	2
Lewis, Charles	2
McHale, Frederick	2
Oh, Hyeonjoo J.	2
Wang, Lin	2
More ▼