ERIC - Search Results

Publication Date

In 2025	0
Since 2024	0
Since 2021 (last 5 years)	2
Since 2016 (last 10 years)	6
Since 2006 (last 20 years)	18

Descriptor

Raw Scores	20
Equated Scores	15
Comparative Analysis	9
Test Items	8
Error of Measurement	7
Statistical Analysis	7
Test Format	6
College Entrance Examinations	5
Scaling	5
Scores	5
Accuracy	4
Item Response Theory	4
Multiple Choice Tests	4
Sample Size	4
Computation	3
Correlation	3
Difficulty Level	3
Language Tests	3
Simulation	3
Verbal Ability	3
Bayesian Statistics	2
Critical Reading	2
English (Second Language)	2
Evaluation Criteria	2
Models	2
More ▼

Source

ETS Research Report Series

Publication Type

Journal Articles	20
Reports - Research	20
Tests/Questionnaires	2

Education Level

Higher Education	5
Postsecondary Education	5
High Schools	2
Secondary Education	2

Audience

Location

Laws, Policies, & Programs

Assessments and Surveys

SAT (College Admission Test)	5
ACT Assessment	1
College Level Examination…	1
Graduate Record Examinations	1
Praxis Series	1
Test of English as a Foreign…	1
Test of English for…	1

What Works Clearinghouse Rating

Showing 1 to 15 of 20 results Save | Export

Effect of Statistically Matching Equating Samples for Common-Item Equating. Research Report. ETS RR-21-02

Peer reviewed
PDF on ERIC

Download full text

Lu, Ru; Kim, Sooyeon – ETS Research Report Series, 2021

This study evaluated the impact of subgroup weighting for equating through a common-item anchor. We used data from a single test form to create two research forms for which the equating relationship was known. The results showed that equating was most accurate when the new form and reference form samples were weighted to be similar to the target…

Descriptors: Equated Scores, Weighted Scores, Raw Scores, Test Items

Comparisons among Approaches to Link Tests Using Random Samples Selected under Suboptimal Conditions. Research Report. ETS RR-21-14

Peer reviewed
PDF on ERIC

Download full text

Kim, Sooyeon; Walker, Michael E. – ETS Research Report Series, 2021

Equating the scores from different forms of a test requires collecting data that link the forms. Problems arise when the test forms to be linked are given to groups that are not equivalent and the forms share no common items by which to measure or adjust for this group nonequivalence. We compared three approaches to adjusting for group…

Descriptors: Equated Scores, Weighted Scores, Sampling, Multiple Choice Tests

The Pseudo-Equivalent Groups Approach as an Alternative to Common-Item Equating. Research Report. ETS RR-18-02

Peer reviewed
PDF on ERIC

Download full text

Kim, Sooyeon; Lu, Ru – ETS Research Report Series, 2018

The purpose of this study was to evaluate the effectiveness of linking test scores by using test takers' background data to form pseudo-equivalent groups (PEG) of test takers. Using 4 operational test forms that each included 100 items and were taken by more than 30,000 test takers, we created 2 half-length research forms that had either 20…

Descriptors: Test Items, Item Banks, Difficulty Level, Comparative Analysis

A Comparison of Score Aggregation Methods for Unidimensional Tests on Different Dimensions. Research Report. ETS RR-18-01

Peer reviewed
PDF on ERIC

Download full text

Fu, Jianbin; Feng, Yuling – ETS Research Report Series, 2018

In this study, we propose aggregating test scores with unidimensional within-test structure and multidimensional across-test structure based on a 2-level, 1-factor model. In particular, we compare 6 score aggregation methods: average of standardized test raw scores (M1), regression factor score estimate of the 1-factor model based on the…

Descriptors: Comparative Analysis, Scores, Correlation, Standardized Tests

An Evaluation of the Single-Group Growth Model as an Alternative to Common-Item Equating. Research Report. ETS RR-16-01

Peer reviewed
PDF on ERIC

Download full text

Wei, Youhua; Morgan, Rick – ETS Research Report Series, 2016

As an alternative to common-item equating when common items do not function as expected, the single-group growth model (SGGM) scaling uses common examinees or repeaters to link test scores on different forms. The SGGM scaling assumes that, for repeaters taking adjacent administrations, the conditional distribution of scale scores in later…

Descriptors: Equated Scores, Growth Models, Scaling, Computation

Accuracy of a Classical Test Theory-Based Procedure for Estimating the Reliability of a Multistage Test. Research Report. ETS RR-17-02

Peer reviewed
PDF on ERIC

Download full text

Kim, Sooyeon; Livingston, Samuel A. – ETS Research Report Series, 2017

The purpose of this simulation study was to assess the accuracy of a classical test theory (CTT)-based procedure for estimating the alternate-forms reliability of scores on a multistage test (MST) having 3 stages. We generated item difficulty and discrimination parameters for 10 parallel, nonoverlapping forms of the complete 3-stage test and…

Descriptors: Accuracy, Test Theory, Test Reliability, Adaptive Testing

Use of Jackknifing to Evaluate Effects of Anchor Item Selection on Equating with the Nonequivalent Groups with Anchor Test (NEAT) Design. Research Report. ETS RR-15-10

Peer reviewed
PDF on ERIC

Download full text

Lu, Ru; Haberman, Shelby; Guo, Hongwen; Liu, Jinghua – ETS Research Report Series, 2015

In this study, we apply jackknifing to anchor items to evaluate the impact of anchor selection on equating stability. In an ideal world, the choice of anchor items should have little impact on equating results. When this ideal does not correspond to reality, selection of anchor items can strongly influence equating results. This influence does not…

Descriptors: Test Construction, Equated Scores, Test Items, Sampling

A Criterion to Evaluate the Individual Raw-to-Scale Equating Conversions. Research Report. ETS RR-13-05

Peer reviewed
PDF on ERIC

Download full text

Guo, Hongwen; Puhan, Gautam; Walker, Michael – ETS Research Report Series, 2013

In this study we investigated when an equating conversion line is problematic in terms of gaps and clumps. We suggest using the conditional standard error of measurement (CSEM) to measure the scale scores that are inappropriate in the overall raw-to-scale transformation.

Descriptors: Equated Scores, Test Items, Evaluation Criteria, Error of Measurement

Demographically Adjusted Groups for Equating Test Scores. Research Report. ETS RR-14-30

Peer reviewed
PDF on ERIC

Download full text

Livingston, Samuel A. – ETS Research Report Series, 2014

In this study, I investigated 2 procedures intended to create test-taker groups of equal ability by poststratifying on a composite variable created from demographic information. In one procedure, the stratifying variable was the composite variable that best predicted the test score. In the other procedure, the stratifying variable was the…

Descriptors: Demography, Equated Scores, Cluster Grouping, Ability Grouping

An Investigation of the Impact of Misrouting under Two-Stage Multistage Testing: A Simulation Study. Research Report. ETS RR-14-01

Peer reviewed
PDF on ERIC

Download full text

Kim, Sooyeon; Moses, Tim – ETS Research Report Series, 2014

The purpose of this study was to investigate the potential impact of misrouting under a 2-stage multistage test (MST) design, which includes 1 routing and 3 second-stage modules. Simulations were used to create a situation in which a large group of examinees took each of the 3 possible MST paths (high, middle, and low). We compared differences in…

Descriptors: Comparative Analysis, Difficulty Level, Scores, Test Wiseness

The Stability of the Score Scales for the "SAT Reasoning Test"™ from 2005 to 2010. Research Report. ETS RR-12-15

Peer reviewed
PDF on ERIC

Download full text

Guo, Hongwen; Liu, Jinghua; Curley, Edward; Dorans, Neil – ETS Research Report Series, 2012

This study examines the stability of the "SAT Reasoning Test"™ score scales from 2005 to 2010. A 2005 old form (OF) was administered along with a 2010 new form (NF). A new conversion for OF was derived through direct equipercentile equating. A comparison of the newly derived and the original OF conversions showed that Critical Reading…

Descriptors: Aptitude Tests, Cognitive Tests, Thinking Skills, Equated Scores

Does Preequating Work? An Investigation into a Preequated Testlet-Based College Placement Exam Using Postadministration Data. Research Report. ETS RR-12-12

Peer reviewed
PDF on ERIC

Download full text

Gao, Rui; He, Wei; Ruan, Chunyi – ETS Research Report Series, 2012

In this study, we investigated whether preequating results agree with equating results that are based on observed operational data (postequating) for a college placement program. Specifically, we examined the degree to which item response theory (IRT) true score preequating results agreed with those from IRT true score postequating and from…

Descriptors: College Entrance Examinations, Student Placement, Item Response Theory, True Scores

Impossible Scores Resulting in Zero Frequencies in the Anchor Test: Impact on Smoothing and Equating. Research Report. ETS RR-08-10

Peer reviewed
PDF on ERIC

Download full text

Puhan, Gautam; vonDavier, Alina; Gupta, Shaloo – ETS Research Report Series, 2008

Equating under the external anchor design is frequently conducted using scaled scores on the anchor test. However, scaled scores often lead to the unique problem of creating zero frequencies in the score distribution because there may not always be a one-to-one correspondence between raw and scaled scores. For example, raw scores of 17 and 18 may…

Descriptors: Equated Scores, Test Items, Raw Scores, Statistical Analysis

Investigating the Effectiveness of Collateral Information on Small-Sample Equating. Research Report. ETS RR-08-52

Peer reviewed
PDF on ERIC

Download full text

Kim, Sooyeon; Linvingston, Samuel A.; Lewis, Charles – ETS Research Report Series, 2008

This paper describes an empirical evaluation of a Bayesian procedure for equating scores on test forms taken by small numbers of examinees, using collateral information from the equating of other test forms. In this procedure, a separate Bayesian estimate is derived for the equated score at each raw-score level, making it unnecessary to specify a…

Descriptors: Equated Scores, Statistical Analysis, Sample Size, Bayesian Statistics

Development of Approximations to Population Invariance Indices. Research Report. ETS RR-08-36

Peer reviewed
PDF on ERIC

Download full text

Liu, Jinghua; Zhu, Xiaowen – ETS Research Report Series, 2008

The purpose of this paper is to explore methods to approximate population invariance without conducting multiple linkings for subpopulations. Under the single group or equivalent groups design, no linking needs to be performed for the parallel-linear system linking functions. The unequated raw score information can be used as an approximation. For…

Descriptors: Raw Scores, Test Format, Comparative Analysis, Test Construction

Previous Page | Next Page »

Pages: 1 | 2

Kim, Sooyeon	6
Liu, Jinghua	5
Guo, Hongwen	4
Livingston, Samuel A.	3
Lu, Ru	3
Puhan, Gautam	2
Curley, Edward	1
Dorans, Neil	1
Dorans, Neil J.	1
Feng, Yuling	1
Fu, Jianbin	1
Gao, Rui	1
Gupta, Shaloo	1
Haberman, Shelby	1
Haberman, Shelby J.	1
He, Wei	1
Lewis, Charles	1
Liao, Chi-Wen	1
Linvingston, Samuel A.	1
Low, Albert C.	1
Morgan, Rick	1
Moses, Tim	1
Ruan, Chunyi	1
Sinharay, Sandip	1
Tannenbaum, Richard J.	1
More ▼