ERIC - Search Results

Publication Date

In 2025	0
Since 2024	0
Since 2021 (last 5 years)	0
Since 2016 (last 10 years)	0
Since 2006 (last 20 years)	23

Descriptor

Equated Scores	23
Statistical Analysis	8
Error of Measurement	7
Methods	6
Test Items	6
College Entrance Examinations	5
Comparative Analysis	5
Multiple Choice Tests	4
Raw Scores	4
Statistical Bias	4
Data Collection	3
Differences	3
Difficulty Level	3
Sample Size	3
Sampling	3
Scores	3
Testing	3
Tests	3
Accuracy	2
Data Analysis	2
Item Response Theory	2
Mathematics Tests	2
Reading Tests	2
Scaling	2
Scoring	2
More ▼

Source

Educational Testing Service

Publication Type

Reports - Research	15
Reports - Evaluative	4
Reports - Descriptive	3
Numerical/Quantitative Data	2
Guides - Classroom - Learner	1
Tests/Questionnaires	1

Education Level

Elementary Education	1
Elementary Secondary Education	1
Grade 4	1
Grade 8	1
High Schools	1
Higher Education	1
Intermediate Grades	1
Junior High Schools	1
Middle Schools	1
Postsecondary Education	1
Secondary Education	1
More ▼

Audience

Location

Laws, Policies, & Programs

Assessments and Surveys

SAT (College Admission Test)	3
Gates MacGinitie Reading Tests	1
National Merit Scholarship…	1
Preliminary Scholastic…	1

What Works Clearinghouse Rating

Showing 1 to 15 of 23 results Save | Export

Equating Test Scores (without IRT). Second Edition

Download full text

Livingston, Samuel A. – Educational Testing Service, 2014

This booklet grew out of a half-day class on equating that author Samuel Livingston teaches for new statistical staff at Educational Testing Service (ETS). The class is a nonmathematical introduction to the topic, emphasizing conceptual understanding and practical applications. The class consists of illustrated lectures, interspersed with…

Descriptors: Equated Scores, Scoring, Self Evaluation (Individuals), Scores

The Single Group with Nearly Equivalent Tests (SiGNET) Design for Equating Very Small Volume Multiple-Choice Tests. Research Report. ETS RR-11-31

Download full text

Grant, Mary C. – Educational Testing Service, 2011

The "single group with nearly equivalent tests" (SiGNET) design proposed here was developed to address the problem of equating scores on multiple-choice test forms with very small single-administration samples. In this design, the majority of items in each new test form consist of items from the previous form, and the new items that were…

Descriptors: Multiple Choice Tests, Equated Scores, Test Items

Equating of Subscores and Weighted Averages under the NEAT Design. Research Report. ETS RR-11-01

Download full text

Sinharay, Sandip; Haberman, Shelby – Educational Testing Service, 2011

Recently, the literature has seen increasing interest in subscores for their potential diagnostic values; for example, one study suggested the report of weighted averages of a subscore and the total score, whereas others showed, for various operational and simulated data sets, that weighted averages, as compared to subscores, lead to more accurate…

Descriptors: Equated Scores, Weighted Scores, Tests, Statistical Analysis

Repeater Effects on Score Equating for a Graduate Admissions Exam. Research Report. ETS RR-11-17

Download full text

Yang, Wen-Ling; Bontya, Andrea M.; Moses, Tim P. – Educational Testing Service, 2011

Using self-reported but empirically verified repeater groups, we analyzed vast amounts of real test data across a wide range of administrations from a graduate admissions examination that was administered in a non-English language to investigate repeater effects on score equating using the nonequivalent groups with anchor test (NEAT) design. Both…

Descriptors: Equated Scores, College Entrance Examinations, Graduate Study, Differences

Equating Subscores Using Total Scaled Scores as an Anchor. Research Report. ETS RR-11-07

Download full text

Puhan, Gautam; Liang, Longjuan – Educational Testing Service, 2011

Because the demand for subscores is ever increasing, this study examined two different approaches for equating subscores: (a) equating a subscore on the new form to the same subscore in the old form using internal common items as the anchor to conduct the equating, and (b) equating a subscore on the new form to the same subscore in the old form…

Descriptors: Equated Scores, Scaling, Raw Scores, Methods

Smoothing and Equating Methods Applied to Different Types of Test Score Distributions and Evaluated with Respect to Multiple Equating Criteria. Research Report. ETS RR-11-20

Download full text

Moses, Tim; Liu, Jinghua – Educational Testing Service, 2011

In equating research and practice, equating functions that are smooth are typically assumed to be more accurate than equating functions with irregularities. This assumption presumes that population test score distributions are relatively smooth. In this study, two examples were used to reconsider common beliefs about smoothing and equating. The…

Descriptors: Equated Scores, Data Analysis, Scores, Methods

Research on Standard Errors of Equating Differences. Research Report. ETS RR-10-25

Download full text

Moses, Tim; Zhang, Wenmin – Educational Testing Service, 2010

In this paper, the "standard error of equating difference" (SEED) is described in terms of originally proposed kernel equating functions (von Davier, Holland, & Thayer, 2004) and extended to incorporate traditional linear and equipercentile functions. These derivations expand on prior developments of SEEDs and standard errors of equating and…

Descriptors: Equated Scores, Simulation, Testing, Statistical Analysis

Use of Continuous Exponential Families to Link Forms via Anchor Tests. Research Report. ETS RR-11-11

Download full text

Haberman, Shelby J.; Yan, Duanli – Educational Testing Service, 2011

Continuous exponential families are applied to linking test forms via an internal anchor. This application combines work on continuous exponential families for single-group designs and work on continuous exponential families for equivalent-group designs. Results are compared to those for kernel and equipercentile equating in the case of chained…

Descriptors: Equated Scores, Statistical Analysis, Language Tests, Mathematics Tests

Sources of Score Scale Inconsistency. Research Report. ETS RR-11-10

Download full text

Haberman, Shelby J.; Dorans, Neil J. – Educational Testing Service, 2011

For testing programs that administer multiple forms within a year and across years, score equating is used to ensure that scores can be used interchangeably. In an ideal world, samples sizes are large and representative of populations that hardly change over time, and very reliable alternate test forms are built with nearly identical psychometric…

Descriptors: Scores, Reliability, Equated Scores, Test Construction

Can Smoothing Help When Equating with Unrepresentative Small Samples? Research Report. ETS RR-11-09

Download full text

Puhan, Gautam – Educational Testing Service, 2011

The study evaluated the effectiveness of log-linear presmoothing (Holland & Thayer, 1987) on the accuracy of small sample chained equipercentile equatings under two conditions (i.e., using small samples that differed randomly in ability from the target population "versus" using small samples that were distinctly different from the…

Descriptors: Equated Scores, Data Analysis, Accuracy, Sample Size

Principles and Practices of Test Score Equating. Research Report. ETS RR-10-29

Download full text

Dorans, Neil J.; Moses, Tim P.; Eignor, Daniel R. – Educational Testing Service, 2010

Score equating is essential for any testing program that continually produces new editions of a test and for which the expectation is that scores from these editions have the same meaning over time. Particularly in testing programs that help make high-stakes decisions, it is extremely important that test equating be done carefully and accurately.…

Descriptors: Equated Scores, Methods, Data Collection, Data Processing

The Use of Two Anchors in Nonequivalent Groups with Anchor Test (NEAT) Equating. Research Report. ETS RR-10-23

Download full text

Moses, Tim; Deng, Weiling; Zhang, Yu-Li – Educational Testing Service, 2010

In the equating literature, a recurring concern is that equating functions that utilize a single anchor to account for examinee groups' nonequivalence are biased when the groups are extremely different and/or when the anchor only weakly measures what the tests measure. Several proposals have been made to address this equating bias by incorporating…

Descriptors: Equated Scores, Data Collection, Statistical Analysis, Differences

Chained versus Post-Stratification Equating in a Linear Context: An Evaluation Using Empirical Data. Research Report. ETS RR-10-06

Download full text

Puhan, Gautam – Educational Testing Service, 2010

This study used real data to construct testing conditions for comparing results of chained linear, Tucker, and Levine-observed score equatings. The comparisons were made under conditions where the new- and old-form samples were similar in ability and when they differed in ability. The length of the anchor test was also varied to enable examination…

Descriptors: Equated Scores, Comparative Analysis, Statistical Analysis, Statistical Bias

Construction of Chained True Score Equipercentile Equatings under the Kernel Equating (KE) Framework and Their Relationship to Levine True Score Equating. Research Report. ETS RR-09-24

Download full text

Chen, Haiwen; Holland, Paul – Educational Testing Service, 2009

In this paper, we develop a new chained equipercentile equating procedure for the nonequivalent groups with anchor test (NEAT) design under the assumptions of the classical test theory model. This new equating is named chained true score equipercentile equating. We also apply the kernel equating framework to this equating design, resulting in a…

Descriptors: True Scores, Equated Scores, Test Theory, Methods

Assessing the Falsifiability of Extreme Linking. Research Report. ETS RR-11-04

Download full text

Middleton, Kyndra; Dorans, Neil J. – Educational Testing Service, 2011

Extreme linkings are performed in settings in which neither equivalent groups nor anchor material is available to link scores on two assessments. Examples of extreme linkages include links between scores on tests administered in different languages or between scores on tests administered across disability groups. The strength of interpretation…

Descriptors: Equated Scores, Testing, Difficulty Level, Test Reliability

Previous Page | Next Page »

Pages: 1 | 2

Dorans, Neil J.	4
Puhan, Gautam	4
Sinharay, Sandip	4
Haberman, Shelby J.	3
Moses, Tim	3
Grant, Mary C.	2
Holland, Paul W.	2
Liang, Longjuan	2
Liu, Jinghua	2
Livingston, Samuel A.	2
Moses, Tim P.	2
Bontya, Andrea M.	1
Chen, Haiwen	1
Curley, Edward	1
Damiano, Michele	1
Deng, Weiling	1
Eignor, Daniel R.	1
Feigenbaum, Miriam	1
Haberman, Shelby	1
Holland, Paul	1
Kim, Sooyeon	1
Lewis, Charles	1
Middleton, Kyndra	1
Ricker, Kathryn L.	1
Tan, Xuan	1
More ▼