Publication Date
In 2025 | 0 |
Since 2024 | 0 |
Since 2021 (last 5 years) | 0 |
Since 2016 (last 10 years) | 0 |
Since 2006 (last 20 years) | 23 |
Descriptor
Source
Educational Testing Service | 23 |
Author
Dorans, Neil J. | 4 |
Puhan, Gautam | 4 |
Sinharay, Sandip | 4 |
Haberman, Shelby J. | 3 |
Moses, Tim | 3 |
Grant, Mary C. | 2 |
Holland, Paul W. | 2 |
Liang, Longjuan | 2 |
Liu, Jinghua | 2 |
Livingston, Samuel A. | 2 |
Moses, Tim P. | 2 |
More ▼ |
Publication Type
Reports - Research | 15 |
Reports - Evaluative | 4 |
Reports - Descriptive | 3 |
Numerical/Quantitative Data | 2 |
Guides - Classroom - Learner | 1 |
Tests/Questionnaires | 1 |
Education Level
Audience
Location
Laws, Policies, & Programs
Assessments and Surveys
SAT (College Admission Test) | 3 |
Gates MacGinitie Reading Tests | 1 |
National Merit Scholarship… | 1 |
Preliminary Scholastic… | 1 |
What Works Clearinghouse Rating
Livingston, Samuel A. – Educational Testing Service, 2014
This booklet grew out of a half-day class on equating that author Samuel Livingston teaches for new statistical staff at Educational Testing Service (ETS). The class is a nonmathematical introduction to the topic, emphasizing conceptual understanding and practical applications. The class consists of illustrated lectures, interspersed with…
Descriptors: Equated Scores, Scoring, Self Evaluation (Individuals), Scores
Grant, Mary C. – Educational Testing Service, 2011
The "single group with nearly equivalent tests" (SiGNET) design proposed here was developed to address the problem of equating scores on multiple-choice test forms with very small single-administration samples. In this design, the majority of items in each new test form consist of items from the previous form, and the new items that were…
Descriptors: Multiple Choice Tests, Equated Scores, Test Items
Sinharay, Sandip; Haberman, Shelby – Educational Testing Service, 2011
Recently, the literature has seen increasing interest in subscores for their potential diagnostic values; for example, one study suggested the report of weighted averages of a subscore and the total score, whereas others showed, for various operational and simulated data sets, that weighted averages, as compared to subscores, lead to more accurate…
Descriptors: Equated Scores, Weighted Scores, Tests, Statistical Analysis
Yang, Wen-Ling; Bontya, Andrea M.; Moses, Tim P. – Educational Testing Service, 2011
Using self-reported but empirically verified repeater groups, we analyzed vast amounts of real test data across a wide range of administrations from a graduate admissions examination that was administered in a non-English language to investigate repeater effects on score equating using the nonequivalent groups with anchor test (NEAT) design. Both…
Descriptors: Equated Scores, College Entrance Examinations, Graduate Study, Differences
Puhan, Gautam; Liang, Longjuan – Educational Testing Service, 2011
Because the demand for subscores is ever increasing, this study examined two different approaches for equating subscores: (a) equating a subscore on the new form to the same subscore in the old form using internal common items as the anchor to conduct the equating, and (b) equating a subscore on the new form to the same subscore in the old form…
Descriptors: Equated Scores, Scaling, Raw Scores, Methods
Moses, Tim; Liu, Jinghua – Educational Testing Service, 2011
In equating research and practice, equating functions that are smooth are typically assumed to be more accurate than equating functions with irregularities. This assumption presumes that population test score distributions are relatively smooth. In this study, two examples were used to reconsider common beliefs about smoothing and equating. The…
Descriptors: Equated Scores, Data Analysis, Scores, Methods
Moses, Tim; Zhang, Wenmin – Educational Testing Service, 2010
In this paper, the "standard error of equating difference" (SEED) is described in terms of originally proposed kernel equating functions (von Davier, Holland, & Thayer, 2004) and extended to incorporate traditional linear and equipercentile functions. These derivations expand on prior developments of SEEDs and standard errors of equating and…
Descriptors: Equated Scores, Simulation, Testing, Statistical Analysis
Use of Continuous Exponential Families to Link Forms via Anchor Tests. Research Report. ETS RR-11-11
Haberman, Shelby J.; Yan, Duanli – Educational Testing Service, 2011
Continuous exponential families are applied to linking test forms via an internal anchor. This application combines work on continuous exponential families for single-group designs and work on continuous exponential families for equivalent-group designs. Results are compared to those for kernel and equipercentile equating in the case of chained…
Descriptors: Equated Scores, Statistical Analysis, Language Tests, Mathematics Tests
Haberman, Shelby J.; Dorans, Neil J. – Educational Testing Service, 2011
For testing programs that administer multiple forms within a year and across years, score equating is used to ensure that scores can be used interchangeably. In an ideal world, samples sizes are large and representative of populations that hardly change over time, and very reliable alternate test forms are built with nearly identical psychometric…
Descriptors: Scores, Reliability, Equated Scores, Test Construction
Puhan, Gautam – Educational Testing Service, 2011
The study evaluated the effectiveness of log-linear presmoothing (Holland & Thayer, 1987) on the accuracy of small sample chained equipercentile equatings under two conditions (i.e., using small samples that differed randomly in ability from the target population "versus" using small samples that were distinctly different from the…
Descriptors: Equated Scores, Data Analysis, Accuracy, Sample Size
Dorans, Neil J.; Moses, Tim P.; Eignor, Daniel R. – Educational Testing Service, 2010
Score equating is essential for any testing program that continually produces new editions of a test and for which the expectation is that scores from these editions have the same meaning over time. Particularly in testing programs that help make high-stakes decisions, it is extremely important that test equating be done carefully and accurately.…
Descriptors: Equated Scores, Methods, Data Collection, Data Processing
Moses, Tim; Deng, Weiling; Zhang, Yu-Li – Educational Testing Service, 2010
In the equating literature, a recurring concern is that equating functions that utilize a single anchor to account for examinee groups' nonequivalence are biased when the groups are extremely different and/or when the anchor only weakly measures what the tests measure. Several proposals have been made to address this equating bias by incorporating…
Descriptors: Equated Scores, Data Collection, Statistical Analysis, Differences
Puhan, Gautam – Educational Testing Service, 2010
This study used real data to construct testing conditions for comparing results of chained linear, Tucker, and Levine-observed score equatings. The comparisons were made under conditions where the new- and old-form samples were similar in ability and when they differed in ability. The length of the anchor test was also varied to enable examination…
Descriptors: Equated Scores, Comparative Analysis, Statistical Analysis, Statistical Bias
Chen, Haiwen; Holland, Paul – Educational Testing Service, 2009
In this paper, we develop a new chained equipercentile equating procedure for the nonequivalent groups with anchor test (NEAT) design under the assumptions of the classical test theory model. This new equating is named chained true score equipercentile equating. We also apply the kernel equating framework to this equating design, resulting in a…
Descriptors: True Scores, Equated Scores, Test Theory, Methods
Middleton, Kyndra; Dorans, Neil J. – Educational Testing Service, 2011
Extreme linkings are performed in settings in which neither equivalent groups nor anchor material is available to link scores on two assessments. Examples of extreme linkages include links between scores on tests administered in different languages or between scores on tests administered across disability groups. The strength of interpretation…
Descriptors: Equated Scores, Testing, Difficulty Level, Test Reliability
Previous Page | Next Page ยป
Pages: 1 | 2