Publication Date
In 2025 | 0 |
Since 2024 | 0 |
Since 2021 (last 5 years) | 1 |
Since 2016 (last 10 years) | 4 |
Since 2006 (last 20 years) | 11 |
Descriptor
Differences | 11 |
Equated Scores | 6 |
Comparative Analysis | 5 |
Statistical Analysis | 4 |
Item Response Theory | 3 |
Scores | 3 |
Test Items | 3 |
True Scores | 3 |
Ability | 2 |
Accuracy | 2 |
Classification | 2 |
More ▼ |
Source
ETS Research Report Series | 11 |
Author
Moses, Tim | 2 |
Belur, Vinetha | 1 |
Bruno, James V. | 1 |
Bunde, Hezekiah | 1 |
Burstein, Jill | 1 |
Cahill, Aoife | 1 |
Chen, Haiwen | 1 |
Flor, Michael | 1 |
Gao, Rui | 1 |
Guo, Hongwen | 1 |
Gyawali, Binod | 1 |
More ▼ |
Publication Type
Journal Articles | 11 |
Reports - Research | 11 |
Tests/Questionnaires | 1 |
Education Level
Higher Education | 2 |
Postsecondary Education | 2 |
Audience
Location
Laws, Policies, & Programs
Assessments and Surveys
Graduate Record Examinations | 2 |
ACT Assessment | 1 |
College Level Examination… | 1 |
SAT (College Admission Test) | 1 |
Test of English as a Foreign… | 1 |
What Works Clearinghouse Rating
Klieger, David M.; Kotloff, Lauren J.; Belur, Vinetha; Schramm-Possinger, Megan E.; Holtzman, Steven L.; Bunde, Hezekiah – ETS Research Report Series, 2022
Intended consequences of giving applicants the option to select which test scores to report include potentially reducing measurement error and inequity in applicants' prior test familiarity. Our first study determined whether score choice options resulted in unintended consequences for lower performing subgroups by detrimentally increasing score…
Descriptors: College Entrance Examinations, Graduate Study, Scores, High Stakes Tests
Wang, Lin – ETS Research Report Series, 2019
Rearranging response options in different versions of a test of multiple-choice items can be an effective strategy against cheating on the test. This study investigated if rearranging response options would affect item performance and test score comparability. A study test was assembled as the base version from which 3 variant versions were…
Descriptors: Multiple Choice Tests, Test Items, Test Format, Scores
Lu, Ru; Guo, Hongwen – ETS Research Report Series, 2018
In this paper we compare the newly developed pseudo-equivalent groups (PEG) linking method with the linking methods based on the traditional nonequivalent groups with anchor test (NEAT) design and illustrate how to use the PEG methods under imperfect equating conditions. To do this, we proposed a new method that combines the features of PEG…
Descriptors: Equated Scores, Comparative Analysis, Test Items, Background
Bruno, James V.; Cahill, Aoife; Gyawali, Binod – ETS Research Report Series, 2016
We present an annotation scheme for classifying differences in the outputs of syntactic constituency parsers when a gold standard is unavailable or undesired, as in the case of texts written by nonnative speakers of English. We discuss its automated implementation and the results of a case study that uses the scheme to choose a parser best suited…
Descriptors: Documentation, Classification, Differences, Syntax
von Davier, Alina A.; Chen, Haiwen – ETS Research Report Series, 2013
In the framework of the observed-score equating methods for the nonequivalent groups with anchor test design, there are 3 fundamentally different ways of using the information provided by the anchor scores to equate the scores of a new form to those of an old form. One method uses the anchor scores as a conditioning variable, such as the Tucker…
Descriptors: Equated Scores, Item Response Theory, True Scores, Methods
Burstein, Jill; Flor, Michael; Tetreault, Joel; Madnani, Nitin; Holtzman, Steven – ETS Research Report Series, 2012
This annotation study is designed to help us gain an increased understanding of paraphrase strategies used by native and nonnative English speakers and how these strategies might affect test takers' essay scores. Toward that end, this study aims to examine and analyze the paraphrase and the types of linguistic modifications used in paraphrase in…
Descriptors: Essay Tests, Scores, Native Speakers, English (Second Language)
Gao, Rui; He, Wei; Ruan, Chunyi – ETS Research Report Series, 2012
In this study, we investigated whether preequating results agree with equating results that are based on observed operational data (postequating) for a college placement program. Specifically, we examined the degree to which item response theory (IRT) true score preequating results agreed with those from IRT true score postequating and from…
Descriptors: College Entrance Examinations, Student Placement, Item Response Theory, True Scores
Paek, Insu – ETS Research Report Series, 2009
Three statistical testing procedures well-known in the maximum likelihood approach are the Wald, likelihood ratio (LR), and score tests. Although well-known, the application of these three testing procedures in the logistic regression method to investigate differential item function (DIF) has not been rigorously made yet. Employing a variety of…
Descriptors: Test Bias, Statistical Analysis, Regression (Statistics), Maximum Likelihood Statistics
Moses, Tim – ETS Research Report Series, 2008
Nine statistical strategies for selecting equating functions in an equivalent groups design were evaluated. The strategies of interest were likelihood ratio chi-square tests, regression tests, Kolmogorov-Smirnov tests, and significance tests for equated score differences. The most accurate strategies in the study were the likelihood ratio tests…
Descriptors: Equated Scores, Statistical Analysis, Statistical Significance, Regression (Statistics)
Moses, Tim; Kim, Sooyeon – ETS Research Report Series, 2007
This study evaluated the impact of unequal reliability on test equating methods in the nonequivalent groups with anchor test (NEAT) design. Classical true score-based models were compared in terms of their assumptions about how reliability impacts test scores. These models were related to treatment of population ability differences by different…
Descriptors: Reliability, Equated Scores, Test Items, Statistical Analysis
Puhan, Gautam; Larkin, Kevin C.; Rupp, Stacie L. – ETS Research Report Series, 2006
This study examined population invariance of equating functions over subgroups defined by ethnicity on a teacher certification test. Investigating subgroup equating invariance was important because the total group who took this test consists of two subgroups (i.e., Hispanic and non-Hispanic) and the Hispanic group is a distinctively more able…
Descriptors: Equated Scores, Licensing Examinations (Professions), Teacher Certification, Ethnicity