NotesFAQContact Us
Collection
Advanced
Search Tips
Publication Date
In 20250
Since 20240
Since 2021 (last 5 years)0
Since 2016 (last 10 years)2
Since 2006 (last 20 years)10
Audience
Researchers3
Laws, Policies, & Programs
What Works Clearinghouse Rating
Showing 1 to 15 of 25 results Save | Export
Peer reviewed Peer reviewed
PDF on ERIC Download full text
Dorans, Neil J. – ETS Research Report Series, 2018
A distinction is made between scores as measures of a construct and predictions of a criterion or outcome variable. The interpretation attached to predictions of criteria, such as job performance or college grade point average (GPA), differs from that attached to scores that are measures of a construct, such as reading proficiency or knowledge…
Descriptors: Job Performance, Scores, Data Interpretation, Statistical Distributions
Peer reviewed Peer reviewed
Direct linkDirect link
Kostal, Jack W.; Sackett, Paul R.; Kuncel, Nathan R.; Walmsley, Philip T.; Stemig, Melissa S. – Educational Measurement: Issues and Practice, 2017
Previous research has established that SAT scores and high school grade point average (HSGPA) differ in their predictive power and in the size of mean differences across racial/ethnic groups. However, the SAT is scaled nationally across all test takers while HSGPA is scaled locally within a school. In this study, the researchers propose that this…
Descriptors: College Entrance Examinations, Scaling, Grade Point Average, Differences
Peer reviewed Peer reviewed
PDF on ERIC Download full text
Attali, Yigal; Saldivia, Luis; Jackson, Carol; Schuppan, Fred; Wanamaker, Wilbur – ETS Research Report Series, 2014
Previous investigations of the ability of content experts and test developers to estimate item difficulty have, for themost part, produced disappointing results. These investigations were based on a noncomparative method of independently rating the difficulty of items. In this article, we argue that, by eliciting comparative judgments of…
Descriptors: Test Items, Difficulty Level, Comparative Analysis, College Entrance Examinations
Peer reviewed Peer reviewed
PDF on ERIC Download full text
Guo, Hongwen; Liu, Jinghua; Curley, Edward; Dorans, Neil – ETS Research Report Series, 2012
This study examines the stability of the "SAT Reasoning Test"™ score scales from 2005 to 2010. A 2005 old form (OF) was administered along with a 2010 new form (NF). A new conversion for OF was derived through direct equipercentile equating. A comparison of the newly derived and the original OF conversions showed that Critical Reading…
Descriptors: Aptitude Tests, Cognitive Tests, Thinking Skills, Equated Scores
Peer reviewed Peer reviewed
PDF on ERIC Download full text
Gao, Rui; He, Wei; Ruan, Chunyi – ETS Research Report Series, 2012
In this study, we investigated whether preequating results agree with equating results that are based on observed operational data (postequating) for a college placement program. Specifically, we examined the degree to which item response theory (IRT) true score preequating results agreed with those from IRT true score postequating and from…
Descriptors: College Entrance Examinations, Student Placement, Item Response Theory, True Scores
Peer reviewed Peer reviewed
PDF on ERIC Download full text
Liu, Jinghua; Curley, Edward; Low, Albert – ETS Research Report Series, 2009
This study examines the stability of the SAT® scale from 1994 to 2001. A 1994 form and a 2001 form were readministered in a 2005 SAT administration, and the 1994 form was equated to the 2001 form. The new conversion was compared to the old conversion. Both the verbal and math sections exhibit a similar degree of scale drift, but in opposite…
Descriptors: College Entrance Examinations, Scaling, Verbal Tests, Mathematics Tests
Peer reviewed Peer reviewed
Direct linkDirect link
Newton, Paul E. – Measurement: Interdisciplinary Research and Perspectives, 2010
This article presents the author's rejoinder to thinking about linking from issue 8(1). Particularly within the more embracing linking frameworks, e.g., Holland & Dorans (2006) and Holland (2007), there appears to be a major disjunction between (1) classification discourse: the supposed basis for classification, that is, the underlying theory…
Descriptors: Foreign Countries, Measurement Techniques, Psychometrics, Comparative Analysis
Peer reviewed Peer reviewed
Direct linkDirect link
Walker, Michael E. – Measurement: Interdisciplinary Research and Perspectives, 2010
"Linking" is a term given to a general class of procedures by which one represents scores X on one test or measure in terms of scores Y on another test or measure. A recent taxonomy by Holland and Dorans (2006; Holland, 2007) organizes the various types of links into three broad categories: prediction, scale aligning, and equating. In…
Descriptors: Foreign Countries, Test Construction, Test Validity, Measurement Techniques
Shaw, Emily J.; Mattern, Krista D. – College Board, 2009
This study examined the relationship between students' self-reported high school grade point average (HSGPA) from the SAT Questionnaire and their HSGPA provided by the colleges and universities they attend. The purpose of this research was to offer updated information on the relatedness of self-reported (by the student) and school-reported (by the…
Descriptors: High School Students, Grade Point Average, Accuracy, Aptitude Tests
Peer reviewed Peer reviewed
PDF on ERIC Download full text
Haberman, Shelby J.; Guo, Hongwen; Liu, Jinghua; Dorans, Neil J. – ETS Research Report Series, 2008
This study uses historical data to explore the consistency of SAT® I: Reasoning Test score conversions and to examine trends in scaled score means. During the period from April 1995 to December 2003, both Verbal (V) and Math (M) means display substantial seasonality, and a slight increasing trend for both is observed. SAT Math means increase more…
Descriptors: College Entrance Examinations, Thinking Skills, Logical Thinking, Scaling
Peer reviewed Peer reviewed
Petersen, Nancy S.; And Others – Journal of Educational Statistics, 1983
Three methods of test equating (linear, equipercentile, and item response theory) were investigated with respect to the issue of scale drift. Results indicate that all three models work well in limited settings but that the item response theory approach provided the most stable results overall. (JKS)
Descriptors: College Entrance Examinations, Comparative Analysis, Equated Scores, Item Analysis
Cook, Linda L.; And Others – 1988
Scaling is carried out in an effort to increase the comparability of scores obtained on different tests. This study explored the relationships between College Board Achievement Test scores and potential scaling covariates for various subgroups of the test-taking population with the goal of providing several alternatives to traditionally used…
Descriptors: Achievement Tests, College Entrance Examinations, Comparative Analysis, Correlation
Marco, Gary L. – 1984
Using raw-to-scaled-score conversions derived from test-score equating to link item-parameter estimates from the one-parameter (Rasch) and three-parameter logistic models, this study evaluated an indirect method for converting item response theory estimates to a common scale. Data were taken from Petersen's Scholastic Aptitude Test (SAT) scale…
Descriptors: College Entrance Examinations, Equated Scores, Estimation (Mathematics), Latent Trait Theory
Eignor, Daniel R. – 1993
Procedures used to establish the comparability of scores derived from the College Board Admissions Testing Program (ATP) computer adaptive Scholastic Aptitude Test (SAT) prototype and the paper-and-pencil SAT are described in this report. Both the prototype, which is made up of Verbal and Mathematics computer adaptive tests (CATs), and a form of…
Descriptors: Adaptive Testing, College Entrance Examinations, Comparative Analysis, Computer Assisted Testing
Lewis, Charles – 1982
The nonparametric approach to test theory discussed here has its roots in the early work of Guttman, Lazarsfeld, and Meredith; and more recently in the work of Cliff and in Tatsuoka and Tatsuoka. Mokken's extensive treatment of this subject concentrated on defining, constructing, and testing unidimensional scales, based on responses to dichotomous…
Descriptors: Computer Oriented Programs, Estimation (Mathematics), Item Analysis, Latent Trait Theory
Previous Page | Next Page »
Pages: 1  |  2