ERIC - Search Results

Publication Date

In 2025	0
Since 2024	0
Since 2021 (last 5 years)	0
Since 2016 (last 10 years)	2
Since 2006 (last 20 years)	10

Source

ETS Research Report Series	6
College Entrance Examination…	3
Educational Measurement:…	2
Measurement:…	2
College Board	1
Journal of Educational…	1

Publication Type

Reports - Research	16
Journal Articles	11
Reports - Evaluative	5
Speeches/Meeting Papers	4
Opinion Papers	2
Reports - Descriptive	1

Education Level

Higher Education	11
Postsecondary Education	11
High Schools	5
Secondary Education	5
Elementary Secondary Education	2

Audience

Researchers

Location

United Kingdom (England)	2
United Kingdom (Wales)	2
United States	2
Australia	1

Laws, Policies, & Programs

Assessments and Surveys

SAT (College Admission Test)	25
ACT Assessment	4
Graduate Record Examinations	3
Advanced Placement…	2
College Board Achievement…	2
College Level Examination…	1

What Works Clearinghouse Rating

Showing 1 to 15 of 25 results Save | Export

Providing a Context for Interpreting Predictions of Job Performance. Research Report. ETS RR-18-38

Peer reviewed
PDF on ERIC

Download full text

Dorans, Neil J. – ETS Research Report Series, 2018

A distinction is made between scores as measures of a construct and predictions of a criterion or outcome variable. The interpretation attached to predictions of criteria, such as job performance or college grade point average (GPA), differs from that attached to scores that are measures of a construct, such as reading proficiency or knowledge…

Descriptors: Job Performance, Scores, Data Interpretation, Statistical Distributions

Within-High-School versus Across-High-School Scaling of Admissions Assessments: Implications for Validity and Diversity Effects

Peer reviewed

Direct link

Kostal, Jack W.; Sackett, Paul R.; Kuncel, Nathan R.; Walmsley, Philip T.; Stemig, Melissa S. – Educational Measurement: Issues and Practice, 2017

Previous research has established that SAT scores and high school grade point average (HSGPA) differ in their predictive power and in the size of mean differences across racial/ethnic groups. However, the SAT is scaled nationally across all test takers while HSGPA is scaled locally within a school. In this study, the researchers propose that this…

Descriptors: College Entrance Examinations, Scaling, Grade Point Average, Differences

Estimating Item Difficulty with Comparative Judgments. Research Report. ETS RR-14-39

Peer reviewed
PDF on ERIC

Download full text

Attali, Yigal; Saldivia, Luis; Jackson, Carol; Schuppan, Fred; Wanamaker, Wilbur – ETS Research Report Series, 2014

Previous investigations of the ability of content experts and test developers to estimate item difficulty have, for themost part, produced disappointing results. These investigations were based on a noncomparative method of independently rating the difficulty of items. In this article, we argue that, by eliciting comparative judgments of…

Descriptors: Test Items, Difficulty Level, Comparative Analysis, College Entrance Examinations

The Stability of the Score Scales for the "SAT Reasoning Test"™ from 2005 to 2010. Research Report. ETS RR-12-15

Peer reviewed
PDF on ERIC

Download full text

Guo, Hongwen; Liu, Jinghua; Curley, Edward; Dorans, Neil – ETS Research Report Series, 2012

This study examines the stability of the "SAT Reasoning Test"™ score scales from 2005 to 2010. A 2005 old form (OF) was administered along with a 2010 new form (NF). A new conversion for OF was derived through direct equipercentile equating. A comparison of the newly derived and the original OF conversions showed that Critical Reading…

Descriptors: Aptitude Tests, Cognitive Tests, Thinking Skills, Equated Scores

Does Preequating Work? An Investigation into a Preequated Testlet-Based College Placement Exam Using Postadministration Data. Research Report. ETS RR-12-12

Peer reviewed
PDF on ERIC

Download full text

Gao, Rui; He, Wei; Ruan, Chunyi – ETS Research Report Series, 2012

In this study, we investigated whether preequating results agree with equating results that are based on observed operational data (postequating) for a college placement program. Specifically, we examined the degree to which item response theory (IRT) true score preequating results agreed with those from IRT true score postequating and from…

Descriptors: College Entrance Examinations, Student Placement, Item Response Theory, True Scores

A Scale Drift Study. Research Report. ETS RR-09-43

Peer reviewed
PDF on ERIC

Download full text

Liu, Jinghua; Curley, Edward; Low, Albert – ETS Research Report Series, 2009

This study examines the stability of the SAT® scale from 1994 to 2001. A 1994 form and a 2001 form were readministered in a 2005 SAT administration, and the 1994 form was equated to the 2001 form. The new conversion was compared to the old conversion. Both the verbal and math sections exhibit a similar degree of scale drift, but in opposite…

Descriptors: College Entrance Examinations, Scaling, Verbal Tests, Mathematics Tests

Conceptualizing Comparability

Peer reviewed

Direct link

Newton, Paul E. – Measurement: Interdisciplinary Research and Perspectives, 2010

This article presents the author's rejoinder to thinking about linking from issue 8(1). Particularly within the more embracing linking frameworks, e.g., Holland & Dorans (2006) and Holland (2007), there appears to be a major disjunction between (1) classification discourse: the supposed basis for classification, that is, the underlying theory…

Descriptors: Foreign Countries, Measurement Techniques, Psychometrics, Comparative Analysis

Linking through Improved Design, Not Redefinition: Commentary on Newton

Peer reviewed

Direct link

Walker, Michael E. – Measurement: Interdisciplinary Research and Perspectives, 2010

"Linking" is a term given to a general class of procedures by which one represents scores X on one test or measure in terms of scores Y on another test or measure. A recent taxonomy by Holland and Dorans (2006; Holland, 2007) organizes the various types of links into three broad categories: prediction, scale aligning, and equating. In…

Descriptors: Foreign Countries, Test Construction, Test Validity, Measurement Techniques

Examining the Accuracy of Self-Reported High School Grade Point Average. Research Report No. 2009-5

Download full text

Shaw, Emily J.; Mattern, Krista D. – College Board, 2009

This study examined the relationship between students' self-reported high school grade point average (HSGPA) from the SAT Questionnaire and their HSGPA provided by the colleges and universities they attend. The purpose of this research was to offer updated information on the relatedness of self-reported (by the student) and school-reported (by the…

Descriptors: High School Students, Grade Point Average, Accuracy, Aptitude Tests

Consistency of SAT® I: Reasoning Test Score Conversions. Research Report. ETS RR-08-67

Peer reviewed
PDF on ERIC

Download full text

Haberman, Shelby J.; Guo, Hongwen; Liu, Jinghua; Dorans, Neil J. – ETS Research Report Series, 2008

This study uses historical data to explore the consistency of SAT® I: Reasoning Test score conversions and to examine trends in scaled score means. During the period from April 1995 to December 2003, both Verbal (V) and Math (M) means display substantial seasonality, and a slight increasing trend for both is observed. SAT Math means increase more…

Descriptors: College Entrance Examinations, Thinking Skills, Logical Thinking, Scaling

IRT versus Conventional Equating Methods: A Comparative Study of Scale Stability.

Peer reviewed

Petersen, Nancy S.; And Others – Journal of Educational Statistics, 1983

Three methods of test equating (linear, equipercentile, and item response theory) were investigated with respect to the issue of scale drift. Results indicate that all three models work well in limited settings but that the item response theory approach provided the most stable results overall. (JKS)

Descriptors: College Entrance Examinations, Comparative Analysis, Equated Scores, Item Analysis

Achievement Test Scaling.

Download full text

Cook, Linda L.; And Others – 1988

Scaling is carried out in an effort to increase the comparability of scores obtained on different tests. This study explored the relationships between College Board Achievement Test scores and potential scaling covariates for various subgroups of the test-taking population with the goal of providing several alternatives to traditionally used…

Descriptors: Achievement Tests, College Entrance Examinations, Comparative Analysis, Correlation

An Evaluation of an Indirect Method of Transforming Item Parameter Estimates from Item Response Theory to a Common Scale.

Marco, Gary L. – 1984

Using raw-to-scaled-score conversions derived from test-score equating to link item-parameter estimates from the one-parameter (Rasch) and three-parameter logistic models, this study evaluated an indirect method for converting item response theory estimates to a common scale. Data were taken from Petersen's Scholastic Aptitude Test (SAT) scale…

Descriptors: College Entrance Examinations, Equated Scores, Estimation (Mathematics), Latent Trait Theory

Deriving Comparable Scores for Computer Adaptive and Conventional Tests: An Example Using the SAT.

Download full text

Eignor, Daniel R. – 1993

Procedures used to establish the comparability of scores derived from the College Board Admissions Testing Program (ATP) computer adaptive Scholastic Aptitude Test (SAT) prototype and the paper-and-pencil SAT are described in this report. Both the prototype, which is made up of Verbal and Mathematics computer adaptive tests (CATs), and a form of…

Descriptors: Adaptive Testing, College Entrance Examinations, Comparative Analysis, Computer Assisted Testing

Developments in Nonparametric Ability Estimation.

Lewis, Charles – 1982

The nonparametric approach to test theory discussed here has its roots in the early work of Guttman, Lazarsfeld, and Meredith; and more recently in the work of Cliff and in Tatsuoka and Tatsuoka. Mokken's extensive treatment of this subject concentrated on defining, constructing, and testing unidimensional scales, based on responses to dichotomous…

Descriptors: Computer Oriented Programs, Estimation (Mathematics), Item Analysis, Latent Trait Theory

Previous Page | Next Page »

Pages: 1 | 2

Scaling	25
College Entrance Examinations	19
Equated Scores	16
Comparative Analysis	12
Scores	10
Mathematics Tests	7
Latent Trait Theory	6
Statistical Analysis	6
Correlation	5
High School Students	5
Item Analysis	5
Measurement Techniques	5
Psychometrics	5
Standardized Tests	5
Test Construction	5
Test Items	5
Achievement Tests	4
Aptitude Tests	4
Classification	4
Mathematical Models	4
Research Methodology	4
Statistical Studies	4
Verbal Tests	4
Estimation (Mathematics)	3
High Schools	3
More ▼

Dorans, Neil J.	6
Liu, Jinghua	3
Marco, Gary L.	3
Curley, Edward	2
Eignor, Daniel R.	2
Guo, Hongwen	2
Angoff, William H.	1
Attali, Yigal	1
Boldt, Robert F.	1
Cook, Linda L.	1
Dorans, Neil	1
Gao, Rui	1
Haberman, Shelby J.	1
He, Wei	1
Jackson, Carol	1
Kostal, Jack W.	1
Kuncel, Nathan R.	1
Lewis, Charles	1
Low, Albert	1
Mattern, Krista D.	1
Newton, Paul E.	1
Petersen, Nancy S.	1
Ruan, Chunyi	1
Sackett, Paul R.	1
More ▼