ERIC - Search Results

Publication Date

In 2025	0
Since 2024	0
Since 2021 (last 5 years)	0
Since 2016 (last 10 years)	0
Since 2006 (last 20 years)	15

Descriptor

College Entrance Examinations	20
Equated Scores	13
Comparative Analysis	10
Test Items	10
Statistical Analysis	8
Mathematics Tests	6
Test Construction	6
Critical Reading	5
Difficulty Level	5
Error of Measurement	5
Raw Scores	5
Scores	5
Verbal Ability	5
Gender Differences	4
Verbal Tests	4
Evaluation Methods	3
High School Students	3
Scaling	3
Test Bias	3
Test Format	3
Testing Programs	3
Accuracy	2
Aptitude Tests	2
Cognitive Tests	2
Correlation	2
More ▼

Source

ETS Research Report Series	9
College Board	2
College Entrance Examination…	2
Educational Testing Service	2
Journal of Educational…	2
Educational Measurement:…	1
Educational and Psychological…	1

Author

Liu, Jinghua	20
Dorans, Neil J.	7
Feigenbaum, Miriam	7
Curley, Edward	6
Guo, Hongwen	4
Sinharay, Sandip	3
Dorans, Neil	2
Holland, Paul W.	2
Allspach, Jill R.	1
Burton, Nancy	1
Cahn, Miriam F.	1
Carey, Jill	1
Cook, Linda	1
Deng, Weiling	1
Dion, Gloria	1
Haberman, Shelby J.	1
Harvey, Anne	1
Holland, Paul	1
Jackson, Carol	1
Klag, Patricia	1
Low, Albert	1
Low, Albert C.	1
Moses, Tim	1
Oh, Hyeon-Joo	1
Schuppan, Fred	1
More ▼

Publication Type

Reports - Research	19
Journal Articles	13
Tests/Questionnaires	2
Numerical/Quantitative Data	1
Reports - Descriptive	1
Speeches/Meeting Papers	1

Education Level

Higher Education	13
Postsecondary Education	13
High Schools	4
Secondary Education	3
Elementary Secondary Education	1

Audience

Location

Laws, Policies, & Programs

Assessments and Surveys

SAT (College Admission Test)	19
Praxis Series	2

What Works Clearinghouse Rating

Showing 1 to 15 of 20 results Save | Export

A Comparison of Raw-to-Scale Conversion Consistency between Single- and Multiple-Linking Using a Nonequivalent Groups Anchor Test Design. Research Report. ETS RR-14-13

Peer reviewed
PDF on ERIC

Download full text

Liu, Jinghua; Guo, Hongwen; Dorans, Neil J. – ETS Research Report Series, 2014

Maintaining score interchangeability and scale consistency is crucial for any testing programs that administer multiple forms across years. The use of a multiple linking design, which involves equating a new form to multiple old forms and averaging the conversions, has been proposed to control scale drift. However, the use of multiple linking…

Descriptors: Comparative Analysis, Reliability, Test Construction, Equated Scores

Assessing a Critical Aspect of Construct Continuity when Test Specifications Change or Test Forms Deviate from Specifications

Peer reviewed

Direct link

Liu, Jinghua; Dorans, Neil J. – Educational Measurement: Issues and Practice, 2013

We make a distinction between two types of test changes: inevitable deviations from specifications versus planned modifications of specifications. We describe how score equity assessment (SEA) can be used as a tool to assess a critical aspect of construct continuity, the equivalence of scores, whenever planned changes are introduced to testing…

Descriptors: Tests, Test Construction, Test Format, Change

Test Score Equating Using Discrete Anchor Items versus Passage-Based Anchor Items: A Case Study Using "SAT"® Data. Research Report. ETS RR-14-14

Peer reviewed
PDF on ERIC

Download full text

Liu, Jinghua; Zu, Jiyun; Curley, Edward; Carey, Jill – ETS Research Report Series, 2014

The purpose of this study is to investigate the impact of discrete anchor items versus passage-based anchor items on observed score equating using empirical data.This study compares an "SAT"® critical reading anchor that contains more discrete items proportionally, compared to the total tests to be equated, to another anchor that…

Descriptors: Equated Scores, Test Items, College Entrance Examinations, Comparative Analysis

Multiple Linking in Equating and Random Scale Drift. Research Report. ETS RR-11-46

Peer reviewed
PDF on ERIC

Download full text

Guo, Hongwen; Liu, Jinghua; Dorans, Neil; Feigenbaum, Miriam – ETS Research Report Series, 2011

Maintaining score stability is crucial for an ongoing testing program that administers several tests per year over many years. One way to stall the drift of the score scale is to use an equating design with multiple links. In this study, we use the operational and experimental SAT® data collected from 44 administrations to investigate the effect…

Descriptors: Equated Scores, College Entrance Examinations, Reliability, Testing Programs

Test Score Equating Using a Mini-Version Anchor and a Midi Anchor: A Case Study Using SAT[R] Data

Peer reviewed

Direct link

Liu, Jinghua; Sinharay, Sandip; Holland, Paul W.; Curley, Edward; Feigenbaum, Miriam – Journal of Educational Measurement, 2011

This study explores an anchor that is different from the traditional miniature anchor in test score equating. In contrast to a traditional "mini" anchor that has the same spread of item difficulties as the tests to be equated, the studied anchor, referred to as a "midi" anchor (Sinharay & Holland), has a smaller spread of…

Descriptors: Equated Scores, Case Studies, College Entrance Examinations, Test Items

Observed Score Equating Using a Mini-Version Anchor and an Anchor with Less Spread of Difficulty: A Comparison Study

Peer reviewed

Direct link

Liu, Jinghua; Sinharay, Sandip; Holland, Paul; Feigenbaum, Miriam; Curley, Edward – Educational and Psychological Measurement, 2011

Two different types of anchors are investigated in this study: a mini-version anchor and an anchor that has a less spread of difficulty than the tests to be equated. The latter is referred to as a midi anchor. The impact of these two different types of anchors on observed score equating are evaluated and compared with respect to systematic error…

Descriptors: Equated Scores, Test Items, Difficulty Level, Statistical Bias

Constructed-Response DIF Evaluations for Mixed-Format Tests. Research Report. ETS RR-13-33

Peer reviewed
PDF on ERIC

Download full text

Moses, Tim; Liu, Jinghua; Tan, Adele; Deng, Weiling; Dorans, Neil J. – ETS Research Report Series, 2013

In this study, differential item functioning (DIF) methods utilizing 14 different matching variables were applied to assess DIF in the constructed-response (CR) items from 6 forms of 3 mixed-format tests. Results suggested that the methods might produce distinct patterns of DIF results for different tests and testing programs, in that the DIF…

Descriptors: Test Construction, Multiple Choice Tests, Test Items, Item Analysis

The Stability of the Score Scales for the "SAT Reasoning Test"™ from 2005 to 2010. Research Report. ETS RR-12-15

Peer reviewed
PDF on ERIC

Download full text

Guo, Hongwen; Liu, Jinghua; Curley, Edward; Dorans, Neil – ETS Research Report Series, 2012

This study examines the stability of the "SAT Reasoning Test"™ score scales from 2005 to 2010. A 2005 old form (OF) was administered along with a 2010 new form (NF). A new conversion for OF was derived through direct equipercentile equating. A comparison of the newly derived and the original OF conversions showed that Critical Reading…

Descriptors: Aptitude Tests, Cognitive Tests, Thinking Skills, Equated Scores

A Scale Drift Study. Research Report. ETS RR-09-43

Peer reviewed
PDF on ERIC

Download full text

Liu, Jinghua; Curley, Edward; Low, Albert – ETS Research Report Series, 2009

This study examines the stability of the SAT® scale from 1994 to 2001. A 1994 form and a 2001 form were readministered in a 2005 SAT administration, and the 1994 form was equated to the 2001 form. The new conversion was compared to the old conversion. Both the verbal and math sections exhibit a similar degree of scale drift, but in opposite…

Descriptors: College Entrance Examinations, Scaling, Verbal Tests, Mathematics Tests

The Effects of Different Types of Anchor Tests on Observed Score Equating. Research Report. ETS RR-09-41

Download full text

Liu, Jinghua; Sinharay, Sandip; Holland, Paul W.; Feigenbaum, Miriam; Curley, Edward – Educational Testing Service, 2009

This study explores the use of a different type of anchor, a "midi anchor", that has a smaller spread of item difficulties than the tests to be equated, and then contrasts its use with the use of a "mini anchor". The impact of different anchors on observed score equating were evaluated and compared with respect to systematic…

Descriptors: Equated Scores, Test Items, Difficulty Level, Error of Measurement

Score Equity Assessment:Development of a Prototype Analysis Using SAT[R] Mathematics Test Data Across Several Administrations. Research Report. ETS RR-09-08

Download full text

Dorans, Neil J.; Liu, Jinghua – Educational Testing Service, 2009

The equating process links scores from different editions of the same test. For testing programs that build nearly parallel forms to the same explicit content and statistical specifications and administer forms under the same conditions, the linkings between the forms are expected to be equatings. Score equity assessment (SEA) provides a useful…

Descriptors: Testing Programs, Mathematics Tests, Quality Control, Psychometrics

Development of Approximations to Population Invariance Indices. Research Report. ETS RR-08-36

Peer reviewed
PDF on ERIC

Download full text

Liu, Jinghua; Zhu, Xiaowen – ETS Research Report Series, 2008

The purpose of this paper is to explore methods to approximate population invariance without conducting multiple linkings for subpopulations. Under the single group or equivalent groups design, no linking needs to be performed for the parallel-linear system linking functions. The unequated raw score information can be used as an approximation. For…

Descriptors: Raw Scores, Test Format, Comparative Analysis, Test Construction

Consistency of SAT® I: Reasoning Test Score Conversions. Research Report. ETS RR-08-67

Peer reviewed
PDF on ERIC

Download full text

Haberman, Shelby J.; Guo, Hongwen; Liu, Jinghua; Dorans, Neil J. – ETS Research Report Series, 2008

This study uses historical data to explore the consistency of SAT® I: Reasoning Test score conversions and to examine trends in scaled score means. During the period from April 1995 to December 2003, both Verbal (V) and Math (M) means display substantial seasonality, and a slight increasing trend for both is observed. SAT Math means increase more…

Descriptors: College Entrance Examinations, Thinking Skills, Logical Thinking, Scaling

An Exploration of Kernel Equating Using SAT® Data: Equating to a Similar Population and to a Distant Population. Research Report. ETS RR-07-17

Peer reviewed
PDF on ERIC

Download full text

Liu, Jinghua; Low, Albert C. – ETS Research Report Series, 2007

This study applied kernel equating (KE) in two scenarios: equating to a very similar population and equating to a very different population, referred to as a distant population, using SAT® data. The KE results were compared to the results obtained from analogous classical equating methods in both scenarios. The results indicate that KE results are…

Descriptors: College Entrance Examinations, Equated Scores, Comparative Analysis, Evaluation Methods

SAT[R] Program Calculator Use Survey.

Download full text

Dion, Gloria; Harvey, Anne; Jackson, Carol; Klag, Patricia; Liu, Jinghua; Wright, Craig – 2000

The Educational Testing Service studied the status of calculator use in high school classrooms. Responses were received from 4,568 high schools. Thirty-eight percent of those who were mailed a survey participated, and 12% of those who received e-mail invitations responded. Results indicate that the prevailing policy in U.S. high schools is to…

Descriptors: Calculators, College Entrance Examinations, Educational Policy, High School Students

Previous Page | Next Page »

Pages: 1 | 2