ERIC - Search Results

Publication Date

In 2025	0
Since 2024	0
Since 2021 (last 5 years)	0
Since 2016 (last 10 years)	1
Since 2006 (last 20 years)	15

Descriptor

Equated Scores	15
Test Items	9
Difficulty Level	7
College Entrance Examinations	5
Comparative Analysis	5
Item Response Theory	4
Statistical Analysis	4
Correlation	3
Error of Measurement	3
Test Bias	3
Tests	3
Accuracy	2
English (Second Language)	2
Evaluation	2
Language Proficiency	2
Licensing Examinations…	2
Multiple Choice Tests	2
Prediction	2
Psychometrics	2
Simulation	2
Statistical Bias	2
Bias	1
Case Studies	1
Data	1
Data Analysis	1
More ▼

Source

Educational Testing Service	4
Journal of Educational…	4
ETS Research Report Series	3
Educational Measurement:…	2
Educational and Psychological…	1
Psychometrika	1

Author

Sinharay, Sandip	15
Holland, Paul W.	7
Curley, Edward	3
Feigenbaum, Miriam	3
Holland, Paul	3
Liu, Jinghua	3
Dorans, Neil J.	2
Han, Ning	2
Liang, Longjuan	2
von Davier, Alina A.	2
Haberman, Shelby	1
Haberman, Shelby J.	1
More ▼

Publication Type

Journal Articles	11
Reports - Research	9
Reports - Descriptive	3
Reports - Evaluative	3
Numerical/Quantitative Data	2

Education Level

Elementary Education	1
High Schools	1
Secondary Education	1

Audience

Location

United States

Laws, Policies, & Programs

Assessments and Surveys

SAT (College Admission Test)	4
National Merit Scholarship…	1
Preliminary Scholastic…	1

What Works Clearinghouse Rating

Showing all 15 results Save | Export

On the Choice of Anchor Tests in Equating

Peer reviewed

Direct link

Sinharay, Sandip – Educational Measurement: Issues and Practice, 2018

The choice of anchor tests is crucial in applications of the nonequivalent groups with anchor test design of equating. Sinharay and Holland (2006, 2007) suggested "miditests," which are anchor tests that are content-representative and have the same mean item difficulty as the total test but have a smaller spread of item difficulties.…

Descriptors: Test Content, Difficulty Level, Test Items, Test Construction

Equating of Augmented Subscores

Peer reviewed

Direct link

Sinharay, Sandip; Haberman, Shelby J. – Journal of Educational Measurement, 2011

Recently, there has been an increasing level of interest in subscores for their potential diagnostic value. Haberman (2008b) suggested reporting an augmented subscore that is a linear combination of a subscore and the total score. Sinharay and Haberman (2008) and Sinharay (2010) showed that augmented subscores often lead to more accurate…

Descriptors: Diagnostic Tests, Psychometrics, Testing, Equated Scores

Equating of Subscores and Weighted Averages under the NEAT Design. Research Report. ETS RR-11-01

Download full text

Sinharay, Sandip; Haberman, Shelby – Educational Testing Service, 2011

Recently, the literature has seen increasing interest in subscores for their potential diagnostic values; for example, one study suggested the report of weighted averages of a subscore and the total score, whereas others showed, for various operational and simulated data sets, that weighted averages, as compared to subscores, lead to more accurate…

Descriptors: Equated Scores, Weighted Scores, Tests, Statistical Analysis

Test Score Equating Using a Mini-Version Anchor and a Midi Anchor: A Case Study Using SAT[R] Data

Peer reviewed

Direct link

Liu, Jinghua; Sinharay, Sandip; Holland, Paul W.; Curley, Edward; Feigenbaum, Miriam – Journal of Educational Measurement, 2011

This study explores an anchor that is different from the traditional miniature anchor in test score equating. In contrast to a traditional "mini" anchor that has the same spread of item difficulties as the tests to be equated, the studied anchor, referred to as a "midi" anchor (Sinharay & Holland), has a smaller spread of…

Descriptors: Equated Scores, Case Studies, College Entrance Examinations, Test Items

The Missing Data Assumptions of the NEAT Design and Their Implications for Test Equating

Peer reviewed

Direct link

Sinharay, Sandip; Holland, Paul W. – Psychometrika, 2010

The Non-Equivalent groups with Anchor Test (NEAT) design involves "missing data" that are "missing by design." Three nonlinear observed score equating methods used with a NEAT design are the "frequency estimation equipercentile equating" (FEEE), the "chain equipercentile equating" (CEE), and the "item-response-theory observed-score-equating" (IRT…

Descriptors: Equated Scores, Item Response Theory, Tests, Data Analysis

Observed Score Equating Using a Mini-Version Anchor and an Anchor with Less Spread of Difficulty: A Comparison Study

Peer reviewed

Direct link

Liu, Jinghua; Sinharay, Sandip; Holland, Paul; Feigenbaum, Miriam; Curley, Edward – Educational and Psychological Measurement, 2011

Two different types of anchors are investigated in this study: a mini-version anchor and an anchor that has a less spread of difficulty than the tests to be equated. The latter is referred to as a midi anchor. The impact of these two different types of anchors on observed score equating are evaluated and compared with respect to systematic error…

Descriptors: Equated Scores, Test Items, Difficulty Level, Statistical Bias

A New Approach to Comparing Several Equating Methods in the Context of the NEAT Design

Peer reviewed

Direct link

Sinharay, Sandip; Holland, Paul W. – Journal of Educational Measurement, 2010

The nonequivalent groups with anchor test (NEAT) design involves missing data that are missing by design. Three equating methods that can be used with a NEAT design are the frequency estimation equipercentile equating method, the chain equipercentile equating method, and the item-response-theory observed-score-equating method. We suggest an…

Descriptors: Equated Scores, Item Response Theory, Comparative Analysis, Evaluation

First Language of Test Takers and Fairness Assessment Procedures

Peer reviewed

Direct link

Sinharay, Sandip; Dorans, Neil J.; Liang, Longjuan – Educational Measurement: Issues and Practice, 2011

Over the past few decades, those who take tests in the United States have exhibited increasing diversity with respect to native language. Standard psychometric procedures for ensuring item and test fairness that have existed for some time were developed when test-taking groups were predominantly native English speakers. A better understanding of…

Descriptors: Test Bias, Testing Programs, Psychometrics, Language Proficiency

An Approach to Evaluating the Missing Data Assumptions of the Chain and Post-Stratification Equating Methods for the NEAT Design

Peer reviewed

Direct link

Holland, Paul W.; Sinharay, Sandip; von Davier, Alina A.; Han, Ning – Journal of Educational Measurement, 2008

Two important types of observed score equating (OSE) methods for the non-equivalent groups with Anchor Test (NEAT) design are chain equating (CE) and post-stratification equating (PSE). CE and PSE reflect two distinctly different ways of using the information provided by the anchor test for computing OSE functions. Both types of methods include…

Descriptors: Equated Scores, Prediction, Comparative Analysis

The Effects of Different Types of Anchor Tests on Observed Score Equating. Research Report. ETS RR-09-41

Download full text

Liu, Jinghua; Sinharay, Sandip; Holland, Paul W.; Feigenbaum, Miriam; Curley, Edward – Educational Testing Service, 2009

This study explores the use of a different type of anchor, a "midi anchor", that has a smaller spread of item difficulties than the tests to be equated, and then contrasts its use with the use of a "mini anchor". The impact of different anchors on observed score equating were evaluated and compared with respect to systematic…

Descriptors: Equated Scores, Test Items, Difficulty Level, Error of Measurement

First Language of Examinees and Its Relationship to Equating. Research Report. ETS RR-09-05

Download full text

Liang, Longjuan; Dorans, Neil J.; Sinharay, Sandip – Educational Testing Service, 2009

To ensure fairness, it is important to better understand the relationship of language proficiency with the standard procedures of psychometric analysis. This paper examines how equating results are affected by an increase in the proportion of examinees who report that English is not their first language, using the analysis samples for a…

Descriptors: Equated Scores, English (Second Language), Reading Tests, Mathematics Tests

The Missing Data Assumptions of the Nonequivalent Groups with Anchor Test (NEAT) Design and Their Implications for Test Equating. Research Report. ETS RR-09-16

Download full text

Sinharay, Sandip; Holland, Paul W. – Educational Testing Service, 2008

The nonequivalent groups with anchor test (NEAT) design involves missing data that are missing by design. Three popular equating methods that can be used with a NEAT design are the poststratification equating method, the chain equipercentile equating method, and the item-response-theory observed-score-equating method. These three methods each…

Descriptors: Equated Scores, Test Items, Item Response Theory, Data

The Correlation between the Scores of a Test and an Anchor Test. Research Report. ETS RR-06-04

Peer reviewed
PDF on ERIC

Download full text

Sinharay, Sandip; Holland, Paul – ETS Research Report Series, 2006

It is a widely held belief that an anchor test used in equating should be a miniature version (or "minitest") of the tests to be equated; that is, the anchor test should be proportionally representative of the two tests in content and statistical characteristics. This paper examines the scientific foundation of this belief, especially…

Descriptors: Test Items, Equated Scores, Correlation, Tests

Choice of Anchor Test in Equating. Research Report. ETS RR-06-35

Peer reviewed
PDF on ERIC

Download full text

Sinharay, Sandip; Holland, Paul – ETS Research Report Series, 2006

It is a widely held belief that anchor tests should be miniature versions (i.e., minitests), with respect to content and statistical characteristics of the tests being equated. This paper examines the foundations for this belief. It examines the requirement of statistical representativeness of anchor tests that are content representative. The…

Descriptors: Test Items, Equated Scores, Evaluation Methods, Difficulty Level

Testing the Untestable Assumptions of the Chain and Poststratification Equating Methods for the NEAT Design. Research Report. ETS RR-06-17

Peer reviewed
PDF on ERIC

Download full text

Holland, Paul W.; von Davier, Alina A.; Sinharay, Sandip; Han, Ning – ETS Research Report Series, 2006

This paper focuses on the Non-Equivalent Groups with Anchor Test (NEAT) design for test equating and on two classes of observed--score equating (OSE) methods--chain equating (CE) and poststratification equating (PSE). These two classes of methods reflect two distinctly different ways of using the information provided by the anchor test for…

Descriptors: Equated Scores, Test Items, Statistical Analysis, Comparative Analysis