ERIC - Search Results

Publication Date

In 2026	0
Since 2025	0
Since 2022 (last 5 years)	0
Since 2017 (last 10 years)	0
Since 2007 (last 20 years)	5

Descriptor

Difficulty Level	6
Test Items	6
College Entrance Examinations	3
Item Response Theory	3
Statistical Analysis	3
Equated Scores	2
Licensing Examinations…	2
Test Bias	2
Accountability	1
Adaptive Testing	1
African Americans	1
Bias	1
Coding	1
Comparative Analysis	1
Computer Assisted Testing	1
Correlation	1
Data	1
Disabilities	1
Educational Technology	1
English (Second Language)	1
Error of Measurement	1
Evaluation Methods	1
Item Analysis	1
Language Tests	1
Literature Reviews	1
More ▼

Source

Educational Testing Service

Author

Sinharay, Sandip	3
Holland, Paul W.	2
Curley, Edward	1
Davey, Tim	1
Dorans, Neil J.	1
Feigenbaum, Miriam	1
Haberman, Shelby J.	1
Kostin, Irene	1
Lee, Yi-Hsuan	1
Liu, Jinghua	1
Stone, Elizabeth	1
More ▼

Publication Type

Reports - Research	3
Information Analyses	2
Opinion Papers	1
Reports - Evaluative	1

Education Level

Elementary Secondary Education	1
Higher Education	1
Postsecondary Education	1

Audience

Location

Laws, Policies, & Programs

Assessments and Surveys

SAT (College Admission Test)	2
Test of English as a Foreign…	1

What Works Clearinghouse Rating

Showing all 6 results Save | Export

Statistical Procedures to Evaluate Quality of Scale Anchoring. Research Report. ETS RR-11-02

Download full text

Haberman, Shelby J.; Sinharay, Sandip; Lee, Yi-Hsuan – Educational Testing Service, 2011

Providing information to test takers and test score users about the abilities of test takers at different score levels has been a persistent problem in educational and psychological measurement (Carroll, 1993). Scale anchoring (Beaton & Allen, 1992), a technique that describes what students at different points on a score scale know and can do,…

Descriptors: Statistical Analysis, Scores, Regression (Statistics), Item Response Theory

Unfair Treatment vs. Confirmation Bias? Comments on Santelices and Wilson. Research Report. ETS RR-10-20

Download full text

Dorans, Neil J. – Educational Testing Service, 2010

Santelices and Wilson (2010) claimed to have addressed technical criticisms of Freedle (2003) presented in Dorans (2004a) and elsewhere. Santelices and Wilson's abstract claimed that their study confirmed that SAT[R] verbal items do function differently for African American and White subgroups. In this commentary, I demonstrate that the…

Descriptors: College Entrance Examinations, Verbal Tests, Test Bias, Test Items

Computer-Adaptive Testing for Students with Disabilities: A Review of the Literature. Research Report. ETS RR-11-32

Download full text

Stone, Elizabeth; Davey, Tim – Educational Testing Service, 2011

There has been an increased interest in developing computer-adaptive testing (CAT) and multistage assessments for K-12 accountability assessments. The move to adaptive testing has been met with some resistance by those in the field of special education who express concern about routing of students with divergent profiles (e.g., some students with…

Descriptors: Disabilities, Adaptive Testing, Accountability, Computer Assisted Testing

The Effects of Different Types of Anchor Tests on Observed Score Equating. Research Report. ETS RR-09-41

Download full text

Liu, Jinghua; Sinharay, Sandip; Holland, Paul W.; Feigenbaum, Miriam; Curley, Edward – Educational Testing Service, 2009

This study explores the use of a different type of anchor, a "midi anchor", that has a smaller spread of item difficulties than the tests to be equated, and then contrasts its use with the use of a "mini anchor". The impact of different anchors on observed score equating were evaluated and compared with respect to systematic…

Descriptors: Equated Scores, Test Items, Difficulty Level, Error of Measurement

The Missing Data Assumptions of the Nonequivalent Groups with Anchor Test (NEAT) Design and Their Implications for Test Equating. Research Report. ETS RR-09-16

Download full text

Sinharay, Sandip; Holland, Paul W. – Educational Testing Service, 2008

The nonequivalent groups with anchor test (NEAT) design involves missing data that are missing by design. Three popular equating methods that can be used with a NEAT design are the poststratification equating method, the chain equipercentile equating method, and the item-response-theory observed-score-equating method. These three methods each…

Descriptors: Equated Scores, Test Items, Item Response Theory, Data

Exploring Item Characteristics That Are Related to the Difficulty of TOEFL Dialogue Items. Research Reports. RR-79. RR-04-11

Download full text

Kostin, Irene – Educational Testing Service, 2004

The purpose of this study is to explore the relationship between a set of item characteristics and the difficulty of TOEFL[R] dialogue items. Identifying characteristics that are related to item difficulty has the potential to improve the efficiency of the item-writing process The study employed 365 TOEFL dialogue items, which were coded on 49…

Descriptors: Statistical Analysis, Difficulty Level, Language Tests, English (Second Language)