ERIC - Search Results

Publication Date

In 2025	0
Since 2024	0
Since 2021 (last 5 years)	0
Since 2016 (last 10 years)	1
Since 2006 (last 20 years)	4

Descriptor

Scores	4
Test Length	4
Comparative Analysis	3
Item Response Theory	3
Models	2
Simulation	2
Statistical Analysis	2
Test Items	2
Test Reliability	2
Computer Assisted Testing	1
Correlation	1
Discourse Analysis	1
English (Second Language)	1
Error of Measurement	1
Essays	1
Factor Analysis	1
Factor Structure	1
Foreign Countries	1
Language Proficiency	1
Language Tests	1
Multivariate Analysis	1
Raw Scores	1
Responses	1
Sample Size	1
Scoring	1
More ▼

Source

ETS Research Report Series

Author

Baba, Kyoko	1
Cumming, Alister	1
Eouanzoui, Keanre	1
Erdosy, Usman	1
Feng, Yuling	1
Fu, Jianbin	1
James, Mark	1
Kantor, Robert	1
Lee, Yi-Hsuan	1
Patsula, Liane	1
Rizavi, Saba	1
Rotou, Ourania	1
Steffen, Manfred	1
Zhang, Jinming	1
More ▼

Publication Type

Journal Articles	4
Reports - Research	4

Education Level

Audience

Location

Canada

Laws, Policies, & Programs

Assessments and Surveys

Test of English as a Foreign…

What Works Clearinghouse Rating

Showing all 4 results Save | Export

A Comparison of Score Aggregation Methods for Unidimensional Tests on Different Dimensions. Research Report. ETS RR-18-01

Peer reviewed
PDF on ERIC

Download full text

Fu, Jianbin; Feng, Yuling – ETS Research Report Series, 2018

In this study, we propose aggregating test scores with unidimensional within-test structure and multidimensional across-test structure based on a 2-level, 1-factor model. In particular, we compare 6 score aggregation methods: average of standardized test raw scores (M1), regression factor score estimate of the 1-factor model based on the…

Descriptors: Comparative Analysis, Scores, Correlation, Standardized Tests

Differential Item Functioning: Its Consequences. Research Report. ETS RR-10-01

Peer reviewed
PDF on ERIC

Download full text

Lee, Yi-Hsuan; Zhang, Jinming – ETS Research Report Series, 2010

This report examines the consequences of differential item functioning (DIF) using simulated data. Its impact on total score, item response theory (IRT) ability estimate, and test reliability was evaluated in various testing scenarios created by manipulating the following four factors: test length, percentage of DIF items per form, sample sizes of…

Descriptors: Test Bias, Item Response Theory, Test Items, Scores

Comparison of Multistage Tests with Computerized Adaptive and Paper-and-Pencil Tests. Research Report. ETS RR-07-04

Peer reviewed
PDF on ERIC

Download full text

Rotou, Ourania; Patsula, Liane; Steffen, Manfred; Rizavi, Saba – ETS Research Report Series, 2007

Traditionally, the fixed-length linear paper-and-pencil (P&P) mode of administration has been the standard method of test delivery. With the advancement of technology, however, the popularity of administering tests using adaptive methods like computerized adaptive testing (CAT) and multistage testing (MST) has grown in the field of measurement…

Descriptors: Comparative Analysis, Test Format, Computer Assisted Testing, Models

Analysis of Discourse Features and Verification of Scoring Levels for Independent and Integrated Prototype Written Tasks for the New TOEFL®. TOEFL® Monograph Series. MS-30. ETS RM-05-13

Peer reviewed
PDF on ERIC

Download full text

Cumming, Alister; Kantor, Robert; Baba, Kyoko; Eouanzoui, Keanre; Erdosy, Usman; James, Mark – ETS Research Report Series, 2006

We assessed whether and how the discourse written for prototype integrated tasks (involving writing in response to print or audio source texts) field tested for the new TOEFL® differs from the discourse written for independent essays (i.e., the TOEFL essay). We selected 216 compositions written for 6 tasks by 36 examinees in a field…

Descriptors: Discourse Analysis, Essays, Scores, Language Proficiency