Publication Date
In 2025 | 0 |
Since 2024 | 0 |
Since 2021 (last 5 years) | 0 |
Since 2016 (last 10 years) | 1 |
Since 2006 (last 20 years) | 4 |
Descriptor
Scores | 4 |
Test Length | 4 |
Comparative Analysis | 3 |
Item Response Theory | 3 |
Models | 2 |
Simulation | 2 |
Statistical Analysis | 2 |
Test Items | 2 |
Test Reliability | 2 |
Computer Assisted Testing | 1 |
Correlation | 1 |
More ▼ |
Source
ETS Research Report Series | 4 |
Author
Baba, Kyoko | 1 |
Cumming, Alister | 1 |
Eouanzoui, Keanre | 1 |
Erdosy, Usman | 1 |
Feng, Yuling | 1 |
Fu, Jianbin | 1 |
James, Mark | 1 |
Kantor, Robert | 1 |
Lee, Yi-Hsuan | 1 |
Patsula, Liane | 1 |
Rizavi, Saba | 1 |
More ▼ |
Publication Type
Journal Articles | 4 |
Reports - Research | 4 |
Education Level
Audience
Location
Canada | 1 |
Laws, Policies, & Programs
Assessments and Surveys
Test of English as a Foreign… | 1 |
What Works Clearinghouse Rating
Fu, Jianbin; Feng, Yuling – ETS Research Report Series, 2018
In this study, we propose aggregating test scores with unidimensional within-test structure and multidimensional across-test structure based on a 2-level, 1-factor model. In particular, we compare 6 score aggregation methods: average of standardized test raw scores (M1), regression factor score estimate of the 1-factor model based on the…
Descriptors: Comparative Analysis, Scores, Correlation, Standardized Tests
Lee, Yi-Hsuan; Zhang, Jinming – ETS Research Report Series, 2010
This report examines the consequences of differential item functioning (DIF) using simulated data. Its impact on total score, item response theory (IRT) ability estimate, and test reliability was evaluated in various testing scenarios created by manipulating the following four factors: test length, percentage of DIF items per form, sample sizes of…
Descriptors: Test Bias, Item Response Theory, Test Items, Scores
Rotou, Ourania; Patsula, Liane; Steffen, Manfred; Rizavi, Saba – ETS Research Report Series, 2007
Traditionally, the fixed-length linear paper-and-pencil (P&P) mode of administration has been the standard method of test delivery. With the advancement of technology, however, the popularity of administering tests using adaptive methods like computerized adaptive testing (CAT) and multistage testing (MST) has grown in the field of measurement…
Descriptors: Comparative Analysis, Test Format, Computer Assisted Testing, Models
Cumming, Alister; Kantor, Robert; Baba, Kyoko; Eouanzoui, Keanre; Erdosy, Usman; James, Mark – ETS Research Report Series, 2006
We assessed whether and how the discourse written for prototype integrated tasks (involving writing in response to print or audio source texts) field tested for the new TOEFL® differs from the discourse written for independent essays (i.e., the TOEFL essay). We selected 216 compositions written for 6 tasks by 36 examinees in a field…
Descriptors: Discourse Analysis, Essays, Scores, Language Proficiency