ERIC - Search Results

Publication Date

In 2025	0
Since 2024	0
Since 2021 (last 5 years)	1
Since 2016 (last 10 years)	2
Since 2006 (last 20 years)	14

Descriptor

Comparative Analysis	18
Computer Assisted Testing	18
Scores	13
English (Second Language)	9
Language Tests	9
Second Language Learning	9
Regression (Statistics)	8
Correlation	7
Statistical Analysis	6
Test Format	6
Writing Tests	6
Scoring	5
Essays	4
Foreign Countries	4
Item Response Theory	4
Simulation	4
Test Items	4
Computer Software	3
Construct Validity	3
Factor Analysis	3
Item Analysis	3
Models	3
Prediction	3
Prompting	3
Test Reliability	3
More ▼

Source

ETS Research Report Series

Publication Type

Journal Articles	18
Reports - Research	18
Tests/Questionnaires	5

Education Level

Higher Education	5
Postsecondary Education	5
Early Childhood Education	1
Elementary Education	1
Grade 1	1
Grade 3	1
Kindergarten	1
Primary Education	1

Audience

Location

Bulgaria	1
Canada	1
China	1
Croatia	1
Lithuania	1
Poland	1
Romania	1
Slovakia	1

Laws, Policies, & Programs

Assessments and Surveys

Test of English as a Foreign…	8
Praxis Series	2
Early Childhood Longitudinal…	1
Graduate Record Examinations	1

What Works Clearinghouse Rating

Showing 1 to 15 of 18 results Save | Export

A Comparison of Spoken and Written Language Use in Traditional and Technology-Mediated Learning Environments. TOEFL® Research Report. RR-94. ETS RR-21-16

Peer reviewed
PDF on ERIC

Download full text

Kyle, Kristopher; Choe, Ann Tai; Eguchi, Masaki; LaFlair, Geoff; Ziegler, Nicole – ETS Research Report Series, 2021

A key piece of a validity argument for a language assessment tool is clear overlap between assessment tasks and the target language use (TLU) domain (i.e., the domain description inference). The TOEFL 2000 Spoken and Written Academic Language (T2K-SWAL) corpus, which represents a variety of academic registers and disciplines in traditional…

Descriptors: Comparative Analysis, Second Language Learning, English (Second Language), Language Tests

Mapping the "TOEFL iBT"® Test Scores to China's Standards of English Language Ability: Implications for Score Interpretation and Use. TOEFL® Research Report. RR-89. ETS RR-19-44

Peer reviewed
PDF on ERIC

Download full text

Papageorgiou, Spiros; Wu, Sha; Hsieh, Ching-Ni; Tannenbaum, Richard J.; Cheng, Mengmeng – ETS Research Report Series, 2019

The past decade has seen an emerging interest in mapping (aligning or linking) test scores to language proficiency levels of external performance scales or frameworks, such as the Common European Framework of Reference (CEFR), as well as locally developed frameworks, such as China's Standards of English Language Ability (CSE). Such alignment is…

Descriptors: English (Second Language), Language Tests, Second Language Learning, Computer Assisted Testing

An Investigation of the Impact of Misrouting under Two-Stage Multistage Testing: A Simulation Study. Research Report. ETS RR-14-01

Peer reviewed
PDF on ERIC

Download full text

Kim, Sooyeon; Moses, Tim – ETS Research Report Series, 2014

The purpose of this study was to investigate the potential impact of misrouting under a 2-stage multistage test (MST) design, which includes 1 routing and 3 second-stage modules. Simulations were used to create a situation in which a large group of examinees took each of the 3 possible MST paths (high, middle, and low). We compared differences in…

Descriptors: Comparative Analysis, Difficulty Level, Scores, Test Wiseness

Automated Trait Scores for "TOEFL"® Writing Tasks. Research Report. ETS RR-15-14

Peer reviewed
PDF on ERIC

Download full text

Attali, Yigal; Sinharay, Sandip – ETS Research Report Series, 2015

The "e-rater"® automated essay scoring system is used operationally in the scoring of "TOEFL iBT"® independent and integrated tasks. In this study we explored the psychometric added value of reporting four trait scores for each of these two tasks, beyond the total e-rater score.The four trait scores are word choice, grammatical…

Descriptors: Writing Tests, Scores, Language Tests, English (Second Language)

Analyzing and Comparing Reading Stimulus Materials across the "TOEFL"® Family of Assessments. "TOEFL iBT"® Research Report. TOEFL iBT-26. ETS Research Report No. RR-15-08

Peer reviewed
PDF on ERIC

Download full text

Chen, Jing; Sheehan, Kathleen M. – ETS Research Report Series, 2015

The "TOEFL"® family of assessments includes the "TOEFL"® Primary"™, "TOEFL Junior"®, and "TOEFL iBT"® tests. The linguistic complexity of stimulus passages in the reading sections of the TOEFL family of assessments is expected to differ across the test levels. This study evaluates the linguistic…

Descriptors: Language Tests, Second Language Learning, English (Second Language), Reading Comprehension

An Item-Driven Adaptive Design for Calibrating Pretest Items. Research Report. ETS RR-14-38

Peer reviewed
PDF on ERIC

Download full text

Ali, Usama S.; Chang, Hua-Hua – ETS Research Report Series, 2014

Adaptive testing is advantageous in that it provides more efficient ability estimates with fewer items than linear testing does. Item-driven adaptive pretesting may also offer similar advantages, and verification of such a hypothesis about item calibration was the main objective of this study. A suitability index (SI) was introduced to adaptively…

Descriptors: Adaptive Testing, Simulation, Pretests Posttests, Test Items

Test Takers' Writing Activities during the "TOEFL iBT"® Writing Tasks: A Stimulated Recall Study. "TOEFL iBT"® Research Report. TOEFL iBT-25. ETS Research Report No. RR-15-04

Peer reviewed
PDF on ERIC

Download full text

Barkaoui, Khaled – ETS Research Report Series, 2015

This study aimed to describe the writing activities that test takers engage in when responding to the writing tasks in the "TOEFL iBT"[superscript R] test and to examine the effects of task type and test-taker English language proficiency (ELP) and keyboarding skills on the frequency and distribution of these activities. Each of 22 test…

Descriptors: Second Language Learning, Language Tests, English (Second Language), Writing Instruction

Automated Trait Scores for "GRE"® Writing Tasks. Research Report. ETS RR-15-15

Peer reviewed
PDF on ERIC

Download full text

Attali, Yigal; Sinharay, Sandip – ETS Research Report Series, 2015

The "e-rater"® automated essay scoring system is used operationally in the scoring of the argument and issue tasks that form the Analytical Writing measure of the "GRE"® General Test. For each of these tasks, this study explored the value added of reporting 4 trait scores for each of these 2 tasks over the total e-rater score.…

Descriptors: Scores, Computer Assisted Testing, Computer Software, Grammar

A Comparison of Achievement Gaps and Test-Taker Characteristics on Computer-Delivered and Paper-Delivered "Praxis I"® Tests. Research Report. ETS RR-14-35

Peer reviewed
PDF on ERIC

Download full text

Steinberg, Jonathan; Brenneman, Meghan; Castellano, Karen; Lin, Peng; Miller, Susanne – ETS Research Report Series, 2014

Test providers are increasingly moving toward exclusively administering assessments by computer. Computerized testing is becoming more desirable for test takers because of increased opportunities to test, faster turnaround of individual scores, or perhaps other factors, offering potential benefits for those who may be struggling to pass licensure…

Descriptors: Comparative Analysis, Achievement Gap, Academic Achievement, Test Format

Investigating the Suitability of Implementing the "e-rater"® Scoring Engine in a Large-Scale English Language Testing Program. Research Report. ETS RR-13-36

Peer reviewed
PDF on ERIC

Download full text

Zhang, Mo; Breyer, F. Jay; Lorenz, Florian – ETS Research Report Series, 2013

In this research, we investigated the suitability of implementing "e-rater"® automated essay scoring in a high-stakes large-scale English language testing program. We examined the effectiveness of generic scoring and 2 variants of prompt-based scoring approaches. Effectiveness was evaluated on a number of dimensions, including agreement…

Descriptors: Computer Assisted Testing, Computer Software, Scoring, Language Tests

Comparison of Multistage Tests with Computerized Adaptive and Paper-and-Pencil Tests. Research Report. ETS RR-07-04

Peer reviewed
PDF on ERIC

Download full text

Rotou, Ourania; Patsula, Liane; Steffen, Manfred; Rizavi, Saba – ETS Research Report Series, 2007

Traditionally, the fixed-length linear paper-and-pencil (P&P) mode of administration has been the standard method of test delivery. With the advancement of technology, however, the popularity of administering tests using adaptive methods like computerized adaptive testing (CAT) and multistage testing (MST) has grown in the field of measurement…

Descriptors: Comparative Analysis, Test Format, Computer Assisted Testing, Models

The Effectiveness of Enhancing Test Security by Using Multiple Item Pools. Research Report. ETS RR-05-19

Peer reviewed
PDF on ERIC

Download full text

Zhang, Jinming; Chang, Hua-Hua – ETS Research Report Series, 2005

This paper compares the use of multiple pools versus a single pool with respect to test security against large-scale item sharing among some examinees in a computer-based test, under the assumption that a randomized item selection method is used. It characterizes the conditions under which employing multiple pools is better than using a single…

Descriptors: Comparative Analysis, Test Items, Item Banks, Computer Assisted Testing

A Note on Gain Scores and Their Interpretation in Developmental Models Designed to Measure Change in the Early School Years. Research Report. ETS RR-07-08

Peer reviewed
PDF on ERIC

Download full text

Rock, Donald A. – ETS Research Report Series, 2007

This paper presents a strategy for measuring cognitive gains in reading during the early school years. It is argued that accurate estimates of gain scores and their appropriate interpretation requires the use of adaptive tests with multiple criterion referenced points that mark learning milestones. It is further argued that two different measures…

Descriptors: Scores, Cognitive Development, Computation, Test Interpretation

Construct Validity of "e-rater"® in Scoring TOEFL® Essays. Research Report. ETS RR-07-21

Peer reviewed
PDF on ERIC

Download full text

Attali, Yigal – ETS Research Report Series, 2007

This study examined the construct validity of the "e-rater"® automated essay scoring engine as an alternative to human scoring in the context of TOEFL® essay writing. Analyses were based on a sample of students who repeated the TOEFL within a short time period. Two "e-rater" scores were investigated in this study, the first…

Descriptors: Construct Validity, Computer Assisted Testing, Scoring, English (Second Language)

Investigating Differences in Examinee Performance between Computer-Based and Handwritten Essays. Research Report. ETS RR-04-18

Peer reviewed
PDF on ERIC

Download full text

Yu, Lei; Livingston, Samuel A.; Larkin, Kevin C.; Bonett, John – ETS Research Report Series, 2004

This study compared essay scores from paper-based and computer-based versions of a writing test for prospective teachers. Scores for essays in the paper-based version averaged nearly half a standard deviation higher than those in the computer-based version, after applying a statistical control for demographic differences between the groups of…

Descriptors: Essays, Writing (Composition), Computer Assisted Testing, Technology Uses in Education

Previous Page | Next Page »

Pages: 1 | 2

Attali, Yigal	3
Chang, Hua-Hua	2
Kim, Sooyeon	2
Sinharay, Sandip	2
Ali, Usama S.	1
Barkaoui, Khaled	1
Bonett, John	1
Boughton, Keith A.	1
Breland, Hunter	1
Brenneman, Meghan	1
Breyer, F. Jay	1
Castellano, Karen	1
Chen, Jing	1
Cheng, Mengmeng	1
Choe, Ann Tai	1
Eguchi, Masaki	1
Horák, Tania	1
Hsieh, Ching-Ni	1
Kyle, Kristopher	1
LaFlair, Geoff	1
Larkin, Kevin C.	1
Lee, Yong-Won	1
Lin, Peng	1
Livingston, Samuel A.	1
Lorenz, Florian	1
More ▼