ERIC - Search Results

Publication Date

In 2025	1
Since 2024	1
Since 2021 (last 5 years)	2
Since 2016 (last 10 years)	6
Since 2006 (last 20 years)	17

Descriptor

Computer Assisted Testing	18
Evaluators	18
Language Tests	18
English (Second Language)	17
Second Language Learning	17
Scoring	10
Oral Language	9
Scores	9
Correlation	8
Interrater Reliability	8
Language Proficiency	8
Computer Software	5
Statistical Analysis	5
Essays	4
Language Teachers	4
Native Language	4
Prompting	4
Rating Scales	4
Scoring Rubrics	4
Speech Communication	4
Writing Tests	4
Accuracy	3
Cues	3
Evaluation Criteria	3
Language Usage	3
More ▼

Source

ETS Research Report Series	8
Language Testing	4
Language Assessment Quarterly	2
Assessment in Education:…	1
International Journal of…	1
ProQuest LLC	1

Publication Type

Reports - Research	17
Journal Articles	16
Tests/Questionnaires	4
Dissertations/Theses -…	1
Speeches/Meeting Papers	1

Education Level

Higher Education	4
Postsecondary Education	3
High Schools	1
Secondary Education	1

Audience

Location

Germany	1
Iran	1
Switzerland	1

Laws, Policies, & Programs

Assessments and Surveys

Test of English as a Foreign…	18
Foreign Language Classroom…	1

What Works Clearinghouse Rating

Showing 1 to 15 of 18 results Save | Export

Integrated Listening/Speaking Skill Assessment: The Role of Ambiguity Tolerance, Cognitive/Metacognitive Strategy Use, and Foreign Language Anxiety

Peer reviewed
PDF on ERIC

Download full text

Karim Sadeghi; Neda Bakhshi – International Journal of Language Testing, 2025

Assessing language skills in an integrative form has drawn the attention of assessment experts in recent years. While some research data exists on integrative listening/reading-to-write assessment, there is comparatively little research literature on listening-to-speak integrated assessment. Also, little attention has been devoted to the role of…

Descriptors: Language Tests, Second Language Learning, English (Second Language), Computer Assisted Testing

Automated Essay Scoring at Scale: A Case Study in Switzerland and Germany. TOEFL® Research Report. RR-86. ETS RR-19-12

Peer reviewed
PDF on ERIC

Download full text

Rupp, André A.; Casabianca, Jodi M.; Krüger, Maleika; Keller, Stefan; Köller, Olaf – ETS Research Report Series, 2019

In this research report, we describe the design and empirical findings for a large-scale study of essay writing ability with approximately 2,500 high school students in Germany and Switzerland on the basis of 2 tasks with 2 associated prompts, each from a standardized writing assessment whose scoring involved both human and automated components.…

Descriptors: Automation, Foreign Countries, English (Second Language), Language Tests

Using Spoken Language Technology for Generating Feedback to Prepare for the TOEFL iBT® Test: A User Perception Study

Peer reviewed

Direct link

Gu, Lin; Davis, Larry; Tao, Jacob; Zechner, Klaus – Assessment in Education: Principles, Policy & Practice, 2021

Recent technology advancements have increased the prospects for automated spoken language technology to provide feedback on speaking performance. In this study we examined user perceptions of using an automated feedback system for preparing for the TOEFL iBT® test. Test takers and language teachers evaluated three types of machine-generated…

Descriptors: Audio Equipment, Test Preparation, Feedback (Response), Scores

The Effect of Training and Rater Differences on Oral Proficiency Assessment

Peer reviewed

Direct link

Kang, Okim; Rubin, Don; Kermad, Alyssa – Language Testing, 2019

As a result of the fact that judgments of non-native speech are closely tied to social biases, oral proficiency ratings are susceptible to error because of rater background and social attitudes. In the present study we seek first to estimate the variance attributable to rater background and attitudinal variables on novice raters' assessments of L2…

Descriptors: Evaluators, Second Language Learning, Language Tests, English (Second Language)

The Influence of Training and Experience on Rater Performance in Scoring Spoken Language

Peer reviewed

Direct link

Davis, Larry – Language Testing, 2016

Two factors were investigated that are thought to contribute to consistency in rater scoring judgments: rater training and experience in scoring. Also considered were the relative effects of scoring rubrics and exemplars on rater performance. Experienced teachers of English (N = 20) scored recorded responses from the TOEFL iBT speaking test prior…

Descriptors: Evaluators, Oral Language, Scores, Language Tests

A Cross-Linguistic Investigation of the Effect of Raters' Accent Familiarity on Speaking Assessment

Peer reviewed

Direct link

Huang, Becky; Alegre, Analucia; Eisenberg, Ann – Language Assessment Quarterly, 2016

The project aimed to examine the effect of raters' familiarity with accents on their judgments of non-native speech. Participants included three groups of raters who were either from Spanish Heritage, Spanish Non-Heritage, or Chinese Heritage backgrounds (n = 16 in each group) using Winke & Gass's (2013) definition of a heritage learner as…

Descriptors: Contrastive Linguistics, Evaluators, Chinese, Spanish

A Study on the Impact of Fatigue on Human Raters When Scoring Speaking Responses

Peer reviewed

Direct link

Ling, Guangming; Mollaun, Pamela; Xi, Xiaoming – Language Testing, 2014

The scoring of constructed responses may introduce construct-irrelevant factors to a test score and affect its validity and fairness. Fatigue is one of the factors that could negatively affect human performance in general, yet little is known about its effects on a human rater's scoring quality on constructed responses. In this study, we compared…

Descriptors: Evaluators, Fatigue (Biology), Scoring, Performance

Automated Trait Scores for "TOEFL"® Writing Tasks. Research Report. ETS RR-15-14

Peer reviewed
PDF on ERIC

Download full text

Attali, Yigal; Sinharay, Sandip – ETS Research Report Series, 2015

The "e-rater"® automated essay scoring system is used operationally in the scoring of "TOEFL iBT"® independent and integrated tasks. In this study we explored the psychometric added value of reporting four trait scores for each of these two tasks, beyond the total e-rater score.The four trait scores are word choice, grammatical…

Descriptors: Writing Tests, Scores, Language Tests, English (Second Language)

The Role of Lexical Properties and Cohesive Devices in Text Integration and Their Effect on Human Ratings of Speaking Proficiency

Peer reviewed

Direct link

Crossley, Scott; Clevinger, Amanda; Kim, YouJin – Language Assessment Quarterly, 2014

There has been a growing interest in the use of integrated tasks in the field of second language testing to enhance the authenticity of language tests. However, the role of text integration in test takers' performance has not been widely investigated. The purpose of the current study is to examine the effects of text-based relational (i.e.,…

Descriptors: Language Proficiency, Connected Discourse, Language Tests, English (Second Language)

TOEFL11: A Corpus of Non-Native English. Research Report. ETS RR-13-24

Peer reviewed
PDF on ERIC

Download full text

Blanchard, Daniel; Tetreault, Joel; Higgins, Derrick; Cahill, Aoife; Chodorow, Martin – ETS Research Report Series, 2013

This report presents work on the development of a new corpus of non-native English writing. It will be useful for the task of native language identification, as well as grammatical error detection and correction, and automatic essay scoring. In this report, the corpus is described in detail.

Descriptors: Language Tests, Second Language Learning, English (Second Language), Writing Tests

Evaluation of the "e-rater"® Scoring Engine for the "TOEFL"® Independent and Integrated Prompts. Research Report. ETS RR-12-06

Peer reviewed
PDF on ERIC

Download full text

Ramineni, Chaitanya; Trapani, Catherine S.; Williamson, David M.; Davey, Tim; Bridgeman, Brent – ETS Research Report Series, 2012

Scoring models for the "e-rater"® system were built and evaluated for the "TOEFL"® exam's independent and integrated writing prompts. Prompt-specific and generic scoring models were built, and evaluation statistics, such as weighted kappas, Pearson correlations, standardized differences in mean scores, and correlations with…

Descriptors: Scoring, Prompting, Evaluators, Computer Software

TOEFL iBT Speaking Test Scores as Indicators of Oral Communicative Language Proficiency

Peer reviewed

Direct link

Bridgeman, Brent; Powers, Donald; Stone, Elizabeth; Mollaun, Pamela – Language Testing, 2012

Scores assigned by trained raters and by an automated scoring system (SpeechRater[TM]) on the speaking section of the TOEFL iBT[TM] were validated against a communicative competence criterion. Specifically, a sample of 555 undergraduate students listened to speech samples from 184 examinees who took the Test of English as a Foreign Language…

Descriptors: Undergraduate Students, Speech Communication, Rating Scales, Scoring

Rater Expertise in a Second Language Speaking Assessment: The Influence of Training and Experience

Direct link

Davis, Lawrence Edward – ProQuest LLC, 2012

Speaking performance tests typically employ raters to produce scores; accordingly, variability in raters' scoring decisions has important consequences for test reliability and validity. One such source of variability is the rater's level of expertise in scoring. Therefore, it is important to understand how raters' performance is influenced by…

Descriptors: Evaluators, Expertise, Scores, Second Language Learning

Developing Analytic Rating Guides for "TOEFL iBT"® Integrated Speaking Tasks. "TOEFL iBT"® Research Report, TOEFL iBT-20. ETS Research Report. RR-13-13

Peer reviewed
PDF on ERIC

Download full text

Jamieson, Joan; Poonpon, Kornwipa – ETS Research Report Series, 2013

Research and development of a new type of scoring rubric for the integrated speaking tasks of "TOEFL iBT"® are described. These "analytic rating guides" could be helpful if tasks modeled after those in TOEFL iBT were used for formative assessment, a purpose which is different from TOEFL iBT's primary use for admission…

Descriptors: Oral Language, Language Proficiency, Scaling, Scores

Analytic Scoring of TOEFL® CBT Essays: Scores from Humans and "E-rater"®. TOEFL® Research Reports. RR-81. ETS RR-08-01

Peer reviewed
PDF on ERIC

Download full text

Lee, Yong-Won; Gentile, Claudia; Kantor, Robert – ETS Research Report Series, 2008

The main purpose of the study was to investigate the distinctness and reliability of analytic (or multitrait) rating dimensions and their relationships to holistic scores and "e-rater"® essay feature variables in the context of the TOEFL® computer-based test (CBT) writing assessment. Data analyzed in the study were analytic and holistic…

Descriptors: English (Second Language), Language Tests, Second Language Learning, Scoring

Previous Page | Next Page »

Pages: 1 | 2

Bridgeman, Brent	2
Davis, Larry	2
Mollaun, Pamela	2
Xi, Xiaoming	2
Zechner, Klaus	2
Alegre, Analucia	1
Attali, Yigal	1
Bejar, Isaac I.	1
Blanchard, Daniel	1
Cahill, Aoife	1
Casabianca, Jodi M.	1
Chodorow, Martin	1
Clevinger, Amanda	1
Crossley, Scott	1
Davey, Tim	1
Davis, Lawrence Edward	1
Eisenberg, Ann	1
Gentile, Claudia	1
Gu, Lin	1
Hemat, Ramin	1
Higgins, Derrick	1
Huang, Becky	1
Jamieson, Joan	1
Kang, Okim	1
Kantor, Robert	1
More ▼