ERIC - Search Results

Publication Date

In 2025	1
Since 2024	2
Since 2021 (last 5 years)	6
Since 2016 (last 10 years)	18
Since 2006 (last 20 years)	36

Descriptor

Evaluators	39
English (Second Language)	37
Language Tests	36
Second Language Learning	36
Scoring	21
Computer Assisted Testing	18
Language Proficiency	14
Scores	14
Interrater Reliability	13
Oral Language	12
Foreign Countries	11
Speech Communication	10
Correlation	9
Pronunciation	9
Rating Scales	9
Native Language	8
Second Language Instruction	8
Statistical Analysis	8
Evaluation Criteria	7
Comparative Analysis	6
Computer Software	6
Essays	6
Writing Evaluation	6
Accuracy	5
College Students	5
More ▼

Publication Type

Journal Articles	36
Reports - Research	36
Tests/Questionnaires	10
Dissertations/Theses -…	1
Numerical/Quantitative Data	1
Opinion Papers	1
Reports - Evaluative	1
Speeches/Meeting Papers	1

Education Level

Higher Education	14
Postsecondary Education	13
High Schools	1
Secondary Education	1

Audience

Location

Iran	4
Australia	1
Europe	1
Germany	1
India	1
Japan (Tokyo)	1
New Zealand	1
Switzerland	1
Thailand	1
United States	1

Laws, Policies, & Programs

Assessments and Surveys

Test of English as a Foreign…	39
International English…	5
Test of English for…	3
Foreign Language Classroom…	1
Graduate Record Examinations	1

What Works Clearinghouse Rating

Showing 1 to 15 of 39 results Save | Export

Revisiting Raters' Accent Familiarity in Speaking Tests: Evidence That Presentation Mode Interacts with Accent Familiarity to Variably Affect Comprehensibility Ratings

Peer reviewed

Direct link

Michael D. Carey; Stefan Szocs – Language Testing, 2024

This controlled experimental study investigated the interaction of variables associated with rating the pronunciation component of high-stakes English-language-speaking tests such as IELTS and TOEFL iBT. One hundred experienced raters who were all either familiar or unfamiliar with Brazilian-accented English or Papua New Guinean Tok Pisin-accented…

Descriptors: Dialects, Pronunciation, Suprasegmentals, Familiarity

Towards More Valid Scoring Criteria for Integrated Reading-Writing and Listening-Writing Summary Tasks

Peer reviewed

Direct link

Chan, Sathena; May, Lyn – Language Testing, 2023

Despite the increased use of integrated tasks in high-stakes academic writing assessment, research on rating criteria which reflect the unique construct of integrated summary writing skills is comparatively rare. Using a mixed-method approach of expert judgement, text analysis, and statistical analysis, this study examines writing features that…

Descriptors: Scoring, Writing Evaluation, Reading Tests, Listening Skills

Applying Cognitive Theory to the Human Essay Rating Process

Peer reviewed

Direct link

Finn, Bridgid; Arslan, Burcu; Walsh, Matthew – Applied Measurement in Education, 2020

To score an essay response, raters draw on previously trained skills and knowledge about the underlying rubric and score criterion. Cognitive processes such as remembering, forgetting, and skill decay likely influence rater performance. To investigate how forgetting influences scoring, we evaluated raters' scoring accuracy on TOEFL and GRE essays.…

Descriptors: Epistemology, Essay Tests, Evaluators, Cognitive Processes

Using Statistical Transformation Methods to Explore Speech Perception Scale Lengths

Peer reviewed
PDF on ERIC

Download full text

Kermad, Alyssa; Bogorevich, Valeria – Language Teaching Research Quarterly, 2022

The practice of second language (L2) speech perception has traditionally relied on equal-interval perceptual scales and novice listeners' (NLs) impressionistic judgments of constructs such as accentedness and comprehensibility (Munro & Derwing, 2011). However, issues have surfaced with respect to how well NLs can use these scales, whether they…

Descriptors: Speech Communication, Second Language Learning, Intelligibility, Rating Scales

In Conversation with John Read on Language Testing and Assessment

Peer reviewed

Direct link

Pang, Alvin – RELC Journal: A Journal of Language Teaching and Research, 2019

John Read is about to retire as Professor in Applied Language Studies at the University of Auckland. He previously taught applied linguistics, Teaching English to Speakers of Other Languages (TESOL) and English for Academic Purposes (EAP) at Victoria University of Wellington, the SEAMEO Regional Language Centre, the University of Texas El Paso,…

Descriptors: Language Tests, Testing, English (Second Language), Second Language Learning

Rater Dominance in Discussion as a Resolution Method

Peer reviewed
PDF on ERIC

Download full text

Ahmadi, Alireza – Taiwan Journal of TESOL, 2020

Rater subjectivity has long been an intriguing topic. The use of discussion as a resolution method is a practical way to reduce this subjectivity. However, the efficacy of discussion depends on whether different raters get equally engaged in it or one rater tends to dominate others. This study investigated whether and how rater dominance occurs in…

Descriptors: Evaluators, Interrater Reliability, Discussion, Discourse Analysis

Integrated Listening/Speaking Skill Assessment: The Role of Ambiguity Tolerance, Cognitive/Metacognitive Strategy Use, and Foreign Language Anxiety

Peer reviewed
PDF on ERIC

Download full text

Karim Sadeghi; Neda Bakhshi – International Journal of Language Testing, 2025

Assessing language skills in an integrative form has drawn the attention of assessment experts in recent years. While some research data exists on integrative listening/reading-to-write assessment, there is comparatively little research literature on listening-to-speak integrated assessment. Also, little attention has been devoted to the role of…

Descriptors: Language Tests, Second Language Learning, English (Second Language), Computer Assisted Testing

The Effects of Task Complexity on Comprehensibility in Second Language Speech

Peer reviewed

Direct link

Choi, Jin Soo – Applied Language Learning, 2021

This study examined the impact of the manipulated task complexity (Robinson 2001a, 2001b, 2007, 2011; Robinson & Gilabert, 2007) on second language (L2) speech comprehensibility. I examined whether manipulated task complexity (a) impacts L2 speech comprehensibility, (b) aligns with L2 speakers' perception of task difficulty (cognitive…

Descriptors: Task Analysis, Second Language Learning, Second Language Instruction, Pronunciation

Automated Essay Scoring at Scale: A Case Study in Switzerland and Germany. TOEFL® Research Report. RR-86. ETS RR-19-12

Peer reviewed
PDF on ERIC

Download full text

Rupp, André A.; Casabianca, Jodi M.; Krüger, Maleika; Keller, Stefan; Köller, Olaf – ETS Research Report Series, 2019

In this research report, we describe the design and empirical findings for a large-scale study of essay writing ability with approximately 2,500 high school students in Germany and Switzerland on the basis of 2 tasks with 2 associated prompts, each from a standardized writing assessment whose scoring involved both human and automated components.…

Descriptors: Automation, Foreign Countries, English (Second Language), Language Tests

For a Greater Good: Bias Analysis in Writing Assessment

Peer reviewed

Direct link

Ahmadi Shirazi, Masoumeh – SAGE Open, 2019

Threats to construct validity should be reduced to a minimum. If true, sources of bias, namely raters, items, tests as well as gender, age, race, language background, culture, and socio-economic status need to be spotted and removed. This study investigates raters' experience, language background, and the choice of essay prompt as potential…

Descriptors: Foreign Countries, Language Tests, Test Bias, Essay Tests

Using Spoken Language Technology for Generating Feedback to Prepare for the TOEFL iBT® Test: A User Perception Study

Peer reviewed

Direct link

Gu, Lin; Davis, Larry; Tao, Jacob; Zechner, Klaus – Assessment in Education: Principles, Policy & Practice, 2021

Recent technology advancements have increased the prospects for automated spoken language technology to provide feedback on speaking performance. In this study we examined user perceptions of using an automated feedback system for preparing for the TOEFL iBT® test. Test takers and language teachers evaluated three types of machine-generated…

Descriptors: Audio Equipment, Test Preparation, Feedback (Response), Scores

Mapping the CU-TEP to the Common European Framework of Reference (CEFTR)

Peer reviewed
PDF on ERIC

Download full text

Wudthayagorn, Jirada – LEARN Journal: Language Education and Acquisition Research Network, 2018

The purpose of this study was to map the Chulalongkorn University Test of English Proficiency, or the CU-TEP, to the Common European Framework of Reference (CEFR) by employing a standard setting methodology. Thirteen experts judged 120 items of the CU-TEP using the Yes/No Angoff technique. The experts decided whether or not a borderline student at…

Descriptors: Guidelines, Rating Scales, English (Second Language), Language Tests

Effects of Strength of Accent on an L2 Interactive Lecture Listening Comprehension Test

Peer reviewed

Direct link

Ockey, Gary J.; Papageorgiou, Spiros; French, Robert – International Journal of Listening, 2016

This article reports on a study which aimed to determine the effect of strength of accent on listening comprehension of interactive lectures. Test takers (N = 21,726) listened to an interactive lecture given by one of nine speakers and responded to six comprehension items. The test taker responses were analyzed with the Rasch computer program…

Descriptors: Pronunciation, Listening Comprehension, Lecture Method, Computer Software

The Effect of Training and Rater Differences on Oral Proficiency Assessment

Peer reviewed

Direct link

Kang, Okim; Rubin, Don; Kermad, Alyssa – Language Testing, 2019

As a result of the fact that judgments of non-native speech are closely tied to social biases, oral proficiency ratings are susceptible to error because of rater background and social attitudes. In the present study we seek first to estimate the variance attributable to rater background and attitudinal variables on novice raters' assessments of L2…

Descriptors: Evaluators, Second Language Learning, Language Tests, English (Second Language)

The Influence of Training and Experience on Rater Performance in Scoring Spoken Language

Peer reviewed

Direct link

Davis, Larry – Language Testing, 2016

Two factors were investigated that are thought to contribute to consistency in rater scoring judgments: rater training and experience in scoring. Also considered were the relative effects of scoring rubrics and exemplars on rater performance. Experienced teachers of English (N = 20) scored recorded responses from the TOEFL iBT speaking test prior…

Descriptors: Evaluators, Oral Language, Scores, Language Tests

Previous Page | Next Page »

Pages: 1 | 2 | 3

ETS Research Report Series	10
Language Testing	8
Language Assessment Quarterly	3
Applied Language Learning	1
Applied Measurement in…	1
Assessment in Education:…	1
Grantee Submission	1
International Journal of…	1
International Journal of…	1
JALT CALL Journal	1
Journal of Pan-Pacific…	1
LEARN Journal: Language…	1
Language Learning	1
Language Teaching Research…	1
ProQuest LLC	1
RELC Journal: A Journal of…	1
SAGE Open	1
TESL-EJ	1
Taiwan Journal of TESOL	1
More ▼

Xi, Xiaoming	4
Bridgeman, Brent	2
Davis, Larry	2
Kang, Okim	2
Kermad, Alyssa	2
Mollaun, Pam	2
Mollaun, Pamela	2
Zechner, Klaus	2
Ahmadi Shirazi, Masoumeh	1
Ahmadi, Alireza	1
Alegre, Analucia	1
Allen, Laura K.	1
Angoff, William H.	1
Arslan, Burcu	1
Attali, Yigal	1
Bejar, Isaac I.	1
Blanchard, Daniel	1
Bogorevich, Valeria	1
Brown, Annie	1
Cahill, Aoife	1
Casabianca, Jodi M.	1
Chan, Sathena	1
Chodorow, Martin	1
Choi, Jin Soo	1
Clevinger, Amanda	1
More ▼