ERIC - Search Results

Publication Date

In 2026	0
Since 2025	1
Since 2022 (last 5 years)	5
Since 2017 (last 10 years)	11
Since 2007 (last 20 years)	21

Descriptor

Computer Assisted Testing	23
Evaluators	23
Scores	23
English (Second Language)	16
Language Tests	16
Second Language Learning	14
Scoring	13
Correlation	9
Comparative Analysis	7
Foreign Countries	7
Interrater Reliability	7
Language Proficiency	7
Oral Language	7
Computer Software	5
Essays	5
Language Teachers	5
Native Language	5
Speech Communication	5
Essay Tests	4
Evaluation Criteria	4
Multiple Regression Analysis	4
Prompting	4
Rating Scales	4
Scoring Rubrics	4
Second Language Instruction	4
More ▼

Source

ETS Research Report Series	5
ProQuest LLC	3
Language Assessment Quarterly	2
Language Testing	2
Applied Linguistics	1
Assessment in Education:…	1
Contemporary Educational…	1
Education Journal	1
English Language Teaching	1
Innovation in Language…	1
International Journal of…	1
International Journal of…	1
Journal of Educational…	1
Journal of Effective Teaching	1
More ▼

Publication Type

Reports - Research	20
Journal Articles	19
Tests/Questionnaires	5
Dissertations/Theses -…	3

Education Level

Higher Education	5
Postsecondary Education	5
Secondary Education	2
Elementary Education	1
High Schools	1

Audience

Location

China	2
Hong Kong	1
Iran	1
Japan	1

Laws, Policies, & Programs

Assessments and Surveys

Test of English as a Foreign…	9
International English…	2
Foreign Language Classroom…	1
Graduate Record Examinations	1
Test of English for…	1

What Works Clearinghouse Rating

Showing 1 to 15 of 23 results Save | Export

Using Linkage Sets to Improve Connectedness in Rater Response Model Estimation

Peer reviewed

Direct link

Casabianca, Jodi M.; Donoghue, John R.; Shin, Hyo Jeong; Chao, Szu-Fu; Choi, Ikkyu – Journal of Educational Measurement, 2023

Using item-response theory to model rater effects provides an alternative solution for rater monitoring and diagnosis, compared to using standard performance metrics. In order to fit such models, the ratings data must be sufficiently connected in order to estimate rater effects. Due to popular rating designs used in large-scale testing scenarios,…

Descriptors: Item Response Theory, Alternative Assessment, Evaluators, Research Problems

Application of an Automated Essay Scoring Engine to English Writing Assessment Using Many-Facet Rasch Measurement

Peer reviewed

Direct link

Chan, Kinnie Kin Yee; Bond, Trevor; Yan, Zi – Language Testing, 2023

We investigated the relationship between the scores assigned by an Automated Essay Scoring (AES) system, the Intelligent Essay Assessor (IEA), and grades allocated by trained, professional human raters to English essay writing by instigating two procedures novel to written-language assessment: the logistic transformation of AES raw scores into…

Descriptors: Computer Assisted Testing, Essays, Scoring, Scores

Mitigating Gender and L1 Biases in Automated English Speaking Assessment

Direct link

Alexander James Kwako – ProQuest LLC, 2023

Automated assessment using Natural Language Processing (NLP) has the potential to make English speaking assessments more reliable, authentic, and accessible. Yet without careful examination, NLP may exacerbate social prejudices based on gender or native language (L1). Current NLP-based assessments are prone to such biases, yet research and…

Descriptors: Gender Bias, Natural Language Processing, Native Language, Computational Linguistics

Automated Speech Scoring of Dialogue Response by Japanese Learners of English as a Foreign Language

Peer reviewed

Direct link

Yuko Hayashi; Yusuke Kondo; Yutaka Ishii – Innovation in Language Learning and Teaching, 2024

Purpose: This study builds a new system for automatically assessing learners' speech elicited from an oral discourse completion task (DCT), and evaluates the prediction capability of the system with a view to better understanding factors deemed influential in predicting speaking proficiency scores and the pedagogical implications of the system.…

Descriptors: English (Second Language), Second Language Learning, Second Language Instruction, Japanese

Human versus Computer Partner in the Paired Oral Discussion Test

Peer reviewed

Direct link

Ockey, Gary J.; Chukharev-Hudilainen, Evgeny – Applied Linguistics, 2021

A challenge of large-scale oral communication assessments is to feasibly assess a broad construct that includes interactional competence. One possible approach in addressing this challenge is to use a spoken dialog system (SDS), with the computer acting as a peer to elicit a ratable speech sample. With this aim, an SDS was built and four trained…

Descriptors: Oral Language, Grammar, Language Fluency, Language Tests

Home-Grown Automated Essay Scoring in the Literature Classroom: A Solution for Managing the Crowd?

Peer reviewed
PDF on ERIC

Download full text

Uzun, Kutay – Contemporary Educational Technology, 2018

Managing crowded classes in terms of classroom assessment is a difficult task due to the amount of time which needs to be devoted to providing feedback to student products. In this respect, the present study aimed to develop an automated essay scoring environment as a potential means to overcome this problem. Secondarily, the study aimed to test…

Descriptors: Computer Assisted Testing, Essays, Scoring, English Literature

The Influence of Rater Effects in Training Sets on the Psychometric Quality of Automated Scoring for Writing Assessments

Peer reviewed

Direct link

Wind, Stefanie A.; Wolfe, Edward W.; Engelhard, George, Jr.; Foltz, Peter; Rosenstein, Mark – International Journal of Testing, 2018

Automated essay scoring engines (AESEs) are becoming increasingly popular as an efficient method for performance assessments in writing, including many language assessments that are used worldwide. Before they can be used operationally, AESEs must be "trained" using machine-learning techniques that incorporate human ratings. However, the…

Descriptors: Computer Assisted Testing, Essay Tests, Writing Evaluation, Scoring

Integrated Listening/Speaking Skill Assessment: The Role of Ambiguity Tolerance, Cognitive/Metacognitive Strategy Use, and Foreign Language Anxiety

Peer reviewed
PDF on ERIC

Download full text

Karim Sadeghi; Neda Bakhshi – International Journal of Language Testing, 2025

Assessing language skills in an integrative form has drawn the attention of assessment experts in recent years. While some research data exists on integrative listening/reading-to-write assessment, there is comparatively little research literature on listening-to-speak integrated assessment. Also, little attention has been devoted to the role of…

Descriptors: Language Tests, Second Language Learning, English (Second Language), Computer Assisted Testing

Using Spoken Language Technology for Generating Feedback to Prepare for the TOEFL iBT® Test: A User Perception Study

Peer reviewed

Direct link

Gu, Lin; Davis, Larry; Tao, Jacob; Zechner, Klaus – Assessment in Education: Principles, Policy & Practice, 2021

Recent technology advancements have increased the prospects for automated spoken language technology to provide feedback on speaking performance. In this study we examined user perceptions of using an automated feedback system for preparing for the TOEFL iBT® test. Test takers and language teachers evaluated three types of machine-generated…

Descriptors: Audio Equipment, Test Preparation, Feedback (Response), Scores

Comparison of Automatic and Expert Teachers' Rating of Computerized English Listening-Speaking Test

Peer reviewed
PDF on ERIC

Download full text

Linlin, Cao – English Language Teaching, 2020

Through Many-Facet Rasch analysis, this study explores the rating differences between 1 computer automatic rater and 5 expert teacher raters on scoring 119 students in a computerized English listening-speaking test. Results indicate that both automatic and the teacher raters demonstrate good inter-rater reliability, though the automatic rater…

Descriptors: Language Tests, Computer Assisted Testing, English (Second Language), Second Language Learning

"How Scripted Is This Going to Be?" Raters' Views of Authenticity in Speaking-Performance Tests

Peer reviewed

Direct link

Burton, John Dylan – Language Assessment Quarterly, 2020

An assumption underlying speaking tests is that scores reflect the ability to produce online, non-rehearsed speech. Speech produced in testing situations may, however, be less spontaneous if extensive test preparation takes place, resulting in memorized or rehearsed responses. If raters detect these patterns, they may conceptualize speech as…

Descriptors: Language Tests, Oral Language, Scores, Speech Communication

The Influence of Training and Experience on Rater Performance in Scoring Spoken Language

Peer reviewed

Direct link

Davis, Larry – Language Testing, 2016

Two factors were investigated that are thought to contribute to consistency in rater scoring judgments: rater training and experience in scoring. Also considered were the relative effects of scoring rubrics and exemplars on rater performance. Experienced teachers of English (N = 20) scored recorded responses from the TOEFL iBT speaking test prior…

Descriptors: Evaluators, Oral Language, Scores, Language Tests

Automated Trait Scores for "TOEFL"® Writing Tasks. Research Report. ETS RR-15-14

Peer reviewed
PDF on ERIC

Download full text

Attali, Yigal; Sinharay, Sandip – ETS Research Report Series, 2015

The "e-rater"® automated essay scoring system is used operationally in the scoring of "TOEFL iBT"® independent and integrated tasks. In this study we explored the psychometric added value of reporting four trait scores for each of these two tasks, beyond the total e-rater score.The four trait scores are word choice, grammatical…

Descriptors: Writing Tests, Scores, Language Tests, English (Second Language)

The Role of Lexical Properties and Cohesive Devices in Text Integration and Their Effect on Human Ratings of Speaking Proficiency

Peer reviewed

Direct link

Crossley, Scott; Clevinger, Amanda; Kim, YouJin – Language Assessment Quarterly, 2014

There has been a growing interest in the use of integrated tasks in the field of second language testing to enhance the authenticity of language tests. However, the role of text integration in test takers' performance has not been widely investigated. The purpose of the current study is to examine the effects of text-based relational (i.e.,…

Descriptors: Language Proficiency, Connected Discourse, Language Tests, English (Second Language)

TOEFL11: A Corpus of Non-Native English. Research Report. ETS RR-13-24

Peer reviewed
PDF on ERIC

Download full text

Blanchard, Daniel; Tetreault, Joel; Higgins, Derrick; Cahill, Aoife; Chodorow, Martin – ETS Research Report Series, 2013

This report presents work on the development of a new corpus of non-native English writing. It will be useful for the task of native language identification, as well as grammatical error detection and correction, and automatic essay scoring. In this report, the corpus is described in detail.

Descriptors: Language Tests, Second Language Learning, English (Second Language), Writing Tests

Previous Page | Next Page »

Pages: 1 | 2

Davis, Larry	2
Alexander James Kwako	1
Attali, Yigal	1
Blanchard, Daniel	1
Bond, Trevor	1
Breyer, F. Jay	1
Burton, John Dylan	1
Cahill, Aoife	1
Casabianca, Jodi M.	1
Chan, Kinnie Kin Yee	1
Chao, Szu-Fu	1
Chodorow, Martin	1
Choi, Ikkyu	1
Chukharev-Hudilainen, Evgeny	1
Clevinger, Amanda	1
Coniam, David	1
Crossley, Scott	1
Davis, Lawrence Edward	1
Donoghue, John R.	1
Engelhard, George, Jr.	1
Foltz, Peter	1
Glew, David	1
Gu, Lin	1
Higgins, Derrick	1
More ▼