ERIC - Search Results

Publication Date

In 2025	2
Since 2024	4
Since 2021 (last 5 years)	8
Since 2016 (last 10 years)	17
Since 2006 (last 20 years)	23

Descriptor

Computer Assisted Testing	23
Evaluators	23
Foreign Countries	23
Second Language Learning	15
English (Second Language)	14
Language Tests	11
Comparative Analysis	10
Scoring	10
Correlation	7
Essays	7
Language Proficiency	7
Scores	7
Computer Software	6
Interrater Reliability	6
Second Language Instruction	6
College Students	5
Rating Scales	5
Writing Evaluation	5
Speech Communication	4
Undergraduate Students	4
Accuracy	3
Cues	3
Evaluation Methods	3
Grading	3
Learning Processes	3
More ▼

Publication Type

Journal Articles	22
Reports - Research	20
Tests/Questionnaires	4
Dissertations/Theses -…	1
Reports - Descriptive	1
Reports - Evaluative	1

Education Level

Higher Education	11
Postsecondary Education	11
Secondary Education	4
High Schools	2
Elementary Education	1
Elementary Secondary Education	1
Grade 11	1
Grade 5	1
Intermediate Grades	1
Middle Schools	1

Audience

Location

China	6
Hong Kong	4
Germany	2
Taiwan	2
United Kingdom	2
Australia	1
California	1
China (Beijing)	1
Cyprus	1
Europe	1
Iran	1
Japan	1
Singapore	1
Switzerland	1
Turkey	1
United States	1
Vietnam	1
More ▼

Laws, Policies, & Programs

Assessments and Surveys

International English…	3
Test of English as a Foreign…	2
Foreign Language Classroom…	1
Test of English for…	1

What Works Clearinghouse Rating

Showing 1 to 15 of 23 results Save | Export

Artificial Intelligence as an Automated Essay Scoring Tool: A Focus on ChatGPT

Peer reviewed
PDF on ERIC

Download full text

Ahmet Can Uyar; Dilek Büyükahiska – International Journal of Assessment Tools in Education, 2025

This study explores the effectiveness of using ChatGPT, an Artificial Intelligence (AI) language model, as an Automated Essay Scoring (AES) tool for grading English as a Foreign Language (EFL) learners' essays. The corpus consists of 50 essays representing various types including analysis, compare and contrast, descriptive, narrative, and opinion…

Descriptors: Artificial Intelligence, Computer Software, Technology Uses in Education, Teaching Methods

Accuracy and Reliability of Large Language Models in Assessing Learning Outcomes Achievement across Cognitive Domains

Peer reviewed

Direct link

Swapna Haresh Teckwani; Amanda Huee-Ping Wong; Nathasha Vihangi Luke; Ivan Cherh Chiet Low – Advances in Physiology Education, 2024

The advent of artificial intelligence (AI), particularly large language models (LLMs) like ChatGPT and Gemini, has significantly impacted the educational landscape, offering unique opportunities for learning and assessment. In the realm of written assessment grading, traditionally viewed as a laborious and subjective process, this study sought to…

Descriptors: Accuracy, Reliability, Computational Linguistics, Standards

Automated Speech Scoring of Dialogue Response by Japanese Learners of English as a Foreign Language

Peer reviewed

Direct link

Yuko Hayashi; Yusuke Kondo; Yutaka Ishii – Innovation in Language Learning and Teaching, 2024

Purpose: This study builds a new system for automatically assessing learners' speech elicited from an oral discourse completion task (DCT), and evaluates the prediction capability of the system with a view to better understanding factors deemed influential in predicting speaking proficiency scores and the pedagogical implications of the system.…

Descriptors: English (Second Language), Second Language Learning, Second Language Instruction, Japanese

Examining Severity and Centrality Effects in TestDaF Writing and Speaking Assessments: An Extended Bayesian Many-Facet Rasch Analysis

Peer reviewed

Direct link

Eckes, Thomas; Jin, Kuan-Yu – International Journal of Testing, 2021

Severity and centrality are two main kinds of rater effects posing threats to the validity and fairness of performance assessments. Adopting Jin and Wang's (2018) extended facets modeling approach, we separately estimated the magnitude of rater severity and centrality effects in the web-based TestDaF (Test of German as a Foreign Language) writing…

Descriptors: Language Tests, German, Second Languages, Writing Tests

Assessing L2 English Speaking Using Automated Scoring Technology: Examining Automarker Reliability

Peer reviewed

Direct link

Xu, Jing; Jones, Edmund; Laxton, Victoria; Galaczi, Evelina – Assessment in Education: Principles, Policy & Practice, 2021

Recent advances in machine learning have made automated scoring of learner speech widespread, and yet validation research that provides support for applying automated scoring technology to assessment is still in its infancy. Both the educational measurement and language assessment communities have called for greater transparency in describing…

Descriptors: Second Language Learning, Second Language Instruction, English (Second Language), Computer Software

Silver Linings: Rethinking Assessment Pedagogy under the Pandemic

Peer reviewed

Direct link

Amrane-Cooper, Linda; Hatzipanagos, Stylianos; Tait, Alan – European Journal of Open, Distance and E-Learning, 2023

In 2020, because of the COVID-19 pandemic the higher education sector, in the United Kingdom and internationally, transitioned to online assessment, at a speed and scale which might have been unimaginable under normal circumstances. The priority in the sector was to ensure that fundamental principles of assessment, including integrity, were…

Descriptors: Pandemics, COVID-19, Educational Change, Integrity

A Comparative Judgment Approach to Assessing Chinese Sign Language Interpreting

Peer reviewed

Direct link

Han, Chao; Xiao, Xiaoyan – Language Testing, 2022

The quality of sign language interpreting (SLI) is a gripping construct among practitioners, educators and researchers, calling for reliable and valid assessment. There has been a diverse array of methods in the extant literature to measure SLI quality, ranging from traditional error analysis to recent rubric scoring. In this study, we want to…

Descriptors: Comparative Analysis, Sign Language, Deaf Interpreting, Evaluators

Integrated Listening/Speaking Skill Assessment: The Role of Ambiguity Tolerance, Cognitive/Metacognitive Strategy Use, and Foreign Language Anxiety

Peer reviewed
PDF on ERIC

Download full text

Karim Sadeghi; Neda Bakhshi – International Journal of Language Testing, 2025

Assessing language skills in an integrative form has drawn the attention of assessment experts in recent years. While some research data exists on integrative listening/reading-to-write assessment, there is comparatively little research literature on listening-to-speak integrated assessment. Also, little attention has been devoted to the role of…

Descriptors: Language Tests, Second Language Learning, English (Second Language), Computer Assisted Testing

Automated Essay Scoring at Scale: A Case Study in Switzerland and Germany. TOEFL® Research Report. RR-86. ETS RR-19-12

Peer reviewed
PDF on ERIC

Download full text

Rupp, André A.; Casabianca, Jodi M.; Krüger, Maleika; Keller, Stefan; Köller, Olaf – ETS Research Report Series, 2019

In this research report, we describe the design and empirical findings for a large-scale study of essay writing ability with approximately 2,500 high school students in Germany and Switzerland on the basis of 2 tasks with 2 associated prompts, each from a standardized writing assessment whose scoring involved both human and automated components.…

Descriptors: Automation, Foreign Countries, English (Second Language), Language Tests

Comparison of Automatic and Expert Teachers' Rating of Computerized English Listening-Speaking Test

Peer reviewed
PDF on ERIC

Download full text

Linlin, Cao – English Language Teaching, 2020

Through Many-Facet Rasch analysis, this study explores the rating differences between 1 computer automatic rater and 5 expert teacher raters on scoring 119 students in a computerized English listening-speaking test. Results indicate that both automatic and the teacher raters demonstrate good inter-rater reliability, though the automatic rater…

Descriptors: Language Tests, Computer Assisted Testing, English (Second Language), Second Language Learning

Variations in Rating Scale Functioning in Assessing Speech Act Production in L2 Chinese

Peer reviewed

Direct link

Li, Shuai; Taguchi, Naoko; Xiao, Feng – Language Assessment Quarterly, 2019

Adopting Linacre's guidelines for evaluating rating scale effectiveness, we examined whether and how a six-point rating scale functioned differently across raters, speech acts, and second language (L2) proficiency levels. We developed a 12-item Computerized Oral Discourse Completion Task (CODCT) for assessing the production of requests, refusals,…

Descriptors: Speech Acts, Rating Scales, Guidelines, Evaluators

"How Scripted Is This Going to Be?" Raters' Views of Authenticity in Speaking-Performance Tests

Peer reviewed

Direct link

Burton, John Dylan – Language Assessment Quarterly, 2020

An assumption underlying speaking tests is that scores reflect the ability to produce online, non-rehearsed speech. Speech produced in testing situations may, however, be less spontaneous if extensive test preparation takes place, resulting in memorized or rehearsed responses. If raters detect these patterns, they may conceptualize speech as…

Descriptors: Language Tests, Oral Language, Scores, Speech Communication

Language Testing in China: Past and Future

Peer reviewed
PDF on ERIC

Download full text

Li, Xuelian – English Language Teaching, 2019

Based on the articles written by mainland Chinese scholars published in the most influential Chinese and international journals, the present article analyzed the language testing research, compared the tendencies of seven categories between 2000-2009 and 2010-2019, and put forward future research directions by referring to international hot…

Descriptors: Language Tests, Testing, Educational History, Futures (of Society)

A Comparative Picture of the Ease of Use and Acceptance of Onscreen Marking by Markers across Subject Areas

Peer reviewed

Direct link

Coniam, David; Yan, Zi – British Journal of Educational Technology, 2016

Onscreen marking (OSM) has been used for the majority of Hong Kong public examinations since 2012. The current study compares marker reactions to OSM, ie, perceived ease of use and acceptance of OSM, against the backdrop of virtually all subject areas being marked on screen. The data were collected from three major sources: (1) survey data…

Descriptors: Foreign Countries, Computer Assisted Testing, Usability, Adoption (Ideas)

A Mixed Methods Approach to the Assessor's Targeting Behavior during Online Peer Assessment: Effects of Anonymity and Underlying Reasons

Peer reviewed

Direct link

Yu, Fu-Yun; Sung, Shannon – Interactive Learning Environments, 2016

This study examined the effects of identity revelation and concealment on the number of times students' work was assessed in an online peer assessment context. It also examined the underlying reasons guiding the assessor's targeting behavior. Two fifth-grade classes participated. The one-group pretest-posttest experimental research design coupled…

Descriptors: Foreign Countries, Elementary School Students, Grade 5, Student Evaluation

Previous Page | Next Page »

Pages: 1 | 2

Language Assessment Quarterly	3
ETS Research Report Series	2
English Language Teaching	2
Advances in Physiology…	1
Assessment in Education:…	1
British Journal of…	1
CALICO Journal	1
Education Journal	1
Educational Research and…	1
European Journal of Open,…	1
Innovation in Language…	1
Interactive Learning…	1
International Journal of…	1
International Journal of…	1
International Journal of…	1
Journal of Educational…	1
Language Testing	1
ProQuest LLC	1
Turkish Online Journal of…	1
More ▼

Coniam, David	4
Kunnan, Antony John	2
Ahmet Can Uyar	1
Amanda Huee-Ping Wong	1
Amrane-Cooper, Linda	1
Breyer, F. Jay	1
Burton, John Dylan	1
Casabianca, Jodi M.	1
Dilek Büyükahiska	1
Eckes, Thomas	1
Galaczi, Evelina	1
Han, Chao	1
Hatzipanagos, Stylianos	1
Hoang, Giang Thi Linh	1
Hsu, Huei-Lien	1
Ivan Cherh Chiet Low	1
Jin, Kuan-Yu	1
Jones, Edmund	1
Karim Sadeghi	1
Keller, Stefan	1
Krüger, Maleika	1
Köller, Olaf	1
Laxton, Victoria	1
Lee, Chun-Yi	1
Li, Shuai	1
More ▼