ERIC - Search Results

Publication Date

In 2025	9
Since 2024	30
Since 2021 (last 5 years)	64
Since 2016 (last 10 years)	77
Since 2006 (last 20 years)	80

Descriptor

Accuracy	80
Computational Linguistics	80
Computer Software	80
English (Second Language)	34
Second Language Learning	34
Comparative Analysis	31
Artificial Intelligence	29
Foreign Countries	29
Second Language Instruction	28
Teaching Methods	22
Translation	17
Writing Evaluation	16
Undergraduate Students	15
Language Usage	14
Scores	14
Evaluators	13
Feedback (Response)	12
Natural Language Processing	12
Student Attitudes	12
Writing Instruction	12
Classification	11
Essays	11
Language Processing	11
Models	11
Computer Assisted Testing	10
More ▼

Publication Type

Reports - Research	67
Journal Articles	63
Dissertations/Theses -…	9
Tests/Questionnaires	9
Speeches/Meeting Papers	5
Reports - Descriptive	3
Information Analyses	2

Education Level

Higher Education	34
Postsecondary Education	34
Elementary Education	5
Secondary Education	3
Early Childhood Education	2
Elementary Secondary Education	2
High Schools	2
Kindergarten	2
Primary Education	2
Grade 10	1
Grade 11	1
Grade 9	1
Junior High Schools	1
Middle Schools	1
More ▼

Audience

Location

China	6
Japan	4
Germany	2
Iran	2
United Kingdom	2
Yemen	2
Canada	1
Georgia	1
Hong Kong	1
Indonesia	1
Saudi Arabia	1
Singapore	1
South Korea	1
South Korea (Seoul)	1
Spain	1
Taiwan	1
Ukraine	1
Vietnam	1
More ▼

Laws, Policies, & Programs

Assessments and Surveys

Flesch Kincaid Grade Level…	1
Force Concept Inventory	1
Test of English as a Foreign…	1

What Works Clearinghouse Rating

Showing 1 to 15 of 80 results Save | Export

The Vulnerability of AI-Based Scoring Systems to Gaming Strategies: A Case Study

Peer reviewed

Direct link

Peter Baldwin; Victoria Yaneva; Kai North; Le An Ha; Yiyun Zhou; Alex J. Mechaber; Brian E. Clauser – Journal of Educational Measurement, 2025

Recent developments in the use of large-language models have led to substantial improvements in the accuracy of content-based automated scoring of free-text responses. The reported accuracy levels suggest that automated systems could have widespread applicability in assessment. However, before they are used in operational testing, other aspects of…

Descriptors: Artificial Intelligence, Scoring, Computational Linguistics, Accuracy

Can Large Language Models Replace Humans in Systematic Reviews? Evaluating GPT-4's Efficacy in Screening and Extracting Data from Peer-Reviewed and Grey Literature in Multiple Languages

Peer reviewed

Direct link

Qusai Khraisha; Sophie Put; Johanna Kappenberg; Azza Warraitch; Kristin Hadfield – Research Synthesis Methods, 2024

Systematic reviews are vital for guiding practice, research and policy, although they are often slow and labour-intensive. Large language models (LLMs) could speed up and automate systematic reviews, but their performance in such tasks has yet to be comprehensively evaluated against humans, and no study has tested Generative Pre-Trained…

Descriptors: Peer Evaluation, Research Reports, Artificial Intelligence, Computer Software

An Expression of Fear Realized in the Form of Sentences in the "Stranger Things" Movie

Peer reviewed
PDF on ERIC

Download full text

Arbain – Indonesian Journal of English Language Teaching and Applied Linguistics, 2023

This study aims to investigate the types and functions of expressions of fear realized in the form of sentences. With a special context in horror movies, the researcher attempted to reveal the types and functions of fear expressions such as directive, commissive, expressive, assertive, and declarative. This research focuses on the subtitles of the…

Descriptors: Films, Speech Acts, Accuracy, Fear

Claude, ChatGPT, Copilot, and Gemini Performance versus Students in Different Topics of Neuroscience

Peer reviewed

Direct link

Volodymyr Mavrych; Ahmed Yaqinuddin; Olena Bolgova – Advances in Physiology Education, 2025

Despite extensive studies on large language models and their capability to respond to questions from various licensed exams, there has been limited focus on employing chatbots for specific subjects within the medical curriculum, specifically medical neuroscience. This research compared the performances of Claude 3.5 Sonnet (Anthropic), GPT-3.5 and…

Descriptors: Artificial Intelligence, Computer Software, Neurosciences, Medical Education

Assessment of Large Language Models' Performances and Hallucinations for Chinese Postgraduate Medical Entrance Examination

Peer reviewed

Direct link

Hongfei Ye; Jian Xu; Danqing Huang; Meng Xie; Jinming Guo; Junrui Yang; Haiwei Bao; Mingzhi Zhang; Ce Zheng – Discover Education, 2025

This study evaluates Large language models (LLMs)' performance on Chinese Postgraduate Medical Entrance Examination (CPGMEE) as well as the hallucinations produced by LLMs and investigate their implications for medical education. We curated 10 trials of mock CPGMEE to evaluate the performances of 4 LLMs (GPT-4.0, ChatGPT, QWen 2.1 and Ernie 4.0).…

Descriptors: College Entrance Examinations, Foreign Countries, Computational Linguistics, Graduate Medical Education

Research on Intelligent Translation System of Spoken English Based on Cyclic Neural Network Model

Peer reviewed

Direct link

Jie Zhang – International Journal of Information and Communication Technology Education, 2024

This paper explores the development of an intelligent translation system for spoken English using Recurrent Neural Network (RNN) models. The fundamental principles of RNNs and their advantages in processing sequential data, particularly in handling time-dependent natural language data, are discussed. The methodology for constructing the…

Descriptors: Oral Language, Translation, Computational Linguistics, Computer Software

Empowering Responsible Use of Large Language Models

Direct link

Xuandong Zhao – ProQuest LLC, 2024

The rapid advancement of powerful Large Language Models (LLMs), such as ChatGPT and Llama, has revolutionized the world by bringing new creative possibilities and enhancing productivity. However, these advancements also pose significant challenges and risks, including the potential for misuse in the form of fake news, academic dishonesty,…

Descriptors: Computational Linguistics, Intellectual Property, Artificial Intelligence, Productivity

Machine Translation: Turkish-English Bilingual Speakers' Accuracy Detection of Evidentiality and Preference of MT

Peer reviewed

Direct link

Sümeyra Tosun – Cognitive Research: Principles and Implications, 2024

Machine translation (MT) is the automated process of translating text between different languages, encompassing a wide range of language pairs. This study focuses on non-professional bilingual speakers of Turkish and English, aiming to assess their ability to discern accuracy in machine translations and their preferences regarding MT. A particular…

Descriptors: Bilingualism, Turkish, English (Second Language), Second Language Learning

Graders of the Future: Comparing the Consistency and Accuracy of GPT4 and Pre-Service Teachers in Physics Essay Question Assessments

Peer reviewed
PDF on ERIC

Download full text

Yubin Xu; Lin Liu; Jianwen Xiong; Guangtian Zhu – Journal of Baltic Science Education, 2025

As the development and application of large language models (LLMs) in physics education progress, the well-known AI-based chatbot ChatGPT4 has presented numerous opportunities for educational assessment. Investigating the potential of AI tools in practical educational assessment carries profound significance. This study explored the comparative…

Descriptors: Physics, Artificial Intelligence, Computer Software, Accuracy

Assessing the Ethical Capabilities of Chat GPT in Healthcare: A Study on Its Proficiency in Situational Judgement Test

Peer reviewed

Direct link

Kunal Sareen – Innovations in Education and Teaching International, 2024

This study examines the proficiency of Chat GPT, an AI language model, in answering questions on the Situational Judgement Test (SJT), a widely used assessment tool for evaluating the fundamental competencies of medical graduates in the UK. A total of 252 SJT questions from the "Oxford Assess and Progress: Situational Judgement" Test…

Descriptors: Ethics, Decision Making, Artificial Intelligence, Computer Software

Examining the Effect of Assessment Construct Characteristics on Machine Learning Scoring of Scientific Argumentation

Peer reviewed

Direct link

Kevin C. Haudek; Xiaoming Zhai – International Journal of Artificial Intelligence in Education, 2024

Argumentation, a key scientific practice presented in the "Framework for K-12 Science Education," requires students to construct and critique arguments, but timely evaluation of arguments in large-scale classrooms is challenging. Recent work has shown the potential of automated scoring systems for open response assessments, leveraging…

Descriptors: Accuracy, Persuasive Discourse, Artificial Intelligence, Learning Management Systems

Using AI-Based Detectors to Control AI-Assisted Plagiarism in ESL Writing: "The Terminator versus the Machines"

Peer reviewed

Direct link

Ibrahim, Karim – Language Testing in Asia, 2023

The release of ChatGPT marked the beginning of a new era of AI-assisted plagiarism that disrupts traditional assessment practices in ESL composition. In the face of this challenge, educators are left with little guidance in controlling AI-assisted plagiarism, especially when conventional methods fail to detect AI-generated texts. One approach to…

Descriptors: English (Second Language), Second Language Learning, Second Language Instruction, Artificial Intelligence

ChatGPT versus a Customized AI Chatbot (Anatbuddy) for Anatomy Education: A Comparative Pilot Study

Peer reviewed

Direct link

Gautham Arun; Vivek Perumal; Francis Paul John Bato Urias; Yan En Ler; Bryan Wen Tao Tan; Ranganath Vallabhajosyula; Emmanuel Tan; Olivia Ng; Kian Bee Ng; Sreenivasulu Reddy Mogali – Anatomical Sciences Education, 2024

Large Language Models (LLMs) have the potential to improve education by personalizing learning. However, ChatGPT-generated content has been criticized for sometimes producing false, biased, and/or hallucinatory information. To evaluate AI's ability to return clear and accurate anatomy information, this study generated a custom interactive and…

Descriptors: Artificial Intelligence, Teaching Methods, Computational Linguistics, Anatomy

Large Language Models as AI-Powered Educational Assistants: Comparing GPT-4 and Gemini for Writing Teaching Cases

Peer reviewed

Direct link

Guido Lang; Tamilla Triantoro; Jason H. Sharp – Journal of Information Systems Education, 2024

This study explores the potential of large language models (LLMs), specifically GPT-4 and Gemini, in generating teaching cases for information systems courses. A unique prompt for writing three different types of teaching cases such as a descriptive case, a normative case, and a project-based case on the same IS topic (i.e., the introduction of…

Descriptors: Computational Linguistics, Computer Software, Artificial Intelligence, Readability Formulas

The Intent of ChatGPT Usage and Its Robustness in Medical Proficiency Exams: A Systematic Review

Peer reviewed

Direct link

Tatiana Chaiban; Zeinab Nahle; Ghaith Assi; Michelle Cherfane – Discover Education, 2024

Background: Since it was first launched, ChatGPT, a Large Language Model (LLM), has been widely used across different disciplines, particularly the medical field. Objective: The main aim of this review is to thoroughly assess the performance of the distinct version of ChatGPT in subspecialty written medical proficiency exams and the factors that…

Descriptors: Medical Education, Accuracy, Artificial Intelligence, Computer Software

Previous Page | Next Page »

Pages: 1 | 2 | 3 | 4 | 5 | 6

ProQuest LLC	9
Grantee Submission	5
Computer Assisted Language…	3
Eurasian Journal of Applied…	3
Journal of Language and…	3
Research-publishing.net	3
Advances in Physiology…	2
Australian Journal of Applied…	2
Discover Education	2
Journal of Speech, Language,…	2
Language Learning & Technology	2
Language Teaching Research	2
Physical Review Physics…	2
Advanced Education	1
Anatomical Sciences Education	1
Arab World English Journal	1
Cognitive Research:…	1
Cognitive Science	1
Computers in the Schools	1
Discourse Processes: A…	1
English Language Teaching	1
English Teaching	1
IEEE Transactions on Learning…	1
Indonesian Journal of English…	1
Innovations in Education and…	1
More ▼

McNamara, Danielle S.	2
Olney, Andrew M.	2
Yarbro, Jeffrey T.	2
Abdalla, Mohamed	1
Ahmed Yaqinuddin	1
Ahn, Soojin	1
Alberto, Paul A.	1
Alex J. Mechaber	1
Amanda Huee-Ping Wong	1
Amparo Lázaro-Ibarrola	1
Arbain	1
Arehart, Kathryn H.	1
Ariamanesh, Ali A.	1
Awadh, Awadh Nasser Munassar	1
Ayaka Sugawara	1
Azza Warraitch	1
Barati, Hossein	1
Bavendiek, Ulrike	1
Bedrick, Steven	1
Bin Dahmash, Nada	1
Blake, John	1
Botarleanu, Robert-Mihai	1
Bowles, Anita R.	1
Bram Bulté	1
Brian E. Clauser	1
More ▼