ERIC - Search Results

Publication Date

In 2025	1
Since 2024	8
Since 2021 (last 5 years)	12
Since 2016 (last 10 years)	12
Since 2006 (last 20 years)	13

Descriptor

Artificial Intelligence	18
Comparative Analysis	18
Evaluation Methods	18
Computer Software	9
Models	6
Decision Making	5
Evaluators	5
Scores	5
Classification	4
Man Machine Systems	4
Second Language Instruction	4
Second Language Learning	4
Teaching Methods	4
Algorithms	3
Computer Assisted Instruction	3
Elementary Secondary Education	3
Feedback (Response)	3
Foreign Countries	3
Instructional Effectiveness	3
Intelligent Tutoring Systems	3
Interrater Reliability	3
Intervention	3
Reliability	3
Task Analysis	3
Time Management	3
More ▼

Source

International Educational…	2
International Journal of…	2
Computers and Education	1
Educational Technology &…	1
Information Processing &…	1
Interactive Learning…	1
International Association for…	1
International Journal of…	1
Journal of Educational…	1
Journal of Educational and…	1
Measurement:…	1
Research Synthesis Methods	1
Studies in Second Language…	1
npj Science of Learning	1
More ▼

Publication Type

Journal Articles	13
Reports - Research	12
Reports - Descriptive	3
Reports - Evaluative	3
Speeches/Meeting Papers	3
Collected Works - Proceedings	1
Information Analyses	1
Opinion Papers	1
Tests/Questionnaires	1

Education Level

Higher Education	3
Postsecondary Education	3
Early Childhood Education	2
Elementary Education	2
Elementary Secondary Education	2
Primary Education	2
Grade 1	1
Grade 2	1
Junior High Schools	1
Kindergarten	1
Middle Schools	1
Secondary Education	1
More ▼

Audience

Location

South Korea	2
Asia	1
Australia	1
Brazil	1
China	1
Connecticut	1
Denmark	1
Egypt	1
Estonia	1
Florida	1
Germany	1
Greece	1
Hawaii	1
Ireland	1
Israel	1
Italy	1
Japan	1
Kazakhstan	1
Netherlands	1
Norway	1
Ohio	1
Pakistan	1
Pennsylvania	1
Philippines	1
Portugal	1
More ▼

Laws, Policies, & Programs

Assessments and Surveys

Comprehensive Tests of Basic…

What Works Clearinghouse Rating

Showing 1 to 15 of 18 results Save | Export

The Guts of Assessment: A Digital Architecture for Machine Learning and Analogue Judgement

Peer reviewed

Direct link

Mark Johnson; Rafiq Saleh – Interactive Learning Environments, 2024

Educational assessment is inherently uncertain, where physiological, psychological and social factors play an important role in establishing judgements which are assumed to be "absolute". AI and other algorithmic approaches to grading of student work strip-out uncertainty, leading to a lack of inspectability in machine judgement and…

Descriptors: Artificial Intelligence, Evaluation Methods, Technology Uses in Education, Man Machine Systems

Towards the Automatic Risk of Bias Assessment on Randomized Controlled Trials: A Comparison of RobotReviewer and Humans

Peer reviewed

Direct link

Yuan Tian; Xi Yang; Suhail A. Doi; Luis Furuya-Kanamori; Lifeng Lin; Joey S. W. Kwong; Chang Xu – Research Synthesis Methods, 2024

RobotReviewer is a tool for automatically assessing the risk of bias in randomized controlled trials, but there is limited evidence of its reliability. We evaluated the agreement between RobotReviewer and humans regarding the risk of bias assessment based on 1955 randomized controlled trials. The risk of bias in these trials was assessed via two…

Descriptors: Risk, Randomized Controlled Trials, Classification, Robotics

Validation and Implementation of Customer Classification System Using Machine Learning

Peer reviewed

Direct link

Hyemin Yoon; HyunJin Kim; Sangjin Kim – Measurement: Interdisciplinary Research and Perspectives, 2024

We have maintained the customer grade system that is being implemented to customers with excellent performance through customer segmentation for years. Currently, financial institutions that operate the customer grade system provide similar services based on the score calculation criteria, but the score calculation criteria vary from the financial…

Descriptors: Classification, Artificial Intelligence, Prediction, Decision Making

Automatic Classification of Chinese Programming MOOC Reviews Using Fine-Tuned BERTs and GPT-Augmented Data

Peer reviewed

Direct link

Xieling Chen; Haoran Xie; Di Zou; Lingling Xu; Fu Lee Wang – Educational Technology & Society, 2025

In massive open online course (MOOC) environments, computer-based analysis of course reviews enables instructors and course designers to develop intervention strategies and improve instruction to support learners' learning. This study aimed to automatically and effectively identify learners' concerned topics within their written reviews. First, we…

Descriptors: Classification, MOOCs, Teaching Skills, Artificial Intelligence

Examining the Effect of Assessment Construct Characteristics on Machine Learning Scoring of Scientific Argumentation

Peer reviewed

Direct link

Kevin C. Haudek; Xiaoming Zhai – International Journal of Artificial Intelligence in Education, 2024

Argumentation, a key scientific practice presented in the "Framework for K-12 Science Education," requires students to construct and critique arguments, but timely evaluation of arguments in large-scale classrooms is challenging. Recent work has shown the potential of automated scoring systems for open response assessments, leveraging…

Descriptors: Accuracy, Persuasive Discourse, Artificial Intelligence, Learning Management Systems

Exploring the Effectiveness of Vocabulary Proficiency Diagnosis Using Linguistic Concept and Skill Modeling

Peer reviewed
PDF on ERIC

Download full text

Ma, Boxuan; Hettiarachchi, Gayan Prasad; Fukui, Sora; Ando, Yuji – International Educational Data Mining Society, 2023

Vocabulary proficiency diagnosis plays an important role in the field of language learning, which aims to identify the level of vocabulary knowledge of a learner through his or her learning process periodically, and can be used to provide personalized materials and feedback in language-learning applications. Traditional approaches are widely…

Descriptors: Vocabulary Development, Second Language Instruction, Second Language Learning, Language Proficiency

Evaluating Large Language Models in Analysing Classroom Dialogue

Peer reviewed

Direct link

Yun Long; Haifeng Luo; Yu Zhang – npj Science of Learning, 2024

This study explores the use of Large Language Models (LLMs), specifically GPT-4, in analysing classroom dialogue--a key task for teaching diagnosis and quality improvement. Traditional qualitative methods are both knowledge- and labour-intensive. This research investigates the potential of LLMs to streamline and enhance this process. Using…

Descriptors: Classroom Communication, Computational Linguistics, Chinese, Mathematics Instruction

Strategies of Improving Information Literacy of College Foreign Language Teachers under the Background of Artificial Intelligence

Peer reviewed

Direct link

Han Yu; Xinguo Li; Brett Bligh – International Journal of Web-Based Learning and Teaching Technologies, 2024

In the era of information technology, foreign language teachers should not only master the professional knowledge of foreign languages, but also master the theoretical knowledge and application skills of modern education technology, that is, have certain information literacy. This article studies the strategies to improve the information literacy…

Descriptors: Information Literacy, College Faculty, Second Language Learning, Second Language Instruction

Combining Human and Automated Scoring Methods in Experimental Assessments of Writing: A Case Study Tutorial

Peer reviewed

Direct link

Reagan Mozer; Luke Miratrix; Jackie Eunjung Relyea; James S. Kim – Journal of Educational and Behavioral Statistics, 2024

In a randomized trial that collects text as an outcome, traditional approaches for assessing treatment impact require that each document first be manually coded for constructs of interest by human raters. An impact analysis can then be conducted to compare treatment and control groups, using the hand-coded scores as a measured outcome. This…

Descriptors: Scoring, Evaluation Methods, Writing Evaluation, Comparative Analysis

Intelligent Tutoring for Surgical Decision Making: A Planning-Based Approach

Peer reviewed

Direct link

Vannaprathip, Narumol; Haddawy, Peter; Schultheis, Holger; Suebnukarn, Siriwan – International Journal of Artificial Intelligence in Education, 2022

Virtual reality simulation has had a significant impact on training of psychomotor surgical skills, yet there is still a lack of work on its use to teach surgical decision making. This is particularly noteworthy given the recognized importance of decision making in achieving positive surgical outcomes. With the objective of filling this gap, we…

Descriptors: Intelligent Tutoring Systems, Decision Making, Surgery, Teaching Methods

Automated Assessment of Second Language Comprehensibility: Review, Training, Validation, and Generalization Studies

Peer reviewed

Direct link

Saito, Kazuya; Macmillan, Konstantinos; Kachlicka, Magdalena; Kunihara, Takuya; Minematsu, Nobuaki – Studies in Second Language Acquisition, 2023

Whereas many scholars have emphasized the relative importance of "comprehensibility" as an ecologically valid goal for L2 speech training, testing, and development, eliciting listeners' judgments is time-consuming. Following calls for research on more efficient L2 speech rating methods in applied linguistics, and growing attention toward…

Descriptors: Second Language Learning, Second Language Instruction, Interrater Reliability, Speech Communication

The AI Teacher Test: Measuring the Pedagogical Ability of Blender and GPT-3 in Educational Dialogues

Peer reviewed
PDF on ERIC

Download full text

Tack, Anaïs; Piech, Chris – International Educational Data Mining Society, 2022

How can we test whether state-of-the-art generative models, such as Blender and GPT-3, are good AI teachers, capable of replying to a student in an educational dialogue? Designing an AI teacher test is challenging: although evaluation methods are much-needed, there is no off-the-shelf solution to measuring pedagogical ability. This paper reports…

Descriptors: Artificial Intelligence, Dialogs (Language), Bayesian Statistics, Decision Making

A Test of Genetic Algorithms in Relevance Feedback.

Peer reviewed

Lopez-Pujalte, Cristina; Guerrero Bote, Vicente P.; Moya Anegon, Felix de – Information Processing & Management, 2002

Discussion of information retrieval, query optimization techniques, and relevance feedback focuses on genetic algorithms, which are derived from artificial intelligence techniques. Describes an evaluation of different genetic algorithms using a residual collection method and compares results with the Ide dec-hi method (Salton and Buckley, 1990…

Descriptors: Algorithms, Artificial Intelligence, Comparative Analysis, Evaluation Methods

Directly Comparing Computer and Human Performance in Language Understanding and Visual Reasoning.

Download full text

Baker, Eva L.; And Others – 1988

Evaluation models are being developed for assessing artificial intelligence (AI) systems in terms of similar performance by groups of people. Natural language understanding and vision systems are the areas of concentration. In simplest terms, the goal is to norm a given natural language system's performance on a sample of people. The specific…

Descriptors: Artificial Intelligence, Comparative Analysis, Computer Assisted Testing, Computer Science

An Historical Perspective and a Model for Evaluation of Intelligent Tutoring Systems.

Peer reviewed

Seidel, Robert J.; Park, Ok-Choon – Journal of Educational Computing Research, 1994

Examines changes which have occurred in the development and evaluation of intelligent tutoring systems (ITSs), speculates on future directions, and proposes a conceptual model for the development of ITSs. Topics discussed include the effectiveness of ITSs; the need for multidisciplinary efforts; and internal and external evaluation. (Contains 78…

Descriptors: Artificial Intelligence, Comparative Analysis, Computer Assisted Instruction, Evaluation Methods

Previous Page | Next Page »

Pages: 1 | 2

Ando, Yuji	1
Baker, Eva L.	1
Brett Bligh	1
Butler, Frances A.	1
Chang Xu	1
Di Zou	1
Fu Lee Wang	1
Fukui, Sora	1
Guerrero Bote, Vicente P.	1
Haddawy, Peter	1
Haifeng Luo	1
Han Yu	1
Haoran Xie	1
Hettiarachchi, Gayan Prasad	1
Hyemin Yoon	1
HyunJin Kim	1
Jackie Eunjung Relyea	1
James S. Kim	1
Joey S. W. Kwong	1
Kachlicka, Magdalena	1
Kevin C. Haudek	1
Kunihara, Takuya	1
Lifeng Lin	1
Lingling Xu	1
More ▼