Publication Date
In 2025 | 1 |
Since 2024 | 8 |
Since 2021 (last 5 years) | 12 |
Since 2016 (last 10 years) | 12 |
Since 2006 (last 20 years) | 13 |
Descriptor
Artificial Intelligence | 18 |
Comparative Analysis | 18 |
Evaluation Methods | 18 |
Computer Software | 9 |
Models | 6 |
Decision Making | 5 |
Evaluators | 5 |
Scores | 5 |
Classification | 4 |
Man Machine Systems | 4 |
Second Language Instruction | 4 |
More ▼ |
Source
Author
Ando, Yuji | 1 |
Baker, Eva L. | 1 |
Brett Bligh | 1 |
Butler, Frances A. | 1 |
Chang Xu | 1 |
Di Zou | 1 |
Fu Lee Wang | 1 |
Fukui, Sora | 1 |
Guerrero Bote, Vicente P. | 1 |
Haddawy, Peter | 1 |
Haifeng Luo | 1 |
More ▼ |
Publication Type
Journal Articles | 13 |
Reports - Research | 12 |
Reports - Descriptive | 3 |
Reports - Evaluative | 3 |
Speeches/Meeting Papers | 3 |
Collected Works - Proceedings | 1 |
Information Analyses | 1 |
Opinion Papers | 1 |
Tests/Questionnaires | 1 |
Education Level
Audience
Location
South Korea | 2 |
Asia | 1 |
Australia | 1 |
Brazil | 1 |
China | 1 |
Connecticut | 1 |
Denmark | 1 |
Egypt | 1 |
Estonia | 1 |
Florida | 1 |
Germany | 1 |
More ▼ |
Laws, Policies, & Programs
Assessments and Surveys
Comprehensive Tests of Basic… | 1 |
What Works Clearinghouse Rating
Mark Johnson; Rafiq Saleh – Interactive Learning Environments, 2024
Educational assessment is inherently uncertain, where physiological, psychological and social factors play an important role in establishing judgements which are assumed to be "absolute". AI and other algorithmic approaches to grading of student work strip-out uncertainty, leading to a lack of inspectability in machine judgement and…
Descriptors: Artificial Intelligence, Evaluation Methods, Technology Uses in Education, Man Machine Systems
Yuan Tian; Xi Yang; Suhail A. Doi; Luis Furuya-Kanamori; Lifeng Lin; Joey S. W. Kwong; Chang Xu – Research Synthesis Methods, 2024
RobotReviewer is a tool for automatically assessing the risk of bias in randomized controlled trials, but there is limited evidence of its reliability. We evaluated the agreement between RobotReviewer and humans regarding the risk of bias assessment based on 1955 randomized controlled trials. The risk of bias in these trials was assessed via two…
Descriptors: Risk, Randomized Controlled Trials, Classification, Robotics
Hyemin Yoon; HyunJin Kim; Sangjin Kim – Measurement: Interdisciplinary Research and Perspectives, 2024
We have maintained the customer grade system that is being implemented to customers with excellent performance through customer segmentation for years. Currently, financial institutions that operate the customer grade system provide similar services based on the score calculation criteria, but the score calculation criteria vary from the financial…
Descriptors: Classification, Artificial Intelligence, Prediction, Decision Making
Xieling Chen; Haoran Xie; Di Zou; Lingling Xu; Fu Lee Wang – Educational Technology & Society, 2025
In massive open online course (MOOC) environments, computer-based analysis of course reviews enables instructors and course designers to develop intervention strategies and improve instruction to support learners' learning. This study aimed to automatically and effectively identify learners' concerned topics within their written reviews. First, we…
Descriptors: Classification, MOOCs, Teaching Skills, Artificial Intelligence
Kevin C. Haudek; Xiaoming Zhai – International Journal of Artificial Intelligence in Education, 2024
Argumentation, a key scientific practice presented in the "Framework for K-12 Science Education," requires students to construct and critique arguments, but timely evaluation of arguments in large-scale classrooms is challenging. Recent work has shown the potential of automated scoring systems for open response assessments, leveraging…
Descriptors: Accuracy, Persuasive Discourse, Artificial Intelligence, Learning Management Systems
Ma, Boxuan; Hettiarachchi, Gayan Prasad; Fukui, Sora; Ando, Yuji – International Educational Data Mining Society, 2023
Vocabulary proficiency diagnosis plays an important role in the field of language learning, which aims to identify the level of vocabulary knowledge of a learner through his or her learning process periodically, and can be used to provide personalized materials and feedback in language-learning applications. Traditional approaches are widely…
Descriptors: Vocabulary Development, Second Language Instruction, Second Language Learning, Language Proficiency
Yun Long; Haifeng Luo; Yu Zhang – npj Science of Learning, 2024
This study explores the use of Large Language Models (LLMs), specifically GPT-4, in analysing classroom dialogue--a key task for teaching diagnosis and quality improvement. Traditional qualitative methods are both knowledge- and labour-intensive. This research investigates the potential of LLMs to streamline and enhance this process. Using…
Descriptors: Classroom Communication, Computational Linguistics, Chinese, Mathematics Instruction
Han Yu; Xinguo Li; Brett Bligh – International Journal of Web-Based Learning and Teaching Technologies, 2024
In the era of information technology, foreign language teachers should not only master the professional knowledge of foreign languages, but also master the theoretical knowledge and application skills of modern education technology, that is, have certain information literacy. This article studies the strategies to improve the information literacy…
Descriptors: Information Literacy, College Faculty, Second Language Learning, Second Language Instruction
Reagan Mozer; Luke Miratrix; Jackie Eunjung Relyea; James S. Kim – Journal of Educational and Behavioral Statistics, 2024
In a randomized trial that collects text as an outcome, traditional approaches for assessing treatment impact require that each document first be manually coded for constructs of interest by human raters. An impact analysis can then be conducted to compare treatment and control groups, using the hand-coded scores as a measured outcome. This…
Descriptors: Scoring, Evaluation Methods, Writing Evaluation, Comparative Analysis
Vannaprathip, Narumol; Haddawy, Peter; Schultheis, Holger; Suebnukarn, Siriwan – International Journal of Artificial Intelligence in Education, 2022
Virtual reality simulation has had a significant impact on training of psychomotor surgical skills, yet there is still a lack of work on its use to teach surgical decision making. This is particularly noteworthy given the recognized importance of decision making in achieving positive surgical outcomes. With the objective of filling this gap, we…
Descriptors: Intelligent Tutoring Systems, Decision Making, Surgery, Teaching Methods
Saito, Kazuya; Macmillan, Konstantinos; Kachlicka, Magdalena; Kunihara, Takuya; Minematsu, Nobuaki – Studies in Second Language Acquisition, 2023
Whereas many scholars have emphasized the relative importance of "comprehensibility" as an ecologically valid goal for L2 speech training, testing, and development, eliciting listeners' judgments is time-consuming. Following calls for research on more efficient L2 speech rating methods in applied linguistics, and growing attention toward…
Descriptors: Second Language Learning, Second Language Instruction, Interrater Reliability, Speech Communication
The AI Teacher Test: Measuring the Pedagogical Ability of Blender and GPT-3 in Educational Dialogues
Tack, Anaïs; Piech, Chris – International Educational Data Mining Society, 2022
How can we test whether state-of-the-art generative models, such as Blender and GPT-3, are good AI teachers, capable of replying to a student in an educational dialogue? Designing an AI teacher test is challenging: although evaluation methods are much-needed, there is no off-the-shelf solution to measuring pedagogical ability. This paper reports…
Descriptors: Artificial Intelligence, Dialogs (Language), Bayesian Statistics, Decision Making

Lopez-Pujalte, Cristina; Guerrero Bote, Vicente P.; Moya Anegon, Felix de – Information Processing & Management, 2002
Discussion of information retrieval, query optimization techniques, and relevance feedback focuses on genetic algorithms, which are derived from artificial intelligence techniques. Describes an evaluation of different genetic algorithms using a residual collection method and compares results with the Ide dec-hi method (Salton and Buckley, 1990…
Descriptors: Algorithms, Artificial Intelligence, Comparative Analysis, Evaluation Methods
Baker, Eva L.; And Others – 1988
Evaluation models are being developed for assessing artificial intelligence (AI) systems in terms of similar performance by groups of people. Natural language understanding and vision systems are the areas of concentration. In simplest terms, the goal is to norm a given natural language system's performance on a sample of people. The specific…
Descriptors: Artificial Intelligence, Comparative Analysis, Computer Assisted Testing, Computer Science

Seidel, Robert J.; Park, Ok-Choon – Journal of Educational Computing Research, 1994
Examines changes which have occurred in the development and evaluation of intelligent tutoring systems (ITSs), speculates on future directions, and proposes a conceptual model for the development of ITSs. Topics discussed include the effectiveness of ITSs; the need for multidisciplinary efforts; and internal and external evaluation. (Contains 78…
Descriptors: Artificial Intelligence, Comparative Analysis, Computer Assisted Instruction, Evaluation Methods
Previous Page | Next Page »
Pages: 1 | 2