ERIC - Search Results

Publication Date

In 2025	0
Since 2024	2
Since 2021 (last 5 years)	9
Since 2016 (last 10 years)	9
Since 2006 (last 20 years)	11

Descriptor

Artificial Intelligence	15
Computer Assisted Testing	15
Computer Software	8
Learning Management Systems	4
Accuracy	3
Automation	3
Reading Comprehension	3
Scoring	3
Comparative Analysis	2
Educational Assessment	2
Equal Education	2
Ethics	2
Evaluation Methods	2
Feedback (Response)	2
Foreign Countries	2
Knowledge Level	2
Laboratories	2
Mathematical Models	2
Mathematics Instruction	2
Models	2
Natural Language Processing	2
Problem Solving	2
Responses	2
Semantics	2
Student Evaluation	2
More ▼

Source

Grantee Submission	3
International Educational…	3
International Association for…	2
International Society for…	1
International Working Group…	1
Online Submission	1

Publication Type

Speeches/Meeting Papers	15
Reports - Research	12
Reports - Evaluative	2
Reports - Descriptive	1

Education Level

Higher Education	4
Postsecondary Education	4
Junior High Schools	1
Middle Schools	1
Secondary Education	1

Audience

Location

Arizona	1
Greece (Athens)	1
Senegal	1
Washington	1

Laws, Policies, & Programs

Assessments and Surveys

What Works Clearinghouse Rating

Showing all 15 results Save | Export

Automated Assessment of Students' Code Comprehension Using LLMs

Peer reviewed

Priti Oli; Rabin Banjade; Jeevan Chapagain; Vasile Rus – Grantee Submission, 2024

Assessing students' answers and in particular natural language answers is a crucial challenge in the field of education. Advances in transformer-based models such as Large Language Models (LLMs), have led to significant progress in various natural language tasks. Nevertheless, amidst the growing trend of evaluating LLMs across diverse tasks,…

Descriptors: Student Evaluation, Computer Assisted Testing, Artificial Intelligence, Comprehension

How Hard Can This Question Be? An Exploratory Analysis of Features Assessing Question Difficulty Using LLMs

Peer reviewed

Andreea Dutulescu; Stefan Ruseti; Mihai Dascalu; Danielle S. McNamara – Grantee Submission, 2024

Assessing the difficulty of reading comprehension questions is crucial to educational methodologies and language understanding technologies. Traditional methods of assessing question difficulty rely frequently on human judgments or shallow metrics, often failing to accurately capture the intricate cognitive demands of answering a question. This…

Descriptors: Difficulty Level, Reading Tests, Test Items, Reading Comprehension

Evaluating Quadratic Weighted Kappa as the Standard Performance Metric for Automated Essay Scoring

Peer reviewed
PDF on ERIC

Download full text

Doewes, Afrizal; Kurdhi, Nughthoh Arfawi; Saxena, Akrati – International Educational Data Mining Society, 2023

Automated Essay Scoring (AES) tools aim to improve the efficiency and consistency of essay scoring by using machine learning algorithms. In the existing research work on this topic, most researchers agree that human-automated score agreement remains the benchmark for assessing the accuracy of machine-generated scores. To measure the performance of…

Descriptors: Essays, Writing Evaluation, Evaluators, Accuracy

Secure Dematerialization of Assessments in Digital Universities through Moodle, WebRTC and Safe Exam Browser (SEB)

Peer reviewed
PDF on ERIC

Download full text

Sylla, Khalifa; Babou, Birahim; Ouya, Samuel – International Association for Development of the Information Society, 2022

This paper deals with a solution allowing digital universities to extend the functionalities of their distance learning platform to offer a secure solution for the dematerialization of assessments. Currently we are witnessing the rise of digital universities, this is the case in Africa, particularly in Senegal. We are witnessing strong growth in…

Descriptors: Foreign Countries, Virtual Universities, Computer Assisted Testing, Educational Technology

Automated Summary Scoring with Readerbench

Peer reviewed
PDF on ERIC

Download full text

Direct link

Botarleanu, Robert-Mihai; Dascalu, Mihai; Allen, Laura K.; Crossley, Scott Andrew; McNamara, Danielle S. – Grantee Submission, 2021

Text summarization is an effective reading comprehension strategy. However, summary evaluation is complex and must account for various factors including the summary and the reference text. This study examines a corpus of approximately 3,000 summaries based on 87 reference texts, with each summary being manually scored on a 4-point Likert scale.…

Descriptors: Computer Assisted Testing, Scoring, Natural Language Processing, Computer Software

Auto-Scoring Student Responses with Images in Mathematics

Peer reviewed
PDF on ERIC

Download full text

Baral, Sami; Botelho, Anthony; Santhanam, Abhishek; Gurung, Ashish; Cheng, Li; Heffernan, Neil – International Educational Data Mining Society, 2023

Teachers often rely on the use of a range of open-ended problems to assess students' understanding of mathematical concepts. Beyond traditional conceptions of student open-ended work, commonly in the form of textual short-answer or essay responses, the use of figures, tables, number lines, graphs, and pictographs are other examples of open-ended…

Descriptors: Mathematics Instruction, Mathematical Concepts, Problem Solving, Test Format

Online Exam Proctoring Services: Students' Perceptions of Privacy and Equity

Peer reviewed
PDF on ERIC

Download full text

Derar Serhan; Natalie Welcome – International Society for Technology, Education, and Science, 2023

Recently, institutions have increased their online course offerings as well as their online degrees. With this significant growth in online offerings, assessment integrity becomes a concern. In response to this concern, many institutions have adopted the use of online proctoring services. The aim of using these online proctoring services is to…

Descriptors: Computer Assisted Testing, Privacy, Student Attitudes, Ethics

Using a Randomized Experiment to Compare the Performance of Two Adaptive Assessment Engines

Peer reviewed
PDF on ERIC

Download full text

Matayoshi, Jeffrey; Uzun, Hasan; Cosyn, Eric – International Educational Data Mining Society, 2022

Knowledge space theory (KST) is a mathematical framework for modeling and assessing student knowledge. While KST has successfully served as the foundation of several learning systems, recent advancements in machine learning provide an opportunity to improve on purely KST-based approaches to assessing student knowledge. As such, in this work we…

Descriptors: Knowledge Level, Mathematical Models, Learning Experience, Comparative Analysis

Results from NCME Survey on Revisions to the "Standards for Educational and Psychological Testing"

Download full text

Doris Zahner; Jeffrey T. Steedle; James Soland; Catherine Welch; Qi Qin; Kathryn Thompson; Richard Phelps – Online Submission, 2023

The "Standards for Educational and Psychological Testing" have served as a cornerstone for best practices in assessment. As the field evolves, so must these standards, with regular revisions ensuring they reflect current knowledge and practice. The National Council on Measurement in Education (NCME) conducted a survey to gather feedback…

Descriptors: Standards, Educational Assessment, Psychological Testing, Best Practices

Implementation of Artificial Intelligence Assessment in Engineering Laboratory Education

Download full text

Samarakou, Maria; Fylladitakis, Emmanouil D.; Prentakis, Pantelis; Athineos, Spyros – International Association for Development of the Information Society, 2014

In laboratory courses, the assessment of exercises and assignments typically is treated as a simple, quantifiable approach. This approach however rarely includes qualitative factors, especially if the grading is being automatically performed by the system, and provides little to no feedback for the students to reflect on their work. The role of…

Descriptors: Artificial Intelligence, Computer Assisted Testing, Evaluation Methods, Engineering Education

Student Consistency and Implications for Feedback in Online Assessment Systems

Download full text

Madhyastha, Tara M.; Tanimoto, Steven – International Working Group on Educational Data Mining, 2009

Most of the emphasis on mining online assessment logs has been to identify content-specific errors. However, the pattern of general "consistency" is domain independent, strongly related to performance, and can itself be a target of educational data mining. We demonstrate that simple consistency indicators are related to student outcomes,…

Descriptors: Web Based Instruction, Computer Assisted Testing, Computer Software, Computer Science Education

Expertise in Psychological Inquiry: A Computerized Assessment.

Download full text

Brammer, Robert – 1997

The ways in which experience and academic training influence the types of questions therapists ask of their clients were studied. Psychologists and psychology students of varying degrees of expertise (n=138) were provided an artificial intelligence program that simulated a date rape client's responses to typed questions and were asked to conduct a…

Descriptors: Artificial Intelligence, Clinical Experience, Computer Assisted Testing, Knowledge Level

Directly Comparing Computer and Human Performance in Language Understanding and Visual Reasoning.

Download full text

Baker, Eva L.; And Others – 1988

Evaluation models are being developed for assessing artificial intelligence (AI) systems in terms of similar performance by groups of people. Natural language understanding and vision systems are the areas of concentration. In simplest terms, the goal is to norm a given natural language system's performance on a sample of people. The specific…

Descriptors: Artificial Intelligence, Comparative Analysis, Computer Assisted Testing, Computer Science

Predicting Item Difficulty in a Reading Comprehension Test with an Artificial Neural Network.

Download full text

Perkins, Kyle; And Others – 1994

This paper reports the results of using a three-layer backpropagation artificial neural network to predict item difficulty in a reading comprehension test. Two network structures were developed, one with and one without a sigmoid function in the output processing unit. The data set, which consisted of a table of coded test items and corresponding…

Descriptors: Artificial Intelligence, Computer Assisted Testing, Expert Systems, Item Analysis

Diagnosis of Cognitive Errors by Statistical Pattern Recognition Methods.

Download full text

Tatsuoka, Kikumi K.; Tatsuoka, Maurice M. – 1986

The rule space model permits measurement of cognitive skill acquisition, diagnosis of cognitive errors, and detection of the strengths and weaknesses of knowledge possessed by individuals. Two ways to classify an individual into his or her most plausible latent state of knowledge include: (1) hypothesis testing--Bayes' decision rules for minimum…

Descriptors: Artificial Intelligence, Bayesian Statistics, Cognitive Development, Computer Assisted Testing

Allen, Laura K.	1
Andreea Dutulescu	1
Athineos, Spyros	1
Babou, Birahim	1
Baker, Eva L.	1
Baral, Sami	1
Botarleanu, Robert-Mihai	1
Botelho, Anthony	1
Brammer, Robert	1
Catherine Welch	1
Cheng, Li	1
Cosyn, Eric	1
Crossley, Scott Andrew	1
Danielle S. McNamara	1
Dascalu, Mihai	1
Derar Serhan	1
Doewes, Afrizal	1
Doris Zahner	1
Fylladitakis, Emmanouil D.	1
Gurung, Ashish	1
Heffernan, Neil	1
James Soland	1
Jeevan Chapagain	1
Jeffrey T. Steedle	1
More ▼