ERIC - Search Results

Publication Date

In 2025	0
Since 2024	5
Since 2021 (last 5 years)	8
Since 2016 (last 10 years)	8
Since 2006 (last 20 years)	8

Descriptor

Artificial Intelligence	8
Computer Assisted Testing	8
Natural Language Processing	3
Scoring	3
Academic Achievement	2
Algorithms	2
Automation	2
Computer Software	2
Difficulty Level	2
Evaluation Methods	2
Reading Comprehension	2
Semantics	2
Student Evaluation	2
Test Items	2
Test Validity	2
Accuracy	1
Coding	1
Comprehension	1
Computation	1
Computational Linguistics	1
Computer Games	1
Computer Interfaces	1
Correlation	1
Creative Thinking	1
Creativity Tests	1
More ▼

Source

Grantee Submission

Publication Type

Reports - Research	7
Speeches/Meeting Papers	3
Journal Articles	1
Reports - Descriptive	1

Education Level

Early Childhood Education	1
Elementary Education	1
Preschool Education	1

Audience

Location

Pennsylvania (Pittsburgh)

Laws, Policies, & Programs

Assessments and Surveys

Torrance Tests of Creative…

What Works Clearinghouse Rating

Showing all 8 results Save | Export

Automated Assessment of Students' Code Comprehension Using LLMs

Peer reviewed

Priti Oli; Rabin Banjade; Jeevan Chapagain; Vasile Rus – Grantee Submission, 2024

Assessing students' answers and in particular natural language answers is a crucial challenge in the field of education. Advances in transformer-based models such as Large Language Models (LLMs), have led to significant progress in various natural language tasks. Nevertheless, amidst the growing trend of evaluating LLMs across diverse tasks,…

Descriptors: Student Evaluation, Computer Assisted Testing, Artificial Intelligence, Comprehension

How Hard Can This Question Be? An Exploratory Analysis of Features Assessing Question Difficulty Using LLMs

Peer reviewed

Andreea Dutulescu; Stefan Ruseti; Mihai Dascalu; Danielle S. McNamara – Grantee Submission, 2024

Assessing the difficulty of reading comprehension questions is crucial to educational methodologies and language understanding technologies. Traditional methods of assessing question difficulty rely frequently on human judgments or shallow metrics, often failing to accurately capture the intricate cognitive demands of answering a question. This…

Descriptors: Difficulty Level, Reading Tests, Test Items, Reading Comprehension

Automated Pipeline for Multi-Lingual Automated Essay Scoring with ReaderBench

Peer reviewed

Direct link

Stefan Ruseti; Ionut Paraschiv; Mihai Dascalu; Danielle S. McNamara – Grantee Submission, 2024

Automated Essay Scoring (AES) is a well-studied problem in Natural Language Processing applied in education. Solutions vary from handcrafted linguistic features to large Transformer-based models, implying a significant effort in feature extraction and model implementation. We introduce a novel Automated Machine Learning (AutoML) pipeline…

Descriptors: Computer Assisted Testing, Scoring, Automation, Essays

Artificial Intelligence-Based Assessment in Education

Peer reviewed
PDF on ERIC

Download full text

Direct link

Ying Fang; Rod D. Roscoe; Danielle S. McNamara – Grantee Submission, 2023

Artificial Intelligence (AI) based assessments are commonly used in a variety of settings including business, healthcare, policing, manufacturing, and education. In education, AI-based assessments undergird intelligent tutoring systems as well as many tools used to evaluate students and, in turn, guide learning and instruction. This chapter…

Descriptors: Artificial Intelligence, Computer Assisted Testing, Student Evaluation, Evaluation Methods

Toward Argument-Based Fairness with an Application to AI-Enhanced Educational Assessments

Peer reviewed
PDF on ERIC

Download full text

Direct link

A. Corinne Huggins-Manley; Brandon M. Booth; Sidney K. D'Mello – Grantee Submission, 2022

The field of educational measurement places validity and fairness as central concepts of assessment quality (AERA, APA, NCME, 2014). Prior research has proposed embedding fairness arguments within argument-based validity processes, particularly when fairness is conceived as comparability in assessment properties across groups (Chapelle, 2021; Xi,…

Descriptors: Educational Assessment, Persuasive Discourse, Validity, Artificial Intelligence

Incorporating Evidence-Based Gamification and Machine Learning to Assess Preschool Executive Function: A Feasibility Study

Peer reviewed

Direct link

Cassondra M. Eng; Aria Tsegai-Moore; Anna V. Fisher – Grantee Submission, 2024

Computerized assessments and digital games have become more prevalent in childhood, necessitating a systematic investigation of the effects of gamified executive function assessments on performance and engagement. This study examined the feasibility of incorporating gamification and a machine learning algorithm that adapts task difficulty to…

Descriptors: Preschool Children, Preschool Curriculum, Preschool Education, Preschool Tests

Automated Summary Scoring with Readerbench

Peer reviewed
PDF on ERIC

Download full text

Direct link

Botarleanu, Robert-Mihai; Dascalu, Mihai; Allen, Laura K.; Crossley, Scott Andrew; McNamara, Danielle S. – Grantee Submission, 2021

Text summarization is an effective reading comprehension strategy. However, summary evaluation is complex and must account for various factors including the summary and the reference text. This study examines a corpus of approximately 3,000 summaries based on 87 reference texts, with each summary being manually scored on a 4-point Likert scale.…

Descriptors: Computer Assisted Testing, Scoring, Natural Language Processing, Computer Software

Measuring Original Thinking in Elementary School: Development and Validation of a Computational Psychometric Approach

Peer reviewed

Direct link

Selcuk Acar; Denis Dumas; Peter Organisciak; Kelly Berthiaume – Grantee Submission, 2024

Creativity is highly valued in both education and the workforce, but assessing and developing creativity can be difficult without psychometrically robust and affordable tools. The open-ended nature of creativity assessments has made them difficult to score, expensive, often imprecise, and therefore impractical for school- or district-wide use. To…

Descriptors: Thinking Skills, Elementary School Students, Artificial Intelligence, Measurement Techniques

Danielle S. McNamara	3
Mihai Dascalu	2
Stefan Ruseti	2
A. Corinne Huggins-Manley	1
Allen, Laura K.	1
Andreea Dutulescu	1
Anna V. Fisher	1
Aria Tsegai-Moore	1
Botarleanu, Robert-Mihai	1
Brandon M. Booth	1
Cassondra M. Eng	1
Crossley, Scott Andrew	1
Dascalu, Mihai	1
Denis Dumas	1
Ionut Paraschiv	1
Jeevan Chapagain	1
Kelly Berthiaume	1
McNamara, Danielle S.	1
Peter Organisciak	1
Priti Oli	1
Rabin Banjade	1
Rod D. Roscoe	1
Selcuk Acar	1
Sidney K. D'Mello	1
Vasile Rus	1
More ▼