ERIC - Search Results

Publication Date

In 2026	0
Since 2025	17

Source

Journal of Educational…	6
Education and Information…	2
Assessment for Effective…	1
IEEE Transactions on Learning…	1
Innovations in Education and…	1
International Journal of…	1
International Journal of…	1
Journal of Creative Behavior	1
Journal of Educational and…	1
Measurement:…	1
Vocabulary Learning and…	1
More ▼

Publication Type

Journal Articles	17
Reports - Research	17

Education Level

Higher Education	4
Postsecondary Education	4
Secondary Education	2
Early Childhood Education	1
Elementary Education	1
Grade 12	1
Grade 3	1
Grade 4	1
Grade 5	1
High Schools	1
Intermediate Grades	1
Middle Schools	1
Primary Education	1
More ▼

Audience

Location

Japan	1
New York	1
New York (Rochester)	1

Laws, Policies, & Programs

Assessments and Surveys

Program for International…	1
Torrance Tests of Creative…	1

What Works Clearinghouse Rating

Showing 1 to 15 of 17 results Save | Export

Exploring the Effects of Small Item Pools on Examinee Achievement Estimates for Computer-Adaptive Tests: A Simulation Study

Peer reviewed

Direct link

Beyza Aksu Dunya; Stefanie Wind – International Journal of Testing, 2025

We explored the practicality of relatively small item pools in the context of low-stakes Computer-Adaptive Testing (CAT), such as CAT procedures that might be used for quick diagnostic or screening exams. We used a basic CAT algorithm without content balancing and exposure control restrictions to reflect low stakes testing scenarios. We examined…

Descriptors: Item Banks, Adaptive Testing, Computer Assisted Testing, Achievement

GRAD-AI: An Automated Grading Tool for Code Assessment and Feedback in Programming Course

Peer reviewed

Direct link

Ishaya Gambo; Faith-Jane Abegunde; Omobola Gambo; Roseline Oluwaseun Ogundokun; Akinbowale Natheniel Babatunde; Cheng-Chi Lee – Education and Information Technologies, 2025

The current educational system relies heavily on manual grading, posing challenges such as delayed feedback and grading inaccuracies. Automated grading tools (AGTs) offer solutions but come with limitations. To address this, "GRAD-AI" is introduced, an advanced AGT that combines automation with teacher involvement for precise grading,…

Descriptors: Automation, Grading, Artificial Intelligence, Computer Assisted Testing

Using GPT-4 to Augment Imbalanced Data for Automatic Scoring

Peer reviewed

Direct link

Luyang Fang; Gyeonggeon Lee; Xiaoming Zhai – Journal of Educational Measurement, 2025

Machine learning-based automatic scoring faces challenges with imbalanced student responses across scoring categories. To address this, we introduce a novel text data augmentation framework that leverages GPT-4, a generative large language model specifically tailored for imbalanced datasets in automatic scoring. Our experimental dataset consisted…

Descriptors: Computer Assisted Testing, Artificial Intelligence, Automation, Scoring

Two-Phase Content-Balancing CD-CAT Online Item Calibration

Peer reviewed

Direct link

Jing Huang; Yuxiao Zhang; Jason W. Morphew; Jayson M. Nissen; Ben Van Dusen; Hua Hua Chang – Journal of Educational Measurement, 2025

Online calibration estimates new item parameters alongside previously calibrated items, supporting efficient item replenishment. However, most existing online calibration procedures for Cognitive Diagnostic Computerized Adaptive Testing (CD-CAT) lack mechanisms to ensure content balance during live testing. This limitation can lead to uneven…

Descriptors: Adaptive Testing, Computer Assisted Testing, Cognitive Measurement, Test Items

Automated Scoring of Figural Tests of Creativity with Computer Vision

Peer reviewed

Direct link

Selcuk Acar; Peter Organisciak; Denis Dumas – Journal of Creative Behavior, 2025

In this three-study investigation, we applied various approaches to score drawings created in response to both Form A and Form B of the Torrance Tests of Creative Thinking-Figural (broadly TTCT-F) as well as the Multi-Trial Creative Ideation task (MTCI). We focused on TTCT-F in Study 1, and utilizing a random forest classifier, we achieved 79% and…

Descriptors: Scoring, Computer Assisted Testing, Models, Correlation

The Vulnerability of AI-Based Scoring Systems to Gaming Strategies: A Case Study

Peer reviewed

Direct link

Peter Baldwin; Victoria Yaneva; Kai North; Le An Ha; Yiyun Zhou; Alex J. Mechaber; Brian E. Clauser – Journal of Educational Measurement, 2025

Recent developments in the use of large-language models have led to substantial improvements in the accuracy of content-based automated scoring of free-text responses. The reported accuracy levels suggest that automated systems could have widespread applicability in assessment. However, before they are used in operational testing, other aspects of…

Descriptors: Artificial Intelligence, Scoring, Computational Linguistics, Accuracy

Using Multiple Maximum Exposure Rates in Computerized Adaptive Testing

Peer reviewed

Direct link

Kylie Gorney; Mark D. Reckase – Journal of Educational Measurement, 2025

In computerized adaptive testing, item exposure control methods are often used to provide a more balanced usage of the item pool. Many of the most popular methods, including the restricted method (Revuelta and Ponsoda), use a single maximum exposure rate to limit the proportion of times that each item is administered. However, Barrada et al.…

Descriptors: Computer Assisted Testing, Adaptive Testing, Test Items, Item Banks

Automatic Prompt Engineering for Automatic Scoring

Peer reviewed

Direct link

Mingfeng Xue; Yunting Liu; Xingyao Xiao; Mark Wilson – Journal of Educational Measurement, 2025

Prompts play a crucial role in eliciting accurate outputs from large language models (LLMs). This study examines the effectiveness of an automatic prompt engineering (APE) framework for automatic scoring in educational measurement. We collected constructed-response data from 930 students across 11 items and used human scores as the true labels. A…

Descriptors: Computer Assisted Testing, Prompting, Educational Assessment, Automation

Efficiency of PROMIS MCAT Assessments for Orthopaedic Care

Peer reviewed

Direct link

Michael Bass; Scott Morris; Sheng Zhang – Measurement: Interdisciplinary Research and Perspectives, 2025

Administration of patient-reported outcome measures (PROs), using multidimensional computer adaptive tests (MCATs) has the potential to reduce patient burden, but the efficiency of MCAT depends on the degree to which an individual's responses fit the psychometric properties of the assessment. Assessing patients' symptom burden through the…

Descriptors: Adaptive Testing, Computer Assisted Testing, Patients, Outcome Measures

GAI versus Teacher Scoring: Which Is Better for Assessing Student Performance?

Peer reviewed

Direct link

Xuefan Li; Marco Zappatore; Tingsong Li; Weiwei Zhang; Sining Tao; Xiaoqing Wei; Xiaoxu Zhou; Naiqing Guan; Anny Chan – IEEE Transactions on Learning Technologies, 2025

The integration of generative artificial intelligence (GAI) into educational settings offers unprecedented opportunities to enhance the efficiency of teaching and the effectiveness of learning, particularly within online platforms. This study evaluates the development and application of a customized GAI-powered teaching assistant, trained…

Descriptors: Artificial Intelligence, Technology Uses in Education, Student Evaluation, Academic Achievement

Detecting Compromised Items with Response Times Using a Bayesian Change-Point Approach

Peer reviewed

Direct link

Yang Du; Susu Zhang – Journal of Educational and Behavioral Statistics, 2025

Item compromise has long posed challenges in educational measurement, jeopardizing both test validity and test security of continuous tests. Detecting compromised items is therefore crucial to address this concern. The present literature on compromised item detection reveals two notable gaps: First, the majority of existing methods are based upon…

Descriptors: Item Response Theory, Item Analysis, Bayesian Statistics, Educational Assessment

Generative AI in Education: ChatGPT-4 in Evaluating Students' Written Responses

Peer reviewed

Direct link

Jussi S. Jauhiainen; Agustín Garagorry Guerra – Innovations in Education and Teaching International, 2025

The study highlights ChatGPT-4's potential in educational settings for the evaluation of university students' open-ended written examination responses. ChatGPT-4 evaluated 54 written responses, ranging from 24 to 256 words in English. It assessed each response using five criteria and assigned a grade on a six-point scale from fail to excellent,…

Descriptors: Artificial Intelligence, Technology Uses in Education, Student Evaluation, Writing Evaluation

Integration of Game Evaluation Methods into the Design of Human-Computer Interaction Course Test System

Peer reviewed

Direct link

Chunsong Jiang; Xuan Chen; Aiping Yu; Guiqin Liang – Education and Information Technologies, 2025

Assignments and tests are the main forms of evaluation in the educational process, students usually lose interest in boring exercises during course learning. In spired of elements from human-computer battle game, a course test system is designed to encourage students to take tests more frequently and actively to achieve better learning effect,…

Descriptors: Computer Games, Educational Games, Game Based Learning, Competition

Utilizing Response Time for Item Selection in On-the-Fly Multistage Adaptive Testing for PISA Assessment

Peer reviewed

Direct link

Xiuxiu Tang; Yi Zheng; Tong Wu; Kit-Tai Hau; Hua-Hua Chang – Journal of Educational Measurement, 2025

Multistage adaptive testing (MST) has been recently adopted for international large-scale assessments such as Programme for International Student Assessment (PISA). MST offers improved measurement efficiency over traditional nonadaptive tests and improved practical convenience over single-item-adaptive computerized adaptive testing (CAT). As a…

Descriptors: Reaction Time, Test Items, Achievement Tests, Foreign Countries

Comparative Analysis of Morphosyntactic Rule Learning among Monolingual, Bilingual, and Trilingual Speakers: A Study on Spanish Preterite Forms

Peer reviewed

Direct link

Roha M. Kaipa; Sarah Wendelbo – International Journal of Multilingualism, 2025

The research on language acquisition and retention has primarily focused on monolinguals and bilinguals, with comparatively few studies including trilinguals. To address this gap, the current study compares the acquisition and retention of a novel morphosyntactic rule in Spanish in twelve monolinguals, twelve bilinguals, and twelve trilinguals.…

Descriptors: Multilingualism, Second Language Instruction, Second Language Learning, Spanish

Previous Page | Next Page »

Pages: 1 | 2

Accuracy	17
Computer Assisted Testing	17
Adaptive Testing	6
Artificial Intelligence	6
Test Items	5
College Students	4
Computer Software	4
Natural Language Processing	4
Scoring	4
Automation	3
Classification	3
Educational Assessment	3
Grading	3
Item Banks	3
Technology Uses in Education	3
Achievement Tests	2
Comparative Analysis	2
Computational Linguistics	2
Computer Games	2
Efficiency	2
Error Patterns	2
Evaluation Criteria	2
Evaluation Methods	2
Feedback (Response)	2
Foreign Countries	2
More ▼

Agustín Garagorry Guerra	1
Aiping Yu	1
Akinbowale Natheniel Babatunde	1
Alex J. Mechaber	1
Anny Chan	1
Ayaka Sugawara	1
Ben Van Dusen	1
Benjamin G. Solomon	1
Beyza Aksu Dunya	1
Brian E. Clauser	1
Cheng-Chi Lee	1
Chunsong Jiang	1
Denis Dumas	1
Faith-Jane Abegunde	1
Guiqin Liang	1
Gyeonggeon Lee	1
Hua Hua Chang	1
Hua-Hua Chang	1
Ishaya Gambo	1
Jason W. Morphew	1
Jayson M. Nissen	1
Jing Huang	1
Jussi S. Jauhiainen	1
Kai North	1
Kayla V. Campaña	1
More ▼