ERIC - Search Results

Publication Date

In 2025

Source

Educational Measurement:…	2
Annenberg Institute for…	1
International Journal of…	1
Journal of Computer Assisted…	1

Author

Okan Bulut	2
Bin Tan	1
Elisabetta Mazzullo	1
Guher Gorgun	1
Jonathan Seiden	1
Mark J. Gierl	1
Marlit A. Lindner	1
Nour Armoush	1
Stella Y. Kim	1
Sungyeun Kim	1
Ute Mertens	1
More ▼

Publication Type

Journal Articles	4
Reports - Research	3
Information Analyses	1
Reports - Descriptive	1
Tests/Questionnaires	1

Education Level

Higher Education	1
Postsecondary Education	1

Audience

Location

Laws, Policies, & Programs

Assessments and Surveys

What Works Clearinghouse Rating

Showing all 5 results Save | Export

A Review of Automatic Item Generation Techniques Leveraging Large Language Models

Peer reviewed
PDF on ERIC

Download full text

Bin Tan; Nour Armoush; Elisabetta Mazzullo; Okan Bulut; Mark J. Gierl – International Journal of Assessment Tools in Education, 2025

This study reviews existing research on the use of large language models (LLMs) for automatic item generation (AIG). We performed a comprehensive literature search across seven research databases, selected studies based on predefined criteria, and summarized 60 relevant studies that employed LLMs in the AIG process. We identified the most commonly…

Descriptors: Artificial Intelligence, Test Items, Automation, Test Format

Generalizability Theory Approach to Analyzing Automated-Item Generated Test Forms

Peer reviewed

Direct link

Stella Y. Kim; Sungyeun Kim – Educational Measurement: Issues and Practice, 2025

This study presents several multivariate Generalizability theory designs for analyzing automatic item-generated (AIG) based test forms. The study used real data to illustrate the analysis procedure and discuss practical considerations. We collected the data from two groups of students, each group receiving a different form generated by AIG. A…

Descriptors: Generalizability Theory, Automation, Test Items, Students

Creating Short Forms of Early Childhood Development Measures: A Framework for Quantifying Statistical, Conceptual, and Practical Tradeoffs in Direct Assessment. EdWorkingPaper No. 25-1143

Download full text

Jonathan Seiden – Annenberg Institute for School Reform at Brown University, 2025

Direct assessments of early childhood development (ECD) are a cornerstone of research in developmental psychology and are increasingly used to evaluate programs and policies in lower- and middle-income countries. Despite strong psychometric properties, these assessments are too expensive and time consuming for use in large-scale monitoring or…

Descriptors: Young Children, Child Development, Performance Based Assessment, Developmental Psychology

Instruction-Tuned Large-Language Models for Quality Control in Automatic Item Generation: A Feasibility Study

Peer reviewed

Direct link

Guher Gorgun; Okan Bulut – Educational Measurement: Issues and Practice, 2025

Automatic item generation may supply many items instantly and efficiently to assessment and learning environments. Yet, the evaluation of item quality persists to be a bottleneck for deploying generated items in learning and assessment settings. In this study, we investigated the utility of using large-language models, specifically Llama 3-8B, for…

Descriptors: Artificial Intelligence, Quality Control, Technology Uses in Education, Automation

Computer-Based Answer-Until-Correct and Elaborated Feedback: Effects on Affective-Motivational and Performance Outcomes

Peer reviewed

Direct link

Ute Mertens; Marlit A. Lindner – Journal of Computer Assisted Learning, 2025

Background: Educational assessments increasingly shift towards computer-based formats. Many studies have explored how different types of automated feedback affect learning. However, few studies have investigated how digital performance feedback affects test takers' ratings of affective-motivational reactions during a testing session. Method: In…

Descriptors: Educational Assessment, Computer Assisted Testing, Automation, Feedback (Response)

Automation	5
Test Items	5
Computer Assisted Testing	3
Artificial Intelligence	2
Test Construction	2
Test Validity	2
Child Development	1
Cloze Procedure	1
College Students	1
Data	1
Data Analysis	1
Data Collection	1
Developing Nations	1
Developmental Psychology	1
Educational Assessment	1
Emotional Response	1
Evaluation Methods	1
Feedback (Response)	1
Generalizability Theory	1
Multiple Choice Tests	1
Performance	1
Performance Based Assessment	1
Pretests Posttests	1
Psychological Patterns	1
Quality Control	1
More ▼