ERIC - Search Results

Publication Date

In 2025	5
Since 2024	7

Source

Educational Psychology Review	2
International Journal of…	1
International Journal of…	1
Language Testing	1
Research Matters	1
Teaching in Higher Education	1

Publication Type

Information Analyses	7
Journal Articles	7
Reports - Evaluative	1
Reports - Research	1

Education Level

Elementary Education	1
Higher Education	1
Postsecondary Education	1
Secondary Education	1

Audience

Location

United Kingdom (England)

Laws, Policies, & Programs

Assessments and Surveys

What Works Clearinghouse Rating

Showing all 7 results Save | Export

A Review of Automatic Item Generation Techniques Leveraging Large Language Models

Peer reviewed
PDF on ERIC

Download full text

Bin Tan; Nour Armoush; Elisabetta Mazzullo; Okan Bulut; Mark J. Gierl – International Journal of Assessment Tools in Education, 2025

This study reviews existing research on the use of large language models (LLMs) for automatic item generation (AIG). We performed a comprehensive literature search across seven research databases, selected studies based on predefined criteria, and summarized 60 relevant studies that employed LLMs in the AIG process. We identified the most commonly…

Descriptors: Artificial Intelligence, Test Items, Automation, Test Format

Does the Format of an Assessment (Closed Book or Open Book) Affect Learning? A Systematic Review of the Literature

Peer reviewed

Direct link

Vahe Permzadian; Kit W. Cho – Teaching in Higher Education, 2025

When administering an in-class exam, a common decision that confronts every instructor is whether the exam format should be closed book or open book. The present review synthesizes research examining the effect of administering closed-book or open-book assessments on long-term learning. Although the overall effect of assessment format on learning…

Descriptors: College Students, Tests, Test Format, Long Term Memory

A Systematic Review of Differential Item Functioning in Second Language Assessment

Peer reviewed

Direct link

Xueliang Chen; Vahid Aryadoust; Wenxin Zhang – Language Testing, 2025

The growing diversity among test takers in second or foreign language (L2) assessments makes the importance of fairness front and center. This systematic review aimed to examine how fairness in L2 assessments was evaluated through differential item functioning (DIF) analysis. A total of 83 articles from 27 journals were included in a systematic…

Descriptors: Second Language Learning, Language Tests, Test Items, Item Analysis

Assessing Scientific Inquiry: A Systematic Literature Review of Tasks, Tools and Techniques

Peer reviewed

Direct link

De Van Vo; Geraldine Mooney Simmie – International Journal of Science and Mathematics Education, 2025

While national curricula in science education highlight the importance of inquiry-based learning, assessing students' capabilities in scientific inquiry remains a subject of debate. Our study explored the construction, developmental trends and validation techniques in relation to assessing scientific inquiry using a systematic literature review…

Descriptors: Science Education, Inquiry, Science Process Skills, Student Evaluation

The Cronbach's Alpha of Domain-Specific Knowledge Tests before and after Learning: A Meta-Analysis of Published Studies

Peer reviewed

Direct link

Peter A. Edelsbrunner; Bianca A. Simonsmeier; Michael Schneider – Educational Psychology Review, 2025

Knowledge is an important predictor and outcome of learning and development. Its measurement is challenged by the fact that knowledge can be integrated and homogeneous, or fragmented and heterogeneous, which can change through learning. These characteristics of knowledge are at odds with current standards for test development, demanding a high…

Descriptors: Meta Analysis, Predictor Variables, Learning Processes, Knowledge Level

Exploring Speededness in Pre-Reform GCSEs (2009 to 2016)

Download full text

Direct link

Emma Walland – Research Matters, 2024

GCSE examinations (taken by students aged 16 years in England) are not intended to be speeded (i.e. to be partly a test of how quickly students can answer questions). However, there has been little research exploring this. The aim of this research was to explore the speededness of past GCSE written examinations, using only the data from scored…

Descriptors: Educational Change, Test Items, Item Analysis, Scoring

Measuring Mathematical Skills in Early Childhood: A Systematic Review of the Psychometric Properties of Early Maths Assessments and Screeners

Peer reviewed

Direct link

Laura A. Outhwaite; Pirjo Aunio; Jaimie Ka Yu Leung; Jo Van Herwegen – Educational Psychology Review, 2024

Successful early mathematical development is vital to children's later education, employment, and wellbeing outcomes. However, established measurement tools are infrequently used to (i) assess children's mathematical skills and (ii) identify children with or at-risk of mathematical learning difficulties. In response, this pre-registered systematic…

Descriptors: Mathematics Tests, Screening Tests, Mathematics Skills, At Risk Students

Test Format	7
Test Items	4
Test Validity	3
Item Analysis	2
Mathematics Tests	2
Science Tests	2
Test Reliability	2
Academic Achievement	1
Achievement Tests	1
Age Differences	1
Artificial Intelligence	1
At Risk Students	1
Automation	1
Bibliometrics	1
Biology	1
Chemistry	1
Cognitive Processes	1
College Students	1
Correlation	1
Culture Fair Tests	1
Cutting Scores	1
Disproportionate…	1
Educational Change	1
Effect Size	1
Elementary School Students	1
More ▼

Bianca A. Simonsmeier	1
Bin Tan	1
De Van Vo	1
Elisabetta Mazzullo	1
Emma Walland	1
Geraldine Mooney Simmie	1
Jaimie Ka Yu Leung	1
Jo Van Herwegen	1
Kit W. Cho	1
Laura A. Outhwaite	1
Mark J. Gierl	1
Michael Schneider	1
Nour Armoush	1
Okan Bulut	1
Peter A. Edelsbrunner	1
Pirjo Aunio	1
Vahe Permzadian	1
Vahid Aryadoust	1
Wenxin Zhang	1
Xueliang Chen	1
More ▼