Publication Date
In 2025 | 5 |
Since 2024 | 7 |
Descriptor
Test Format | 7 |
Test Items | 4 |
Test Validity | 3 |
Item Analysis | 2 |
Mathematics Tests | 2 |
Science Tests | 2 |
Test Reliability | 2 |
Academic Achievement | 1 |
Achievement Tests | 1 |
Age Differences | 1 |
Artificial Intelligence | 1 |
More ▼ |
Source
Educational Psychology Review | 2 |
International Journal of… | 1 |
International Journal of… | 1 |
Language Testing | 1 |
Research Matters | 1 |
Teaching in Higher Education | 1 |
Author
Bianca A. Simonsmeier | 1 |
Bin Tan | 1 |
De Van Vo | 1 |
Elisabetta Mazzullo | 1 |
Emma Walland | 1 |
Geraldine Mooney Simmie | 1 |
Jaimie Ka Yu Leung | 1 |
Jo Van Herwegen | 1 |
Kit W. Cho | 1 |
Laura A. Outhwaite | 1 |
Mark J. Gierl | 1 |
More ▼ |
Publication Type
Information Analyses | 7 |
Journal Articles | 7 |
Reports - Evaluative | 1 |
Reports - Research | 1 |
Audience
Location
United Kingdom (England) | 1 |
Laws, Policies, & Programs
Assessments and Surveys
What Works Clearinghouse Rating
Bin Tan; Nour Armoush; Elisabetta Mazzullo; Okan Bulut; Mark J. Gierl – International Journal of Assessment Tools in Education, 2025
This study reviews existing research on the use of large language models (LLMs) for automatic item generation (AIG). We performed a comprehensive literature search across seven research databases, selected studies based on predefined criteria, and summarized 60 relevant studies that employed LLMs in the AIG process. We identified the most commonly…
Descriptors: Artificial Intelligence, Test Items, Automation, Test Format
Vahe Permzadian; Kit W. Cho – Teaching in Higher Education, 2025
When administering an in-class exam, a common decision that confronts every instructor is whether the exam format should be closed book or open book. The present review synthesizes research examining the effect of administering closed-book or open-book assessments on long-term learning. Although the overall effect of assessment format on learning…
Descriptors: College Students, Tests, Test Format, Long Term Memory
Xueliang Chen; Vahid Aryadoust; Wenxin Zhang – Language Testing, 2025
The growing diversity among test takers in second or foreign language (L2) assessments makes the importance of fairness front and center. This systematic review aimed to examine how fairness in L2 assessments was evaluated through differential item functioning (DIF) analysis. A total of 83 articles from 27 journals were included in a systematic…
Descriptors: Second Language Learning, Language Tests, Test Items, Item Analysis
De Van Vo; Geraldine Mooney Simmie – International Journal of Science and Mathematics Education, 2025
While national curricula in science education highlight the importance of inquiry-based learning, assessing students' capabilities in scientific inquiry remains a subject of debate. Our study explored the construction, developmental trends and validation techniques in relation to assessing scientific inquiry using a systematic literature review…
Descriptors: Science Education, Inquiry, Science Process Skills, Student Evaluation
Peter A. Edelsbrunner; Bianca A. Simonsmeier; Michael Schneider – Educational Psychology Review, 2025
Knowledge is an important predictor and outcome of learning and development. Its measurement is challenged by the fact that knowledge can be integrated and homogeneous, or fragmented and heterogeneous, which can change through learning. These characteristics of knowledge are at odds with current standards for test development, demanding a high…
Descriptors: Meta Analysis, Predictor Variables, Learning Processes, Knowledge Level
Emma Walland – Research Matters, 2024
GCSE examinations (taken by students aged 16 years in England) are not intended to be speeded (i.e. to be partly a test of how quickly students can answer questions). However, there has been little research exploring this. The aim of this research was to explore the speededness of past GCSE written examinations, using only the data from scored…
Descriptors: Educational Change, Test Items, Item Analysis, Scoring
Laura A. Outhwaite; Pirjo Aunio; Jaimie Ka Yu Leung; Jo Van Herwegen – Educational Psychology Review, 2024
Successful early mathematical development is vital to children's later education, employment, and wellbeing outcomes. However, established measurement tools are infrequently used to (i) assess children's mathematical skills and (ii) identify children with or at-risk of mathematical learning difficulties. In response, this pre-registered systematic…
Descriptors: Mathematics Tests, Screening Tests, Mathematics Skills, At Risk Students