ERIC - Search Results

Publication Date

In 2025	3
Since 2024	4
Since 2021 (last 5 years)	4
Since 2016 (last 10 years)	5
Since 2006 (last 20 years)	6

Source

Applied Measurement in…	2
Clearing House	2
Educational Measurement:…	2
Educational and Psychological…	2
Review of Educational Research	2
Annual Review of Applied…	1
Assessment & Evaluation in…	1
Educational Assessment	1
Educational Psychology Review	1
Educational Technology	1
International Journal of…	1
Journal of Science Education…	1
Language Testing	1
Research Matters	1
More ▼

Publication Type

Information Analyses	30
Journal Articles	19
Reports - Evaluative	9
Speeches/Meeting Papers	7
Guides - Non-Classroom	5
Reports - Research	4
Opinion Papers	1
Tests/Questionnaires	1

Education Level

Elementary Secondary Education	1
Secondary Education	1

Audience

Practitioners	4
Teachers	3

Location

United Kingdom (England)

Laws, Policies, & Programs

Assessments and Surveys

ACT Assessment	1
International Association for…	1
National Teacher Examinations	1

What Works Clearinghouse Rating

Showing 1 to 15 of 30 results Save | Export

A Review of Automatic Item Generation Techniques Leveraging Large Language Models

Peer reviewed
PDF on ERIC

Download full text

Bin Tan; Nour Armoush; Elisabetta Mazzullo; Okan Bulut; Mark J. Gierl – International Journal of Assessment Tools in Education, 2025

This study reviews existing research on the use of large language models (LLMs) for automatic item generation (AIG). We performed a comprehensive literature search across seven research databases, selected studies based on predefined criteria, and summarized 60 relevant studies that employed LLMs in the AIG process. We identified the most commonly…

Descriptors: Artificial Intelligence, Test Items, Automation, Test Format

A Systematic Review of Differential Item Functioning in Second Language Assessment

Peer reviewed

Direct link

Xueliang Chen; Vahid Aryadoust; Wenxin Zhang – Language Testing, 2025

The growing diversity among test takers in second or foreign language (L2) assessments makes the importance of fairness front and center. This systematic review aimed to examine how fairness in L2 assessments was evaluated through differential item functioning (DIF) analysis. A total of 83 articles from 27 journals were included in a systematic…

Descriptors: Second Language Learning, Language Tests, Test Items, Item Analysis

The Cronbach's Alpha of Domain-Specific Knowledge Tests before and after Learning: A Meta-Analysis of Published Studies

Peer reviewed

Direct link

Peter A. Edelsbrunner; Bianca A. Simonsmeier; Michael Schneider – Educational Psychology Review, 2025

Knowledge is an important predictor and outcome of learning and development. Its measurement is challenged by the fact that knowledge can be integrated and homogeneous, or fragmented and heterogeneous, which can change through learning. These characteristics of knowledge are at odds with current standards for test development, demanding a high…

Descriptors: Meta Analysis, Predictor Variables, Learning Processes, Knowledge Level

Exploring Speededness in Pre-Reform GCSEs (2009 to 2016)

Download full text

Direct link

Emma Walland – Research Matters, 2024

GCSE examinations (taken by students aged 16 years in England) are not intended to be speeded (i.e. to be partly a test of how quickly students can answer questions). However, there has been little research exploring this. The aim of this research was to explore the speededness of past GCSE written examinations, using only the data from scored…

Descriptors: Educational Change, Test Items, Item Analysis, Scoring

A Scoping Review of Empirical Research on Recent Computational Thinking Assessments

Peer reviewed

Direct link

Cutumisu, Maria; Adams, Cathy; Lu, Chang – Journal of Science Education and Technology, 2019

Computational thinking (CT) is regarded as an essential twenty-first century competency and it is already embedded in K-12 curricula across the globe. However, research on assessing CT has lagged, with few assessments being implemented and validated. Moreover, there is a lack of systematic grouping of CT assessments. This scoping review examines…

Descriptors: Computation, Thinking Skills, 21st Century Skills, Elementary Secondary Education

Test Directions as a Critical Component of Test Design: Best Practices and the Impact of Examinee Characteristics

Peer reviewed

Direct link

Lakin, Joni M. – Educational Assessment, 2014

The purpose of test directions is to familiarize examinees with a test so that they respond to items in the manner intended. However, changes in educational measurement as well as the U.S. student population present new challenges to test directions and increase the impact that differential familiarity could have on the validity of test score…

Descriptors: Test Content, Test Construction, Best Practices, Familiarity

Applications of Conventional and Non-Restrictive Multiple-Choice Examination Items.

Peer reviewed

Kolstad, Rosemarie K.; Kolstad, Robert A. – Clearing House, 1982

Argues that multiple choice tests can be effective only if the items are written in a format suitable for testing the mastery of specific instructional objectives. Proposes the use of nonrestrictive test items and cites examples of such items. (FL)

Descriptors: Elementary Secondary Education, Multiple Choice Tests, Test Construction, Test Format

Validity of a Taxonomy of Multiple-Choice Item-Writing Rules.

Peer reviewed

Haladyna, Thomas M.; Downing, Steven M. – Applied Measurement in Education, 1989

Results of 96 theoretical/empirical studies were reviewed to see if they support a taxonomy of 43 rules for writing multiple-choice test items. The taxonomy is the result of an analysis of 46 textbooks dealing with multiple-choice item writing. For nearly half of the rules, no research was found. (SLD)

Descriptors: Classification, Literature Reviews, Multiple Choice Tests, Test Construction

Type K and Other Complex Multiple-Choice Items: An Analysis of Research and Item Properties.

Peer reviewed

Albanese, Mark A. – Educational Measurement: Issues and Practice, 1993

A comprehensive review is given of evidence, with a bearing on the recommendation to avoid use of complex multiple choice (CMC) items. Avoiding Type K items (four primary responses and five secondary choices) seems warranted, but evidence against CMC in general is less clear. (SLD)

Descriptors: Cues, Difficulty Level, Multiple Choice Tests, Responses

Dimensions of Oral Assessment.

Peer reviewed

Joughin, Gordon – Assessment & Evaluation in Higher Education, 1998

Analysis of literature on oral assessment in college instruction identified six dimensions: primary content type; interaction between examiner and learner; authenticity of assessment task; structure of assessment task; examiner; and orality (extent to which knowledge is tested orally). These help in understanding the nature of oral assessment and…

Descriptors: College Instruction, Higher Education, Student Evaluation, Test Format

Applications of Conventional and Non-Restrictive Multiple-Choice Examination Items.

Peer reviewed

Kolstad, Rosemarie K.; Kolstad, Robert A. – Clearing House, 1994

Argues that multiple-choice tests can be effective only if the items are written in a format suitable for testing the mastery of specific instructional objectives. Proposes the use of nonrestrictive test items and cites examples of such items. (FL)

Descriptors: Elementary Secondary Education, Multiple Choice Tests, Student Evaluation, Test Construction

Reviewing Criterion-Referenced Test Items.

Haladyna, Thomas M.; Roid, Gale H. – Educational Technology, 1983

Summarizes item review in the development of criterion-referenced tests, including logical item review, which examines the match between instructional intent and the items; empirical item review, which examines response patterns; traditional item review; and instructional sensitivity of test items. Twenty-eight references are listed. (MBR)

Descriptors: Criterion Referenced Tests, Educational Research, Literature Reviews, Teaching Methods

Short Answer Questions. Teaching and Learning in Higher Education, 22.

Download full text

Ellington, Henry – 1987

The second of three sequels to the booklet "Student Assessment," this booklet begins by describing and giving examples of three different forms that short-answer questions can take: (1) completion items; (2) unique-answer questions; and (3) open short-answer questions. Guidelines are then provided for deciding which type of question to…

Descriptors: Foreign Countries, Higher Education, Instructional Material Evaluation, Questioning Techniques

A Meta-Analytic Review of Item Discrimination and Difficulty in Multiple-Choice Items Using "None-of-the-Above."

Peer reviewed

Knowles, Susan L.; Welch, Cynthia A. – Educational and Psychological Measurement, 1992

A meta-analysis of the difficulty and discrimination of the "none-of-the-above" (NOTA) test option was conducted with 12 articles (20 effect sizes) for difficulty and 7 studies (11 effect sizes) for discrimination. Findings indicate that using the NOTA option does not result in items of lesser quality. (SLD)

Descriptors: Difficulty Level, Effect Size, Meta Analysis, Multiple Choice Tests

Objective Questions. Teaching and Learning in Higher Education, 21.

Download full text

Ellington, Henry – 1987

The first of three sequels to the booklet "Student Assessment," this booklet begins by describing and providing examples of four different forms that objective questions can take: (1) conventional multiple choice questions; (2) true/false questions; (3) assertion/reason items; and (4) matching items. Guidance is offered on how to decide which type…

Descriptors: Foreign Countries, Higher Education, Instructional Material Evaluation, Objective Tests

Previous Page | Next Page »

Pages: 1 | 2

Test Format	30
Test Items	30
Test Construction	22
Achievement Tests	8
Literature Reviews	8
Multiple Choice Tests	7
Difficulty Level	6
Elementary Secondary Education	6
Scoring	6
Test Validity	6
Higher Education	5
Item Analysis	5
Foreign Countries	4
Language Tests	4
Student Evaluation	4
Instructional Material…	3
Meta Analysis	3
Questioning Techniques	3
Responses	3
Second Language Instruction	3
Test Content	3
Test Reliability	3
Automation	2
Classification	2
Comparative Analysis	2
More ▼

Ellington, Henry	3
Haladyna, Thomas M.	2
Kolstad, Robert A.	2
Kolstad, Rosemarie K.	2
Adams, Cathy	1
Albanese, Mark A.	1
Bennett, Randy Elliot	1
Benson, Jeri	1
Bianca A. Simonsmeier	1
Bin Tan	1
Bolden, Bernadine J.	1
Brindley, Geoff	1
Burstein, Leigh	1
Buser, Karen	1
Colton, Dean A.	1
Cutumisu, Maria	1
Dorans, Neil J.	1
Downing, Steven M.	1
Elisabetta Mazzullo	1
Emma Walland	1
Fox, Paul W.	1
Frisbie, David A.	1
James, Charles H.	1
Joughin, Gordon	1
Knowles, Susan L.	1
More ▼