ERIC - Search Results

Publication Date

In 2025	1
Since 2024	2
Since 2021 (last 5 years)	13
Since 2016 (last 10 years)	24
Since 2006 (last 20 years)	39

Descriptor

Models	56
Multiple Choice Tests	56
Test Items	56
Item Response Theory	20
Test Construction	20
Foreign Countries	12
Difficulty Level	10
Test Format	10
Responses	9
Comparative Analysis	8
Item Analysis	8
Computation	7
Goodness of Fit	7
Psychometrics	7
Science Tests	7
Simulation	7
Accuracy	6
Guessing (Tests)	6
Mathematics Tests	6
Scoring	6
Test Reliability	6
College Entrance Examinations	5
Scores	5
Statistical Analysis	5
Test Bias	5
More ▼

Publication Type

Journal Articles	40
Reports - Research	34
Reports - Evaluative	15
Speeches/Meeting Papers	8
Reports - Descriptive	5
Non-Print Media	1
Reference Materials - General	1
Tests/Questionnaires	1

Education Level

Higher Education	10
Postsecondary Education	9
Elementary Education	4
Middle Schools	4
Secondary Education	4
Junior High Schools	3
Grade 6	2
Grade 7	2
Intermediate Grades	2
Early Childhood Education	1
Elementary Secondary Education	1
Grade 3	1
Grade 4	1
Grade 5	1
Grade 8	1
High Schools	1
Primary Education	1
More ▼

Audience

Location

Canada	3
Iran	3
Taiwan	2
California	1
Europe	1
Germany	1
Indonesia	1
Sweden	1
Turkey	1
United States	1
Wisconsin	1
More ▼

Laws, Policies, & Programs

Assessments and Surveys

National Assessment of…	2
SAT (College Admission Test)	2
Alberta Grade Twelve Diploma…	1
Armed Services Vocational…	1
Force Concept Inventory	1
International English…	1
Trends in International…	1
Wisconsin Knowledge and…	1

What Works Clearinghouse Rating

Showing 1 to 15 of 56 results Save | Export

The Accuracy of Estimating Parameters of Multiple-Choice Test Items, Following Item-Response Theory: A Simulation Study

Peer reviewed
PDF on ERIC

Download full text

Aiman Mohammad Freihat; Omar Saleh Bani Yassin – Educational Process: International Journal, 2025

Background/purpose: This study aimed to reveal the accuracy of estimation of multiple-choice test items parameters following the models of the item-response theory in measurement. Materials/methods: The researchers depended on the measurement accuracy indicators, which express the absolute difference between the estimated and actual values of the…

Descriptors: Accuracy, Computation, Multiple Choice Tests, Test Items

Generating Multiple Choice Questions with a Multi-Angle Question Answering Model

Peer reviewed
PDF on ERIC

Download full text

Direct link

Olney, Andrew M. – Grantee Submission, 2022

Multi-angle question answering models have recently been proposed that promise to perform related tasks like question generation. However, performance on related tasks has not been thoroughly studied. We investigate a leading model called Macaw on the task of multiple choice question generation and evaluate its performance on three angles that…

Descriptors: Test Construction, Multiple Choice Tests, Test Items, Models

Generating Multiple Choice Questions from a Textbook: LLMs Match Human Performance on Most Metrics

Peer reviewed
PDF on ERIC

Download full text

Andrew M. Olney – Grantee Submission, 2023

Multiple choice questions are traditionally expensive to produce. Recent advances in large language models (LLMs) have led to fine-tuned LLMs that generate questions competitive with human-authored questions. However, the relative capabilities of ChatGPT-family models have not yet been established for this task. We present a carefully-controlled…

Descriptors: Test Construction, Multiple Choice Tests, Test Items, Algorithms

Regression with Reduced Rank Predictor Matrices: A Model of Trade-Offs

Peer reviewed
PDF on ERIC

Download full text

Direct link

Davison, Mark L.; Davenport, Ernest C., Jr.; Jia, Hao; Seipel, Ben; Carlson, Sarah E. – Grantee Submission, 2022

A regression model of predictor trade-offs is described. Each regression parameter equals the expected change in Y obtained by trading 1 point from one predictor to a second predictor. The model applies to predictor variables that sum to a constant T for all observations; for example, proportions summing to T=1.0 or percentages summing to T=100…

Descriptors: Regression (Statistics), Prediction, Predictor Variables, Models

Using Machine Learning to Predict Bloom's Taxonomy Level for Certification Exam Items

Peer reviewed

Direct link

Mead, Alan D.; Zhou, Chenxuan – Journal of Applied Testing Technology, 2022

This study fit a Naïve Bayesian classifier to the words of exam items to predict the Bloom's taxonomy level of the items. We addressed five research questions, showing that reasonably good prediction of Bloom's level was possible, but accuracy varies across levels. In our study, performance for Level 2 was poor (Level 2 items were misclassified…

Descriptors: Artificial Intelligence, Prediction, Taxonomy, Natural Language Processing

Fused SDT/IRT Models for Mixed-Format Exams

Peer reviewed

Direct link

Lawrence T. DeCarlo – Educational and Psychological Measurement, 2024

A psychological framework for different types of items commonly used with mixed-format exams is proposed. A choice model based on signal detection theory (SDT) is used for multiple-choice (MC) items, whereas an item response theory (IRT) model is used for open-ended (OE) items. The SDT and IRT models are shown to share a common conceptualization…

Descriptors: Test Format, Multiple Choice Tests, Item Response Theory, Models

Multidimensional Item Response Theory and the Brief Electricity and Magnetism Assessment

Peer reviewed

Direct link

Hansen, John; Stewart, John – Physical Review Physics Education Research, 2021

This work is the fourth of a series of papers applying multidimensional item response theory (MIRT) to widely used physics conceptual assessments. This study applies MIRT analysis using both exploratory and confirmatory methods to the Brief Electricity and Magnetism Assessment (BEMA) to explore the assessment's structure and to determine a…

Descriptors: Item Response Theory, Science Tests, Energy, Magnets

Dissecting Knowledge, Guessing, and Blunder in Multiple Choice Assessments

Peer reviewed

Direct link

Abu-Ghazalah, Rashid M.; Dubins, David N.; Poon, Gregory M. K. – Applied Measurement in Education, 2023

Multiple choice results are inherently probabilistic outcomes, as correct responses reflect a combination of knowledge and guessing, while incorrect responses additionally reflect blunder, a confidently committed mistake. To objectively resolve knowledge from responses in an MC test structure, we evaluated probabilistic models that explicitly…

Descriptors: Guessing (Tests), Multiple Choice Tests, Probability, Models

Automatic Multiple Choice Question Generation From Text: A Survey

Peer reviewed

Direct link

Rao, Dhawaleswar; Saha, Sujan Kumar – IEEE Transactions on Learning Technologies, 2020

Automatic multiple choice question (MCQ) generation from a text is a popular research area. MCQs are widely accepted for large-scale assessment in various domains and applications. However, manual generation of MCQs is expensive and time-consuming. Therefore, researchers have been attracted toward automatic MCQ generation since the late 90's.…

Descriptors: Multiple Choice Tests, Test Construction, Automation, Computer Software

Implementation of Four-Tier Multiple-Choice Instruments Based on the Partial Credit Model in Evaluating Students' Learning Progress

Peer reviewed
PDF on ERIC

Download full text

Laliyo, Lukman Abdul Rauf; Hamdi, Syukrul; Pikoli, Masrid; Abdullah, Romario; Panigoro, Citra – European Journal of Educational Research, 2021

One of the issues that hinder the students' learning progress is the inability to construct an epistemological explanation of a scientific phenomenon. Four-tier multiple-choice (hereinafter, 4TMC) instrument and Partial-Credit Model were employed to elaborate on the diagnosis process of the aforementioned problem. This study was to develop and…

Descriptors: Learning Processes, Multiple Choice Tests, Models, Test Items

Modeling Partial Knowledge on Multiple-Choice Items Using Elimination Testing

Peer reviewed

Direct link

Wu, Qian; De Laet, Tinne; Janssen, Rianne – Journal of Educational Measurement, 2019

Single-best answers to multiple-choice items are commonly dichotomized into correct and incorrect responses, and modeled using either a dichotomous item response theory (IRT) model or a polytomous one if differences among all response options are to be retained. The current study presents an alternative IRT-based modeling approach to…

Descriptors: Multiple Choice Tests, Item Response Theory, Test Items, Responses

Intersecting Visual and Verbal Representations and Levels of Reasoning in the Structure of Matter Learning Progression

Peer reviewed

Direct link

Langbeheim, Elon; Ben-Eliyahu, Einat; Adadan, Emine; Akaygun, Sevil; Ramnarain, Umesh Dewnarain – Chemistry Education Research and Practice, 2022

Learning progressions (LPs) are novel models for the development of assessments in science education, that often use a scale to categorize students' levels of reasoning. Pictorial representations are important in chemistry teaching and learning, and also in LPs, but the differences between pictorial and verbal items in chemistry LPs is unclear. In…

Descriptors: Science Instruction, Learning Trajectories, Chemistry, Thinking Skills

Item Characteristic Curve Asymmetry: A Better Way to Accommodate Slips and Guesses than a Four-Parameter Model?

Peer reviewed

Direct link

Liao, Xiangyi; Bolt, Daniel M. – Journal of Educational and Behavioral Statistics, 2021

Four-parameter models have received increasing psychometric attention in recent years, as a reduced upper asymptote for item characteristic curves can be appealing for measurement applications such as adaptive testing and person-fit assessment. However, applications can be challenging due to the large number of parameters in the model. In this…

Descriptors: Test Items, Models, Mathematics Tests, Item Response Theory

Cognitive Diagnostic Assessment of IELTS Listening: Providing Feedback from Its Internal Structure

Peer reviewed
PDF on ERIC

Download full text

Panahi, Ali; Mohebbi, Hassan – Language Teaching Research Quarterly, 2022

High stakes testing, such as IELTS, is designed to select individuals for decision-making purposes (Fulcher, 2013b). Hence, there is a slow-growing stream of research investigating the subskills of IELTS listening and, in feedback terms, its effects on individuals and educational programs. Here, cognitive diagnostic assessment (CDA) performs it…

Descriptors: Decision Making, Listening Comprehension Tests, Multiple Choice Tests, Diagnostic Tests

Controlling Bias in Both Constructed Response and Multiple-Choice Items When Analyzed with the Dichotomous Rasch Model

Peer reviewed

Direct link

Andrich, David; Marais, Ida – Journal of Educational Measurement, 2018

Even though guessing biases difficulty estimates as a function of item difficulty in the dichotomous Rasch model, assessment programs with tests which include multiple-choice items often construct scales using this model. Research has shown that when all items are multiple-choice, this bias can largely be eliminated. However, many assessments have…

Descriptors: Multiple Choice Tests, Test Items, Guessing (Tests), Test Bias

Previous Page | Next Page »

Pages: 1 | 2 | 3 | 4

Journal of Educational…	5
Educational and Psychological…	4
Grantee Submission	4
Psychometrika	4
Applied Psychological…	3
ETS Research Report Series	3
Journal of Educational and…	3
Applied Measurement in…	2
Practical Assessment,…	2
AERA Online Paper Repository	1
Alberta Journal of…	1
Assessment & Evaluation in…	1
Chemistry Education Research…	1
College Board	1
Educational Process:…	1
Educational Psychology	1
European Educational Research…	1
European Journal of…	1
IEEE Transactions on Learning…	1
International Journal of…	1
International Journal of…	1
International Journal of…	1
Journal of Applied Measurement	1
Journal of Applied Testing…	1
Language Teaching Research…	1
More ▼

Bolt, Daniel M.	3
Suh, Youngsuk	3
Baghaei, Purya	2
Hamdi, Syukrul	2
Penfield, Randall D.	2
Revuelta, Javier	2
Trevisan, Michael S.	2
Abdullah, Romario	1
Abu-Ghazalah, Rashid M.	1
Adadan, Emine	1
Afsharrad, Mohammad	1
Aiman Mohammad Freihat	1
Akaygun, Sevil	1
Albacete, Patricia	1
Andrew M. Olney	1
Andrich, David	1
Ben-Eliyahu, Einat	1
Bender, Timothy A.	1
Bennett, Randy Elliot	1
Bock, R. Darrell	1
Boulais, André-Philippe	1
Burton, Richard F.	1
Cao, Yi	1
Carlson, Sarah E.	1
More ▼