ERIC - Search Results

Publication Date

In 2026	0
Since 2025	3
Since 2022 (last 5 years)	23
Since 2017 (last 10 years)	34
Since 2007 (last 20 years)	45

Descriptor

Computer Software	65
Item Analysis	65
Test Items	65
Computer Assisted Testing	26
Difficulty Level	21
Comparative Analysis	19
Item Response Theory	18
Foreign Countries	17
Test Construction	13
Multiple Choice Tests	12
Scoring	11
Accuracy	10
Achievement Tests	9
Classification	8
Computer Simulation	8
Correlation	8
Latent Trait Theory	8
Artificial Intelligence	7
English (Second Language)	7
Language Tests	7
Maximum Likelihood Statistics	7
Models	7
Programming	7
Second Language Learning	7
Simulation	7
More ▼

Publication Type

Journal Articles	47
Reports - Research	41
Speeches/Meeting Papers	12
Reports - Descriptive	11
Reports - Evaluative	10
Information Analyses	3
Tests/Questionnaires	2
Books	1
Collected Works - General	1
Computer Programs	1
Numerical/Quantitative Data	1
More ▼

Education Level

Higher Education	8
Postsecondary Education	7
Elementary Secondary Education	4
Secondary Education	4
Elementary Education	2
Adult Education	1
Early Childhood Education	1
Grade 4	1
Grade 5	1
Intermediate Grades	1
Kindergarten	1
Middle Schools	1
Primary Education	1
More ▼

Audience

Researchers	6
Practitioners	2
Students	1

Location

China	2
Japan	2
Czech Republic	1
Indonesia	1
Italy	1
Nigeria	1
Oman	1
Saudi Arabia	1
Taiwan	1
Texas	1
United Kingdom	1
United Kingdom (England)	1
More ▼

Laws, Policies, & Programs

Assessments and Surveys

International English…	2
Trends in International…	2
Law School Admission Test	1
Michigan Test of English…	1
Peabody Picture Vocabulary…	1
Program for International…	1
State of Texas Assessments of…	1
Test of English as a Foreign…	1

What Works Clearinghouse Rating

Showing 1 to 15 of 65 results Save | Export

Essentials of Visual Diagnosis of Test Items. Logical, Illogical, and Anomalous Patterns in Tests Items to Be Detected

Peer reviewed
PDF on ERIC

Download full text

Metsämuuronen, Jari – Practical Assessment, Research & Evaluation, 2022

This article discusses visual techniques for detecting test items that would be optimal to be selected to the final compilation on the one hand and, on the other hand, to out-select those items that would lower the quality of the compilation. Some classic visual tools are discussed, first, in a practical manner in diagnosing the logical,…

Descriptors: Test Items, Item Analysis, Item Response Theory, Cutting Scores

Cognitive Diagnosis Testlet Model for Multiple-Choice Items

Peer reviewed

Direct link

Lei Guo; Wenjie Zhou; Xiao Li – Journal of Educational and Behavioral Statistics, 2024

The testlet design is very popular in educational and psychological assessments. This article proposes a new cognitive diagnosis model, the multiple-choice cognitive diagnostic testlet (MC-CDT) model for tests using testlets consisting of MC items. The MC-CDT model uses the original examinees' responses to MC items instead of dichotomously scored…

Descriptors: Multiple Choice Tests, Diagnostic Tests, Accuracy, Computer Software

Assessment of Large Language Models' Performances and Hallucinations for Chinese Postgraduate Medical Entrance Examination

Peer reviewed

Direct link

Hongfei Ye; Jian Xu; Danqing Huang; Meng Xie; Jinming Guo; Junrui Yang; Haiwei Bao; Mingzhi Zhang; Ce Zheng – Discover Education, 2025

This study evaluates Large language models (LLMs)' performance on Chinese Postgraduate Medical Entrance Examination (CPGMEE) as well as the hallucinations produced by LLMs and investigate their implications for medical education. We curated 10 trials of mock CPGMEE to evaluate the performances of 4 LLMs (GPT-4.0, ChatGPT, QWen 2.1 and Ernie 4.0).…

Descriptors: College Entrance Examinations, Foreign Countries, Computational Linguistics, Graduate Medical Education

An Analysis of Differential Bundle Functioning in Multidimensional Tests Using the SIBTEST Procedure

Peer reviewed
PDF on ERIC

Download full text

Özdogan, Didem; Kelecioglu, Hülya – International Journal of Assessment Tools in Education, 2022

This study aims to analyze the differential bundle functioning in multidimensional tests with a specific purpose to detect this effect through differentiating the location of the item with DIF in the test, the correlation between the dimensions, the sample size, and the ratio of reference to focal group size. The first 10 items of the test that is…

Descriptors: Correlation, Sample Size, Test Items, Item Analysis

NLP-Based Management of Large Multiple-Choice Test Item Repositories

Peer reviewed
PDF on ERIC

Download full text

Valentina Albano; Donatella Firmani; Luigi Laura; Jerin George Mathew; Anna Lucia Paoletti; Irene Torrente – Journal of Learning Analytics, 2023

Multiple-choice questions (MCQs) are widely used in educational assessments and professional certification exams. Managing large repositories of MCQs, however, poses several challenges due to the high volume of questions and the need to maintain their quality and relevance over time. One of these challenges is the presence of questions that…

Descriptors: Natural Language Processing, Multiple Choice Tests, Test Items, Item Analysis

Assessing the Ethical Capabilities of Chat GPT in Healthcare: A Study on Its Proficiency in Situational Judgement Test

Peer reviewed

Direct link

Kunal Sareen – Innovations in Education and Teaching International, 2024

This study examines the proficiency of Chat GPT, an AI language model, in answering questions on the Situational Judgement Test (SJT), a widely used assessment tool for evaluating the fundamental competencies of medical graduates in the UK. A total of 252 SJT questions from the "Oxford Assess and Progress: Situational Judgement" Test…

Descriptors: Ethics, Decision Making, Artificial Intelligence, Computer Software

Content and Item Response Theory Analysis of ChatGPT-4-Generated Multiple-Choice Items

Peer reviewed

Direct link

Roger Young; Emily Courtney; Alexander Kah; Mariah Wilkerson; Yi-Hsin Chen – Teaching of Psychology, 2025

Background: Multiple-choice item (MCI) assessments are burdensome for instructors to develop. Artificial intelligence (AI, e.g., ChatGPT) can streamline the process without sacrificing quality. The quality of AI-generated MCIs and human experts is comparable. However, whether the quality of AI-generated MCIs is equally good across various domain-…

Descriptors: Item Response Theory, Multiple Choice Tests, Psychology, Textbooks

Hybrid Maximum Clique Algorithm Using Parallel Integer Programming for Uniform Test Assembly

Peer reviewed

Direct link

Fuchimoto, Kazuma; Ishii, Takatoshi; Ueno, Maomi – IEEE Transactions on Learning Technologies, 2022

Educational assessments often require uniform test forms, for which each test form has equivalent measurement accuracy but with a different set of items. For uniform test assembly, an important issue is the increase of the number of assembled uniform tests. Although many automatic uniform test assembly methods exist, the maximum clique algorithm…

Descriptors: Simulation, Efficiency, Test Items, Educational Assessment

Answer Changing Behaviors and Performance in a First-Year Medical Gross and Developmental Anatomy Course

Peer reviewed
PDF on ERIC

Download full text

Marli Crabtree; Kenneth L. Thompson; Ellen M. Robertson – HAPS Educator, 2024

Research has suggested that changing one's answer on multiple-choice examinations is more likely to lead to positive academic outcomes. This study aimed to further understand the relationship between changing answer selections and item attributes, student performance, and time within a population of 158 first-year medical students enrolled in a…

Descriptors: Anatomy, Science Tests, Medical Students, Medical Education

Evaluating the Effectiveness of a Computerized Achievement Test Using Learn Smart for Psychometric Assessment under Item Response Theory

Peer reviewed
PDF on ERIC

Download full text

Mimi Ismail; Ahmed Al - Badri; Said Al - Senaidi – Journal of Education and e-Learning Research, 2025

This study aimed to reveal the differences in individuals' abilities, their standard errors, and the psychometric properties of the test according to the two methods of applying the test (electronic and paper). The descriptive approach was used to achieve the study's objectives. The study sample consisted of 74 male and female students at the…

Descriptors: Achievement Tests, Computer Assisted Testing, Psychometrics, Item Response Theory

Figure-Based Approach in Creating ChatGPT-4o-Resistant Multiple-Choice Questions for Introductory Biology Courses: An Instructional Guide

Peer reviewed
PDF on ERIC

Download full text

Kyeng Gea Lee; Mark J. Lee; Soo Jung Lee – International Journal of Technology in Education and Science, 2024

Online assessment is an essential part of online education, and if conducted properly, has been found to effectively gauge student learning. Generally, textbased questions have been the cornerstone of online assessment. Recently, however, the emergence of generative artificial intelligence has added a significant challenge to the integrity of…

Descriptors: Artificial Intelligence, Computer Software, Biology, Science Instruction

Modeling and Analyzing Scorer Preferences in Short-Answer Math Questions

Peer reviewed
PDF on ERIC

Download full text

Zhang, Mengxue; Heffernan, Neil; Lan, Andrew – International Educational Data Mining Society, 2023

Automated scoring of student responses to open-ended questions, including short-answer questions, has great potential to scale to a large number of responses. Recent approaches for automated scoring rely on supervised learning, i.e., training classifiers or fine-tuning language models on a small number of responses with human-provided score…

Descriptors: Scoring, Computer Assisted Testing, Mathematics Instruction, Mathematics Tests

Comparison of R Packages for Automated Test Assembly with Mixed-Integer Linear Programming

Peer reviewed

Direct link

Peabody, Michael R. – Measurement: Interdisciplinary Research and Perspectives, 2023

Many organizations utilize some form of automation in the test assembly process; either fully algorithmic or heuristically constructed. However, one issue with heuristic models is that when the test assembly problem changes the entire model may need to be re-conceptualized and recoded. In contrast, mixed-integer programming (MIP) is a mathematical…

Descriptors: Programming Languages, Algorithms, Heuristics, Mathematical Models

A Multilevel Mixture IRT Framework for Modeling Response Times as Predictors or Indicators of Response Engagement in IRT Models

Peer reviewed

Direct link

Nagy, Gabriel; Ulitzsch, Esther – Educational and Psychological Measurement, 2022

Disengaged item responses pose a threat to the validity of the results provided by large-scale assessments. Several procedures for identifying disengaged responses on the basis of observed response times have been suggested, and item response theory (IRT) models for response engagement have been proposed. We outline that response time-based…

Descriptors: Item Response Theory, Hierarchical Linear Modeling, Predictor Variables, Classification

How to Use Academic and Digital Fingerprints to Catch and Eliminate Contract Cheating during Online Multiple-Choice Examinations: A Case Study

Peer reviewed

Direct link

Emery-Wetherell, Meaghan; Wang, Ruoyao – Assessment & Evaluation in Higher Education, 2023

Over four semesters of a large introductory statistics course the authors found students were engaging in contract cheating on Chegg.com during multiple choice examinations. In this paper we describe our methodology for identifying, addressing and eventually eliminating cheating. We successfully identified 23 out of 25 students using a combination…

Descriptors: Computer Assisted Testing, Multiple Choice Tests, Cheating, Identification

Previous Page | Next Page »

Pages: 1 | 2 | 3 | 4 | 5

Educational and Psychological…	3
IEEE Transactions on Learning…	3
International Educational…	3
Applied Psychological…	2
ETS Research Report Series	2
International Journal of…	2
Journal of Educational and…	2
Language Assessment Quarterly	2
Language Testing	2
Measurement:…	2
Advances in Engineering…	1
Assessment & Evaluation in…	1
Collegiate Microcomputer	1
Discover Education	1
Educational Measurement:…	1
Educational Technology &…	1
English Language Teaching	1
Grantee Submission	1
HAPS Educator	1
Innovations in Education and…	1
Interactive Learning…	1
International Association for…	1
International Journal of…	1
International Journal of…	1
International Journal of…	1
More ▼

Ishii, Takatoshi	2
Pelánek, Radek	2
Ueno, Maomi	2
Adeleke, A. A.	1
Ahmed Al - Badri	1
Aiken, Lewis R.	1
Alammary, Ali	1
Alderson, J. Charles	1
Alexander Kah	1
Alghazali, Tawfeeq	1
Alhadi, Moosa A. A.	1
Ames, Allison J.	1
Anna Lucia Paoletti	1
Atar, Hakan Yavuz	1
Aybek, Eren Can	1
Bakla, Arif	1
Baroroh, Kiromim	1
Beauchamp, David	1
Beerwinkle, Andrea	1
Bock, R. Darrell	1
Breyer, F. Jay	1
Ce Zheng	1
Chapamn, Kayla E.	1
Cobern, William W.	1
Connell, Michael L.	1
More ▼