ERIC - Search Results

Publication Date

In 2025	4
Since 2024	11
Since 2021 (last 5 years)	40
Since 2016 (last 10 years)	59
Since 2006 (last 20 years)	80

Descriptor

Computer Software	121
Item Analysis	121
Test Items	64
Computer Assisted Testing	33
Foreign Countries	31
Comparative Analysis	27
Item Response Theory	26
Difficulty Level	22
Test Construction	19
Evaluation Methods	17
Latent Trait Theory	16
Scoring	16
Models	15
Multiple Choice Tests	14
Second Language Learning	14
Accuracy	13
Correlation	13
Factor Analysis	12
Scores	12
Statistical Analysis	12
Computational Linguistics	11
Language Tests	11
Mathematical Models	11
Psychometrics	11
Artificial Intelligence	10
More ▼

Publication Type

Journal Articles	89
Reports - Research	69
Reports - Descriptive	24
Reports - Evaluative	15
Speeches/Meeting Papers	15
Tests/Questionnaires	7
Information Analyses	5
Numerical/Quantitative Data	4
Computer Programs	3
Book/Product Reviews	2
Guides - Non-Classroom	2
Books	1
Collected Works - General	1
Dissertations/Theses -…	1
Opinion Papers	1
More ▼

Education Level

Higher Education	19
Postsecondary Education	17
Secondary Education	7
Elementary Education	6
Early Childhood Education	4
Elementary Secondary Education	4
Primary Education	4
Middle Schools	3
Adult Education	2
Grade 3	2
High Schools	2
Kindergarten	2
Grade 10	1
Grade 11	1
Grade 2	1
Grade 4	1
Grade 5	1
Grade 6	1
Grade 7	1
Grade 8	1
Grade 9	1
Intermediate Grades	1
Junior High Schools	1
More ▼

Audience

Researchers	12
Practitioners	3
Students	2

Location

China	3
Germany	2
Japan	2
Turkey	2
United Kingdom	2
Australia	1
Canada	1
Colombia	1
Czech Republic	1
Denmark	1
Hong Kong	1
Indonesia	1
Iran	1
Iraq	1
Italy	1
Kenya	1
Maryland	1
Maryland (Baltimore)	1
Nigeria	1
Oman	1
Saudi Arabia	1
Spain	1
Taiwan	1
Texas	1
United Kingdom (England)	1
More ▼

Laws, Policies, & Programs

Assessments and Surveys

International English…	2
Program for International…	2
Test of English as a Foreign…	2
Trends in International…	2
Law School Admission Test	1
Michigan Test of English…	1
National Assessment of…	1
Peabody Picture Vocabulary…	1
Rosenberg Self Esteem Scale	1
SAT (College Admission Test)	1
State of Texas Assessments of…	1
Wechsler Adult Intelligence…	1
More ▼

What Works Clearinghouse Rating

Showing 1 to 15 of 121 results Save | Export

Using SAS PROC IRT for Multidimensional Item Response Theory Analysis

Peer reviewed

Direct link

Cole, Ki; Paek, Insu – Measurement: Interdisciplinary Research and Perspectives, 2022

Statistical Analysis Software (SAS) is a widely used tool for data management analysis across a variety of fields. The procedure for item response theory (PROC IRT) is one to perform unidimensional and multidimensional item response theory (IRT) analysis for dichotomous and polytomous data. This review provides a summary of the features of PROC…

Descriptors: Item Response Theory, Computer Software, Item Analysis, Statistical Analysis

Essentials of Visual Diagnosis of Test Items. Logical, Illogical, and Anomalous Patterns in Tests Items to Be Detected

Peer reviewed
PDF on ERIC

Download full text

Metsämuuronen, Jari – Practical Assessment, Research & Evaluation, 2022

This article discusses visual techniques for detecting test items that would be optimal to be selected to the final compilation on the one hand and, on the other hand, to out-select those items that would lower the quality of the compilation. Some classic visual tools are discussed, first, in a practical manner in diagnosing the logical,…

Descriptors: Test Items, Item Analysis, Item Response Theory, Cutting Scores

Cognitive Diagnosis Testlet Model for Multiple-Choice Items

Peer reviewed

Direct link

Lei Guo; Wenjie Zhou; Xiao Li – Journal of Educational and Behavioral Statistics, 2024

The testlet design is very popular in educational and psychological assessments. This article proposes a new cognitive diagnosis model, the multiple-choice cognitive diagnostic testlet (MC-CDT) model for tests using testlets consisting of MC items. The MC-CDT model uses the original examinees' responses to MC items instead of dichotomously scored…

Descriptors: Multiple Choice Tests, Diagnostic Tests, Accuracy, Computer Software

Claude, ChatGPT, Copilot, and Gemini Performance versus Students in Different Topics of Neuroscience

Peer reviewed

Direct link

Volodymyr Mavrych; Ahmed Yaqinuddin; Olena Bolgova – Advances in Physiology Education, 2025

Despite extensive studies on large language models and their capability to respond to questions from various licensed exams, there has been limited focus on employing chatbots for specific subjects within the medical curriculum, specifically medical neuroscience. This research compared the performances of Claude 3.5 Sonnet (Anthropic), GPT-3.5 and…

Descriptors: Artificial Intelligence, Computer Software, Neurosciences, Medical Education

Assessment of Large Language Models' Performances and Hallucinations for Chinese Postgraduate Medical Entrance Examination

Peer reviewed

Direct link

Hongfei Ye; Jian Xu; Danqing Huang; Meng Xie; Jinming Guo; Junrui Yang; Haiwei Bao; Mingzhi Zhang; Ce Zheng – Discover Education, 2025

This study evaluates Large language models (LLMs)' performance on Chinese Postgraduate Medical Entrance Examination (CPGMEE) as well as the hallucinations produced by LLMs and investigate their implications for medical education. We curated 10 trials of mock CPGMEE to evaluate the performances of 4 LLMs (GPT-4.0, ChatGPT, QWen 2.1 and Ernie 4.0).…

Descriptors: College Entrance Examinations, Foreign Countries, Computational Linguistics, Graduate Medical Education

Automatic Wordnet Construction and Its Application in Generating Distractors for Cloze Questions

Direct link

Yicheng Sun – ProQuest LLC, 2024

We study how to automatically generate cloze questions from given texts to assess reading comprehension, where a cloze question consists of a stem with a blank space holder for the answer key, and three distractors for generating confusions. We present a generative method called CQG (Cloze Question Generator) for constructing cloze questions from…

Descriptors: Cloze Procedure, Reading Processes, Questioning Techniques, Computational Linguistics

An Analysis of Differential Bundle Functioning in Multidimensional Tests Using the SIBTEST Procedure

Peer reviewed
PDF on ERIC

Download full text

Özdogan, Didem; Kelecioglu, Hülya – International Journal of Assessment Tools in Education, 2022

This study aims to analyze the differential bundle functioning in multidimensional tests with a specific purpose to detect this effect through differentiating the location of the item with DIF in the test, the correlation between the dimensions, the sample size, and the ratio of reference to focal group size. The first 10 items of the test that is…

Descriptors: Correlation, Sample Size, Test Items, Item Analysis

NLP-Based Management of Large Multiple-Choice Test Item Repositories

Peer reviewed
PDF on ERIC

Download full text

Valentina Albano; Donatella Firmani; Luigi Laura; Jerin George Mathew; Anna Lucia Paoletti; Irene Torrente – Journal of Learning Analytics, 2023

Multiple-choice questions (MCQs) are widely used in educational assessments and professional certification exams. Managing large repositories of MCQs, however, poses several challenges due to the high volume of questions and the need to maintain their quality and relevance over time. One of these challenges is the presence of questions that…

Descriptors: Natural Language Processing, Multiple Choice Tests, Test Items, Item Analysis

Get REAL: Development and Validation of the Rubric for the Evaluation of Apps for Learning

Peer reviewed

Direct link

Melanie Ann Weber; Mia Anzilotti; Reece Gormley; Christina Huber; Alyssa McGarvey; Grace McKee; Claire Ogden; Hannah Seinfeld; Julia Wank; Arnold Olszewski – Perspectives of the ASHA Special Interest Groups, 2024

Purpose: Technology, including educational applications (apps), is commonly used in schools by teachers and speech-language pathologists. Nonetheless, very little research has examined the efficacy of these apps for student learning or how to choose appropriate apps for instruction. Several previous rubrics to evaluate the instructional quality of…

Descriptors: Computer Software, Handheld Devices, Educational Technology, Technology Uses in Education

Assessing the Ethical Capabilities of Chat GPT in Healthcare: A Study on Its Proficiency in Situational Judgement Test

Peer reviewed

Direct link

Kunal Sareen – Innovations in Education and Teaching International, 2024

This study examines the proficiency of Chat GPT, an AI language model, in answering questions on the Situational Judgement Test (SJT), a widely used assessment tool for evaluating the fundamental competencies of medical graduates in the UK. A total of 252 SJT questions from the "Oxford Assess and Progress: Situational Judgement" Test…

Descriptors: Ethics, Decision Making, Artificial Intelligence, Computer Software

Hybrid Maximum Clique Algorithm Using Parallel Integer Programming for Uniform Test Assembly

Peer reviewed

Direct link

Fuchimoto, Kazuma; Ishii, Takatoshi; Ueno, Maomi – IEEE Transactions on Learning Technologies, 2022

Educational assessments often require uniform test forms, for which each test form has equivalent measurement accuracy but with a different set of items. For uniform test assembly, an important issue is the increase of the number of assembled uniform tests. Although many automatic uniform test assembly methods exist, the maximum clique algorithm…

Descriptors: Simulation, Efficiency, Test Items, Educational Assessment

Evaluation of Polytomous Item Locations in Multicomponent Measuring Instruments: A Note on a Latent Variable Modeling Procedure

Peer reviewed

Direct link

Raykov, Tenko; Pusic, Martin – Educational and Psychological Measurement, 2023

This note is concerned with evaluation of location parameters for polytomous items in multiple-component measuring instruments. A point and interval estimation procedure for these parameters is outlined that is developed within the framework of latent variable modeling. The method permits educational, behavioral, biomedical, and marketing…

Descriptors: Item Analysis, Measurement Techniques, Computer Software, Intervals

Answer Changing Behaviors and Performance in a First-Year Medical Gross and Developmental Anatomy Course

Peer reviewed
PDF on ERIC

Download full text

Marli Crabtree; Kenneth L. Thompson; Ellen M. Robertson – HAPS Educator, 2024

Research has suggested that changing one's answer on multiple-choice examinations is more likely to lead to positive academic outcomes. This study aimed to further understand the relationship between changing answer selections and item attributes, student performance, and time within a population of 158 first-year medical students enrolled in a…

Descriptors: Anatomy, Science Tests, Medical Students, Medical Education

Evaluating the Effectiveness of a Computerized Achievement Test Using Learn Smart for Psychometric Assessment under Item Response Theory

Peer reviewed
PDF on ERIC

Download full text

Mimi Ismail; Ahmed Al - Badri; Said Al - Senaidi – Journal of Education and e-Learning Research, 2025

This study aimed to reveal the differences in individuals' abilities, their standard errors, and the psychometric properties of the test according to the two methods of applying the test (electronic and paper). The descriptive approach was used to achieve the study's objectives. The study sample consisted of 74 male and female students at the…

Descriptors: Achievement Tests, Computer Assisted Testing, Psychometrics, Item Response Theory

Figure-Based Approach in Creating ChatGPT-4o-Resistant Multiple-Choice Questions for Introductory Biology Courses: An Instructional Guide

Peer reviewed
PDF on ERIC

Download full text

Kyeng Gea Lee; Mark J. Lee; Soo Jung Lee – International Journal of Technology in Education and Science, 2024

Online assessment is an essential part of online education, and if conducted properly, has been found to effectively gauge student learning. Generally, textbased questions have been the cornerstone of online assessment. Recently, however, the emergence of generative artificial intelligence has added a significant challenge to the integrity of…

Descriptors: Artificial Intelligence, Computer Software, Biology, Science Instruction

Previous Page | Next Page »

Pages: 1 | 2 | 3 | 4 | 5 | 6 | 7 | 8 | 9

Applied Psychological…	8
Educational and Psychological…	8
Measurement:…	5
IEEE Transactions on Learning…	3
International Educational…	3
International Journal of…	3
Language Testing	3
ETS Research Report Series	2
Educational Measurement:…	2
International Journal of…	2
International Journal of…	2
Journal of Education and…	2
Journal of Educational Data…	2
Journal of Educational and…	2
Language Assessment Quarterly	2
Online Submission	2
Partnership for Assessment of…	2
Physical Review Physics…	2
Practical Assessment,…	2
Psychometrika	2
Advances in Engineering…	1
Advances in Physiology…	1
Annual Review of Applied…	1
Assessment & Evaluation in…	1
Australasian Journal of…	1
More ▼

Raykov, Tenko	3
Aiken, Lewis R.	2
Bercovitz, Elizabeth	2
Brandt, Rusty	2
Hambleton, Ronald K.	2
Ishii, Takatoshi	2
Marcoulides, George A.	2
Papageorgiou, Spiros	2
Pelánek, Radek	2
Ueno, Maomi	2
Zimmerman, Linda	2
Adeleke, A. A.	1
Ahmed Al - Badri	1
Ahmed Yaqinuddin	1
Alammary, Ali	1
Alderson, J. Charles	1
Alghazali, Tawfeeq	1
Alhadi, Moosa A. A.	1
Altay, Firat	1
Alyssa McGarvey	1
Ames, Allison J.	1
Anna Lucia Paoletti	1
Applegate, Brooks	1
Arnold Olszewski	1
More ▼