ERIC - Search Results

Publication Date

In 2026	0
Since 2025	2
Since 2022 (last 5 years)	13
Since 2017 (last 10 years)	18
Since 2007 (last 20 years)	32

Descriptor

Comparative Analysis	37
Computer Software	37
Test Items	37
Item Analysis	19
Foreign Countries	13
Accuracy	12
Computer Assisted Testing	12
Item Response Theory	12
Correlation	9
English (Second Language)	9
Language Tests	8
Second Language Learning	8
Difficulty Level	7
Scores	7
Statistical Analysis	7
Computational Linguistics	6
Multiple Choice Tests	6
Test Construction	6
Artificial Intelligence	5
Classification	5
College Students	5
Evaluators	5
Models	5
Scoring	5
Second Language Instruction	5
More ▼

Publication Type

Journal Articles	27
Reports - Research	26
Speeches/Meeting Papers	5
Dissertations/Theses -…	3
Reports - Descriptive	3
Reports - Evaluative	3
Books	2
Tests/Questionnaires	2
Reports - General	1

Education Level

Higher Education	10
Postsecondary Education	9
Elementary Secondary Education	3
Elementary Education	1
High Schools	1
Secondary Education	1

Audience

Researchers	2
Practitioners	1
Students	1

Location

Japan	4
Czech Republic	1
Germany	1
India	1
Indonesia	1
Morocco	1
Saudi Arabia	1
Spain	1
Switzerland	1
Taiwan	1
Yemen	1
More ▼

Laws, Policies, & Programs

Assessments and Surveys

International English…	1
Test of English as a Foreign…	1
Test of English for…	1
Trends in International…	1

What Works Clearinghouse Rating

Showing 1 to 15 of 37 results Save | Export

Cognitive Diagnosis Testlet Model for Multiple-Choice Items

Peer reviewed

Direct link

Lei Guo; Wenjie Zhou; Xiao Li – Journal of Educational and Behavioral Statistics, 2024

The testlet design is very popular in educational and psychological assessments. This article proposes a new cognitive diagnosis model, the multiple-choice cognitive diagnostic testlet (MC-CDT) model for tests using testlets consisting of MC items. The MC-CDT model uses the original examinees' responses to MC items instead of dichotomously scored…

Descriptors: Multiple Choice Tests, Diagnostic Tests, Accuracy, Computer Software

A Comparative Study of AI-Human-Made and Human-Made Test Forms for a University TESOL Theory Course

Peer reviewed

Direct link

Kyung-Mi O. – Language Testing in Asia, 2024

This study examines the efficacy of artificial intelligence (AI) in creating parallel test items compared to human-made ones. Two test forms were developed: one consisting of 20 existing human-made items and another with 20 new items generated with ChatGPT assistance. Expert reviews confirmed the content parallelism of the two test forms.…

Descriptors: Comparative Analysis, Artificial Intelligence, Computer Software, Test Items

Content and Item Response Theory Analysis of ChatGPT-4-Generated Multiple-Choice Items

Peer reviewed

Direct link

Roger Young; Emily Courtney; Alexander Kah; Mariah Wilkerson; Yi-Hsin Chen – Teaching of Psychology, 2025

Background: Multiple-choice item (MCI) assessments are burdensome for instructors to develop. Artificial intelligence (AI, e.g., ChatGPT) can streamline the process without sacrificing quality. The quality of AI-generated MCIs and human experts is comparable. However, whether the quality of AI-generated MCIs is equally good across various domain-…

Descriptors: Item Response Theory, Multiple Choice Tests, Psychology, Textbooks

Hybrid Maximum Clique Algorithm Using Parallel Integer Programming for Uniform Test Assembly

Peer reviewed

Direct link

Fuchimoto, Kazuma; Ishii, Takatoshi; Ueno, Maomi – IEEE Transactions on Learning Technologies, 2022

Educational assessments often require uniform test forms, for which each test form has equivalent measurement accuracy but with a different set of items. For uniform test assembly, an important issue is the increase of the number of assembled uniform tests. Although many automatic uniform test assembly methods exist, the maximum clique algorithm…

Descriptors: Simulation, Efficiency, Test Items, Educational Assessment

Modeling and Analyzing Scorer Preferences in Short-Answer Math Questions

Peer reviewed
PDF on ERIC

Download full text

Zhang, Mengxue; Heffernan, Neil; Lan, Andrew – International Educational Data Mining Society, 2023

Automated scoring of student responses to open-ended questions, including short-answer questions, has great potential to scale to a large number of responses. Recent approaches for automated scoring rely on supervised learning, i.e., training classifiers or fine-tuning language models on a small number of responses with human-provided score…

Descriptors: Scoring, Computer Assisted Testing, Mathematics Instruction, Mathematics Tests

Comparison of R Packages for Automated Test Assembly with Mixed-Integer Linear Programming

Peer reviewed

Direct link

Peabody, Michael R. – Measurement: Interdisciplinary Research and Perspectives, 2023

Many organizations utilize some form of automation in the test assembly process; either fully algorithmic or heuristically constructed. However, one issue with heuristic models is that when the test assembly problem changes the entire model may need to be re-conceptualized and recoded. In contrast, mixed-integer programming (MIP) is a mathematical…

Descriptors: Programming Languages, Algorithms, Heuristics, Mathematical Models

ChatGPT-4o, ChatGPT-4 and Google Gemini are Compared with Students: A Study in Higher Education

Peer reviewed
PDF on ERIC

Download full text

Harun Bayer; Fazilet Gül Ince Araci; Gülsah Gürkan – International Journal of Technology in Education and Science, 2024

The rapid advancement of artificial intelligence technologies, their pervasive use in every field, and the growing understanding of the benefits they bring have led actors in the education sector to pursue research in this field. In particular, the use of artificial intelligence tools has become more prevalent in the education sector due to the…

Descriptors: Artificial Intelligence, Computer Software, Computational Linguistics, Technology Uses in Education

Deliberate Practice of Spreadsheet Skills When Using Copiable, Randomized, and Auto-Graded Questions within an Interactive Textbook

Peer reviewed
PDF on ERIC

Download full text

Gorbett, Luke J.; Chapamn, Kayla E.; Liberatore, Matthew W. – Advances in Engineering Education, 2022

Spreadsheets are a core computational tool for practicing engineers and engineering students. While Microsoft Excel, Google Sheets, and other spreadsheet tools have some differences, numerous formulas, functions, and other tasks are common across versions and platforms. Building upon learning science frameworks showing that interactive activities…

Descriptors: Spreadsheets, Computer Software, Engineering Education, Textbooks

Sparse Factor Autoencoders for Item Response Theory

Peer reviewed
PDF on ERIC

Download full text

PaaBen, Benjamin; Dywel, Malwina; Fleckenstein, Melanie; Pinkwart, Niels – International Educational Data Mining Society, 2022

Item response theory (IRT) is a popular method to infer student abilities and item difficulties from observed test responses. However, IRT struggles with two challenges: How to map items to skills if multiple skills are present? And how to infer the ability of new students that have not been part of the training data? Inspired by recent advances…

Descriptors: Item Response Theory, Test Items, Item Analysis, Inferences

Evaluation of Auto-Generated Distractors in Multiple Choice Questions from a Semantic Network

Peer reviewed

Direct link

Zhang, Lishan; VanLehn, Kurt – Interactive Learning Environments, 2021

Despite their drawback, multiple-choice questions are an enduring feature in instruction because they can be answered more rapidly than open response questions and they are easily scored. However, it can be difficult to generate good incorrect choices (called "distractors"). We designed an algorithm to generate distractors from a…

Descriptors: Semantics, Networks, Multiple Choice Tests, Teaching Methods

Calibrated Parsing Items Evaluation: A Step towards Objectifying the Translation Assessment

Peer reviewed

Direct link

Akbari, Alireza; Shahnazari, Mohammadtaghi – Language Testing in Asia, 2019

The present research paper introduces a translation evaluation method called Calibrated Parsing Items Evaluation (CPIE hereafter). This evaluation method maximizes translators' performance through identifying the parsing items with an optimal p-docimology and d-index (item discrimination). This method checks all the possible parses (annotations)…

Descriptors: Test Items, Translation, Computer Software, Evaluators

Benthik Android Physics Comic Effectiveness for Vector Representation and Crtitical Thinking Students' Improvement

Peer reviewed
PDF on ERIC

Download full text

Maghfiroh, Anissa; Kuswanto, Heru – International Journal of Instruction, 2022

This research aims to reveal the effectiveness of the use of Kofie GeBoL media in improving (1) vector representation ability and (2) critical thinking ability in physics instruction. It is a descriptive quantitative study with the quasi-experiment design. It was conducted in two stages: empirical try out and implementation of Kofie GeboL to see…

Descriptors: Physics, Instructional Effectiveness, Critical Thinking, Thinking Skills

Evaluation of Automated Vocabulary Quiz Generation with VocQGen

Peer reviewed
PDF on ERIC

Download full text

Qiao Wang; Ralph L. Rose; Ayaka Sugawara; Naho Orita – Vocabulary Learning and Instruction, 2025

VocQGen is an automated tool designed to generate multiple-choice cloze (MCC) questions for vocabulary assessment in second language learning contexts. It leverages several natural language processing (NLP) tools and OpenAI's GPT-4 model to produce MCC items quickly from user-specified word lists. To evaluate its effectiveness, we used the first…

Descriptors: Vocabulary Skills, Artificial Intelligence, Computer Software, Multiple Choice Tests

Scoring Graphical Responses in TIMSS 2019 Using Artificial Neural Networks

Peer reviewed

Direct link

von Davier, Matthias; Tyack, Lillian; Khorramdel, Lale – Educational and Psychological Measurement, 2023

Automated scoring of free drawings or images as responses has yet to be used in large-scale assessments of student achievement. In this study, we propose artificial neural networks to classify these types of graphical responses from a TIMSS 2019 item. We are comparing classification accuracy of convolutional and feed-forward approaches. Our…

Descriptors: Scoring, Networks, Artificial Intelligence, Elementary Secondary Education

A Comparability Study of Text Difficulty and Task Characteristics of Parallel Academic IELTS Reading Tests

Peer reviewed
PDF on ERIC

Download full text

Liao, Linyu – English Language Teaching, 2020

As a high-stakes standardized test, IELTS is expected to have comparable forms of test papers so that test takers from different test administration on different dates receive comparable test scores. Therefore, this study examined the text difficulty and task characteristics of four parallel academic IELTS reading tests to reveal to what extent…

Descriptors: Second Language Learning, English (Second Language), Language Tests, High Stakes Tests

Previous Page | Next Page »

Pages: 1 | 2 | 3

International Educational…	3
ProQuest LLC	3
Applied Psychological…	2
IEEE Transactions on Learning…	2
Language Testing in Asia	2
Advances in Engineering…	1
Discourse Processes: A…	1
ETS Research Report Series	1
Educational and Psychological…	1
English Language Teaching	1
Interactive Learning…	1
International Journal of…	1
International Journal of…	1
JALT CALL Journal	1
Journal of Education and…	1
Journal of Educational…	1
Journal of Educational and…	1
Journal of Language and…	1
Language Testing	1
Measurement:…	1
Psicologica: International…	1
Psychometrika	1
Routledge, Taylor & Francis…	1
Structural Equation Modeling:…	1
Taiwan Journal of TESOL	1
More ▼

Ishii, Takatoshi	2
Ueno, Maomi	2
Ahmed, Tamim	1
Akbari, Alireza	1
Alexander Kah	1
Ashwell, Tim	1
Awadh, Awadh Nasser Munassar	1
Ayaka Sugawara	1
Benitez, Isabel	1
Breyer, F. Jay	1
Chapamn, Kayla E.	1
Cook, Nancy R.	1
DeMars, Christine E.	1
Deng, Nina	1
Dywel, Malwina	1
Elam, Jesse R.	1
Emily Courtney	1
Fazilet Gül Ince Araci	1
Fleckenstein, Melanie	1
Fuchimoto, Kazuma	1
Gialluca, Kathleen A.	1
Gomez-Benito, Juana	1
Gorbett, Luke J.	1
Gygax, Pascal M.	1
Gülsah Gürkan	1
More ▼