ERIC - Search Results

Publication Date

In 2025	0
Since 2024	1
Since 2021 (last 5 years)	12
Since 2016 (last 10 years)	16
Since 2006 (last 20 years)	21

Descriptor

Comparative Analysis	27
Computer Software	27
Item Analysis	27
Test Items	18
Computer Assisted Testing	10
Foreign Countries	7
Accuracy	6
Item Response Theory	6
Models	6
Correlation	5
Computational Linguistics	4
Difficulty Level	4
Evaluators	4
Latent Trait Theory	4
Mathematics Tests	4
Scores	4
Classification	3
Computer Simulation	3
Educational Assessment	3
Elementary School Students	3
English (Second Language)	3
Equated Scores	3
Error Patterns	3
Estimation (Mathematics)	3
Factor Analysis	3
More ▼

Publication Type

Reports - Research	19
Journal Articles	18
Speeches/Meeting Papers	5
Reports - Descriptive	3
Reports - Evaluative	3
Tests/Questionnaires	3
Books	1
Guides - Non-Classroom	1

Education Level

Higher Education	6
Postsecondary Education	5
Elementary Education	3
Early Childhood Education	2
Elementary Secondary Education	2
Primary Education	2
Grade 2	1
Grade 3	1
High Schools	1
Kindergarten	1
Secondary Education	1
More ▼

Audience

Researchers	4
Practitioners	2
Students	2

Location

Japan	2
Czech Republic	1
Germany	1
Maryland	1
Maryland (Baltimore)	1
Saudi Arabia	1
Turkey	1

Laws, Policies, & Programs

Assessments and Surveys

International English…	1
SAT (College Admission Test)	1
Trends in International…	1

What Works Clearinghouse Rating

Showing 1 to 15 of 27 results Save | Export

Cognitive Diagnosis Testlet Model for Multiple-Choice Items

Peer reviewed

Direct link

Lei Guo; Wenjie Zhou; Xiao Li – Journal of Educational and Behavioral Statistics, 2024

The testlet design is very popular in educational and psychological assessments. This article proposes a new cognitive diagnosis model, the multiple-choice cognitive diagnostic testlet (MC-CDT) model for tests using testlets consisting of MC items. The MC-CDT model uses the original examinees' responses to MC items instead of dichotomously scored…

Descriptors: Multiple Choice Tests, Diagnostic Tests, Accuracy, Computer Software

Hybrid Maximum Clique Algorithm Using Parallel Integer Programming for Uniform Test Assembly

Peer reviewed

Direct link

Fuchimoto, Kazuma; Ishii, Takatoshi; Ueno, Maomi – IEEE Transactions on Learning Technologies, 2022

Educational assessments often require uniform test forms, for which each test form has equivalent measurement accuracy but with a different set of items. For uniform test assembly, an important issue is the increase of the number of assembled uniform tests. Although many automatic uniform test assembly methods exist, the maximum clique algorithm…

Descriptors: Simulation, Efficiency, Test Items, Educational Assessment

Modeling and Analyzing Scorer Preferences in Short-Answer Math Questions

Peer reviewed
PDF on ERIC

Download full text

Zhang, Mengxue; Heffernan, Neil; Lan, Andrew – International Educational Data Mining Society, 2023

Automated scoring of student responses to open-ended questions, including short-answer questions, has great potential to scale to a large number of responses. Recent approaches for automated scoring rely on supervised learning, i.e., training classifiers or fine-tuning language models on a small number of responses with human-provided score…

Descriptors: Scoring, Computer Assisted Testing, Mathematics Instruction, Mathematics Tests

Comparison of R Packages for Automated Test Assembly with Mixed-Integer Linear Programming

Peer reviewed

Direct link

Peabody, Michael R. – Measurement: Interdisciplinary Research and Perspectives, 2023

Many organizations utilize some form of automation in the test assembly process; either fully algorithmic or heuristically constructed. However, one issue with heuristic models is that when the test assembly problem changes the entire model may need to be re-conceptualized and recoded. In contrast, mixed-integer programming (MIP) is a mathematical…

Descriptors: Programming Languages, Algorithms, Heuristics, Mathematical Models

Refining Semantic Similarity of Paraphasias Using a Contextual Language Model

Peer reviewed

Direct link

Salem, Alexandra C.; Gale, Robert; Casilio, Marianne; Fleegle, Mikala; Fergadiotis, Gerasimos; Bedrick, Steven – Journal of Speech, Language, and Hearing Research, 2023

Purpose: ParAlg (Paraphasia Algorithms) is a software that automatically categorizes a person with aphasia's naming error (paraphasia) in relation to its intended target on a picture-naming test. These classifications (based on lexicality as well as semantic, phonological, and morphological similarity to the target) are important for…

Descriptors: Semantics, Computer Software, Aphasia, Classification

How Do Physics Students Evaluate Artificial Intelligence Responses on Comprehension Questions? A Study on the Perceived Scientific Accuracy and Linguistic Quality of ChatGPT

Peer reviewed

Direct link

Dahlkemper, Merten Nikolay; Lahme, Simon Zacharias; Klein, Pascal – Physical Review Physics Education Research, 2023

This study aimed at evaluating how students perceive the linguistic quality and scientific accuracy of ChatGPT responses to physics comprehension questions. A total of 102 first- and second-year physics students were confronted with three questions of progressing difficulty from introductory mechanics (rolling motion, waves, and fluid dynamics).…

Descriptors: Physics, Science Instruction, Artificial Intelligence, Computer Software

Deliberate Practice of Spreadsheet Skills When Using Copiable, Randomized, and Auto-Graded Questions within an Interactive Textbook

Peer reviewed
PDF on ERIC

Download full text

Gorbett, Luke J.; Chapamn, Kayla E.; Liberatore, Matthew W. – Advances in Engineering Education, 2022

Spreadsheets are a core computational tool for practicing engineers and engineering students. While Microsoft Excel, Google Sheets, and other spreadsheet tools have some differences, numerous formulas, functions, and other tasks are common across versions and platforms. Building upon learning science frameworks showing that interactive activities…

Descriptors: Spreadsheets, Computer Software, Engineering Education, Textbooks

Sparse Factor Autoencoders for Item Response Theory

Peer reviewed
PDF on ERIC

Download full text

PaaBen, Benjamin; Dywel, Malwina; Fleckenstein, Melanie; Pinkwart, Niels – International Educational Data Mining Society, 2022

Item response theory (IRT) is a popular method to infer student abilities and item difficulties from observed test responses. However, IRT struggles with two challenges: How to map items to skills if multiple skills are present? And how to infer the ability of new students that have not been part of the training data? Inspired by recent advances…

Descriptors: Item Response Theory, Test Items, Item Analysis, Inferences

Evaluation of Auto-Generated Distractors in Multiple Choice Questions from a Semantic Network

Peer reviewed

Direct link

Zhang, Lishan; VanLehn, Kurt – Interactive Learning Environments, 2021

Despite their drawback, multiple-choice questions are an enduring feature in instruction because they can be answered more rapidly than open response questions and they are easily scored. However, it can be difficult to generate good incorrect choices (called "distractors"). We designed an algorithm to generate distractors from a…

Descriptors: Semantics, Networks, Multiple Choice Tests, Teaching Methods

An Introduction to the Analysis of Ranked Response Data

Peer reviewed
PDF on ERIC

Download full text

Finch, Holmes – Practical Assessment, Research & Evaluation, 2022

Researchers in many disciplines work with ranking data. This data type is unique in that it is often deterministic in nature (the ranks of items "k"-1 determine the rank of item "k"), and the difference in a pair of rank scores separated by "k" units is equivalent regardless of the actual values of the two ranks in…

Descriptors: Data Analysis, Statistical Inference, Models, College Faculty

The Influence of Interactive Features in Storybook Apps on Children's Reading Comprehension and Story Enjoyment

Peer reviewed

Direct link

Son, Seung-Hee Claire; Butcher, Kirsten R.; Liang, Lauren Aimonette – Elementary School Journal, 2020

This study investigates how interactive features embedded in the illustrations of storybook apps influenced young readers' story enjoyment and comprehension. Kindergartners and second graders (N = 91) were randomly assigned to read storybook apps in an interactive or noninteractive condition. Findings showed that children's self-reported enjoyment…

Descriptors: Computer Software, Reading Comprehension, Preferences, Recall (Psychology)

Scoring Graphical Responses in TIMSS 2019 Using Artificial Neural Networks

Peer reviewed

Direct link

von Davier, Matthias; Tyack, Lillian; Khorramdel, Lale – Educational and Psychological Measurement, 2023

Automated scoring of free drawings or images as responses has yet to be used in large-scale assessments of student achievement. In this study, we propose artificial neural networks to classify these types of graphical responses from a TIMSS 2019 item. We are comparing classification accuracy of convolutional and feed-forward approaches. Our…

Descriptors: Scoring, Networks, Artificial Intelligence, Elementary Secondary Education

A Comparability Study of Text Difficulty and Task Characteristics of Parallel Academic IELTS Reading Tests

Peer reviewed
PDF on ERIC

Download full text

Liao, Linyu – English Language Teaching, 2020

As a high-stakes standardized test, IELTS is expected to have comparable forms of test papers so that test takers from different test administration on different dates receive comparable test scores. Therefore, this study examined the text difficulty and task characteristics of four parallel academic IELTS reading tests to reveal to what extent…

Descriptors: Second Language Learning, English (Second Language), Language Tests, High Stakes Tests

Assessing Rasch Measurement Estimation Methods across R Packages with Yes/No Vocabulary Test Data

Peer reviewed

Direct link

Nicklin, Christopher; Vitta, Joseph P. – Language Testing, 2022

Instrument measurement conducted with Rasch analysis is a common process in language assessment research. A recent systematic review of 215 studies involving Rasch analysis in language testing and applied linguistics research reported that 23 different software packages had been utilized. However, none of the analyses were conducted with one of…

Descriptors: Programming Languages, Vocabulary Development, Language Tests, Computer Software

Maximum Clique Algorithm and Its Approximation for Uniform Test Form Assembly

Peer reviewed

Direct link

Ishii, Takatoshi; Songmuang, Pokpong; Ueno, Maomi – IEEE Transactions on Learning Technologies, 2014

Educational assessments occasionally require uniform test forms for which each test form comprises a different set of items, but the forms meet equivalent test specifications (i.e., qualities indicated by test information functions based on item response theory). We propose two maximum clique algorithms (MCA) for uniform test form assembly. The…

Descriptors: Simulation, Efficiency, Test Items, Educational Assessment

Previous Page | Next Page »

Pages: 1 | 2

International Educational…	3
Applied Psychological…	2
IEEE Transactions on Learning…	2
Advances in Engineering…	1
ETS Research Report Series	1
Educational and Psychological…	1
Elementary School Journal	1
English Language Teaching	1
Interactive Learning…	1
International Journal of…	1
Journal of Educational and…	1
Journal of Speech, Language,…	1
Language Testing	1
Measurement:…	1
Partnership for Assessment of…	1
Physical Review Physics…	1
Practical Assessment,…	1
Psychometrika	1
Routledge, Taylor & Francis…	1
More ▼

Ishii, Takatoshi	2
Ueno, Maomi	2
Bedrick, Steven	1
Bercovitz, Elizabeth	1
Brandt, Rusty	1
Breyer, F. Jay	1
Butcher, Kirsten R.	1
Casilio, Marianne	1
Chapamn, Kayla E.	1
Dahlkemper, Merten Nikolay	1
Dalbudak, Ibrahim	1
Dorak, Feridun	1
Dywel, Malwina	1
Fergadiotis, Gerasimos	1
Finch, Holmes	1
Fleckenstein, Melanie	1
Fleegle, Mikala	1
Fuchimoto, Kazuma	1
Gale, Robert	1
Gialluca, Kathleen A.	1
Gorbett, Luke J.	1
Gürkan, Alper C.	1
Hart, Robert S.	1
Hazar, Gürkan	1
More ▼