ERIC - Search Results

Publication Date

In 2025	4
Since 2024	6

Descriptor

Classification	6
Computer Assisted Testing	6
Accuracy	5
Adaptive Testing	2
Diagnostic Tests	2
Information Security	2
Scoring	2
Test Format	2
Achievement Tests	1
Active Learning	1
Algorithms	1
Artificial Intelligence	1
Automation	1
Bayesian Statistics	1
Brain Hemisphere Functions	1
College Students	1
Computational Linguistics	1
Computer Games	1
Computer Science Education	1
Computer Software	1
Contrastive Linguistics	1
Design	1
Educational Assessment	1
Educational Research	1
Educational Strategies	1
More ▼

Source

ACM Transactions on Computing…	1
Assessment for Effective…	1
Journal of Educational…	1
Journal of Educational and…	1
ProQuest LLC	1
Second Language Research	1

Author

Alex J. Mechaber	1
Benjamin G. Solomon	1
Brian E. Clauser	1
Chia-Hsuan Liao	1
Diana Franklin	1
Ellen Lau	1
Erica Goodwin	1
Grace Williams	1
Hongxuan Chen	1
Jing Ma	1
Jonathan Liu	1
Kai North	1
Kayla V. Campaña	1
Le An Ha	1
Peter Baldwin	1
Seth Poulsen	1
Susu Zhang	1
Victoria Yaneva	1
Yael Gertner	1
Yang Du	1
Yiyun Zhou	1
More ▼

Publication Type

Journal Articles	5
Reports - Research	4
Dissertations/Theses -…	1
Information Analyses	1

Education Level

Higher Education	2
Postsecondary Education	2
Early Childhood Education	1
Elementary Education	1
Grade 3	1
Grade 4	1
Grade 5	1
Intermediate Grades	1
Middle Schools	1
Primary Education	1

Audience

Location

New York	1
Taiwan	1

Laws, Policies, & Programs

Assessments and Surveys

What Works Clearinghouse Rating

Showing all 6 results Save | Export

The Vulnerability of AI-Based Scoring Systems to Gaming Strategies: A Case Study

Peer reviewed

Direct link

Peter Baldwin; Victoria Yaneva; Kai North; Le An Ha; Yiyun Zhou; Alex J. Mechaber; Brian E. Clauser – Journal of Educational Measurement, 2025

Recent developments in the use of large-language models have led to substantial improvements in the accuracy of content-based automated scoring of free-text responses. The reported accuracy levels suggest that automated systems could have widespread applicability in assessment. However, before they are used in operational testing, other aspects of…

Descriptors: Artificial Intelligence, Scoring, Computational Linguistics, Accuracy

The Impact of Scoring Later on Mixed Format Adaptive Testing

Direct link

Jing Ma – ProQuest LLC, 2024

This study investigated the impact of scoring polytomous items later on measurement precision, classification accuracy, and test security in mixed-format adaptive testing. Utilizing the shadow test approach, a simulation study was conducted across various test designs, lengths, number and location of polytomous item. Results showed that while…

Descriptors: Scoring, Adaptive Testing, Test Items, Classification

Teaching Algorithm Design: A Literature Review

Peer reviewed

Direct link

Jonathan Liu; Seth Poulsen; Erica Goodwin; Hongxuan Chen; Grace Williams; Yael Gertner; Diana Franklin – ACM Transactions on Computing Education, 2025

Algorithm design is a vital skill developed in most undergraduate Computer Science (CS) programs, but few research studies focus on pedagogy related to algorithms coursework. To understand the work that has been done in the area, we present a systematic survey and literature review of CS Education studies. We search for research that is both…

Descriptors: Teaching Methods, Algorithms, Design, Computer Science Education

Detecting Compromised Items with Response Times Using a Bayesian Change-Point Approach

Peer reviewed

Direct link

Yang Du; Susu Zhang – Journal of Educational and Behavioral Statistics, 2025

Item compromise has long posed challenges in educational measurement, jeopardizing both test validity and test security of continuous tests. Detecting compromised items is therefore crucial to address this concern. The present literature on compromised item detection reveals two notable gaps: First, the majority of existing methods are based upon…

Descriptors: Item Response Theory, Item Analysis, Bayesian Statistics, Educational Assessment

Classification Accuracy of i-Ready and Prior Year State Exams on Year-End Outcomes

Peer reviewed

Direct link

Kayla V. Campaña; Benjamin G. Solomon – Assessment for Effective Intervention, 2025

The purpose of this study was to compare the classification accuracy of data produced by the previous year's end-of-year New York state assessment, a computer-adaptive diagnostic assessment ("i-Ready"), and the gating combination of both assessments to predict the rate of students passing the following year's end-of-year state assessment…

Descriptors: Accuracy, Classification, Diagnostic Tests, Adaptive Testing

ERP Sensitivity to Subcategorization Violations in L2 Learners

Peer reviewed

Direct link

Chia-Hsuan Liao; Ellen Lau – Second Language Research, 2024

Event concepts of common verbs (e.g. "eat," "sleep") can be broadly shared across languages, but a given language's rules for subcategorization are largely arbitrary and vary substantially across languages. When subcategorization information does not match between first language (L1) and second language (L2), how does this…

Descriptors: Verbs, Brain Hemisphere Functions, Diagnostic Tests, English