ERIC - Search Results

Publication Date

In 2026	0
Since 2025	2
Since 2022 (last 5 years)	3
Since 2017 (last 10 years)	3
Since 2007 (last 20 years)	3

Descriptor

Artificial Intelligence	3
Computer Assisted Testing	3
Accuracy	1
Automation	1
Cloze Procedure	1
Context Effect	1
Cooperation	1
Data	1
Evaluation Methods	1
Identification	1
Individual Characteristics	1
Individual Differences	1
Keyboarding (Data Entry)	1
Models	1
Multivariate Analysis	1
Quality Control	1
Teacher Role	1
Technology Uses in Education	1
Test Construction	1
Test Items	1
Test Validity	1
Writing Processes	1
Writing Tests	1
More ▼

Source

Educational Measurement:…

Author

Andrew Hoang	1
Chen Li	1
Guher Gorgun	1
Hongwen Guo	1
Ingrisone, James N.	1
Ingrisone, Soo Jeong	1
Mo Zhang	1
Okan Bulut	1
Paul Deane	1

Publication Type

Journal Articles	3
Reports - Research	3

Education Level

Audience

Location

Laws, Policies, & Programs

Assessments and Surveys

What Works Clearinghouse Rating

Showing all 3 results Save | Export

Hierarchical Agglomerative Clustering to Detect Test Collusion on Computer-Based Tests

Peer reviewed

Direct link

Ingrisone, Soo Jeong; Ingrisone, James N. – Educational Measurement: Issues and Practice, 2023

There has been a growing interest in approaches based on machine learning (ML) for detecting test collusion as an alternative to the traditional methods. Clustering analysis under an unsupervised learning technique appears especially promising to detect group collusion. In this study, the effectiveness of hierarchical agglomerative clustering…

Descriptors: Identification, Cooperation, Computer Assisted Testing, Artificial Intelligence

Applications and Modeling of Keystroke Logs in Writing Assessments

Peer reviewed

Direct link

Mo Zhang; Paul Deane; Andrew Hoang; Hongwen Guo; Chen Li – Educational Measurement: Issues and Practice, 2025

In this paper, we describe two empirical studies that demonstrate the application and modeling of keystroke logs in writing assessments. We illustrate two different approaches of modeling differences in writing processes: analysis of mean differences in handcrafted theory-driven features and use of large language models to identify stable personal…

Descriptors: Writing Tests, Computer Assisted Testing, Keyboarding (Data Entry), Writing Processes

Instruction-Tuned Large-Language Models for Quality Control in Automatic Item Generation: A Feasibility Study

Peer reviewed

Direct link

Guher Gorgun; Okan Bulut – Educational Measurement: Issues and Practice, 2025

Automatic item generation may supply many items instantly and efficiently to assessment and learning environments. Yet, the evaluation of item quality persists to be a bottleneck for deploying generated items in learning and assessment settings. In this study, we investigated the utility of using large-language models, specifically Llama 3-8B, for…

Descriptors: Artificial Intelligence, Quality Control, Technology Uses in Education, Automation