Publication Date
In 2025 | 2 |
Since 2024 | 2 |
Since 2021 (last 5 years) | 3 |
Since 2016 (last 10 years) | 3 |
Since 2006 (last 20 years) | 3 |
Descriptor
Artificial Intelligence | 3 |
Computer Assisted Testing | 3 |
Accuracy | 1 |
Automation | 1 |
Cloze Procedure | 1 |
Context Effect | 1 |
Cooperation | 1 |
Data | 1 |
Evaluation Methods | 1 |
Identification | 1 |
Individual Characteristics | 1 |
More ▼ |
Source
Educational Measurement:… | 3 |
Author
Andrew Hoang | 1 |
Chen Li | 1 |
Guher Gorgun | 1 |
Hongwen Guo | 1 |
Ingrisone, James N. | 1 |
Ingrisone, Soo Jeong | 1 |
Mo Zhang | 1 |
Okan Bulut | 1 |
Paul Deane | 1 |
Publication Type
Journal Articles | 3 |
Reports - Research | 3 |
Education Level
Audience
Location
Laws, Policies, & Programs
Assessments and Surveys
What Works Clearinghouse Rating
Ingrisone, Soo Jeong; Ingrisone, James N. – Educational Measurement: Issues and Practice, 2023
There has been a growing interest in approaches based on machine learning (ML) for detecting test collusion as an alternative to the traditional methods. Clustering analysis under an unsupervised learning technique appears especially promising to detect group collusion. In this study, the effectiveness of hierarchical agglomerative clustering…
Descriptors: Identification, Cooperation, Computer Assisted Testing, Artificial Intelligence
Mo Zhang; Paul Deane; Andrew Hoang; Hongwen Guo; Chen Li – Educational Measurement: Issues and Practice, 2025
In this paper, we describe two empirical studies that demonstrate the application and modeling of keystroke logs in writing assessments. We illustrate two different approaches of modeling differences in writing processes: analysis of mean differences in handcrafted theory-driven features and use of large language models to identify stable personal…
Descriptors: Writing Tests, Computer Assisted Testing, Keyboarding (Data Entry), Writing Processes
Guher Gorgun; Okan Bulut – Educational Measurement: Issues and Practice, 2025
Automatic item generation may supply many items instantly and efficiently to assessment and learning environments. Yet, the evaluation of item quality persists to be a bottleneck for deploying generated items in learning and assessment settings. In this study, we investigated the utility of using large-language models, specifically Llama 3-8B, for…
Descriptors: Artificial Intelligence, Quality Control, Technology Uses in Education, Automation