Publication Date
In 2025 | 2 |
Since 2024 | 4 |
Descriptor
Author
Alex J. Mechaber | 1 |
Brian E. Clauser | 1 |
Chia-Hsuan Liao | 1 |
Ellen Lau | 1 |
Jing Ma | 1 |
Kai North | 1 |
Le An Ha | 1 |
Peter Baldwin | 1 |
Susu Zhang | 1 |
Victoria Yaneva | 1 |
Yang Du | 1 |
More ▼ |
Publication Type
Journal Articles | 3 |
Reports - Research | 3 |
Dissertations/Theses -… | 1 |
Education Level
Higher Education | 1 |
Postsecondary Education | 1 |
Audience
Location
Taiwan | 1 |
Laws, Policies, & Programs
Assessments and Surveys
What Works Clearinghouse Rating
Peter Baldwin; Victoria Yaneva; Kai North; Le An Ha; Yiyun Zhou; Alex J. Mechaber; Brian E. Clauser – Journal of Educational Measurement, 2025
Recent developments in the use of large-language models have led to substantial improvements in the accuracy of content-based automated scoring of free-text responses. The reported accuracy levels suggest that automated systems could have widespread applicability in assessment. However, before they are used in operational testing, other aspects of…
Descriptors: Artificial Intelligence, Scoring, Computational Linguistics, Accuracy
Jing Ma – ProQuest LLC, 2024
This study investigated the impact of scoring polytomous items later on measurement precision, classification accuracy, and test security in mixed-format adaptive testing. Utilizing the shadow test approach, a simulation study was conducted across various test designs, lengths, number and location of polytomous item. Results showed that while…
Descriptors: Scoring, Adaptive Testing, Test Items, Classification
Yang Du; Susu Zhang – Journal of Educational and Behavioral Statistics, 2025
Item compromise has long posed challenges in educational measurement, jeopardizing both test validity and test security of continuous tests. Detecting compromised items is therefore crucial to address this concern. The present literature on compromised item detection reveals two notable gaps: First, the majority of existing methods are based upon…
Descriptors: Item Response Theory, Item Analysis, Bayesian Statistics, Educational Assessment
Chia-Hsuan Liao; Ellen Lau – Second Language Research, 2024
Event concepts of common verbs (e.g. "eat," "sleep") can be broadly shared across languages, but a given language's rules for subcategorization are largely arbitrary and vary substantially across languages. When subcategorization information does not match between first language (L1) and second language (L2), how does this…
Descriptors: Verbs, Brain Hemisphere Functions, Diagnostic Tests, English