ERIC - Search Results

Publication Date

In 2025	3
Since 2024	3
Since 2021 (last 5 years)	4

Source

Journal of Educational…

Author

Alex J. Mechaber	1
Brian E. Clauser	1
Emre Gonulates	1
Harold Doran	1
Kai North	1
Kuan-Yu Jin	1
Le An Ha	1
Peter Baldwin	1
Shermis, Mark D.	1
Ted Diaz	1
Testsuhiro Yamada	1
Vanessa Culver	1
Victoria Yaneva	1
Wai-Lok Siu	1
Yiyun Zhou	1
More ▼

Publication Type

Journal Articles	4
Reports - Descriptive	2
Reports - Research	2

Education Level

Audience

Location

Laws, Policies, & Programs

Assessments and Surveys

What Works Clearinghouse Rating

Showing all 4 results Save | Export

A Generalized Objective Function for Computer Adaptive Item Selection

Peer reviewed

Direct link

Harold Doran; Testsuhiro Yamada; Ted Diaz; Emre Gonulates; Vanessa Culver – Journal of Educational Measurement, 2025

Computer adaptive testing (CAT) is an increasingly common mode of test administration offering improved test security, better measurement precision, and the potential for shorter testing experiences. This article presents a new item selection algorithm based on a generalized objective function to support multiple types of testing conditions and…

Descriptors: Computer Assisted Testing, Adaptive Testing, Test Items, Algorithms

The Vulnerability of AI-Based Scoring Systems to Gaming Strategies: A Case Study

Peer reviewed

Direct link

Peter Baldwin; Victoria Yaneva; Kai North; Le An Ha; Yiyun Zhou; Alex J. Mechaber; Brian E. Clauser – Journal of Educational Measurement, 2025

Recent developments in the use of large-language models have led to substantial improvements in the accuracy of content-based automated scoring of free-text responses. The reported accuracy levels suggest that automated systems could have widespread applicability in assessment. However, before they are used in operational testing, other aspects of…

Descriptors: Artificial Intelligence, Scoring, Computational Linguistics, Accuracy

Modeling Directional Testlet Effects on Multiple Open-Ended Questions

Peer reviewed

Direct link

Kuan-Yu Jin; Wai-Lok Siu – Journal of Educational Measurement, 2025

Educational tests often have a cluster of items linked by a common stimulus ("testlet"). In such a design, the dependencies caused between items are called "testlet effects." In particular, the directional testlet effect (DTE) refers to a recursive influence whereby responses to earlier items can positively or negatively affect…

Descriptors: Models, Test Items, Educational Assessment, Scores

Anchoring Validity Evidence for Automated Essay Scoring

Peer reviewed

Direct link

Shermis, Mark D. – Journal of Educational Measurement, 2022

One of the challenges of discussing validity arguments for machine scoring of essays centers on the absence of a commonly held definition and theory of good writing. At best, the algorithms attempt to measure select attributes of writing and calibrate them against human ratings with the goal of accurate prediction of scores for new essays.…

Descriptors: Scoring, Essays, Validity, Writing Evaluation

Computer Software	4
Accuracy	2
Computer Assisted Testing	2
Scoring	2
Test Items	2
Adaptive Testing	1
Algorithms	1
Artificial Intelligence	1
Classification	1
Computation	1
Computational Linguistics	1
Computer Games	1
Correlation	1
Cost Effectiveness	1
Educational Assessment	1
Error Patterns	1
Essays	1
Evaluators	1
Goodness of Fit	1
History Instruction	1
Item Response Theory	1
Models	1
Prediction	1
Psychometrics	1
Scaling	1
More ▼