ERIC - Search Results

Publication Date

In 2025	5
Since 2024	6
Since 2021 (last 5 years)	19
Since 2016 (last 10 years)	32
Since 2006 (last 20 years)	51

Descriptor

Computer Assisted Testing	112
Adaptive Testing	55
Test Items	54
Test Construction	33
Item Response Theory	31
Simulation	24
Comparative Analysis	20
Item Banks	17
Scores	15
Scoring	14
Higher Education	13
Automation	12
Psychometrics	12
College Students	11
Item Analysis	11
Models	11
Test Format	11
Difficulty Level	9
Comparative Testing	8
Evaluation Methods	8
Statistical Analysis	8
College Entrance Examinations	7
Computer Simulation	7
Error Patterns	7
Mathematical Models	7
More ▼

Source

Journal of Educational…

112

Publication Type

Journal Articles	112
Reports - Research	65
Reports - Evaluative	34
Reports - Descriptive	8
Speeches/Meeting Papers	7
Book/Product Reviews	3
Information Analyses	2
Opinion Papers	1

Education Level

Higher Education	2
Elementary Education	1
Postsecondary Education	1

Audience

Researchers

Location

United Kingdom

Laws, Policies, & Programs

Assessments and Surveys

Graduate Record Examinations	4
Indiana Statewide Testing for…	2
Advanced Placement…	1
Law School Admission Test	1

What Works Clearinghouse Rating

Showing 1 to 15 of 112 results Save | Export

Detecting Group Collaboration Using Multiple Correspondence Analysis

Peer reviewed

Direct link

Grochowalski, Joseph H.; Hendrickson, Amy – Journal of Educational Measurement, 2023

Test takers wishing to gain an unfair advantage often share answers with other test takers, either sharing all answers (a full key) or some (a partial key). Detecting key sharing during a tight testing window requires an efficient, easily interpretable, and rich form of analysis that is descriptive and inferential. We introduce a detection method…

Descriptors: Identification, Cooperative Learning, Cheating, Statistical Analysis

Using Response Time in Multidimensional Computerized Adaptive Testing

Peer reviewed

Direct link

He, Yinhong; Qi, Yuanyuan – Journal of Educational Measurement, 2023

In multidimensional computerized adaptive testing (MCAT), item selection strategies are generally constructed based on responses, and they do not consider the response times required by items. This study constructed two new criteria (referred to as DT-inc and DT) for MCAT item selection by utilizing information from response times. The new designs…

Descriptors: Reaction Time, Adaptive Testing, Computer Assisted Testing, Test Items

A Generalized Objective Function for Computer Adaptive Item Selection

Peer reviewed

Direct link

Harold Doran; Testsuhiro Yamada; Ted Diaz; Emre Gonulates; Vanessa Culver – Journal of Educational Measurement, 2025

Computer adaptive testing (CAT) is an increasingly common mode of test administration offering improved test security, better measurement precision, and the potential for shorter testing experiences. This article presents a new item selection algorithm based on a generalized objective function to support multiple types of testing conditions and…

Descriptors: Computer Assisted Testing, Adaptive Testing, Test Items, Algorithms

The Vulnerability of AI-Based Scoring Systems to Gaming Strategies: A Case Study

Peer reviewed

Direct link

Peter Baldwin; Victoria Yaneva; Kai North; Le An Ha; Yiyun Zhou; Alex J. Mechaber; Brian E. Clauser – Journal of Educational Measurement, 2025

Recent developments in the use of large-language models have led to substantial improvements in the accuracy of content-based automated scoring of free-text responses. The reported accuracy levels suggest that automated systems could have widespread applicability in assessment. However, before they are used in operational testing, other aspects of…

Descriptors: Artificial Intelligence, Scoring, Computational Linguistics, Accuracy

Using Multiple Maximum Exposure Rates in Computerized Adaptive Testing

Peer reviewed

Direct link

Kylie Gorney; Mark D. Reckase – Journal of Educational Measurement, 2025

In computerized adaptive testing, item exposure control methods are often used to provide a more balanced usage of the item pool. Many of the most popular methods, including the restricted method (Revuelta and Ponsoda), use a single maximum exposure rate to limit the proportion of times that each item is administered. However, Barrada et al.…

Descriptors: Computer Assisted Testing, Adaptive Testing, Test Items, Item Banks

Online Monitoring of Test-Taking Behavior Based on Item Responses and Response Times

Peer reviewed

Direct link

Han, Suhwa; Kang, Hyeon-Ah – Journal of Educational Measurement, 2023

The study presents multivariate sequential monitoring procedures for examining test-taking behaviors online. The procedures monitor examinee's responses and response times and signal aberrancy as soon as significant change is identifieddetected in the test-taking behavior. The study in particular proposes three schemes to track different…

Descriptors: Test Wiseness, Student Behavior, Item Response Theory, Computer Assisted Testing

Pretest Item Calibration in Computerized Multistage Adaptive Testing

Peer reviewed

Direct link

Ersen, Rabia Karatoprak; Lee, Won-Chan – Journal of Educational Measurement, 2023

The purpose of this study was to compare calibration and linking methods for placing pretest item parameter estimates on the item pool scale in a 1-3 computerized multistage adaptive testing design in terms of item parameter recovery. Two models were used: embedded-section, in which pretest items were administered within a separate module, and…

Descriptors: Pretesting, Test Items, Computer Assisted Testing, Adaptive Testing

Online Calibration in Multidimensional Computerized Adaptive Testing with Polytomously Scored Items

Peer reviewed

Direct link

Yuan, Lu; Huang, Yingshi; Li, Shuhang; Chen, Ping – Journal of Educational Measurement, 2023

Online calibration is a key technology for item calibration in computerized adaptive testing (CAT) and has been widely used in various forms of CAT, including unidimensional CAT, multidimensional CAT (MCAT), CAT with polytomously scored items, and cognitive diagnostic CAT. However, as multidimensional and polytomous assessment data become more…

Descriptors: Computer Assisted Testing, Adaptive Testing, Computation, Test Items

Evaluating the Consistency and Reliability of Attribution Methods in Automated Short Answer Grading (ASAG) Systems: Toward an Explainable Scoring System

Peer reviewed

Direct link

Wallace N. Pinto Jr.; Jinnie Shin – Journal of Educational Measurement, 2025

In recent years, the application of explainability techniques to automated essay scoring and automated short-answer grading (ASAG) models, particularly those based on transformer architectures, has gained significant attention. However, the reliability and consistency of these techniques remain underexplored. This study systematically investigates…

Descriptors: Automation, Grading, Computer Assisted Testing, Scoring

Using Automated Procedures to Score Educational Essays Written in Three Languages

Peer reviewed

Direct link

Tahereh Firoozi; Hamid Mohammadi; Mark J. Gierl – Journal of Educational Measurement, 2025

The purpose of this study is to describe and evaluate a multilingual automated essay scoring (AES) system for grading essays in three languages. Two different sentence embedding models were evaluated within the AES system, multilingual BERT (mBERT) and language-agnostic BERT sentence embedding (LaBSE). German, Italian, and Czech essays were…

Descriptors: College Students, Slavic Languages, German, Italian

Detecting Differential Item Functioning in CAT Using IRT Residual DIF Approach

Peer reviewed

Direct link

Lim, Hwanggyu; Choe, Edison M. – Journal of Educational Measurement, 2023

The residual differential item functioning (RDIF) detection framework was developed recently under a linear testing context. To explore the potential application of this framework to computerized adaptive testing (CAT), the present study investigated the utility of the RDIF[subscript R] statistic both as an index for detecting uniform DIF of…

Descriptors: Test Items, Computer Assisted Testing, Item Response Theory, Adaptive Testing

Using Linkage Sets to Improve Connectedness in Rater Response Model Estimation

Peer reviewed

Direct link

Casabianca, Jodi M.; Donoghue, John R.; Shin, Hyo Jeong; Chao, Szu-Fu; Choi, Ikkyu – Journal of Educational Measurement, 2023

Using item-response theory to model rater effects provides an alternative solution for rater monitoring and diagnosis, compared to using standard performance metrics. In order to fit such models, the ratings data must be sufficiently connected in order to estimate rater effects. Due to popular rating designs used in large-scale testing scenarios,…

Descriptors: Item Response Theory, Alternative Assessment, Evaluators, Research Problems

Score Comparability between Online Proctored and In-Person Credentialing Exams

Peer reviewed

Direct link

Jones, Paul; Tong, Ye; Liu, Jinghua; Borglum, Joshua; Primoli, Vince – Journal of Educational Measurement, 2022

This article studied two methods to detect mode effects in two credentialing exams. In Study 1, we used a "modal scale comparison approach," where the same pool of items was calibrated separately, without transformation, within two TC cohorts (TC1 and TC2) and one OP cohort (OP1) matched on their pool-based scale score distributions. The…

Descriptors: Scores, Credentials, Licensing Examinations (Professions), Computer Assisted Testing

The Automated Test Assembly and Routing Rule for Multistage Adaptive Testing with Multidimensional Item Response Theory

Peer reviewed

Direct link

Xu, Lingling; Wang, Shiyu; Cai, Yan; Tu, Dongbo – Journal of Educational Measurement, 2021

Designing a multidimensional adaptive test (M-MST) based on a multidimensional item response theory (MIRT) model is critical to make full use of the advantages of both MST and MIRT in implementing multidimensional assessments. This study proposed two types of automated test assembly (ATA) algorithms and one set of routing rules that can facilitate…

Descriptors: Item Response Theory, Adaptive Testing, Automation, Test Construction

Using Keystroke Behavior Patterns to Detect Nonauthentic Texts in Writing Assessments: Evaluating the Fairness of Predictive Models

Peer reviewed

Direct link

Yang Jiang; Mo Zhang; Jiangang Hao; Paul Deane; Chen Li – Journal of Educational Measurement, 2024

The emergence of sophisticated AI tools such as ChatGPT, coupled with the transition to remote delivery of educational assessments in the COVID-19 era, has led to increasing concerns about academic integrity and test security. Using AI tools, test takers can produce high-quality texts effortlessly and use them to game assessments. It is thus…

Descriptors: Integrity, Artificial Intelligence, Technology Uses in Education, Ethics

Previous Page | Next Page »

Pages: 1 | 2 | 3 | 4 | 5 | 6 | 7 | 8

Vispoel, Walter P.	7
Bennett, Randy Elliot	5
Chang, Hua-Hua	5
Bridgeman, Brent	4
Rock, Donald A.	4
Wainer, Howard	4
van der Linden, Wim J.	4
Bleiler, Timothy	3
Stocking, Martha L.	3
Tatsuoka, Kikumi K.	3
Veldkamp, Bernard P.	3
Wang, Tianyou	3
Bejar, Isaac I.	2
Cai, Yan	2
Chen, Shu-Ying	2
Choi, Ikkyu	2
Choi, Seung W.	2
Finkelman, Matthew	2
Kang, Hyeon-Ah	2
Kim, Dong-In	2
Lewis, Charles	2
Li, Jie	2
Luecht, Richard M.	2
Morley, Mary	2
More ▼