ERIC - Search Results

Publication Date

In 2025	3
Since 2024	7

Source

Educational Measurement:…

Author

Andrew Hoang	1
April L. Zenisky	1
Chen Li	1
Deborah J. Harris	1
Guher Gorgun	1
Hongwen Guo	1
Hwanggyu Lim	1
J. R. Lockwood	1
Javier Suárez-Álvarez	1
Jimmy de la Torre	1
Jinran Wu	1
Kyung T. Han	1
Maria Elena Oliveri	1
Mo Zhang	1
Okan Bulut	1
Paul Deane	1
Stephen G. Sireci	1
William Belzak	1
Xuelan Qiu	1
Ye Ma	1
Yigal Attali	1
You-Gan Wang	1
More ▼

Publication Type

Journal Articles	7
Reports - Research	6
Reports - Descriptive	1

Education Level

Adult Education

Audience

Location

Laws, Policies, & Programs

Assessments and Surveys

What Works Clearinghouse Rating

Showing all 7 results Save | Export

Investigating Approaches to Controlling Item Position Effects in Computerized Adaptive Tests

Peer reviewed

Direct link

Ye Ma; Deborah J. Harris – Educational Measurement: Issues and Practice, 2025

Item position effect (IPE) refers to situations where an item performs differently when it is administered in different positions on a test. The majority of previous research studies have focused on investigating IPE under linear testing. There is a lack of IPE research under adaptive testing. In addition, the existence of IPE might violate Item…

Descriptors: Computer Assisted Testing, Adaptive Testing, Item Response Theory, Test Items

Applications and Modeling of Keystroke Logs in Writing Assessments

Peer reviewed

Direct link

Mo Zhang; Paul Deane; Andrew Hoang; Hongwen Guo; Chen Li – Educational Measurement: Issues and Practice, 2025

In this paper, we describe two empirical studies that demonstrate the application and modeling of keystroke logs in writing assessments. We illustrate two different approaches of modeling differences in writing processes: analysis of mean differences in handcrafted theory-driven features and use of large language models to identify stable personal…

Descriptors: Writing Tests, Computer Assisted Testing, Keyboarding (Data Entry), Writing Processes

An Automated Item Pool Assembly Framework for Maximizing Item Utilization for CAT

Peer reviewed

Direct link

Hwanggyu Lim; Kyung T. Han – Educational Measurement: Issues and Practice, 2024

Computerized adaptive testing (CAT) has gained deserved popularity in the administration of educational and professional assessments, but continues to face test security challenges. To ensure sustained quality assurance and testing integrity, it is imperative to establish and maintain multiple stable item pools that are consistent in terms of…

Descriptors: Computer Assisted Testing, Adaptive Testing, Test Items, Item Banks

Instruction-Tuned Large-Language Models for Quality Control in Automatic Item Generation: A Feasibility Study

Peer reviewed

Direct link

Guher Gorgun; Okan Bulut – Educational Measurement: Issues and Practice, 2025

Automatic item generation may supply many items instantly and efficiently to assessment and learning environments. Yet, the evaluation of item quality persists to be a bottleneck for deploying generated items in learning and assessment settings. In this study, we investigated the utility of using large-language models, specifically Llama 3-8B, for…

Descriptors: Artificial Intelligence, Quality Control, Technology Uses in Education, Automation

Measuring Variability in Proctor Decision Making on High-Stakes Assessments: Improving Test Security in the Digital Age

Peer reviewed

Direct link

William Belzak; J. R. Lockwood; Yigal Attali – Educational Measurement: Issues and Practice, 2024

Remote proctoring, or monitoring test takers through internet-based, video-recording software, has become critical for maintaining test security on high-stakes assessments. The main role of remote proctors is to make judgments about test takers' behaviors and decide whether these behaviors constitute rule violations. Variability in proctor…

Descriptors: Computer Security, High Stakes Tests, English (Second Language), Second Language Learning

Item Response Theory Models for Polytomous Multidimensional Forced-Choice Items to Measure Construct Differentiation

Peer reviewed

Direct link

Xuelan Qiu; Jimmy de la Torre; You-Gan Wang; Jinran Wu – Educational Measurement: Issues and Practice, 2024

Multidimensional forced-choice (MFC) items have been found to be useful to reduce response biases in personality assessments. However, conventional scoring methods for the MFC items result in ipsative data, hindering the wider applications of the MFC format. In the last decade, a number of item response theory (IRT) models have been developed,…

Descriptors: Item Response Theory, Personality Traits, Personality Measures, Personality Assessment

Evolving Educational Testing to Meet Students' Needs: Design-in-Real-Time Assessment

Peer reviewed

Direct link

Stephen G. Sireci; Javier Suárez-Álvarez; April L. Zenisky; Maria Elena Oliveri – Educational Measurement: Issues and Practice, 2024

The goal in personalized assessment is to best fit the needs of each individual test taker, given the assessment purposes. Design-in-Real-Time (DIRTy) assessment reflects the progressive evolution in testing from a single test, to an adaptive test, to an adaptive assessment "system." In this article, we lay the foundation for DIRTy…

Descriptors: Educational Assessment, Student Needs, Test Format, Test Construction

Computer Assisted Testing	7
Adaptive Testing	3
Test Construction	3
Test Items	3
Artificial Intelligence	2
Context Effect	2
Evaluation Methods	2
High Stakes Tests	2
Item Response Theory	2
Testing	2
Academic Aspiration	1
Adult Students	1
Automation	1
Career Choice	1
Career Exploration	1
Cloze Procedure	1
Comparative Testing	1
Computer Mediated…	1
Computer Security	1
Culture Fair Tests	1
Data	1
Decision Making	1
Delivery Systems	1
Educational Assessment	1
Educational Innovation	1
More ▼