ERIC - Search Results

Publication Date

In 2025	2
Since 2024	3
Since 2021 (last 5 years)	7
Since 2016 (last 10 years)	13
Since 2006 (last 20 years)	14

Descriptor

Computer Assisted Testing	19
Test Items	19
Test Construction	9
Adaptive Testing	5
Elementary Secondary Education	4
Item Banks	4
Item Response Theory	4
Test Format	4
Test Validity	4
Automation	3
Computer Software Reviews	3
Item Analysis	3
Reaction Time	3
Computer Software	2
Computer Uses in Education	2
Data Analysis	2
Language Tests	2
Latent Trait Theory	2
Mathematics Tests	2
Microcomputers	2
Models	2
Multiple Choice Tests	2
Test Wiseness	2
Ability Identification	1
Academic Achievement	1
More ▼

Source

Educational Measurement:…

Publication Type

Journal Articles	19
Reports - Research	11
Reports - Evaluative	7
Information Analyses	2
Book/Product Reviews	1
Collected Works - Serials	1
Reports - Descriptive	1

Education Level

Higher Education	1
Postsecondary Education	1
Secondary Education	1

Audience

Location

Germany	1
West Virginia	1

Laws, Policies, & Programs

Assessments and Surveys

ACT Assessment	1
Program for International…	1

What Works Clearinghouse Rating

Showing 1 to 15 of 19 results Save | Export

Investigating Approaches to Controlling Item Position Effects in Computerized Adaptive Tests

Peer reviewed

Direct link

Ye Ma; Deborah J. Harris – Educational Measurement: Issues and Practice, 2025

Item position effect (IPE) refers to situations where an item performs differently when it is administered in different positions on a test. The majority of previous research studies have focused on investigating IPE under linear testing. There is a lack of IPE research under adaptive testing. In addition, the existence of IPE might violate Item…

Descriptors: Computer Assisted Testing, Adaptive Testing, Item Response Theory, Test Items

Item Selection Algorithm Based on Collaborative Filtering for Item Exposure Control

Peer reviewed

Direct link

Pan, Yiqin; Livne, Oren; Wollack, James A.; Sinharay, Sandip – Educational Measurement: Issues and Practice, 2023

In computerized adaptive testing, overexposure of items in the bank is a serious problem and might result in item compromise. We develop an item selection algorithm that utilizes the entire bank well and reduces the overexposure of items. The algorithm is based on collaborative filtering and selects an item in two stages. In the first stage, a set…

Descriptors: Computer Assisted Testing, Adaptive Testing, Test Items, Algorithms

Exploration of Latent Structure in Test Revision and Review Log Data

Peer reviewed

Direct link

Zhang, Susu; Li, Anqi; Wang, Shiyu – Educational Measurement: Issues and Practice, 2023

In computer-based tests allowing revision and reviews, examinees' sequence of visits and answer changes to questions can be recorded. The variable-length revision log data introduce new complexities to the collected data but, at the same time, provide additional information on examinees' test-taking behavior, which can inform test development and…

Descriptors: Computer Assisted Testing, Test Construction, Test Wiseness, Test Items

An Automated Item Pool Assembly Framework for Maximizing Item Utilization for CAT

Peer reviewed

Direct link

Hwanggyu Lim; Kyung T. Han – Educational Measurement: Issues and Practice, 2024

Computerized adaptive testing (CAT) has gained deserved popularity in the administration of educational and professional assessments, but continues to face test security challenges. To ensure sustained quality assurance and testing integrity, it is imperative to establish and maintain multiple stable item pools that are consistent in terms of…

Descriptors: Computer Assisted Testing, Adaptive Testing, Test Items, Item Banks

Instruction-Tuned Large-Language Models for Quality Control in Automatic Item Generation: A Feasibility Study

Peer reviewed

Direct link

Guher Gorgun; Okan Bulut – Educational Measurement: Issues and Practice, 2025

Automatic item generation may supply many items instantly and efficiently to assessment and learning environments. Yet, the evaluation of item quality persists to be a bottleneck for deploying generated items in learning and assessment settings. In this study, we investigated the utility of using large-language models, specifically Llama 3-8B, for…

Descriptors: Artificial Intelligence, Quality Control, Technology Uses in Education, Automation

Mode Effects in College Admissions Testing and Differential Speededness as a Possible Explanation

Peer reviewed

Direct link

Steedle, Jeffrey T.; Cho, Young Woo; Wang, Shichao; Arthur, Ann M.; Li, Dongmei – Educational Measurement: Issues and Practice, 2022

As testing programs transition from paper to online testing, they must study mode comparability to support the exchangeability of scores from different testing modes. To that end, a series of three mode comparability studies was conducted during the 2019-2020 academic year with examinees randomly assigned to take the ACT college admissions exam on…

Descriptors: College Entrance Examinations, Computer Assisted Testing, Scores, Test Format

The Effect of Drag-and-Drop Item Features on Test-Taker Performance and Response Strategies

Peer reviewed

Direct link

Arslan, Burcu; Jiang, Yang; Keehner, Madeleine; Gong, Tao; Katz, Irvin R.; Yan, Fred – Educational Measurement: Issues and Practice, 2020

Computer-based educational assessments often include items that involve drag-and-drop responses. There are different ways that drag-and-drop items can be laid out and different choices that test developers can make when designing these items. Currently, these decisions are based on experts' professional judgments and design constraints, rather…

Descriptors: Test Items, Computer Assisted Testing, Test Format, Decision Making

Development and Validation of an Automatic Item Generation System for English Idioms

Peer reviewed

Direct link

Rafatbakhsh, Elaheh; Ahmadi, Alireza; Moloodi, Amirsaeid; Mehrpour, Saeed – Educational Measurement: Issues and Practice, 2021

Test development is a crucial, yet difficult and time-consuming part of any educational system, and the task often falls all on teachers. Automatic item generation systems have recently drawn attention as they can reduce this burden and make test development more convenient. Such systems have been developed to generate items for vocabulary,…

Descriptors: Test Construction, Test Items, Computer Assisted Testing, Multiple Choice Tests

Construct Equivalence of PISA Reading Comprehension Measured with Paper-Based and Computer-Based Assessments

Peer reviewed

Direct link

Kroehne, Ulf; Buerger, Sarah; Hahnel, Carolin; Goldhammer, Frank – Educational Measurement: Issues and Practice, 2019

For many years, reading comprehension in the Programme for International Student Assessment (PISA) was measured via paper-based assessment (PBA). In the 2015 cycle, computer-based assessment (CBA) was introduced, raising the question of whether central equivalence criteria required for a valid interpretation of the results are fulfilled. As an…

Descriptors: Reading Comprehension, Computer Assisted Testing, Achievement Tests, Foreign Countries

Can Item Response Times Provide Insight into Students' Motivation and Self-Efficacy in Math? An Initial Application of Test Metadata to Understand Students' Social-Emotional Needs

Peer reviewed

Direct link

Soland, James – Educational Measurement: Issues and Practice, 2019

As computer-based tests become more common, there is a growing wealth of metadata related to examinees' response processes, which include solution strategies, concentration, and operating speed. One common type of metadata is item response time. While response times have been used extensively to improve estimates of achievement, little work…

Descriptors: Test Items, Item Response Theory, Metadata, Self Efficacy

Rapid-Guessing Behavior: Its Identification, Interpretation, and Implications

Peer reviewed

Direct link

Wise, Steven L. – Educational Measurement: Issues and Practice, 2017

The rise of computer-based testing has brought with it the capability to measure more aspects of a test event than simply the answers selected or constructed by the test taker. One behavior that has drawn much research interest is the time test takers spend responding to individual multiple-choice items. In particular, very short response…

Descriptors: Guessing (Tests), Multiple Choice Tests, Test Items, Reaction Time

Using Response Time to Detect Item Preknowledge in Computer-Based Licensure Examinations

Peer reviewed

Direct link

Qian, Hong; Staniewska, Dorota; Reckase, Mark; Woo, Ada – Educational Measurement: Issues and Practice, 2016

This article addresses the issue of how to detect item preknowledge using item response time data in two computer-based large-scale licensure examinations. Item preknowledge is indicated by an unexpected short response time and a correct response. Two samples were used for detecting item preknowledge for each examination. The first sample was from…

Descriptors: Reaction Time, Licensing Examinations (Professions), Computer Assisted Testing, Prior Learning

Instructional Topics in Educational Measurement (ITEMS) Module: Using Automated Processes to Generate Test Items

Peer reviewed

Direct link

Gierl, Mark J.; Lai, Hollis – Educational Measurement: Issues and Practice, 2013

Changes to the design and development of our educational assessments are resulting in the unprecedented demand for a large and continuous supply of content-specific test items. One way to address this growing demand is with automatic item generation (AIG). AIG is the process of using item models to generate test items with the aid of computer…

Descriptors: Educational Assessment, Test Items, Automation, Computer Assisted Testing

A Process for Reviewing and Evaluating Generated Test Items

Peer reviewed

Direct link

Gierl, Mark J.; Lai, Hollis – Educational Measurement: Issues and Practice, 2016

Testing organization needs large numbers of high-quality items due to the proliferation of alternative test administration methods and modern test designs. But the current demand for items far exceeds the supply. Test items, as they are currently written, evoke a process that is both time-consuming and expensive because each item is written,…

Descriptors: Test Items, Test Construction, Psychometrics, Models

Testing Software Review: MicroCAT Version 3.0.

Peer reviewed

Stone, Clement A. – Educational Measurement: Issues and Practice, 1989

MicroCAT version 3.0--an integrated test development, administration, and analysis system--is reviewed in this first article of a series on testing software. A framework for comparing testing software is presented. The strength of this package lies in the development, banking, and administration of items composed of text and graphics. (SLD)

Descriptors: Computer Assisted Testing, Computer Software, Computer Software Reviews, Data Analysis

Previous Page | Next Page »

Pages: 1 | 2

Gierl, Mark J.	2
Lai, Hollis	2
Ahmadi, Alireza	1
Arslan, Burcu	1
Arthur, Ann M.	1
Buerger, Sarah	1
Cho, Young Woo	1
Deborah J. Harris	1
Goldhammer, Frank	1
Gong, Tao	1
Guher Gorgun	1
Hahnel, Carolin	1
Hsu, Tse-chi	1
Hwanggyu Lim	1
Jiang, Yang	1
Katz, Irvin R.	1
Keehner, Madeleine	1
Kroehne, Ulf	1
Kyung T. Han	1
Li, Anqi	1
Li, Dongmei	1
Livne, Oren	1
Mehrpour, Saeed	1
Moloodi, Amirsaeid	1
Okan Bulut	1
More ▼