ERIC - Search Results

Publication Date

In 2025	3
Since 2024	7
Since 2021 (last 5 years)	23
Since 2016 (last 10 years)	40
Since 2006 (last 20 years)	46

Descriptor

Computer Assisted Testing	81
Test Construction	34
Elementary Secondary Education	19
Test Items	19
Adaptive Testing	16
Microcomputers	15
Test Validity	14
Computer Software	12
Educational Testing	11
Item Banks	11
Scoring	10
Educational Assessment	9
Evaluation Methods	9
Item Response Theory	9
Testing	8
Item Analysis	7
Models	7
Scores	7
Student Evaluation	7
Test Use	7
Achievement Tests	6
Mathematics Tests	6
Psychometrics	6
Reaction Time	6
Testing Problems	6
More ▼

Source

Educational Measurement:…

Publication Type

Journal Articles	81
Reports - Research	32
Reports - Evaluative	23
Reports - Descriptive	15
Opinion Papers	10
Information Analyses	6
Speeches/Meeting Papers	3
Book/Product Reviews	2
Collected Works - Serials	1
Guides - Non-Classroom	1
Tests/Questionnaires	1
More ▼

Audience

Researchers	8
Practitioners	1

Location

Canada	1
Germany	1
Hong Kong	1
Texas	1
West Virginia	1

Laws, Policies, & Programs

Assessments and Surveys

Program for International…	2
ACT Assessment	1
Learning Potential Assessment…	1
United States Medical…	1

What Works Clearinghouse Rating

Showing 1 to 15 of 81 results Save | Export

Investigating Approaches to Controlling Item Position Effects in Computerized Adaptive Tests

Peer reviewed

Direct link

Ye Ma; Deborah J. Harris – Educational Measurement: Issues and Practice, 2025

Item position effect (IPE) refers to situations where an item performs differently when it is administered in different positions on a test. The majority of previous research studies have focused on investigating IPE under linear testing. There is a lack of IPE research under adaptive testing. In addition, the existence of IPE might violate Item…

Descriptors: Computer Assisted Testing, Adaptive Testing, Item Response Theory, Test Items

Item Response Theory Models for Polytomous Multidimensional Forced-Choice Items to Measure Construct Differentiation

Peer reviewed

Direct link

Xuelan Qiu; Jimmy de la Torre; You-Gan Wang; Jinran Wu – Educational Measurement: Issues and Practice, 2024

Multidimensional forced-choice (MFC) items have been found to be useful to reduce response biases in personality assessments. However, conventional scoring methods for the MFC items result in ipsative data, hindering the wider applications of the MFC format. In the last decade, a number of item response theory (IRT) models have been developed,…

Descriptors: Item Response Theory, Personality Traits, Personality Measures, Personality Assessment

Item Selection Algorithm Based on Collaborative Filtering for Item Exposure Control

Peer reviewed

Direct link

Pan, Yiqin; Livne, Oren; Wollack, James A.; Sinharay, Sandip – Educational Measurement: Issues and Practice, 2023

In computerized adaptive testing, overexposure of items in the bank is a serious problem and might result in item compromise. We develop an item selection algorithm that utilizes the entire bank well and reduces the overexposure of items. The algorithm is based on collaborative filtering and selects an item in two stages. In the first stage, a set…

Descriptors: Computer Assisted Testing, Adaptive Testing, Test Items, Algorithms

An Automated Item Pool Assembly Framework for Maximizing Item Utilization for CAT

Peer reviewed

Direct link

Hwanggyu Lim; Kyung T. Han – Educational Measurement: Issues and Practice, 2024

Computerized adaptive testing (CAT) has gained deserved popularity in the administration of educational and professional assessments, but continues to face test security challenges. To ensure sustained quality assurance and testing integrity, it is imperative to establish and maintain multiple stable item pools that are consistent in terms of…

Descriptors: Computer Assisted Testing, Adaptive Testing, Test Items, Item Banks

Measuring Variability in Proctor Decision Making on High-Stakes Assessments: Improving Test Security in the Digital Age

Peer reviewed

Direct link

William Belzak; J. R. Lockwood; Yigal Attali – Educational Measurement: Issues and Practice, 2024

Remote proctoring, or monitoring test takers through internet-based, video-recording software, has become critical for maintaining test security on high-stakes assessments. The main role of remote proctors is to make judgments about test takers' behaviors and decide whether these behaviors constitute rule violations. Variability in proctor…

Descriptors: Computer Security, High Stakes Tests, English (Second Language), Second Language Learning

Score Reporting for Examinees with Incomplete Data on Large-Scale Educational Assessments

Peer reviewed

Direct link

Sinharay, Sandip – Educational Measurement: Issues and Practice, 2021

Technical difficulties occasionally lead to missing item scores and hence to incomplete data on computerized tests. It is not straightforward to report scores to the examinees whose data are incomplete due to technical difficulties. Such reporting essentially involves imputation of missing scores. In this paper, a simulation study based on data…

Descriptors: Data Analysis, Scores, Educational Assessment, Educational Testing

Applying Evidence-Centered Design in the Development of a Multidimensional Adaptive Reading Motivation Measure

Peer reviewed

Direct link

Wang, Wenhao; Kingston, Neal M.; Davis, Marcia H.; Tiemann, Gail C.; Tonks, Stephen; Hock, Michael – Educational Measurement: Issues and Practice, 2021

Adaptive tests are more efficient than fixed-length tests through the use of item response theory; adaptive tests also present students questions that are tailored to their proficiency level. Although the adaptive algorithm is straightforward, developing a multidimensional computer adaptive test (MCAT) measure is complex. Evidence-centered design…

Descriptors: Evidence Based Practice, Reading Motivation, Adaptive Testing, Computer Assisted Testing

Hierarchical Agglomerative Clustering to Detect Test Collusion on Computer-Based Tests

Peer reviewed

Direct link

Ingrisone, Soo Jeong; Ingrisone, James N. – Educational Measurement: Issues and Practice, 2023

There has been a growing interest in approaches based on machine learning (ML) for detecting test collusion as an alternative to the traditional methods. Clustering analysis under an unsupervised learning technique appears especially promising to detect group collusion. In this study, the effectiveness of hierarchical agglomerative clustering…

Descriptors: Identification, Cooperation, Computer Assisted Testing, Artificial Intelligence

Evolving Educational Testing to Meet Students' Needs: Design-in-Real-Time Assessment

Peer reviewed

Direct link

Stephen G. Sireci; Javier Suárez-Álvarez; April L. Zenisky; Maria Elena Oliveri – Educational Measurement: Issues and Practice, 2024

The goal in personalized assessment is to best fit the needs of each individual test taker, given the assessment purposes. Design-in-Real-Time (DIRTy) assessment reflects the progressive evolution in testing from a single test, to an adaptive test, to an adaptive assessment "system." In this article, we lay the foundation for DIRTy…

Descriptors: Educational Assessment, Student Needs, Test Format, Test Construction

Applications and Modeling of Keystroke Logs in Writing Assessments

Peer reviewed

Direct link

Mo Zhang; Paul Deane; Andrew Hoang; Hongwen Guo; Chen Li – Educational Measurement: Issues and Practice, 2025

In this paper, we describe two empirical studies that demonstrate the application and modeling of keystroke logs in writing assessments. We illustrate two different approaches of modeling differences in writing processes: analysis of mean differences in handcrafted theory-driven features and use of large language models to identify stable personal…

Descriptors: Writing Tests, Computer Assisted Testing, Keyboarding (Data Entry), Writing Processes

Measurement Efficiency for Technology-Enhanced and Multiple-Choice Items in a K-12 Mathematics Accountability Assessment

Peer reviewed

Direct link

Ersan, Ozge; Berry, Yufeng – Educational Measurement: Issues and Practice, 2023

The increasing use of computerization in the testing industry and the need for items potentially measuring higher-order skills have led educational measurement communities to develop technology-enhanced (TE) items and conduct validity studies on the use of TE items. Parallel to this goal, the purpose of this study was to collect validity evidence…

Descriptors: Computer Assisted Testing, Multiple Choice Tests, Elementary Secondary Education, Accountability

Exploration of Latent Structure in Test Revision and Review Log Data

Peer reviewed

Direct link

Zhang, Susu; Li, Anqi; Wang, Shiyu – Educational Measurement: Issues and Practice, 2023

In computer-based tests allowing revision and reviews, examinees' sequence of visits and answer changes to questions can be recorded. The variable-length revision log data introduce new complexities to the collected data but, at the same time, provide additional information on examinees' test-taking behavior, which can inform test development and…

Descriptors: Computer Assisted Testing, Test Construction, Test Wiseness, Test Items

Comparing Large-Scale Assessments in Two Proctoring Modalities with Interactive Log Data Analysis

Peer reviewed

Direct link

Shin, Jinnie; Guo, Qi; Morin, Maxim – Educational Measurement: Issues and Practice, 2023

With the increased restrictions on physical distancing due to the COVID-19 pandemic, remote proctoring has emerged as an alternative to traditional onsite proctoring to ensure the continuity of essential assessments, such as computer-based medical licensing exams. Recent literature has highlighted the significant impact of different proctoring…

Descriptors: Foreign Countries, High Stakes Tests, Computer Assisted Testing, Licensing Examinations (Professions)

Instruction-Tuned Large-Language Models for Quality Control in Automatic Item Generation: A Feasibility Study

Peer reviewed

Direct link

Guher Gorgun; Okan Bulut – Educational Measurement: Issues and Practice, 2025

Automatic item generation may supply many items instantly and efficiently to assessment and learning environments. Yet, the evaluation of item quality persists to be a bottleneck for deploying generated items in learning and assessment settings. In this study, we investigated the utility of using large-language models, specifically Llama 3-8B, for…

Descriptors: Artificial Intelligence, Quality Control, Technology Uses in Education, Automation

Bilevel Topic Model-Based Multitask Learning for Constructed-Responses Multidimensional Automated Scoring and Interpretation

Peer reviewed

Direct link

Xiong, Jiawei; Li, Feiming – Educational Measurement: Issues and Practice, 2023

Multidimensional scoring evaluates each constructed-response answer from more than one rating dimension and/or trait such as lexicon, organization, and supporting ideas instead of only one holistic score, to help students distinguish between various dimensions of writing quality. In this work, we present a bilevel learning model for combining two…

Descriptors: Scoring, Models, Task Analysis, Learning Processes

Previous Page | Next Page »

Pages: 1 | 2 | 3 | 4 | 5 | 6

Wise, Steven L.	4
Kingsbury, G. Gage	3
Sinharay, Sandip	3
Arslan, Burcu	2
Bennett, Randy E.	2
Deane, Paul	2
Gierl, Mark J.	2
Hiscox, Michael D.	2
Hsu, Tse-chi	2
Keehner, Madeleine	2
Lai, Hollis	2
Plake, Barbara S.	2
Stone, Clement A.	2
Zhang, Mo	2
Abedi, Jamal	1
Agrimson, Jared	1
Ahmadi, Alireza	1
Allalouf, Avi	1
Andrew Hoang	1
April L. Zenisky	1
Arthur, Ann M.	1
Averitt, Jason	1
Baker, Frank B.	1
Balizet, Sha	1
More ▼

Secondary Education	7
High Schools	4
Higher Education	3
Junior High Schools	3
Middle Schools	3
Postsecondary Education	3
Adult Education	2
Elementary Education	2
Elementary Secondary Education	2
Grade 4	2
Intermediate Grades	2
Early Childhood Education	1
Grade 10	1
Grade 3	1
Grade 5	1
Grade 6	1
Grade 7	1
Grade 8	1
Grade 9	1
High School Equivalency…	1
Primary Education	1
More ▼