Publication Date
| In 2026 | 1 |
| Since 2025 | 35 |
Descriptor
| Computer Assisted Testing | 35 |
| Test Validity | 29 |
| Foreign Countries | 15 |
| Test Construction | 11 |
| Test Reliability | 11 |
| Evaluation Methods | 8 |
| Test Items | 8 |
| Elementary School Students | 7 |
| Scores | 7 |
| Student Evaluation | 6 |
| College Students | 5 |
| More ▼ | |
Source
Author
| Alba Richaudeau | 1 |
| Alexandra Ilie | 1 |
| Amanda Leigh Duncan | 1 |
| Ana Oliveira-Buckley | 1 |
| Andreea Dutulescu | 1 |
| Andrew J. O. Whitehouse | 1 |
| Angela Chamberlain | 1 |
| Anke Lindmeier | 1 |
| Ann Arthur | 1 |
| Anthony Setari | 1 |
| Arzu Uçar | 1 |
| More ▼ | |
Publication Type
| Journal Articles | 32 |
| Reports - Research | 32 |
| Reports - Evaluative | 2 |
| Collected Works - General | 1 |
| Reports - Descriptive | 1 |
| Speeches/Meeting Papers | 1 |
Education Level
Audience
Location
| Indonesia | 3 |
| Argentina | 1 |
| Australia | 1 |
| California | 1 |
| Chile | 1 |
| China | 1 |
| Finland | 1 |
| Greece | 1 |
| Iran | 1 |
| Ireland | 1 |
| New York | 1 |
| More ▼ | |
Laws, Policies, & Programs
Assessments and Surveys
What Works Clearinghouse Rating
Zahra Banitalebi; Masoomeh Estaji; Gavin T. L. Brown – Educational Technology & Society, 2025
The significance of teacher's assessment literacy (AL) was originally captured by the 1990 standards for teacher's competence in educational assessment. Competence in assessment has changed with the widespread use of recent technology advancements in educational assessment. Consequently, new measures are needed to measure Teacher Assessment…
Descriptors: Assessment Literacy, Computer Assisted Testing, Measurement Techniques, Questionnaires
Endang Susantini; Yurizka Melia Sari; Prima Vidya Asteria; Muhammad Ilyas Marzuqi – Journal of Education and Learning (EduLearn), 2025
Assessing preservice' higher order thinking skills (HOTS) in science and mathematics is essential. Teachers' HOTS ability is closely related to their ability to create HOTS-type science and mathematics problems. Among various types of HOTS, one is Bloomian HOTS. To facilitate the preservice teacher to create problems in those subjects, an Android…
Descriptors: Content Validity, Mathematics Instruction, Decision Making, Thinking Skills
Kayla V. Campaña; Benjamin G. Solomon – Assessment for Effective Intervention, 2025
The purpose of this study was to compare the classification accuracy of data produced by the previous year's end-of-year New York state assessment, a computer-adaptive diagnostic assessment ("i-Ready"), and the gating combination of both assessments to predict the rate of students passing the following year's end-of-year state assessment…
Descriptors: Accuracy, Classification, Diagnostic Tests, Adaptive Testing
K. Talman; J. Vierula; T. Karihtala; E. Laakkonen; J. Engblom; E. Haavisto – Higher Education Quarterly, 2025
Higher education institutions need to develop valid, fair, and objective selection methods. Current literature reporting the development and validation of new national large-scale selection tests is scarce. This two-phased study aimed to (1) develop and (2) evaluate the validity of the Finnish digital Universities of Applied Sciences Entrance…
Descriptors: Admission Criteria, Test Construction, Test Validity, Computer Assisted Testing
Daniel G. Lannin; Taylor Flinn; Alexandra Ilie; Dan Ispas – Teaching of Psychology, 2026
Background: The validity of unmonitored online exams has raised concerns about academic integrity and grade inflation, especially given the rise of artificial intelligence-powered tools. Objective: This study evaluates the validity of unmonitored online exams by comparing student performance between two sections of an undergraduate personality…
Descriptors: Computer Assisted Testing, Test Validity, Undergraduate Students, Psychology
Ebru Balta; Arzu Uçar – International Journal of Assessment Tools in Education, 2025
Unproctored Computerized Adaptive Testing (CAT) is gaining traction due to its convenience, flexibility, and scalability, particularly in high-stakes assessments. However, the lack of proctor can give rise to aberrant testing behavior. These behaviors can impair the validity of test scores. This paper explores the use of a verification test to…
Descriptors: Adaptive Testing, Computer Assisted Testing, Paper and Pencil Tests, Test Validity
Angela Chamberlain; Emily D'Arcy; Andrew J. O. Whitehouse; Kerry Wallace; Maya Hayden-Evans; Sonya Girdler; Benjamin Milbourn; Sven Bölte; Kiah Evans – Journal of Autism and Developmental Disorders, 2025
Purpose: The PEDI-CAT (ASD) is used to assess functioning of children and youth on the autism spectrum; however, current psychometric evidence is limited. This study aimed to explore the reliability, validity and acceptability of the PEDI-CAT (ASD) using a large Australian sample. Methods: Caregivers of 134 children and youth on the spectrum…
Descriptors: Autism Spectrum Disorders, Children, Youth, Test Reliability
Mingfeng Xue; Yunting Liu; Xingyao Xiao; Mark Wilson – Journal of Educational Measurement, 2025
Prompts play a crucial role in eliciting accurate outputs from large language models (LLMs). This study examines the effectiveness of an automatic prompt engineering (APE) framework for automatic scoring in educational measurement. We collected constructed-response data from 930 students across 11 items and used human scores as the true labels. A…
Descriptors: Computer Assisted Testing, Prompting, Educational Assessment, Automation
Sukru Murat Cebeci; Selcuk Acar – Journal of Creative Behavior, 2025
This study presents the Cebeci Test of Creativity (CTC), a novel computerized assessment tool designed to address the limitations of traditional open-ended paper-and-pencil creativity tests. The CTC is designed to overcome the challenges associated with the administration and manual scoring of traditional paper and pencil creativity tests. In this…
Descriptors: Creativity, Creativity Tests, Test Construction, Test Validity
Andreea Dutulescu; Stefan Ruseti; Denis Iorga; Mihai Dascalu; Danielle S. McNamara – Grantee Submission, 2025
Automated multiple-choice question (MCQ) generation is valuable for scalable assessment and enhanced learning experiences. How-ever, existing MCQ generation methods face challenges in ensuring plausible distractors and maintaining answer consistency. This paper intro-duces a method for MCQ generation that integrates reasoning-based explanations…
Descriptors: Automation, Computer Assisted Testing, Multiple Choice Tests, Natural Language Processing
Jun-ichiro Yasuda; Michael M. Hull; Naohiro Mae; Kentaro Kojima – Physical Review Physics Education Research, 2025
Although conceptual assessment tests are commonly administered at the beginning and end of a semester, this pre-post approach has inherent limitations. Specifically, education researchers and instructors have limited ability to observe the progression of students' conceptual understanding throughout the course. Furthermore, instructors are limited…
Descriptors: Computer Assisted Testing, Adaptive Testing, Science Tests, Scientific Concepts
Yi-Jui I. Chen; Yi-Jhen Wu; Yi-Hsin Chen; Robin Irey – Journal of Psychoeducational Assessment, 2025
A short form of the 60-item computer-based orthographic processing assessment (long-form COPA or COPA-LF) was developed. The COPA-LF consists of five skills, including rapid perception, access, differentiation, correction, and arrangement. Thirty items from the COPA-LF were selected for the short-form COPA (COPA-SF) based on cognitive diagnostic…
Descriptors: Computer Assisted Testing, Test Length, Test Validity, Orthographic Symbols
Tahereh Firoozi; Hamid Mohammadi; Mark J. Gierl – Journal of Educational Measurement, 2025
The purpose of this study is to describe and evaluate a multilingual automated essay scoring (AES) system for grading essays in three languages. Two different sentence embedding models were evaluated within the AES system, multilingual BERT (mBERT) and language-agnostic BERT sentence embedding (LaBSE). German, Italian, and Czech essays were…
Descriptors: College Students, Slavic Languages, German, Italian
Mostafa M. Samy; Mohamed A. Metwally; Mahmoud Ashry; Wael M. Elmayyah – Measurement: Interdisciplinary Research and Perspectives, 2025
Gas Turbine Engines (GTE) have the highest power-to-weight ratio among Internal Combustion Engines (ICE). Its modularity and ability to utilize various types of fuel make it highly recommended in power plants, naval transportation, and, of course, the most equipped in aviation. The lack of GTEs' real data is increasing a recognized need for…
Descriptors: Engines, Power Technology, Data Collection, Data Interpretation
Beifang Ma; Maximilian Krötz; Viola Deutscher; Esther Winther – International Journal of Training and Development, 2025
The rapid digital transformation of vocational education and training (VET) has underscored the need to adapt traditional assessment methods to digital formats. However, when transitioning to digital modes, it is crucial to consider factors beyond mere technical implementation, particularly the potential impact of altered presentation formats on…
Descriptors: Job Skills, Competence, Test Format, Computer Assisted Testing

Peer reviewed
Direct link
