NotesFAQContact Us
Collection
Advanced
Search Tips
Publication Date
In 20261
Since 202535
Audience
Laws, Policies, & Programs
What Works Clearinghouse Rating
Showing 1 to 15 of 35 results Save | Export
Peer reviewed Peer reviewed
Direct linkDirect link
Zahra Banitalebi; Masoomeh Estaji; Gavin T. L. Brown – Educational Technology & Society, 2025
The significance of teacher's assessment literacy (AL) was originally captured by the 1990 standards for teacher's competence in educational assessment. Competence in assessment has changed with the widespread use of recent technology advancements in educational assessment. Consequently, new measures are needed to measure Teacher Assessment…
Descriptors: Assessment Literacy, Computer Assisted Testing, Measurement Techniques, Questionnaires
Peer reviewed Peer reviewed
PDF on ERIC Download full text
Endang Susantini; Yurizka Melia Sari; Prima Vidya Asteria; Muhammad Ilyas Marzuqi – Journal of Education and Learning (EduLearn), 2025
Assessing preservice' higher order thinking skills (HOTS) in science and mathematics is essential. Teachers' HOTS ability is closely related to their ability to create HOTS-type science and mathematics problems. Among various types of HOTS, one is Bloomian HOTS. To facilitate the preservice teacher to create problems in those subjects, an Android…
Descriptors: Content Validity, Mathematics Instruction, Decision Making, Thinking Skills
Peer reviewed Peer reviewed
Direct linkDirect link
Kayla V. Campaña; Benjamin G. Solomon – Assessment for Effective Intervention, 2025
The purpose of this study was to compare the classification accuracy of data produced by the previous year's end-of-year New York state assessment, a computer-adaptive diagnostic assessment ("i-Ready"), and the gating combination of both assessments to predict the rate of students passing the following year's end-of-year state assessment…
Descriptors: Accuracy, Classification, Diagnostic Tests, Adaptive Testing
Peer reviewed Peer reviewed
Direct linkDirect link
K. Talman; J. Vierula; T. Karihtala; E. Laakkonen; J. Engblom; E. Haavisto – Higher Education Quarterly, 2025
Higher education institutions need to develop valid, fair, and objective selection methods. Current literature reporting the development and validation of new national large-scale selection tests is scarce. This two-phased study aimed to (1) develop and (2) evaluate the validity of the Finnish digital Universities of Applied Sciences Entrance…
Descriptors: Admission Criteria, Test Construction, Test Validity, Computer Assisted Testing
Peer reviewed Peer reviewed
Direct linkDirect link
Daniel G. Lannin; Taylor Flinn; Alexandra Ilie; Dan Ispas – Teaching of Psychology, 2026
Background: The validity of unmonitored online exams has raised concerns about academic integrity and grade inflation, especially given the rise of artificial intelligence-powered tools. Objective: This study evaluates the validity of unmonitored online exams by comparing student performance between two sections of an undergraduate personality…
Descriptors: Computer Assisted Testing, Test Validity, Undergraduate Students, Psychology
Peer reviewed Peer reviewed
PDF on ERIC Download full text
Ebru Balta; Arzu Uçar – International Journal of Assessment Tools in Education, 2025
Unproctored Computerized Adaptive Testing (CAT) is gaining traction due to its convenience, flexibility, and scalability, particularly in high-stakes assessments. However, the lack of proctor can give rise to aberrant testing behavior. These behaviors can impair the validity of test scores. This paper explores the use of a verification test to…
Descriptors: Adaptive Testing, Computer Assisted Testing, Paper and Pencil Tests, Test Validity
Peer reviewed Peer reviewed
Direct linkDirect link
Angela Chamberlain; Emily D'Arcy; Andrew J. O. Whitehouse; Kerry Wallace; Maya Hayden-Evans; Sonya Girdler; Benjamin Milbourn; Sven Bölte; Kiah Evans – Journal of Autism and Developmental Disorders, 2025
Purpose: The PEDI-CAT (ASD) is used to assess functioning of children and youth on the autism spectrum; however, current psychometric evidence is limited. This study aimed to explore the reliability, validity and acceptability of the PEDI-CAT (ASD) using a large Australian sample. Methods: Caregivers of 134 children and youth on the spectrum…
Descriptors: Autism Spectrum Disorders, Children, Youth, Test Reliability
Peer reviewed Peer reviewed
Direct linkDirect link
Mingfeng Xue; Yunting Liu; Xingyao Xiao; Mark Wilson – Journal of Educational Measurement, 2025
Prompts play a crucial role in eliciting accurate outputs from large language models (LLMs). This study examines the effectiveness of an automatic prompt engineering (APE) framework for automatic scoring in educational measurement. We collected constructed-response data from 930 students across 11 items and used human scores as the true labels. A…
Descriptors: Computer Assisted Testing, Prompting, Educational Assessment, Automation
Peer reviewed Peer reviewed
Direct linkDirect link
Sukru Murat Cebeci; Selcuk Acar – Journal of Creative Behavior, 2025
This study presents the Cebeci Test of Creativity (CTC), a novel computerized assessment tool designed to address the limitations of traditional open-ended paper-and-pencil creativity tests. The CTC is designed to overcome the challenges associated with the administration and manual scoring of traditional paper and pencil creativity tests. In this…
Descriptors: Creativity, Creativity Tests, Test Construction, Test Validity
Peer reviewed Peer reviewed
Direct linkDirect link
Andreea Dutulescu; Stefan Ruseti; Denis Iorga; Mihai Dascalu; Danielle S. McNamara – Grantee Submission, 2025
Automated multiple-choice question (MCQ) generation is valuable for scalable assessment and enhanced learning experiences. How-ever, existing MCQ generation methods face challenges in ensuring plausible distractors and maintaining answer consistency. This paper intro-duces a method for MCQ generation that integrates reasoning-based explanations…
Descriptors: Automation, Computer Assisted Testing, Multiple Choice Tests, Natural Language Processing
Peer reviewed Peer reviewed
Direct linkDirect link
Jun-ichiro Yasuda; Michael M. Hull; Naohiro Mae; Kentaro Kojima – Physical Review Physics Education Research, 2025
Although conceptual assessment tests are commonly administered at the beginning and end of a semester, this pre-post approach has inherent limitations. Specifically, education researchers and instructors have limited ability to observe the progression of students' conceptual understanding throughout the course. Furthermore, instructors are limited…
Descriptors: Computer Assisted Testing, Adaptive Testing, Science Tests, Scientific Concepts
Peer reviewed Peer reviewed
Direct linkDirect link
Yi-Jui I. Chen; Yi-Jhen Wu; Yi-Hsin Chen; Robin Irey – Journal of Psychoeducational Assessment, 2025
A short form of the 60-item computer-based orthographic processing assessment (long-form COPA or COPA-LF) was developed. The COPA-LF consists of five skills, including rapid perception, access, differentiation, correction, and arrangement. Thirty items from the COPA-LF were selected for the short-form COPA (COPA-SF) based on cognitive diagnostic…
Descriptors: Computer Assisted Testing, Test Length, Test Validity, Orthographic Symbols
Peer reviewed Peer reviewed
Direct linkDirect link
Tahereh Firoozi; Hamid Mohammadi; Mark J. Gierl – Journal of Educational Measurement, 2025
The purpose of this study is to describe and evaluate a multilingual automated essay scoring (AES) system for grading essays in three languages. Two different sentence embedding models were evaluated within the AES system, multilingual BERT (mBERT) and language-agnostic BERT sentence embedding (LaBSE). German, Italian, and Czech essays were…
Descriptors: College Students, Slavic Languages, German, Italian
Peer reviewed Peer reviewed
Direct linkDirect link
Mostafa M. Samy; Mohamed A. Metwally; Mahmoud Ashry; Wael M. Elmayyah – Measurement: Interdisciplinary Research and Perspectives, 2025
Gas Turbine Engines (GTE) have the highest power-to-weight ratio among Internal Combustion Engines (ICE). Its modularity and ability to utilize various types of fuel make it highly recommended in power plants, naval transportation, and, of course, the most equipped in aviation. The lack of GTEs' real data is increasing a recognized need for…
Descriptors: Engines, Power Technology, Data Collection, Data Interpretation
Peer reviewed Peer reviewed
Direct linkDirect link
Beifang Ma; Maximilian Krötz; Viola Deutscher; Esther Winther – International Journal of Training and Development, 2025
The rapid digital transformation of vocational education and training (VET) has underscored the need to adapt traditional assessment methods to digital formats. However, when transitioning to digital modes, it is crucial to consider factors beyond mere technical implementation, particularly the potential impact of altered presentation formats on…
Descriptors: Job Skills, Competence, Test Format, Computer Assisted Testing
Previous Page | Next Page »
Pages: 1  |  2  |  3