Publication Date
In 2025 | 16 |
Since 2024 | 43 |
Since 2021 (last 5 years) | 145 |
Since 2016 (last 10 years) | 277 |
Since 2006 (last 20 years) | 408 |
Descriptor
Computer Assisted Testing | 603 |
Test Validity | 603 |
Test Reliability | 260 |
Test Construction | 194 |
Foreign Countries | 182 |
Language Tests | 109 |
Test Items | 98 |
English (Second Language) | 92 |
Scores | 91 |
Evaluation Methods | 85 |
Second Language Learning | 83 |
More ▼ |
Source
Author
McKown, Clark | 5 |
Petscher, Yaacov | 5 |
Bulut, Okan | 4 |
Garcia Laborda, Jesus | 4 |
Wainer, Howard | 4 |
Wise, Steven L. | 4 |
Alonzo, Julie | 3 |
Bejar, Isaac I. | 3 |
Bennett, Randy Elliot | 3 |
Cory, Charles H. | 3 |
Ecalle, Jean | 3 |
More ▼ |
Publication Type
Education Level
Location
China | 17 |
Canada | 14 |
Indonesia | 13 |
Australia | 11 |
Germany | 11 |
Turkey | 11 |
California | 10 |
New York | 7 |
United Kingdom | 7 |
United Kingdom (England) | 7 |
Taiwan | 6 |
More ▼ |
Laws, Policies, & Programs
Individuals with Disabilities… | 2 |
Family Educational Rights and… | 1 |
Health Insurance Portability… | 1 |
No Child Left Behind Act 2001 | 1 |
Pell Grant Program | 1 |
Race to the Top | 1 |
Assessments and Surveys
What Works Clearinghouse Rating
Endang Susantini; Yurizka Melia Sari; Prima Vidya Asteria; Muhammad Ilyas Marzuqi – Journal of Education and Learning (EduLearn), 2025
Assessing preservice' higher order thinking skills (HOTS) in science and mathematics is essential. Teachers' HOTS ability is closely related to their ability to create HOTS-type science and mathematics problems. Among various types of HOTS, one is Bloomian HOTS. To facilitate the preservice teacher to create problems in those subjects, an Android…
Descriptors: Content Validity, Mathematics Instruction, Decision Making, Thinking Skills
Ahmad, Nor Shafrin; Zaharudin, Rozniza; Khairani, Ahmad Zamri – International Journal of Educational Methodology, 2022
Anger is a topic that requires intervention from teachers, counsellors, psychologists, parents, and all communities. The expressions of anger are subjective and sometimes hard to identify. Thus, anger should be measured more objectively, while the expressions need to be examined closely. The purpose of this study is to provide valid confirmation…
Descriptors: Psychological Patterns, Test Validity, Psychometrics, Adolescents
K. Talman; J. Vierula; T. Karihtala; E. Laakkonen; J. Engblom; E. Haavisto – Higher Education Quarterly, 2025
Higher education institutions need to develop valid, fair, and objective selection methods. Current literature reporting the development and validation of new national large-scale selection tests is scarce. This two-phased study aimed to (1) develop and (2) evaluate the validity of the Finnish digital Universities of Applied Sciences Entrance…
Descriptors: Admission Criteria, Test Construction, Test Validity, Computer Assisted Testing
Xiong, Yao; Schunn, Christian D.; Wu, Yong – Journal of Computer Assisted Learning, 2023
Background: For peer assessment, reliability (i.e., consistency in ratings across peers) and validity (i.e., consistency of peer ratings with instructors or experts) are frequently examined in the research literature to address a central concern of instructors and students. Although the average levels are generally promising, both reliability and…
Descriptors: Peer Evaluation, Computer Assisted Testing, Test Reliability, Test Validity
Jyun-Hong Chen; Hsiu-Yi Chao – Journal of Educational and Behavioral Statistics, 2024
To solve the attenuation paradox in computerized adaptive testing (CAT), this study proposes an item selection method, the integer programming approach based on real-time test data (IPRD), to improve test efficiency. The IPRD method turns information regarding the ability distribution of the population from real-time test data into feasible test…
Descriptors: Data Use, Computer Assisted Testing, Adaptive Testing, Design
Ágnes Hódi; Edit Tóth – International Journal of Early Childhood, 2024
Phonological awareness plays a key role in learning to read; therefore, its assessment has received a lot of attention. Research in the domain of phonological awareness has been characterized by attempts to develop reliable and valid assessment tools for diverse populations. Over the past few decades, phonological awareness assessment has gone…
Descriptors: Phonological Awareness, Computer Assisted Testing, Hungarian, Native Language
Aditya Shah; Ajay Devmane; Mehul Ranka; Prathamesh Churi – Education and Information Technologies, 2024
Online learning has grown due to the advancement of technology and flexibility. Online examinations measure students' knowledge and skills. Traditional question papers include inconsistent difficulty levels, arbitrary question allocations, and poor grading. The suggested model calibrates question paper difficulty based on student performance to…
Descriptors: Computer Assisted Testing, Difficulty Level, Grading, Test Construction
Ng, Emily – International Journal of Adult Education and Technology, 2020
The resources and time constraints of assessing large classes are always weighed up against the validity, reliability, and learning outcomes of the assessment tasks. With the digital revolution in the 21st Century, educators can benefit from computer technology to carry out a large-scale assessment in higher education more efficiently. In this…
Descriptors: Nursing Students, Computer Assisted Testing, Student Evaluation, Multiple Choice Tests
Yi-Jui I. Chen; Yi-Jhen Wu; Yi-Hsin Chen; Robin Irey – Journal of Psychoeducational Assessment, 2025
A short form of the 60-item computer-based orthographic processing assessment (long-form COPA or COPA-LF) was developed. The COPA-LF consists of five skills, including rapid perception, access, differentiation, correction, and arrangement. Thirty items from the COPA-LF were selected for the short-form COPA (COPA-SF) based on cognitive diagnostic…
Descriptors: Computer Assisted Testing, Test Length, Test Validity, Orthographic Symbols
Backes, Ben; Cowan, James – National Center for Analysis of Longitudinal Data in Education Research (CALDER), 2020
Prior work has documented a substantial penalty associated with taking the Partnership for Assessment of Readiness for College and Careers (PARCC) online relative to on paper (Backes & Cowan, 2019). However, this penalty does not necessarily make online tests less useful. For example, it could be the case that computer literacy skills are…
Descriptors: Predictive Validity, Test Validity, Computer Assisted Testing, Comparative Analysis
Osman Tat; Abdullah Faruk Kilic – Turkish Online Journal of Distance Education, 2024
The widespread availability of internet access in daily life has resulted in a greater acceptance of online assessment methods. E-assessment platforms offer various features such as randomizing questions and answers, utilizing extensive question banks, setting time limits, and managing access during online exams. Electronic assessment enables…
Descriptors: Test Construction, Test Validity, Test Reliability, Anxiety
Tahereh Firoozi; Hamid Mohammadi; Mark J. Gierl – Journal of Educational Measurement, 2025
The purpose of this study is to describe and evaluate a multilingual automated essay scoring (AES) system for grading essays in three languages. Two different sentence embedding models were evaluated within the AES system, multilingual BERT (mBERT) and language-agnostic BERT sentence embedding (LaBSE). German, Italian, and Czech essays were…
Descriptors: College Students, Slavic Languages, German, Italian
Huawei, Shi; Aryadoust, Vahid – Education and Information Technologies, 2023
Automated writing evaluation (AWE) systems are developed based on interdisciplinary research and technological advances such as natural language processing, computer sciences, and latent semantic analysis. Despite a steady increase in research publications in this area, the results of AWE investigations are often mixed, and their validity may be…
Descriptors: Writing Evaluation, Writing Tests, Computer Assisted Testing, Automation
Panayiotis Panayides; Elena C. Papanastasiou; Katerina Georgiou; Maria Karekla – European Journal of Education, 2024
This study is an investigation of the validity of the Online Test Anxiety Inventory (O?-TAI) for adult students. The scale contained the 20 items of the Test Anxiety Inventory (Spielberger, 1980), together with five computer anxiety items, all rephrased so as to pertain to online test anxiety. The scale was administered to a large sample of Greek…
Descriptors: Test Validity, Computer Assisted Testing, Test Anxiety, Measures (Individuals)
Mostafa M. Samy; Mohamed A. Metwally; Mahmoud Ashry; Wael M. Elmayyah – Measurement: Interdisciplinary Research and Perspectives, 2025
Gas Turbine Engines (GTE) have the highest power-to-weight ratio among Internal Combustion Engines (ICE). Its modularity and ability to utilize various types of fuel make it highly recommended in power plants, naval transportation, and, of course, the most equipped in aviation. The lack of GTEs' real data is increasing a recognized need for…
Descriptors: Engines, Power Technology, Data Collection, Data Interpretation