Publication Date
In 2025 | 7 |
Since 2024 | 21 |
Since 2021 (last 5 years) | 67 |
Since 2016 (last 10 years) | 134 |
Since 2006 (last 20 years) | 188 |
Descriptor
Computer Assisted Testing | 260 |
Test Reliability | 260 |
Test Validity | 260 |
Test Construction | 96 |
Foreign Countries | 82 |
Evaluation Methods | 43 |
Scores | 40 |
Language Tests | 36 |
Student Evaluation | 36 |
Elementary School Students | 35 |
Test Items | 34 |
More ▼ |
Source
Author
Petscher, Yaacov | 5 |
Federico, Pat-Anthony | 3 |
McKown, Clark | 3 |
Russo-Ponsaran, Nicole M. | 3 |
Tock, Jamie | 3 |
Ackerman, Debra J. | 2 |
Amit Sevak | 2 |
Anderson, Paul S. | 2 |
Anna-Maria Fall | 2 |
Beula M. Magimairaj | 2 |
Boyle, Michael H. | 2 |
More ▼ |
Publication Type
Education Level
Location
Turkey | 9 |
California | 6 |
China | 6 |
New York | 6 |
Germany | 5 |
Indonesia | 5 |
Australia | 4 |
Canada | 4 |
Malaysia | 4 |
United Kingdom | 4 |
Florida | 3 |
More ▼ |
Laws, Policies, & Programs
No Child Left Behind Act 2001 | 1 |
Pell Grant Program | 1 |
Race to the Top | 1 |
Assessments and Surveys
What Works Clearinghouse Rating
Xiong, Yao; Schunn, Christian D.; Wu, Yong – Journal of Computer Assisted Learning, 2023
Background: For peer assessment, reliability (i.e., consistency in ratings across peers) and validity (i.e., consistency of peer ratings with instructors or experts) are frequently examined in the research literature to address a central concern of instructors and students. Although the average levels are generally promising, both reliability and…
Descriptors: Peer Evaluation, Computer Assisted Testing, Test Reliability, Test Validity
Jyun-Hong Chen; Hsiu-Yi Chao – Journal of Educational and Behavioral Statistics, 2024
To solve the attenuation paradox in computerized adaptive testing (CAT), this study proposes an item selection method, the integer programming approach based on real-time test data (IPRD), to improve test efficiency. The IPRD method turns information regarding the ability distribution of the population from real-time test data into feasible test…
Descriptors: Data Use, Computer Assisted Testing, Adaptive Testing, Design
Ágnes Hódi; Edit Tóth – International Journal of Early Childhood, 2024
Phonological awareness plays a key role in learning to read; therefore, its assessment has received a lot of attention. Research in the domain of phonological awareness has been characterized by attempts to develop reliable and valid assessment tools for diverse populations. Over the past few decades, phonological awareness assessment has gone…
Descriptors: Phonological Awareness, Computer Assisted Testing, Hungarian, Native Language
Aditya Shah; Ajay Devmane; Mehul Ranka; Prathamesh Churi – Education and Information Technologies, 2024
Online learning has grown due to the advancement of technology and flexibility. Online examinations measure students' knowledge and skills. Traditional question papers include inconsistent difficulty levels, arbitrary question allocations, and poor grading. The suggested model calibrates question paper difficulty based on student performance to…
Descriptors: Computer Assisted Testing, Difficulty Level, Grading, Test Construction
Yi-Jui I. Chen; Yi-Jhen Wu; Yi-Hsin Chen; Robin Irey – Journal of Psychoeducational Assessment, 2025
A short form of the 60-item computer-based orthographic processing assessment (long-form COPA or COPA-LF) was developed. The COPA-LF consists of five skills, including rapid perception, access, differentiation, correction, and arrangement. Thirty items from the COPA-LF were selected for the short-form COPA (COPA-SF) based on cognitive diagnostic…
Descriptors: Computer Assisted Testing, Test Length, Test Validity, Orthographic Symbols
Osman Tat; Abdullah Faruk Kilic – Turkish Online Journal of Distance Education, 2024
The widespread availability of internet access in daily life has resulted in a greater acceptance of online assessment methods. E-assessment platforms offer various features such as randomizing questions and answers, utilizing extensive question banks, setting time limits, and managing access during online exams. Electronic assessment enables…
Descriptors: Test Construction, Test Validity, Test Reliability, Anxiety
Tahereh Firoozi; Hamid Mohammadi; Mark J. Gierl – Journal of Educational Measurement, 2025
The purpose of this study is to describe and evaluate a multilingual automated essay scoring (AES) system for grading essays in three languages. Two different sentence embedding models were evaluated within the AES system, multilingual BERT (mBERT) and language-agnostic BERT sentence embedding (LaBSE). German, Italian, and Czech essays were…
Descriptors: College Students, Slavic Languages, German, Italian
Sher Muhammad Awan; Rashid Hussain; Khalid Saleem – Pakistan Journal of Distance and Online Learning, 2024
The objective of this study was to find effect of Jazz smart classroom's online quiz on academic achievement of students at secondary school level. Experimental design was used in which two groups were selected one was control and other was experimental group. Experimental group was given a treatment of six weeks by using Jazz Smart Classroom's…
Descriptors: Educational Technology, High School Students, Computer Assisted Testing, Academic Achievement
Luis Felipe Dias Lopes; Fabiane Volpato Chiapinoto; Martiele Gonçalves Moreira; Nuvea Kuhn; Fillipe Grando Lopes; Luciana Davi Traverso; Deoclécio Junior Cardoso Silva; Gilnei Luiz de Moura – Journal of Education and Learning, 2024
This study aimed to validate a scale for subjectively measuring teaching competencies for innovation in higher education. The scale was developed by creating a set of items that underwent content validity through the Delphi technique and face validity. A survey was then conducted with 523 higher education professors. The resulting scale, called…
Descriptors: Foreign Countries, College Faculty, Teacher Competencies, Teacher Competency Testing
Sonique Sailsman; Emma El-Shami – Quarterly Review of Distance Education, 2024
Nurse educators at the undergraduate level spend significant time developing and revising exam questions. Following the exam administration, course faculty have the opportunity to complete an item analysis and question revision to improve reliability and validity. A challenge faculty face is tracking these exam changes when teaching as part of a…
Descriptors: Nursing Education, Nursing Students, College Faculty, Test Construction
Liou, Gloria; Bonner, Cavan V.; Tay, Louis – International Journal of Testing, 2022
With the advent of big data and advances in technology, psychological assessments have become increasingly sophisticated and complex. Nevertheless, traditional psychometric issues concerning the validity, reliability, and measurement bias of such assessments remain fundamental in determining whether score inferences of human attributes are…
Descriptors: Psychometrics, Computer Assisted Testing, Adaptive Testing, Data
Yue Huang; Joshua Wilson – Journal of Computer Assisted Learning, 2025
Background: Automated writing evaluation (AWE) systems, used as formative assessment tools in writing classrooms, are promising for enhancing instruction and improving student performance. Although meta-analytic evidence supports AWE's effectiveness in various contexts, research on its effectiveness in the U.S. K-12 setting has lagged behind its…
Descriptors: Writing Evaluation, Writing Skills, Writing Tests, Writing Instruction
Che Lah, Noor Hidayah; Tasir, Zaidatun; Jumaat, Nurul Farhana – Educational Studies, 2023
The aim of the study was to evaluate the extended version of the Problem-Solving Inventory (PSI) via an online learning setting known as the Online Problem-Solving Inventory (OPSI) through the lens of Rasch Model analysis. To date, there is no extended version of the PSI for online settings even though many researchers have used it; thus, this…
Descriptors: Problem Solving, Measures (Individuals), Electronic Learning, Item Response Theory
Uzun, N. Bilge; Aktas, Mehtap; Akay, Cenk – Journal of Educational Technology, 2023
The challenges experienced in measurement and evaluation during the distance education process among student and instructor groups are discussed in the study. A qualitative meta-synthesis method is used in this research. Twenty studies were included in the meta-synthesis. The challenges experienced by the instructors are program utilization,…
Descriptors: Measurement Techniques, Evaluation Methods, Distance Education, Literature Reviews
Ying Xu; Xiaodong Li; Jin Chen – Language Testing, 2025
This article provides a detailed review of the Computer-based English Listening Speaking Test (CELST) used in Guangdong, China, as part of the National Matriculation English Test (NMET) to assess students' English proficiency. The CELST measures listening and speaking skills as outlined in the "English Curriculum for Senior Middle…
Descriptors: Computer Assisted Testing, English (Second Language), Language Tests, Listening Comprehension Tests