Publication Date
| In 2026 | 0 |
| Since 2025 | 6 |
| Since 2022 (last 5 years) | 31 |
| Since 2017 (last 10 years) | 75 |
| Since 2007 (last 20 years) | 128 |
Descriptor
| Computer Assisted Testing | 143 |
| Test Reliability | 93 |
| Foreign Countries | 72 |
| Test Validity | 66 |
| Undergraduate Students | 44 |
| College Students | 40 |
| Evaluation Methods | 34 |
| Test Construction | 33 |
| Reliability | 31 |
| Student Evaluation | 31 |
| Student Attitudes | 29 |
| More ▼ | |
Source
Author
Publication Type
Education Level
Audience
Location
| Turkey | 14 |
| China | 9 |
| Australia | 6 |
| Germany | 5 |
| Japan | 4 |
| Brazil | 3 |
| Connecticut | 3 |
| Indonesia | 3 |
| Israel | 3 |
| Italy | 3 |
| Malaysia | 3 |
| More ▼ | |
Laws, Policies, & Programs
| Every Student Succeeds Act… | 2 |
| Pell Grant Program | 1 |
Assessments and Surveys
What Works Clearinghouse Rating
Jonas Flodén – British Educational Research Journal, 2025
This study compares how the generative AI (GenAI) large language model (LLM) ChatGPT performs in grading university exams compared to human teachers. Aspects investigated include consistency, large discrepancies and length of answer. Implications for higher education, including the role of teachers and ethics, are also discussed. Three…
Descriptors: College Faculty, Artificial Intelligence, Comparative Testing, Scoring
Tahereh Firoozi; Hamid Mohammadi; Mark J. Gierl – Journal of Educational Measurement, 2025
The purpose of this study is to describe and evaluate a multilingual automated essay scoring (AES) system for grading essays in three languages. Two different sentence embedding models were evaluated within the AES system, multilingual BERT (mBERT) and language-agnostic BERT sentence embedding (LaBSE). German, Italian, and Czech essays were…
Descriptors: College Students, Slavic Languages, German, Italian
Parker, Mark A. J.; Hedgeland, Holly; Jordan, Sally E.; Braithwaite, Nicholas St. J. – European Journal of Science and Mathematics Education, 2023
The study covers the development and testing of the alternative mechanics survey (AMS), a modified force concept inventory (FCI), which used automatically marked free-response questions. Data were collected over a period of three academic years from 611 participants who were taking physics classes at high school and university level. A total of…
Descriptors: Test Construction, Scientific Concepts, Physics, Test Reliability
Swapna Haresh Teckwani; Amanda Huee-Ping Wong; Nathasha Vihangi Luke; Ivan Cherh Chiet Low – Advances in Physiology Education, 2024
The advent of artificial intelligence (AI), particularly large language models (LLMs) like ChatGPT and Gemini, has significantly impacted the educational landscape, offering unique opportunities for learning and assessment. In the realm of written assessment grading, traditionally viewed as a laborious and subjective process, this study sought to…
Descriptors: Accuracy, Reliability, Computational Linguistics, Standards
Osman Tat; Abdullah Faruk Kilic – Turkish Online Journal of Distance Education, 2024
The widespread availability of internet access in daily life has resulted in a greater acceptance of online assessment methods. E-assessment platforms offer various features such as randomizing questions and answers, utilizing extensive question banks, setting time limits, and managing access during online exams. Electronic assessment enables…
Descriptors: Test Construction, Test Validity, Test Reliability, Anxiety
Schoch, Kerstin; Ostermann, Thomas – Creativity Research Journal, 2022
Although art has been subject to psychological research for some time, the artwork itself received little attention in quantitative research. The rating instrument for two-dimensional pictorial works ("RizbA") fills this gap by providing a tool for formal picture analysis. This study validates the questionnaire on 294 images created by…
Descriptors: Psychometrics, Art, Measures (Individuals), Visual Arts
Shard; Devesh Kumar; Sapna Koul – International Journal of Information and Learning Technology, 2024
Purpose: This study aims to gain insights into how students perceive online examination practices and evaluation, as well as identify the key factors that impact their intentions toward online exams. Design/methodology/approach: This empirical study conducted in India utilized an online survey method between May 24 and June 14, 2022. The data were…
Descriptors: Foreign Countries, Undergraduate Students, Graduate Students, Student Attitudes
Luis Felipe Dias Lopes; Fabiane Volpato Chiapinoto; Martiele Gonçalves Moreira; Nuvea Kuhn; Fillipe Grando Lopes; Luciana Davi Traverso; Deoclécio Junior Cardoso Silva; Gilnei Luiz de Moura – Journal of Education and Learning, 2024
This study aimed to validate a scale for subjectively measuring teaching competencies for innovation in higher education. The scale was developed by creating a set of items that underwent content validity through the Delphi technique and face validity. A survey was then conducted with 523 higher education professors. The resulting scale, called…
Descriptors: Foreign Countries, College Faculty, Teacher Competencies, Teacher Competency Testing
Sonique Sailsman; Emma El-Shami – Quarterly Review of Distance Education, 2024
Nurse educators at the undergraduate level spend significant time developing and revising exam questions. Following the exam administration, course faculty have the opportunity to complete an item analysis and question revision to improve reliability and validity. A challenge faculty face is tracking these exam changes when teaching as part of a…
Descriptors: Nursing Education, Nursing Students, College Faculty, Test Construction
Andrea Fernández-Sánchez; Juan José Lorenzo-Castiñeiras; Ana Sánchez-Bello – European Journal of Education, 2025
The advent of artificial intelligence (AI) technologies heralds a transformative era in education. This study investigates the integration of AI tools in developing educational assessment rubrics within the 'Curriculum Design Development and Evaluation' course at the University of A Coruña during the 2023-2024 academic year. Employing an…
Descriptors: Foreign Countries, Higher Education, Artificial Intelligence, Technology Integration
Che Lah, Noor Hidayah; Tasir, Zaidatun; Jumaat, Nurul Farhana – Educational Studies, 2023
The aim of the study was to evaluate the extended version of the Problem-Solving Inventory (PSI) via an online learning setting known as the Online Problem-Solving Inventory (OPSI) through the lens of Rasch Model analysis. To date, there is no extended version of the PSI for online settings even though many researchers have used it; thus, this…
Descriptors: Problem Solving, Measures (Individuals), Electronic Learning, Item Response Theory
Junlan Pan; Emma Marsden – Language Testing, 2024
"Tests of Aptitude for Language Learning" (TALL) is an openly accessible internet-based battery to measure the multifaceted construct of foreign language aptitude, using language domain-specific instruments and L1-sensitive instructions and stimuli. This brief report introduces the components of this theory-informed battery and…
Descriptors: Language Tests, Aptitude Tests, Second Language Learning, Test Construction
On-Soon Lee – Journal of Pan-Pacific Association of Applied Linguistics, 2024
Despite the increasing interest in using AI tools as assistant agents in instructional settings, the effectiveness of ChatGPT, the generative pretrained AI, for evaluating the accuracy of second language (L2) writing has been largely unexplored in formative assessment. Therefore, the current study aims to examine how ChatGPT, as an evaluator,…
Descriptors: Foreign Countries, Undergraduate Students, English (Second Language), Second Language Learning
Mücahit Öztürk – Open Praxis, 2024
This study examined the problems that pre-service teachers face in the online assessment process and their suggestions for solutions to these problems. The participants were 136 pre-service teachers who have been experiencing online assessment for a long time and who took the Foundations of Open and Distance Learning course. This research is a…
Descriptors: Foreign Countries, Preservice Teacher Education, Preservice Teachers, Distance Education
Erdemir, Mustafa; Akyuz, Halil Ibrahim – Journal on School Educational Technology, 2020
The purpose of this study is to reduce ethics violations such as tricking and cheating that may occur in the offline assessment of undergraduate level Physics-II (Electricity) course subjects. The study is significant for the reliable and ethical evaluation of the Internet and computer-based educational process. Thirty-eight pre-service teachers…
Descriptors: Test Reliability, Ethics, Cheating, Undergraduate Students

Peer reviewed
Direct link
