Publication Date
In 2025 | 46 |
Since 2024 | 187 |
Since 2021 (last 5 years) | 694 |
Since 2016 (last 10 years) | 1883 |
Since 2006 (last 20 years) | 3471 |
Descriptor
Source
Author
Publication Type
Education Level
Audience
Teachers | 34 |
Researchers | 29 |
Practitioners | 28 |
Policymakers | 9 |
Administrators | 8 |
Counselors | 1 |
Media Staff | 1 |
Students | 1 |
Location
Turkey | 178 |
Australia | 77 |
Canada | 72 |
China | 72 |
Germany | 62 |
United Kingdom | 62 |
United Kingdom (England) | 51 |
Indonesia | 46 |
Netherlands | 44 |
California | 43 |
Taiwan | 40 |
More ▼ |
Laws, Policies, & Programs
Assessments and Surveys
What Works Clearinghouse Rating
Meets WWC Standards without Reservations | 4 |
Meets WWC Standards with or without Reservations | 6 |
Does not meet standards | 2 |
Susan Ramlo; Carrie Salmon; Yuan Xue – Journal of College Science Teaching, 2025
Research shows that there are multiple benefits to giving college students oral rather than written exams. However, studies that examine, describe, and differentiate how students view their oral exams were never found in a literature search. The purpose of this study was to use Q methodology [Q] to describe the divergent student views about taking…
Descriptors: Undergraduate Students, Science Instruction, Chemistry, Organic Chemistry
Shun-Fu Hu; Amery D. Wu; Jake Stone – Journal of Educational Measurement, 2025
Scoring high-dimensional assessments (e.g., > 15 traits) can be a challenging task. This paper introduces the multilabel neural network (MNN) as a scoring method for high-dimensional assessments. Additionally, it demonstrates how MNN can score the same test responses to maximize different performance metrics, such as accuracy, recall, or…
Descriptors: Tests, Testing, Scores, Test Construction
Joanna Williamson – Research Matters, 2025
Teachers, examiners and assessment experts know from experience that some candidates annotate exam questions. "Annotation" includes anything the candidate writes or draws outside of the designated response space, such as underlining, jotting, circling, sketching and calculating. Annotations are of interest because they may evidence…
Descriptors: Mathematics, Tests, Documentation, Secondary Education
Kylie Gorney; Sandip Sinharay – Journal of Educational Measurement, 2025
Although there exists an extensive amount of research on subscores and their properties, limited research has been conducted on categorical subscores and their interpretations. In this paper, we focus on the claim of Feinberg and von Davier that categorical subscores are useful for remediation and instructional purposes. We investigate this claim…
Descriptors: Tests, Scores, Test Interpretation, Alternative Assessment
Jingwen Wang; Ying Zheng; Yi Zou – Language Testing in Asia, 2024
Pearson Test of English Academic (PTE Academic), a high-stakes English language proficiency test, underwent substantial revisions in 2021. The test duration was reduced from 3 h to 2 h by reducing specific task numbers and sections. This study investigates the impact of these changes on teachers' perceptions and teaching practices, areas…
Descriptors: Foreign Countries, High Stakes Tests, Language Proficiency, Language Tests
Yücel Makaraci; Kazim Nas; Kerem Gündüz; Abdullah Uysal; Samuel T. Orange; Juan D. Ruiz-Cárdenas – Measurement in Physical Education and Exercise Science, 2024
The aim was to determine the validity and test-retest reliability of the Sit to Stand App variables (rising time, vertical velocity, and power) for measuring single-leg sit-to-stand (STS) test compared to those derived from ground reaction force data. Twenty-seven female athletes performed the single-leg STS test over three consecutive sessions…
Descriptors: Computer Simulation, Measurement Techniques, Athletics, Physical Fitness
Pastor, Dena A.; Patterson, Chris R.; Finney, Sara J. – Journal of Psychoeducational Assessment, 2023
In low-stakes testing contexts, there are minimal personal consequences associated with examinee performance. Examples include assessments administered for research, program evaluation, test development, and international comparisons (e.g., Programme for International Student Assessment [PISA]). Because test-taking motivation can suffer in…
Descriptors: Test Construction, Test Validity, Student Attitudes, Attitude Measures
Fergadiotis, Gerasimos; Casilio, Marianne; Dickey, Michael Walsh; Steel, Stacey; Nicholson, Hannele; Fleegle, Mikala; Swiderski, Alexander; Hula, William D. – Journal of Speech, Language, and Hearing Research, 2023
Purpose: Item response theory (IRT) is a modern psychometric framework with several advantageous properties as compared with classical test theory. IRT has been successfully used to model performance on anomia tests in individuals with aphasia; however, all efforts to date have focused on noun production accuracy. The purpose of this study is to…
Descriptors: Item Response Theory, Psychometrics, Verbs, Naming
Tugra Karademir Coskun; Ayfer Alper – Digital Education Review, 2024
This study aims to examine the potential differences between teacher evaluations and artificial intelligence (AI) tool-based assessment systems in university examinations. The research has evaluated a wide spectrum of exams including numerical and verbal course exams, exams with different assessment styles (project, test exam, traditional exam),…
Descriptors: Artificial Intelligence, Visual Aids, Video Technology, Tests
Sheri Bayley – Teaching and Learning in Communication Sciences & Disorders, 2024
The purpose of this study was to explore student performance, self-ratings of learning and preference, and student comments on a variety of reading quiz formats in a first semester speech-language pathology graduate course. Students from two cohorts (n = 34) completed four types of quizzes: closed-book, open-book, open-note, and collaborative…
Descriptors: Reading Instruction, Tests, Graduate Students, Courses
Tri Sedya Febrianti; Siti Fatimah; Yuni Fitriyah; Hanifah Nurhayati – International Journal of Education in Mathematics, Science and Technology, 2024
Assessing students' understanding of circle-related material through subjective tests is effective, though grading these tests can be challenging and often requires technological support. ChatGPT has shown promise in providing reliable and objective evaluations. Many teachers in Indonesia, however, continue to face difficulties integrating…
Descriptors: Artificial Intelligence, Computer Assisted Testing, Scoring, Tests
Li Zhao; Junjie Peng; Shiqi Ke; Kang Lee – Educational Psychology Review, 2024
Unproctored and teacher-proctored exams have been widely used to prevent cheating at many universities worldwide. However, no empirical studies have directly compared their effectiveness in promoting academic integrity in actual exams. To address this significant gap, in four preregistered field studies, we examined the effectiveness of…
Descriptors: Supervision, Tests, Testing, Integrity
Alex Buckley – Studies in Higher Education, 2024
Despite a large amount of critical research literature, traditional examinations continue to be widely used in higher education. This article reviews recent literature in order to assess the role played by the approaches adopted by researchers in the gap between research on exams, and the way exams are used. Viviane Robinson's 'problem-based…
Descriptors: Literature Reviews, Testing, Higher Education, Testing Problems
Hanssens, Jolan; Langie, Greet; Van Soom, Carolien – International Journal of Education in Mathematics, Science and Technology, 2023
In Flanders, open admission into higher education has led to heterogeneity in academic preparedness of incoming STEM students. Higher education institutions offer low stakes positioning tests to these students in order to help them assess their level of starting competences. Due to the unique nature of these tests, little can be inferred about…
Descriptors: Student Attitudes, Tests, Higher Education, STEM Education
Abdullah Al Fraidan; Meznah Saud Abdulaziz Alsubaie – Educational Process: International Journal, 2025
Background: This study examines the effect of test anxiety on the academic performance of postgraduate female students, focusing on their perceptions and experiences in open-book exams (OBE) and closed-book exams (CBE). Method: A qualitative case study design was employed using the Thinking Aloud Protocol (TAP) to collect data from five Saudi…
Descriptors: Test Anxiety, Vocabulary, Females, Books