Publication Date
| In 2026 | 0 |
| Since 2025 | 63 |
| Since 2022 (last 5 years) | 329 |
| Since 2017 (last 10 years) | 827 |
| Since 2007 (last 20 years) | 1777 |
Descriptor
Source
Author
Publication Type
Education Level
Audience
| Researchers | 86 |
| Practitioners | 63 |
| Administrators | 34 |
| Teachers | 25 |
| Policymakers | 23 |
| Community | 5 |
| Media Staff | 5 |
| Support Staff | 5 |
| Counselors | 2 |
| Parents | 2 |
| Students | 2 |
| More ▼ | |
Location
| Australia | 64 |
| United Kingdom | 59 |
| Canada | 54 |
| China | 40 |
| United States | 39 |
| California | 37 |
| United Kingdom (England) | 36 |
| Texas | 32 |
| Turkey | 28 |
| Japan | 26 |
| Israel | 23 |
| More ▼ | |
Laws, Policies, & Programs
Assessments and Surveys
What Works Clearinghouse Rating
Kevin C. Haudek; Xiaoming Zhai – International Journal of Artificial Intelligence in Education, 2024
Argumentation, a key scientific practice presented in the "Framework for K-12 Science Education," requires students to construct and critique arguments, but timely evaluation of arguments in large-scale classrooms is challenging. Recent work has shown the potential of automated scoring systems for open response assessments, leveraging…
Descriptors: Accuracy, Persuasive Discourse, Artificial Intelligence, Learning Management Systems
Education Endowment Foundation, 2024
The socioeconomic attainment gap grows as learners progress through the education system, meaning that it is at its widest when they reach the post-16 stage. In 2023, by the end of secondary school, socioeconomically disadvantaged pupils were 18.8 months behind their peers, whilst persistently disadvantaged pupils (those eligible for free school…
Descriptors: Foreign Countries, Educational Finance, Socioeconomic Status, Secondary School Students
Harnar, Michael A.; Hillman, Jeffrey A.; Endres, Cheryl L.; Snow, Juna Z. – American Journal of Evaluation, 2020
The term "meta-evaluation"--referring to the "evaluation of evaluations"--has been in the evaluation lexicon for a half-century. Despite this longevity, research on meta-evaluation is sparse and even more so for internal formative types of meta-evaluation. This exploratory study builds on our understanding of meta-evaluative…
Descriptors: Evaluation Methods, Formative Evaluation, Evaluation Research, Quality Assurance
Garcia, Gabriela L.; Stevahn, Laurie – American Journal of Evaluation, 2020
This article reports research that examined the meaning of two broad evaluator competency domains. The first is "situational awareness" (SA) that focuses on understanding the unique contexts of evaluations and their users/stakeholders. The second is "interpersonal competence" (IC) that focuses on social skills needed for…
Descriptors: Evaluators, Interpersonal Competence, Job Skills, Context Effect
Minott, Mark – Journal of Workplace Learning, 2020
Purpose: The purpose of the self-study is two-fold: first, to aid in redressing the lack of attention given to the professional development i.e., the building of practical or work-related knowledge of examination invigilators and second, to forward the idea that engaging the examination invigilation process reflectively is an effective form of…
Descriptors: Testing, Observation, Evaluators, Professional Development
Coetzee, Philna; du Plessis, Annelize – Industry and Higher Education, 2021
Practising internal auditors, including entry-level internal auditors, need face-to-face soft skills to effectively manage the increased complexity of their profession. Although many studies have highlighted the need for soft skills, none has identified the various categories of face-to-face soft skills required by entry-level internal auditors…
Descriptors: Accounting, Audits (Verification), Evaluators, Entry Workers
Styck, Kara M.; Anthony, Christopher J.; Sandilos, Lia E.; DiPerna, James C. – Child Development, 2021
The Classroom Assessment Scoring System (CLASS; Pianta et al., 2008) is a popular measure of teacher-child interactions. Despite its prominence, CLASS scores have fairly weak relations with various child outcomes (e.g., Zaslow et al., 2010). One potential reason for these findings could be systematic differences in observer severity. As such, the…
Descriptors: Classroom Environment, Teacher Student Relationship, Scores, Correlation
Wang, Jue; Engelhard, George, Jr. – Educational and Psychological Measurement, 2019
The purpose of this study is to explore the use of unfolding models for evaluating the quality of ratings obtained in rater-mediated assessments. Two different judgmental processes can be used to conceptualize ratings: impersonal judgments and personal preferences. Impersonal judgments are typically expected in rater-mediated assessments, and…
Descriptors: Evaluative Thinking, Preferences, Evaluators, Models
Roger Young; Emily Courtney; Alexander Kah; Mariah Wilkerson; Yi-Hsin Chen – Teaching of Psychology, 2025
Background: Multiple-choice item (MCI) assessments are burdensome for instructors to develop. Artificial intelligence (AI, e.g., ChatGPT) can streamline the process without sacrificing quality. The quality of AI-generated MCIs and human experts is comparable. However, whether the quality of AI-generated MCIs is equally good across various domain-…
Descriptors: Item Response Theory, Multiple Choice Tests, Psychology, Textbooks
Fatih Yavuz; Özgür Çelik; Gamze Yavas Çelik – British Journal of Educational Technology, 2025
This study investigates the validity and reliability of generative large language models (LLMs), specifically ChatGPT and Google's Bard, in grading student essays in higher education based on an analytical grading rubric. A total of 15 experienced English as a foreign language (EFL) instructors and two LLMs were asked to evaluate three student…
Descriptors: English (Second Language), Second Language Learning, Second Language Instruction, Computational Linguistics
Andrew Potter; Mitchell Shortt; Maria Goldshtein; Rod D. Roscoe – Grantee Submission, 2025
Broadly defined, academic language (AL) is a set of lexical-grammatical norms and registers commonly used in educational and academic discourse. Mastery of academic language in writing is an important aspect of writing instruction and assessment. The purpose of this study was to use Natural Language Processing (NLP) tools to examine the extent to…
Descriptors: Academic Language, Natural Language Processing, Grammar, Vocabulary Skills
Dally, Kerry; Holbrook, Allyson; Lovat, Terence; Fairbairn, Hedy – Higher Education Research and Development, 2022
There has been substantial research on doctoral supervision and examination, yet rarely a focus on what happens at the end-stage of the process when examiner feedback is received and addressed. This article reports survey findings (n = 262) from a study investigating supervisor perceptions about Australian end-stage doctoral examination processes.…
Descriptors: Doctoral Students, Doctoral Dissertations, Writing Evaluation, Supervision
Marquina, Monica; Gimenez, Graciela; Rodríguez, Wenceslao; Mazzeo, Ignacio – Quality Assurance in Education: An International Perspective, 2022
Purpose: The purpose of this paper is to study how quality assurance (QA) has impacted Argentina's higher education system, how QA tasks are reflected on the organizational structure of institutions, which kind of professional profiles the new QA staff assume and to what extent university life is reconfigured from these changes.…
Descriptors: Foreign Countries, Quality Assurance, Universities, Educational Quality
Wind, Stefanie A. – Measurement: Interdisciplinary Research and Perspectives, 2022
In many performance assessments, one or two raters from the complete rater pool scores each performance, resulting in a sparse rating design, where there are limited observations of each rater relative to the complete sample of students. Although sparse rating designs can be constructed to facilitate estimation of student achievement, the…
Descriptors: Evaluators, Bias, Identification, Performance Based Assessment
Yu-Tzu Chang; Ann Tai Choe; Daniel Holden; Daniel R. Isbell – Language Testing, 2024
In this Brief Report, we describe an evaluation of and revisions to a rubric adapted from the Jacobs et al.'s (1981) ESL COMPOSITION PROFILE, with four rubric categories and 20-point rating scales, in the context of an intensive English program writing placement test. Analysis of 4 years of rating data (2016-2021, including 434 essays) using…
Descriptors: Language Tests, Rating Scales, Second Language Learning, English (Second Language)

Peer reviewed
Direct link
