ERIC - Search Results

Publication Date

In 2025

Publication Type

Journal Articles	36
Reports - Research	34
Tests/Questionnaires	3
Information Analyses	2

Education Level

Higher Education	16
Postsecondary Education	16
Secondary Education	8
High Schools	2
Elementary Secondary Education	1
Grade 10	1

Audience

Location

Japan	3
China	2
Iran	2
Australia	1
Chile	1
China (Shanghai)	1
Colombia	1
Europe	1
European Union	1
Germany	1
Illinois (Urbana)	1
Ireland	1
Mexico	1
Missouri	1
Russia	1
Spain (Madrid)	1
Thailand	1
Turkey	1
Turkey (Istanbul)	1
United Kingdom	1
United Kingdom (England)	1
United Kingdom (Leeds)	1
United States	1
More ▼

Laws, Policies, & Programs

Assessments and Surveys

ACTFL Oral Proficiency…	1
Foreign Language Classroom…	1
International English…	1
Program for International…	1
Teaching and Learning…	1
Test of English as a Foreign…	1

What Works Clearinghouse Rating

Showing 1 to 15 of 36 results Save | Export

"It Has to Be the Common Thread": Weaving Attention to Racial Equity and Justice across Lines of Inquiry and Evaluative Criteria

Peer reviewed

Direct link

Rebecca M. Teasdale; Cherie M. Avent; Ceily L. Moore; María B. Serrano Abreu; Xinru Yan – American Journal of Evaluation, 2025

Evaluators must attend to the destructive forces of racialization and racism to contribute to social transformation. Thus, evaluators are called to center culture, context, equity, and social justice during each step of the evaluation process. Here, we focus on the step(s) in which evaluators define program quality and specify evaluative lines of…

Descriptors: Racism, Evaluation Criteria, Social Justice, Evaluators

Task Design and Rater Effects in Task-Based Language Assessment

Peer reviewed

Direct link

Stefan O'Grady – TESOL Journal, 2025

Task-based language assessment represents a major component of task-based language teaching syllabi. Current perspectives emphasise the importance of tasks in the assessment process, suggesting that adherence to influential models of language production during task design yields predictable test outcomes. The current study contends that the…

Descriptors: Task Analysis, Language Tests, Evaluators, Rating Scales

Employing a Hierarchical Rater Models for Automated Scoring: Scope Review on the Application in Educational Assessment

Peer reviewed
PDF on ERIC

Download full text

Direct link

Akif Avcu – Malaysian Online Journal of Educational Technology, 2025

This scope-review presents the milestones of how Hierarchical Rater Models (HRMs) become operable to used in automated essay scoring (AES) to improve instructional evaluation. Although essay evaluations--a useful instrument for evaluating higher-order cognitive abilities--have always depended on human raters, concerns regarding rater bias,…

Descriptors: Automation, Scoring, Models, Educational Assessment

Assessing Penmanship of Chinese Handwriting: A Deep Learning-Based Approach

Peer reviewed

Direct link

Zebo Xu; Prerit S. Mittal; Mohd. Mohsin Ahmed; Chandranath Adak; Zhenguang G. Cai – Reading and Writing: An Interdisciplinary Journal, 2025

The rise of the digital era has led to a decline in handwriting as the primary mode of communication, resulting in negative effects on handwriting literacy, particularly in complex writing systems such as Chinese. The marginalization of handwriting has contributed to the deterioration of penmanship, defined as the ability to write aesthetically…

Descriptors: Handwriting, Writing Skills, Chinese, Ideography

Toward Racial Equity: Navigating Diverse Conceptualizations and Perspectives in Social Justice-Oriented Evaluation Practice

Peer reviewed

Direct link

Cherie M. Avent; Rebecca M. Teasdale; Xinru Yan; María B. Serrano-Abreu; Ceily L. Moore – American Journal of Evaluation, 2025

The effects of race can manifest in various ways in evaluation contexts, making it critical for evaluators to unpack how race and racism are "complex and destructive forces" for racially minoritized and Indigenous communities. The clarion calls by evaluators on the need for greater attention to issues of race and racism in evaluation…

Descriptors: Race, Racism, Evaluators, Equal Education

Peer reviewed

Direct link

Emily F. Gates; Ruoying Li – American Journal of Evaluation, 2025

Amid calls for evaluations to advance equity, there are ongoing debates, varied guidance, and limited empirical research on how evaluators practically attend to equity in their work. This article identifies ethical questions--about the right thing to do when there are multiple options--that arise when evaluators attend to equity and factors that…

Descriptors: Evaluators, Ethics, Attitudes, Expertise

The Vulnerability of AI-Based Scoring Systems to Gaming Strategies: A Case Study

Peer reviewed

Direct link

Peter Baldwin; Victoria Yaneva; Kai North; Le An Ha; Yiyun Zhou; Alex J. Mechaber; Brian E. Clauser – Journal of Educational Measurement, 2025

Recent developments in the use of large-language models have led to substantial improvements in the accuracy of content-based automated scoring of free-text responses. The reported accuracy levels suggest that automated systems could have widespread applicability in assessment. However, before they are used in operational testing, other aspects of…

Descriptors: Artificial Intelligence, Scoring, Computational Linguistics, Accuracy

Group-Based Journal Review: Opportunities for Researcher Development and Enjoyment

Peer reviewed

Direct link

Eva Heinrich; Geof Hill; Jo-Anne Kelder; Michelle Picard – International Journal for Academic Development, 2025

The availability of expert reviewers, essential for academic publishing, is increasingly under threat, due to workload pressures and lack of development pathways. This inquiry, undertaken by the editors of an emergent higher education journal, draws on reviewers' experiences as articulated in 'reviewer stories' and examines key questions around…

Descriptors: Periodicals, Evaluators, Professional Development, Professional Identity

Sceptics and Champions: Participant Insights on the Use of Partial Randomization to Allocate Research Culture Funding

Peer reviewed

Direct link

Catherine Davies; Holly Ingram – Research Evaluation, 2025

As part of the shift towards a more equitable research culture, funders are reconsidering traditional approaches to peer review. In doing so, they seek to minimize bias towards certain research ideas and researcher profiles, to ensure greater inclusion of disadvantaged groups, to improve review quality, to reduce burden, and to enable more…

Descriptors: Resource Allocation, Research, Culture, Probability

Planning Missing Data Designs for Human Ratings in Creativity Research: A Practical Guide

Peer reviewed

Direct link

Boris Forthmann; Benjamin Goecke; Roger E. Beaty – Creativity Research Journal, 2025

Human ratings are ubiquitous in creativity research. Yet, the process of rating responses to creativity tasks -- typically several hundred or thousands of responses, per rater -- is often time-consuming and expensive. Planned missing data designs, where raters only rate a subset of the total number of responses, have been recently proposed as one…

Descriptors: Creativity, Research, Researchers, Research Methodology

ChatGPT as an Automated Essay Scoring Tool in the Writing Classrooms: How It Compares with Human Scoring

Peer reviewed

Direct link

Ngoc My Bui; Jessie S. Barrot – Education and Information Technologies, 2025

With the generative artificial intelligence (AI) tool's remarkable capabilities in understanding and generating meaningful content, intriguing questions have been raised about its potential as an automated essay scoring (AES) system. One such tool is ChatGPT, which is capable of scoring any written work based on predefined criteria. However,…

Descriptors: Artificial Intelligence, Natural Language Processing, Technology Uses in Education, Automation

One Score, Two Components: Disentangling Appropriateness and Originality in PISA Creative Thinking Judgments Using Generalized Item Response Tree Models

Peer reviewed

Direct link

Nils Myszkowski; Martin Storme – Journal of Creative Behavior, 2025

In the PISA 2022 creative thinking test, students provide a response to a prompt, which is then coded by human raters as no credit, partial credit, or full credit. Like many large-scale educational testing frameworks, PISA uses the generalized partial credit model (GPCM) as a response model for these ordinal ratings. In this paper, we show that…

Descriptors: Creative Thinking, Creativity Tests, Scores, Prompting

IRT Observed-Score Equating for Rater-Mediated Assessments Using a Hierarchical Rater Model

Peer reviewed

Direct link

Tong Wu; Stella Y. Kim; Carl Westine; Michelle Boyer – Journal of Educational Measurement, 2025

While significant attention has been given to test equating to ensure score comparability, limited research has explored equating methods for rater-mediated assessments, where human raters inherently introduce error. If not properly addressed, these errors can undermine score interchangeability and test validity. This study proposes an equating…

Descriptors: Item Response Theory, Evaluators, Error of Measurement, Test Validity

Grading Explanations of Problem-Solving Process and Generating Feedback Using Large Language Models at Human-Level Accuracy

Peer reviewed

Direct link

Zhongzhou Chen; Tong Wan – Physical Review Physics Education Research, 2025

This study examines the feasibility and potential advantages of using large language models, in particular GPT-4o, to perform partial credit grading of large numbers of student written responses to introductory level physics problems. Students were instructed to write down verbal explanations of their reasoning process when solving one conceptual…

Descriptors: Grading, Technology Uses in Education, Student Evaluation, Science Education

Developing and Assessing New Curriculum for Missouri's Future Soil Evaluators

Peer reviewed

Direct link

Joseph Meinert; Kerry Clark; Stephen Anderson – Natural Sciences Education, 2025

One-quarter of households in the state of Missouri are connected to an onsite sewage treatment system. These systems are necessary due to the risks human effluent poses to public health and our states' aquifers and waterways. For these systems to function correctly, a thorough and accurate evaluation of the soils and landforms onsite must be…

Descriptors: Curriculum Development, Soil Science, Evaluators, Sanitation

Previous Page | Next Page »

Pages: 1 | 2 | 3

Evaluators	36
Foreign Countries	16
Artificial Intelligence	14
Computer Software	13
Second Language Learning	13
English (Second Language)	11
Second Language Instruction	11
Writing Evaluation	11
College Students	8
Comparative Analysis	8
Essays	8
Computational Linguistics	7
Scores	7
Technology Integration	7
Accuracy	6
Feedback (Response)	6
Language Teachers	6
Scoring	6
Scoring Rubrics	6
Teacher Attitudes	6
Teaching Methods	6
Writing Instruction	6
Language Tests	5
Secondary School Students	5
Writing Skills	5
More ▼

American Journal of Evaluation	3
International Journal of…	2
Journal of Educational…	2
Language Testing	2
Teaching of Psychology	2
American Educational Research…	1
Asian Journal of Distance…	1
British Journal of…	1
Creativity Research Journal	1
Education and Information…	1
Eurasian Journal of Applied…	1
Grantee Submission	1
Innovations in Education and…	1
International Journal for…	1
International Journal of…	1
International Journal of…	1
International Journal of…	1
Irish Educational Studies	1
Journal of Baltic Science…	1
Journal of Creative Behavior	1
Journal of Education and…	1
Language Testing in Asia	1
Malaysian Online Journal of…	1
Natural Sciences Education	1
Physical Review Physics…	1
More ▼

Ceily L. Moore	2
Cherie M. Avent	2
Rebecca M. Teasdale	2
Xinru Yan	2
Ahmet Can Uyar	1
Akif Avcu	1
Alex J. Mechaber	1
Alexander Kah	1
Andrew Potter	1
Angelika Tsivinskaya	1
Antonia Vaughan	1
Audrey Doyle	1
Aysegül Liman-Kaban	1
Benjamin Goecke	1
Boris Forthmann	1
Brian E. Clauser	1
Carl Westine	1
Catherine Davies	1
Chandranath Adak	1
Chelsea M. Sims	1
Chelsea R. Frazier	1
Claudia Prieto-Latorre	1
Dilek Büyükahiska	1
Elizabeth L. Wetzler	1
Emily Courtney	1
More ▼