ERIC - Search Results

Publication Date

In 2025	10
Since 2024	18
Since 2021 (last 5 years)	49
Since 2016 (last 10 years)	98
Since 2006 (last 20 years)	130

Descriptor

Evaluators	151
Writing Evaluation	151
Second Language Learning	78
English (Second Language)	75
Essays	65
Foreign Countries	57
Scoring	48
Second Language Instruction	44
Comparative Analysis	37
Correlation	36
Interrater Reliability	36
Scores	34
Scoring Rubrics	32
Language Tests	30
Writing Skills	30
Rating Scales	29
Computer Software	27
Evaluation Criteria	27
Computational Linguistics	25
Writing Instruction	25
Undergraduate Students	24
Writing Tests	23
Language Proficiency	21
Writing (Composition)	21
Accuracy	20
More ▼

Publication Type

Reports - Research	130
Journal Articles	125
Tests/Questionnaires	24
Speeches/Meeting Papers	11
Reports - Evaluative	6
Dissertations/Theses -…	5
Reports - Descriptive	4
Guides - Non-Classroom	3
Information Analyses	3
Opinion Papers	2
Reference Materials -…	1
More ▼

Education Level

Higher Education	61
Postsecondary Education	53
Secondary Education	15
Elementary Education	6
High Schools	6
Grade 7	3
Junior High Schools	2
Middle Schools	2
Adult Education	1
Early Childhood Education	1
Elementary Secondary Education	1
Grade 1	1
Grade 10	1
Grade 11	1
Grade 12	1
Grade 2	1
Grade 6	1
Primary Education	1
More ▼

Audience

Researchers	2
Practitioners	1
Teachers	1

Location

Turkey	12
China	7
Japan	5
Iran	4
Australia	3
Europe	3
Thailand	3
Indonesia	2
Norway	2
South Korea	2
Belgium	1
California	1
Florida	1
Germany	1
Hawaii	1
Hong Kong	1
Illinois (Urbana)	1
Indiana	1
Kuwait	1
Netherlands	1
New Jersey	1
Nigeria	1
Ohio	1
Pakistan	1
Spain	1
More ▼

Laws, Policies, & Programs

Assessments and Surveys

International English…	7
Test of English as a Foreign…	6
Flesch Kincaid Grade Level…	1
Gates MacGinitie Reading Tests	1
General Educational…	1
National Assessment of…	1
New Jersey High School…	1

What Works Clearinghouse Rating

Showing 1 to 15 of 151 results Save | Export

Standard Setting in Academic Writing Assessment through Objective Standard Setting Method

Peer reviewed
PDF on ERIC

Download full text

Fisne, Fatima Nur; Sata, Mehmet; Karakaya, Ismail – International Journal of Assessment Tools in Education, 2022

Performance standards have important consequences for all the stakeholders in the assessment of L2 academic writing. These standards not only describe the level of writing performance but also provide a basis for making evaluative decisions on the academic writing. Such a high-stakes role of the performance standards requires the enhancement of…

Descriptors: Standard Setting, Writing Evaluation, Academic Language, English (Second Language)

Grading the Graders: Comparing Generative AI and Human Assessment in Essay Evaluation

Peer reviewed

Direct link

Elizabeth L. Wetzler; Kenneth S. Cassidy; Margaret J. Jones; Chelsea R. Frazier; Nickalous A. Korbut; Chelsea M. Sims; Shari S. Bowen; Michael Wood – Teaching of Psychology, 2025

Background: Generative artificial intelligence (AI) represents a potentially powerful, time-saving tool for grading student essays. However, little is known about how AI-generated essay scores compare to human instructor scores. Objective: The purpose of this study was to compare the essay grading scores produced by AI with those of human…

Descriptors: Essays, Writing Evaluation, Scores, Evaluators

Exploring Difficult-to-Score Essays with a Hyperbolic Cosine Accuracy Model and Coh-Metrix Indices

Peer reviewed

Direct link

Wang, Jue; Engelhard, George; Combs, Trenton – Journal of Experimental Education, 2023

Unfolding models are frequently used to develop scales for measuring attitudes. Recently, unfolding models have been applied to examine rater severity and accuracy within the context of rater-mediated assessments. One of the problems in applying unfolding models to rater-mediated assessments is that the substantive interpretations of the latent…

Descriptors: Writing Evaluation, Scoring, Accuracy, Computational Linguistics

The Effect of Rater Training on Rating Behaviors in Peer Assessment among Secondary School Students

Peer reviewed
PDF on ERIC

Download full text

Nazira Tursynbayeva; Umur Öç; Ismail Karakaya – International Journal of Assessment Tools in Education, 2024

This study aimed to measure the effect of rater training given to improve the peer assessment skills of secondary school students on rater behaviors using the many-facet Rasch Measurement model. The research employed a single-group pretest-posttest design. Since all raters scored all students, the analyses were carried out in a fully crossed (s x…

Descriptors: Evaluators, Training, Behavior, Peer Evaluation

Triangulating Natural Language Processing (NLP)-Based Analysis of Rater Comments and Many-Facet Rasch Measurement (MFRM): An Innovative Approach to Investigating Raters' Application of Rating Scales in Writing Assessment

Peer reviewed

Direct link

Huiying Cai; Xun Yan – Language Testing, 2024

Rater comments tend to be qualitatively analyzed to indicate raters' application of rating scales. This study applied natural language processing (NLP) techniques to quantify meaningful, behavioral information from a corpus of rater comments and triangulated that information with a many-facet Rasch measurement (MFRM) analysis of rater scores. The…

Descriptors: Natural Language Processing, Item Response Theory, Rating Scales, Writing Evaluation

Evaluating Quadratic Weighted Kappa as the Standard Performance Metric for Automated Essay Scoring

Peer reviewed
PDF on ERIC

Download full text

Doewes, Afrizal; Kurdhi, Nughthoh Arfawi; Saxena, Akrati – International Educational Data Mining Society, 2023

Automated Essay Scoring (AES) tools aim to improve the efficiency and consistency of essay scoring by using machine learning algorithms. In the existing research work on this topic, most researchers agree that human-automated score agreement remains the benchmark for assessing the accuracy of machine-generated scores. To measure the performance of…

Descriptors: Essays, Writing Evaluation, Evaluators, Accuracy

Graders of the Future: Comparing the Consistency and Accuracy of GPT4 and Pre-Service Teachers in Physics Essay Question Assessments

Peer reviewed
PDF on ERIC

Download full text

Yubin Xu; Lin Liu; Jianwen Xiong; Guangtian Zhu – Journal of Baltic Science Education, 2025

As the development and application of large language models (LLMs) in physics education progress, the well-known AI-based chatbot ChatGPT4 has presented numerous opportunities for educational assessment. Investigating the potential of AI tools in practical educational assessment carries profound significance. This study explored the comparative…

Descriptors: Physics, Artificial Intelligence, Computer Software, Accuracy

The Whole Is More than the Sum of Its Parts -- Assessing Writing Using the Consensual Assessment Technique

Peer reviewed

Direct link

Zahn, Daniela; Canton, Ursula; Boyd, Victoria; Hamilton, Laura; Mamo, Josianne; McKay, Jane; Proudfoot, Linda; Telfer, Dickson; Williams, Kim; Wilson, Colin – Studies in Higher Education, 2021

Evaluating the impact of Academic Literacies teaching (Lea and Street [1998. "Student Writing in Higher Education: An Academic Literacies Approach." "Studies in Higher Education" 23 (2): 157-72. doi:10.1080/03075079812331380364]) is difficult, as it involves gauging whether writers: (1) gain better understanding of what…

Descriptors: Writing Evaluation, Evaluation Methods, Undergraduate Students, Foreign Countries

Scoring Difficulty in Summary Writing Assessment: Toward the Reconstruction of Analytic Rubric

Peer reviewed
PDF on ERIC

Download full text

Makiko Kato – Journal of Education and Learning, 2025

This study aims to examine whether differences exist in the factors influencing the difficulty of scoring English summaries and determining scores based on the raters' attributes, and to collect candid opinions, considerations, and tentative suggestions for future improvements to the analytic rubric of summary writing for English learners. In this…

Descriptors: Writing Evaluation, Scoring, Writing Skills, English (Second Language)

Assessing the Content Quality of Essays in Content and Language Integrated Learning: Exploring the Construct from Subject Specialists' Perspectives

Peer reviewed

Direct link

Takanori Sato – Language Testing, 2024

Assessing the content of learners' compositions is a common practice in second language (L2) writing assessment. However, the construct definition of content in L2 writing assessment potentially underrepresents the target competence in content and language integrated learning (CLIL), which aims to foster not only L2 proficiency but also critical…

Descriptors: Language Tests, Content and Language Integrated Learning, Writing Evaluation, Writing Tests

Supervisor Perspectives on the 'End-Stage' of the Doctoral Examination Process

Peer reviewed

Direct link

Dally, Kerry; Holbrook, Allyson; Lovat, Terence; Fairbairn, Hedy – Higher Education Research and Development, 2022

There has been substantial research on doctoral supervision and examination, yet rarely a focus on what happens at the end-stage of the process when examiner feedback is received and addressed. This article reports survey findings (n = 262) from a study investigating supervisor perceptions about Australian end-stage doctoral examination processes.…

Descriptors: Doctoral Students, Doctoral Dissertations, Writing Evaluation, Supervision

Utilizing Large Language Models for EFL Essay Grading: An Examination of Reliability and Validity in Rubric-Based Assessments

Peer reviewed

Direct link

Fatih Yavuz; Özgür Çelik; Gamze Yavas Çelik – British Journal of Educational Technology, 2025

This study investigates the validity and reliability of generative large language models (LLMs), specifically ChatGPT and Google's Bard, in grading student essays in higher education based on an analytical grading rubric. A total of 15 experienced English as a foreign language (EFL) instructors and two LLMs were asked to evaluate three student…

Descriptors: English (Second Language), Second Language Learning, Second Language Instruction, Computational Linguistics

Assessing Academic Language in Tenth Grade Essays Using Natural Language Processing

Peer reviewed

Direct link

Andrew Potter; Mitchell Shortt; Maria Goldshtein; Rod D. Roscoe – Grantee Submission, 2025

Broadly defined, academic language (AL) is a set of lexical-grammatical norms and registers commonly used in educational and academic discourse. Mastery of academic language in writing is an important aspect of writing instruction and assessment. The purpose of this study was to use Natural Language Processing (NLP) tools to examine the extent to…

Descriptors: Academic Language, Natural Language Processing, Grammar, Vocabulary Skills

Making Each Point Count: Revising a Local Adaptation of the Jacobs et al.'s (1981) ESL COMPOSITION PROFILE Rubric

Peer reviewed

Direct link

Yu-Tzu Chang; Ann Tai Choe; Daniel Holden; Daniel R. Isbell – Language Testing, 2024

In this Brief Report, we describe an evaluation of and revisions to a rubric adapted from the Jacobs et al.'s (1981) ESL COMPOSITION PROFILE, with four rubric categories and 20-point rating scales, in the context of an intensive English program writing placement test. Analysis of 4 years of rating data (2016-2021, including 434 essays) using…

Descriptors: Language Tests, Rating Scales, Second Language Learning, English (Second Language)

Automated Essay Scoring and Revising Based on Open-Source Large Language Models

Peer reviewed

Direct link

Yishen Song; Qianta Zhu; Huaibo Wang; Qinhua Zheng – IEEE Transactions on Learning Technologies, 2024

Manually scoring and revising student essays has long been a time-consuming task for educators. With the rise of natural language processing techniques, automated essay scoring (AES) and automated essay revising (AER) have emerged to alleviate this burden. However, current AES and AER models require large amounts of training data and lack…

Descriptors: Scoring, Essays, Writing Evaluation, Computer Software

Previous Page | Next Page »

Pages: 1 | 2 | 3 | 4 | 5 | 6 | 7 | 8 | 9 | 10 | 11

Language Testing	22
Language Testing in Asia	10
Language Assessment Quarterly	6
Grantee Submission	5
ProQuest LLC	5
English Language Teaching	4
International Journal of…	4
ETS Research Report Series	3
Assessing Writing	2
Educational and Psychological…	2
Higher Education Research and…	2
International Educational…	2
Language Awareness	2
PASAA: Journal of Language…	2
Reading and Writing: An…	2
TESOL Quarterly: A Journal…	2
AERA Online Paper Repository	1
Advances in Language and…	1
Applied Linguistics	1
Applied Measurement in…	1
Asia-Pacific Education…	1
Asian Journal of University…	1
Asian-Pacific Journal of…	1
Assessment & Evaluation in…	1
Assessment in Education:…	1
More ▼

McNamara, Danielle S.	4
Crossley, Scott A.	3
Linn, Robert L.	3
Sata, Mehmet	3
Wind, Stefanie A.	3
Wolfe, Edward W.	3
Allen, Laura	2
Allen, Laura K.	2
Attali, Yigal	2
Barati, Hossein	2
Barkaoui, Khaled	2
Crossley, Scott	2
Engelhard, George, Jr.	2
Ghanbari, Nasim	2
Han, Turgay	2
Jølle, Lennart	2
Karakaya, Ismail	2
Kuiken, Folkert	2
Kunnan, Antony John	2
Li, Jiuliang	2
Lim, Gad S.	2
McNamara, Danielle	2
Ruegg, Rachael	2
Vedder, Ineke	2
More ▼