ERIC - Search Results

Publication Date

In 2025	1
Since 2024	2
Since 2021 (last 5 years)	8
Since 2016 (last 10 years)	24
Since 2006 (last 20 years)	34

Descriptor

Correlation	36
Evaluators	36
Writing Evaluation	36
Second Language Learning	22
English (Second Language)	18
Essays	15
Scoring	14
Foreign Countries	13
Statistical Analysis	13
Comparative Analysis	12
Interrater Reliability	12
Scores	10
Computational Linguistics	9
Computer Assisted Testing	9
Computer Software	9
Language Tests	9
Scoring Rubrics	9
Second Language Instruction	9
Undergraduate Students	9
Accuracy	7
Evaluation Criteria	7
Writing Skills	7
Language Proficiency	6
Writing (Composition)	6
Writing Tests	6
More ▼

Publication Type

Journal Articles	32
Reports - Research	32
Tests/Questionnaires	6
Reports - Evaluative	3
Information Analyses	2
Speeches/Meeting Papers	2
Dissertations/Theses -…	1

Education Level

Higher Education	15
Postsecondary Education	13
Secondary Education	2
Grade 11	1
Grade 6	1
Grade 7	1
High Schools	1

Audience

Location

China	2
Turkey	2
Australia	1
Belgium	1
California	1
Europe	1
Hong Kong	1
Indonesia	1
Japan	1
Nigeria	1
Ohio	1
United Kingdom	1
Vietnam	1
Yemen	1
More ▼

Laws, Policies, & Programs

Assessments and Surveys

International English…	3
Test of English as a Foreign…	2
Flesch Kincaid Grade Level…	1
Gates MacGinitie Reading Tests	1

What Works Clearinghouse Rating

Showing 1 to 15 of 36 results Save | Export

Graders of the Future: Comparing the Consistency and Accuracy of GPT4 and Pre-Service Teachers in Physics Essay Question Assessments

Peer reviewed
PDF on ERIC

Download full text

Yubin Xu; Lin Liu; Jianwen Xiong; Guangtian Zhu – Journal of Baltic Science Education, 2025

As the development and application of large language models (LLMs) in physics education progress, the well-known AI-based chatbot ChatGPT4 has presented numerous opportunities for educational assessment. Investigating the potential of AI tools in practical educational assessment carries profound significance. This study explored the comparative…

Descriptors: Physics, Artificial Intelligence, Computer Software, Accuracy

The Whole Is More than the Sum of Its Parts -- Assessing Writing Using the Consensual Assessment Technique

Peer reviewed

Direct link

Zahn, Daniela; Canton, Ursula; Boyd, Victoria; Hamilton, Laura; Mamo, Josianne; McKay, Jane; Proudfoot, Linda; Telfer, Dickson; Williams, Kim; Wilson, Colin – Studies in Higher Education, 2021

Evaluating the impact of Academic Literacies teaching (Lea and Street [1998. "Student Writing in Higher Education: An Academic Literacies Approach." "Studies in Higher Education" 23 (2): 157-72. doi:10.1080/03075079812331380364]) is difficult, as it involves gauging whether writers: (1) gain better understanding of what…

Descriptors: Writing Evaluation, Evaluation Methods, Undergraduate Students, Foreign Countries

Development and Validation of a Rating Scale for Summarization as an Integrated Task

Peer reviewed

Direct link

Li, Jiuliang; Wang, Qian – Asian-Pacific Journal of Second and Foreign Language Education, 2021

Summary writing is essential for academic success, and has attracted renewed interest in academic research and large-scale language test. However, less attention has been paid to the development and evaluation of the scoring scales of summary writing. This study reports on the validation of a summary rubric that represented an approach to scale…

Descriptors: Validity, Rating Scales, Writing Skills, Writing Evaluation

The Intersection of AI and Language Assessment: A Study on the Reliability of ChatGPT in Grading IELTS Writing Task 2

Peer reviewed
PDF on ERIC

Download full text

Osama Koraishi – Language Teaching Research Quarterly, 2024

This study conducts a comprehensive quantitative evaluation of OpenAI's language model, ChatGPT 4, for grading Task 2 writing of the IELTS exam. The objective is to assess the alignment between ChatGPT's grading and that of official human raters. The analysis encompassed a multifaceted approach, including a comparison of means and reliability…

Descriptors: Second Language Learning, English (Second Language), Language Tests, Artificial Intelligence

Meta-Analysis of Inter-Rater Agreement and Discrepancy Between Human and Automated English Essay Scoring

Peer reviewed
PDF on ERIC

Download full text

Direct link

Jiyeo Yun – English Teaching, 2023

Studies on automatic scoring systems in writing assessments have also evaluated the relationship between human and machine scores for the reliability of automated essay scoring systems. This study investigated the magnitudes of indices for inter-rater agreement and discrepancy, especially regarding human and machine scoring, in writing assessment.…

Descriptors: Meta Analysis, Interrater Reliability, Essays, Scoring

The Focus, Function and Framing of Feedback Information: Linguistic and Content Analysis of In-Text Feedback Comments

Peer reviewed

Direct link

Derham, Cathrine; Balloo, Kieran; Winstone, Naomi – Assessment & Evaluation in Higher Education, 2022

In-text comments, in the form of annotations on students' work, are a form of feedback information that should guide students to take action. Both the focus of the in-text comments, and the ways in which they are linguistically communicated, have potential to impact upon the way in which they are perceived by students. This study reports on an…

Descriptors: Feedback (Response), Content Analysis, Essays, Summative Evaluation

Assessing Second-Language Academic Writing: AI vs. Human Raters

Peer reviewed
PDF on ERIC

Download full text

Vasfiye Geçkin; Ebru Kiziltas; Çagatay Çinar – Journal of Educational Technology and Online Learning, 2023

The quality of writing in a second language (L2) is one of the indicators of the level of proficiency for many college students to be eligible for departmental studies. Although certain software programs, such as Intelligent Essay Assessor or IntelliMetric, have been introduced to evaluate second-language writing quality, an overall assessment of…

Descriptors: Writing Evaluation, Second Language Learning, Second Language Instruction, Language Proficiency

Using Latent Semantic Analysis to Score Short Answer Constructed Responses: Automated Scoring of the Consequences Test

Peer reviewed

Direct link

LaVoie, Noelle; Parker, James; Legree, Peter J.; Ardison, Sharon; Kilcullen, Robert N. – Educational and Psychological Measurement, 2020

Automated scoring based on Latent Semantic Analysis (LSA) has been successfully used to score essays and constrained short answer responses. Scoring tests that capture open-ended, short answer responses poses some challenges for machine learning approaches. We used LSA techniques to score short answer responses to the Consequences Test, a measure…

Descriptors: Semantics, Evaluators, Essays, Scoring

The Influence of Rater Effects in Training Sets on the Psychometric Quality of Automated Scoring for Writing Assessments

Peer reviewed

Direct link

Wind, Stefanie A.; Wolfe, Edward W.; Engelhard, George, Jr.; Foltz, Peter; Rosenstein, Mark – International Journal of Testing, 2018

Automated essay scoring engines (AESEs) are becoming increasingly popular as an efficient method for performance assessments in writing, including many language assessments that are used worldwide. Before they can be used operationally, AESEs must be "trained" using machine-learning techniques that incorporate human ratings. However, the…

Descriptors: Computer Assisted Testing, Essay Tests, Writing Evaluation, Scoring

'Remark or Retake'? A Study of Candidate Performance in IELTS and Perceptions towards Test Failure

Peer reviewed

Direct link

Pearson, William S. – Language Testing in Asia, 2019

It is becoming increasingly important for individuals for whom English is a second language to demonstrate their linguistic credentials for academic, work and employment purposes. One option is to undertake International English Language Testing System (IELTS), which involves attempting to meet the linguistic entrance criteria set by a gatekeeping…

Descriptors: English (Second Language), Language Tests, Second Language Learning, Cutting Scores

The Use of Semantic Similarity Tools in Automated Content Scoring of Fact-Based Essays Written by EFL Learners

Peer reviewed

Direct link

Wang, Qiao – Education and Information Technologies, 2022

This study searched for open-source semantic similarity tools and evaluated their effectiveness in automated content scoring of fact-based essays written by English-as-a-Foreign-Language (EFL) learners. Fifty writing samples under a fact-based writing task from an academic English course in a Japanese university were collected and a gold standard…

Descriptors: English (Second Language), Second Language Learning, Second Language Instruction, Scoring

Exploring the Use of ESL Composition Profile for College Writing in the Indonesian Context

Peer reviewed
PDF on ERIC

Download full text

Setyowati, Lestari; Sukmawan, Sony; El-Sulukiyyah, Ana Ahsana – International Journal of Language Education, 2020

Assessing writing is a demanding task. If a lecturer of writing is not prepared with a reliable scoring rubric, the students' real performance might not be known. One of the well-known English as a second language (ESL) writing rubric is the Jacobs ESL Composition Profile which was developed by Jacobs, Zingraf, Wormuth, Hartfiel, & Hughey in…

Descriptors: Second Language Learning, Second Language Instruction, English (Second Language), Writing Evaluation

Grading Emails and Generating Feedback

Peer reviewed
PDF on ERIC

Download full text

Unnam, Abhishek; Takhar, Rohit; Aggarwal, Varun – International Educational Data Mining Society, 2019

Email has become the most preferred form of business communication. Writing "good" email has become an essential skill required in the industry. "Good" email writing not only facilitates clear communication, but also makes a positive impression on the recipient, whether it be one's colleague or a customer. The aim of this paper…

Descriptors: Grading, Electronic Mail, Feedback (Response), Written Language

The Impact of Rater Variability on Relationships among Different Effect-Size Indices for Inter-Rater Agreement between Human and Automated Essay Scoring

Direct link

Yun, Jiyeo – ProQuest LLC, 2017

Since researchers investigated automatic scoring systems in writing assessments, they have dealt with relationships between human and machine scoring, and then have suggested evaluation criteria for inter-rater agreement. The main purpose of my study is to investigate the magnitudes of and relationships among indices for inter-rater agreement used…

Descriptors: Interrater Reliability, Essays, Scoring, Evaluators

Effects of Analytical and Holistic Scoring Patterns on Scorer Reliability in Biology Essay Tests

Peer reviewed
PDF on ERIC

Download full text

Ebuoh, Casmir N. – World Journal of Education, 2018

Literature revealed that the patterns/methods of scoring essay tests had been criticized for not being reliable and this unreliability is more likely to be more in internal examinations than in the external examinations. The purpose of this study is to find out the effects of analytical and holistic scoring patterns on scorer reliability in…

Descriptors: Holistic Approach, Scoring, Essay Tests, Biology

Previous Page | Next Page »

Pages: 1 | 2 | 3

Language Testing	4
Language Assessment Quarterly	3
ETS Research Report Series	2
Advances in Language and…	1
Applied Measurement in…	1
Asian-Pacific Journal of…	1
Assessment & Evaluation in…	1
CALICO Journal	1
Education and Information…	1
Educational Psychology	1
Educational Research and…	1
Educational Sciences: Theory…	1
Educational and Psychological…	1
English Language Teaching	1
English Teaching	1
Grantee Submission	1
Higher Education Research and…	1
International Educational…	1
International Journal of…	1
International Journal of…	1
Journal of Baltic Science…	1
Journal of Educational…	1
Journal of Effective Teaching	1
Language Learning & Technology	1
Language Teaching Research…	1
More ▼

Crossley, Scott A.	2
Kuiken, Folkert	2
Kunnan, Antony John	2
Linn, Robert L.	2
McNamara, Danielle S.	2
Vedder, Ineke	2
Aggarwal, Varun	1
Al-Hattami, Abdulghani A.	1
Ardison, Sharon	1
Ari, Gokhan	1
Aryadoust, Vahid	1
Attali, Yigal	1
Balloo, Kieran	1
Berger, Cynthia M.	1
Beyreli, Latif	1
Boyd, Victoria	1
Bridgeman, Brent	1
Canton, Ursula	1
Coniam, David	1
Davey, Tim	1
Derham, Cathrine	1
Ebru Kiziltas	1
Ebuoh, Casmir N.	1
Eckes, Thomas	1
More ▼