ERIC - Search Results

Publication Date

In 2025	4
Since 2024	7
Since 2021 (last 5 years)	25
Since 2016 (last 10 years)	58
Since 2006 (last 20 years)	77

Descriptor

Comparative Analysis	79
Evaluators	79
Foreign Countries	79
Second Language Learning	44
English (Second Language)	42
Second Language Instruction	27
Language Tests	20
Scores	20
Correlation	18
Teaching Methods	18
Undergraduate Students	17
Writing Evaluation	16
Accuracy	15
Evaluation Methods	15
Essays	14
Statistical Analysis	14
Student Evaluation	14
Computer Software	13
College Students	12
Pronunciation	12
Scoring	12
Interrater Reliability	11
Language Proficiency	11
Rating Scales	11
Computer Assisted Testing	10
More ▼

Publication Type

Journal Articles	75
Reports - Research	71
Tests/Questionnaires	13
Reports - Evaluative	8
Information Analyses	2
Speeches/Meeting Papers	2

Education Level

Higher Education	46
Postsecondary Education	42
Secondary Education	8
Elementary Secondary Education	4
High Schools	2
Early Childhood Education	1
Grade 11	1
Grade 12	1
Kindergarten	1
Preschool Education	1
Primary Education	1
More ▼

Audience

Location

China	11
Canada	5
Turkey	5
United Kingdom	5
Europe	4
India	4
Iran	4
Japan	4
Australia	3
Hong Kong	3
Netherlands	3
United Kingdom (England)	3
United States	3
Denmark	2
Germany	2
Singapore	2
Spain	2
Thailand	2
Afghanistan	1
Argentina	1
Austria	1
Belgium	1
Bosnia and Herzegovina…	1
Brazil	1
Canada (Montreal)	1
More ▼

Laws, Policies, & Programs

Assessments and Surveys

International English…	2
Test of English as a Foreign…	2
Rosenberg Self Esteem Scale	1
Test of English for…	1
Trends in International…	1

What Works Clearinghouse Rating

Showing 1 to 15 of 79 results Save | Export

Examining the Effect of Item Difficulty and Rater Leniency on Iranian Test Takers' Performance on WDCT and DSAT: A Comparative Study

Peer reviewed
PDF on ERIC

Download full text

Reza Shahi; Hamdollah Ravand; Golam Reza Rohani – International Journal of Language Testing, 2025

The current paper intends to exploit the Many Facet Rasch Model to investigate and compare the impact of situations (items) and raters on test takers' performance on the Written Discourse Completion Test (WDCT) and Discourse Self-Assessment Tests (DSAT). In this study, the participants were 110 English as a Foreign Language (EFL) students at…

Descriptors: Comparative Analysis, English (Second Language), Second Language Learning, Second Language Instruction

Graders of the Future: Comparing the Consistency and Accuracy of GPT4 and Pre-Service Teachers in Physics Essay Question Assessments

Peer reviewed
PDF on ERIC

Download full text

Yubin Xu; Lin Liu; Jianwen Xiong; Guangtian Zhu – Journal of Baltic Science Education, 2025

As the development and application of large language models (LLMs) in physics education progress, the well-known AI-based chatbot ChatGPT4 has presented numerous opportunities for educational assessment. Investigating the potential of AI tools in practical educational assessment carries profound significance. This study explored the comparative…

Descriptors: Physics, Artificial Intelligence, Computer Software, Accuracy

The Concurrent Validity of Comparative Judgement Outcomes Compared with Marks

Download full text

Gill, Tim – Research Matters, 2022

In Comparative Judgement (CJ) exercises, examiners are asked to look at a selection of candidate scripts (with marks removed) and order them in terms of which they believe display the best quality. By including scripts from different examination sessions, the results of these exercises can be used to help with maintaining standards. Results from…

Descriptors: Comparative Analysis, Decision Making, Scripts, Standards

Judges' Views on Pairwise Comparative Judgement and Rank Ordering as Alternatives to Analytical Essay Marking

Download full text

Walland, Emma – Research Matters, 2022

In this article, I report on examiners' views and experiences of using Pairwise Comparative Judgement (PCJ) and Rank Ordering (RO) as alternatives to traditional analytical marking for GCSE English Language essays. Fifteen GCSE English Language examiners took part in the study. After each had judged 100 pairs of essays using PCJ and eight packs of…

Descriptors: Essays, Grading, Writing Evaluation, Evaluators

Depth-Perception-Based Representation in Holistic Rating on ESL Essay Writing

Peer reviewed

Direct link

Lian Li; Jiehui Hu; Yu Dai; Ping Zhou; Wanhong Zhang – Reading & Writing Quarterly, 2024

This paper proposes to use depth perception to represent raters' decision in holistic evaluation of ESL essays, as an alternative medium to conventional form of numerical scores. The researchers verified the new method's accuracy and inter/intra-rater reliability by inviting 24 ESL teachers to perform different representations when rating 60…

Descriptors: Essays, Holistic Approach, Writing Evaluation, Accuracy

Comparison of Traditional Essay Questions versus Case Based Modified Essay Questions in Biochemistry

Peer reviewed

Direct link

Bansal, Aastha; Dubey, Abhishek; Singh, Vijay Kumar; Goswami, Binita; Kaushik, Smita – Biochemistry and Molecular Biology Education, 2023

Adult learning involves the analysis and synthesis of knowledge to become competent, which cannot be assessed only by traditional assessment tool and didactic learning methods. Stimulation of higher domains of cognitive learning needs to be inculcated to reach a better understanding of the subject rather than traditional assessment tools that…

Descriptors: Biochemistry, Science Instruction, Alternative Assessment, Microbiology

Artificial Intelligence as an Automated Essay Scoring Tool: A Focus on ChatGPT

Peer reviewed
PDF on ERIC

Download full text

Ahmet Can Uyar; Dilek Büyükahiska – International Journal of Assessment Tools in Education, 2025

This study explores the effectiveness of using ChatGPT, an Artificial Intelligence (AI) language model, as an Automated Essay Scoring (AES) tool for grading English as a Foreign Language (EFL) learners' essays. The corpus consists of 50 essays representing various types including analysis, compare and contrast, descriptive, narrative, and opinion…

Descriptors: Artificial Intelligence, Computer Software, Technology Uses in Education, Teaching Methods

Interlanguage Phonology and Accentedness: An Experimental Study of Thai Final Nasal Consonants in Chinese Students Learning Thai

Peer reviewed
PDF on ERIC

Download full text

Hou, Peng; Kraisame, Sarawut – rEFLections, 2023

This paper provides an experimental study of interlanguage phonological characteristics of Chinese students learning Thai as a foreign language and the accentedness perceived by native Thai speakers. Both production and perception experiments were designed to see how Chinese students acoustically produced Thai final nasal consonants and how Thai…

Descriptors: Phonology, Pronunciation, Thai, Second Language Learning

Assessing Second-Language Academic Writing: AI vs. Human Raters

Peer reviewed
PDF on ERIC

Download full text

Vasfiye Geçkin; Ebru Kiziltas; Çagatay Çinar – Journal of Educational Technology and Online Learning, 2023

The quality of writing in a second language (L2) is one of the indicators of the level of proficiency for many college students to be eligible for departmental studies. Although certain software programs, such as Intelligent Essay Assessor or IntelliMetric, have been introduced to evaluate second-language writing quality, an overall assessment of…

Descriptors: Writing Evaluation, Second Language Learning, Second Language Instruction, Language Proficiency

Accuracy and Reliability of Large Language Models in Assessing Learning Outcomes Achievement across Cognitive Domains

Peer reviewed

Direct link

Swapna Haresh Teckwani; Amanda Huee-Ping Wong; Nathasha Vihangi Luke; Ivan Cherh Chiet Low – Advances in Physiology Education, 2024

The advent of artificial intelligence (AI), particularly large language models (LLMs) like ChatGPT and Gemini, has significantly impacted the educational landscape, offering unique opportunities for learning and assessment. In the realm of written assessment grading, traditionally viewed as a laborious and subjective process, this study sought to…

Descriptors: Accuracy, Reliability, Computational Linguistics, Standards

Automated Speech Scoring of Dialogue Response by Japanese Learners of English as a Foreign Language

Peer reviewed

Direct link

Yuko Hayashi; Yusuke Kondo; Yutaka Ishii – Innovation in Language Learning and Teaching, 2024

Purpose: This study builds a new system for automatically assessing learners' speech elicited from an oral discourse completion task (DCT), and evaluates the prediction capability of the system with a view to better understanding factors deemed influential in predicting speaking proficiency scores and the pedagogical implications of the system.…

Descriptors: English (Second Language), Second Language Learning, Second Language Instruction, Japanese

Effect of Racial Bias on Composite Construction

Peer reviewed

Direct link

Bhardwaj, Kavya; Hole, Graham – Applied Cognitive Psychology, 2020

We investigated how prior bias about a face's racial characteristics can affect its encoding and resultant facial composite construction. In total, 61 participants (24 Europeans, 18 Indians living in India and 19 Indians living in Europe) saw a racially ambiguous unfamiliar face and were led to believe it was either European or Indian. They…

Descriptors: Racial Bias, Indians, Human Body, Race

Moderation of Non-Exam Assessments: Is Comparative Judgement a Practical Alternative?

Download full text

Vidal Rodeiro, Carmen; Chambers, Lucy – Research Matters, 2022

Many high-stakes qualifications include non-exam assessments that are marked by teachers. Awarding bodies then apply a moderation process to bring the marking of these assessments to an agreed standard. Comparative Judgement (CJ) is a technique where two (or more) pieces of work are compared at a time, allowing an overall rank order of work to be…

Descriptors: Evaluation Methods, Portfolios (Background Materials), Decision Making, Task Analysis

Investigating the Impact of Rater Training on Rater Errors in the Process of Assessing Writing Skill

Peer reviewed
PDF on ERIC

Download full text

Sata, Mehmet; Karakaya, Ismail – International Journal of Assessment Tools in Education, 2022

In the process of measuring and assessing high-level cognitive skills, interference of rater errors in measurements brings about a constant concern and low objectivity. The main purpose of this study was to investigate the impact of rater training on rater errors in the process of assessing individual performance. The study was conducted with a…

Descriptors: Evaluators, Training, Comparative Analysis, Academic Language

Assumptions of Speaker Ethnicity and the Effect on Ratings of Accentedness, Comprehensibility, and Intelligibility

Peer reviewed

Direct link

Lee, Bradford J.; Bailey, Justin L. – Language Awareness, 2023

While listeners tend to downgrade speakers' accent and comprehensibility when they perceive them to be from a different language community--a process known as reverse linguistic stereotyping (RLS)--research has generally relied solely on quantitative data such as Likert scale ratings. The current study sought to extend the analysis further by…

Descriptors: Likert Scales, Stereotypes, Ethnicity, Intelligibility

Previous Page | Next Page »

Pages: 1 | 2 | 3 | 4 | 5 | 6

Language Assessment Quarterly	4
Language Testing	4
Language Testing in Asia	3
Research Matters	3
TESOL Quarterly: A Journal…	3
Advances in Physiology…	2
American Journal of Evaluation	2
English Language Teaching	2
International Journal of…	2
Interpreter and Translator…	2
Language and Education	2
rEFLections	2
Advances in Language and…	1
Applied Cognitive Psychology	1
Asia-Pacific Education…	1
Assessment in Education:…	1
Behaviour & Information…	1
Biochemistry and Molecular…	1
Bulgarian Comparative…	1
CALICO Journal	1
Cambridge Assessment	1
Clinical Linguistics &…	1
Computer Assisted Language…	1
Creativity Research Journal	1
ETS Research Report Series	1
More ▼

Coniam, David	2
Trofimovich, Pavel	2
Abdul Gafoor, K.	1
Ahmadi, Alireza	1
Ahmet Can Uyar	1
Amanda Huee-Ping Wong	1
Baidak, Nathalie	1
Bailey, Justin L.	1
Bansal, Aastha	1
Bhardwaj, Kavya	1
Bosch, Emma	1
Bradic, Lejla	1
Brannen, Kathleen	1
Breidahl, Karen N.	1
Breyer, F. Jay	1
Buckingham, Louisa	1
Burke, Rachel	1
Burset, Silvia	1
Cardoso, Walcir	1
Chambers, Lucy	1
Chun, Dorothy	1
Coleman, Tori	1
Cots, Josep M.	1
Crisp, Victoria	1
Dai, David Wei	1
More ▼