ERIC - Search Results

Publication Date

In 2025	1
Since 2024	2
Since 2021 (last 5 years)	13
Since 2016 (last 10 years)	31
Since 2006 (last 20 years)	58

Descriptor

Evaluators	59
Scoring	59
Second Language Learning	59
English (Second Language)	46
Language Tests	41
Foreign Countries	25
Computer Assisted Testing	20
Second Language Instruction	19
Comparative Analysis	18
Correlation	18
Writing Evaluation	18
Essays	17
Scores	17
Interrater Reliability	15
Oral Language	14
Language Proficiency	13
Native Language	13
Computer Software	11
Rating Scales	11
Speech Communication	11
Decision Making	10
Evaluation Criteria	10
College Students	9
Accuracy	8
Evaluation Methods	8
More ▼

Publication Type

Journal Articles	53
Reports - Research	49
Tests/Questionnaires	11
Dissertations/Theses -…	4
Information Analyses	4
Reports - Descriptive	2
Reports - Evaluative	2
Speeches/Meeting Papers	1

Education Level

Higher Education	20
Postsecondary Education	16
High Schools	2
Secondary Education	2
Adult Education	1

Audience

Location

China	7
Japan	4
India	3
Iran	3
Europe	2
South Korea	2
Turkey	2
Colombia	1
Germany	1
Japan (Tokyo)	1
Michigan	1
Switzerland	1
United States	1
Vietnam	1
More ▼

Laws, Policies, & Programs

Assessments and Surveys

Test of English as a Foreign…	20
International English…	5
Test of English for…	2
ACTFL Oral Proficiency…	1
Graduate Record Examinations	1

What Works Clearinghouse Rating

Showing 1 to 15 of 59 results Save | Export

Non-Expert Raters' Scoring Behavior and Cognition in Assessing Pragmatic Production in L2 Chinese

Peer reviewed

Direct link

Shuai Li; Xian Li; Yali Feng; Ting Wen – Educational Linguistics, 2023

This chapter reports on a study investigating non-expert raters' scoring behavior and cognitive processes involved in evaluating speech acts and pragmatic routines in L2 Chinese. Pragmatic production data were collected from 51 American learners of Chinese, who completed a 12-item oral Discourse Completion Test (DCT). The learners were divided…

Descriptors: Scoring, Cognitive Processes, Speech Acts, Pragmatics

Scoring Difficulty in Summary Writing Assessment: Toward the Reconstruction of Analytic Rubric

Peer reviewed
PDF on ERIC

Download full text

Makiko Kato – Journal of Education and Learning, 2025

This study aims to examine whether differences exist in the factors influencing the difficulty of scoring English summaries and determining scores based on the raters' attributes, and to collect candid opinions, considerations, and tentative suggestions for future improvements to the analytic rubric of summary writing for English learners. In this…

Descriptors: Writing Evaluation, Scoring, Writing Skills, English (Second Language)

Meta-Analysis of Inter-Rater Agreement and Discrepancy Between Human and Automated English Essay Scoring

Peer reviewed
PDF on ERIC

Download full text

Direct link

Jiyeo Yun – English Teaching, 2023

Studies on automatic scoring systems in writing assessments have also evaluated the relationship between human and machine scores for the reliability of automated essay scoring systems. This study investigated the magnitudes of indices for inter-rater agreement and discrepancy, especially regarding human and machine scoring, in writing assessment.…

Descriptors: Meta Analysis, Interrater Reliability, Essays, Scoring

Automated Speech Scoring of Dialogue Response by Japanese Learners of English as a Foreign Language

Peer reviewed

Direct link

Yuko Hayashi; Yusuke Kondo; Yutaka Ishii – Innovation in Language Learning and Teaching, 2024

Purpose: This study builds a new system for automatically assessing learners' speech elicited from an oral discourse completion task (DCT), and evaluates the prediction capability of the system with a view to better understanding factors deemed influential in predicting speaking proficiency scores and the pedagogical implications of the system.…

Descriptors: English (Second Language), Second Language Learning, Second Language Instruction, Japanese

Raters' Perceptions of Rating Scales Criteria and Its Effect on the Process and Outcome of Their Rating

Peer reviewed

Direct link

Heidari, Nasim; Ghanbari, Nasim; Abbasi, Abbas – Language Testing in Asia, 2022

It is widely believed that human rating performance is influenced by an array of different factors. Among these, rater-related variables such as experience, language background, perceptions, and attitudes have been mentioned. One of the important rater-related factors is the way the raters interact with the rating scales. In particular, how raters…

Descriptors: Evaluators, Rating Scales, Language Tests, English (Second Language)

Temporal Fluency and Floor/Ceiling Scoring of Intermediate and Advanced Speech on the ACTFL Spanish Oral Proficiency Interview--Computer

Peer reviewed

Direct link

Cox, Troy L.; Brown, Alan V.; Thompson, Gregory L. – Language Testing, 2023

The rating of proficiency tests that use the Inter-agency Roundtable (ILR) and American Council on the Teaching of Foreign Languages (ACTFL) guidelines claims that each major level is based on hierarchal linguistic functions that require mastery of multidimensional traits in such a way that each level subsumes the levels beneath it. These…

Descriptors: Oral Language, Language Fluency, Scoring, Cues

Applying Cognitive Theory to the Human Essay Rating Process

Peer reviewed

Direct link

Finn, Bridgid; Arslan, Burcu; Walsh, Matthew – Applied Measurement in Education, 2020

To score an essay response, raters draw on previously trained skills and knowledge about the underlying rubric and score criterion. Cognitive processes such as remembering, forgetting, and skill decay likely influence rater performance. To investigate how forgetting influences scoring, we evaluated raters' scoring accuracy on TOEFL and GRE essays.…

Descriptors: Epistemology, Essay Tests, Evaluators, Cognitive Processes

Comparing Holistic and Analytic Marking Methods in Assessing Speech Act Production in L2 Chinese

Peer reviewed

Direct link

Li, Shuai; Wen, Ting; Li, Xian; Feng, Yali; Lin, Chuan – Language Testing, 2023

This study compared holistic and analytic marking methods for their effects on parameter estimation (of examinees, raters, and items) and rater cognition in assessing speech act production in L2 Chinese. Seventy American learners of Chinese completed an oral Discourse Completion Test assessing requests and refusals. Four first-language (L1)…

Descriptors: Speech Acts, Second Language Learning, Second Language Instruction, Chinese

An Investigation of the Impact of Jagged Profile on L2 Speaking Test Ratings: Evidence from Rating and Eye-Tracking Data

Peer reviewed

Direct link

Ma, Wenyue; Winke, Paula – Language Assessment Quarterly, 2022

The factors that influence rater scoring have been a subject of great interest to researchers in second language assessment. However, the research on the impact of test-takers' speech profiles (e.g., a jagged or a flat profile reflecting analytic subscores) on raters' scoring behaviors remains to be seen. To investigate the role of speech profiles…

Descriptors: Language Tests, Second Language Learning, Speech Communication, Profiles

Can Automated Machine Translation Evaluation Metrics Be Used to Assess Students' Interpretation in the Language Learning Classroom?

Peer reviewed

Direct link

Han, Chao; Lu, Xiaolei – Computer Assisted Language Learning, 2023

The use of translation and interpreting (T&I) in the language learning classroom is commonplace, serving various pedagogical and assessment purposes. Previous utilization of T&I exercises is driven largely by their potential to enhance language learning, whereas the latest trend has begun to underscore T&I as a crucial skill to be…

Descriptors: Translation, Computational Linguistics, Correlation, Language Processing

Assessing L2 English Speaking Using Automated Scoring Technology: Examining Automarker Reliability

Peer reviewed

Direct link

Xu, Jing; Jones, Edmund; Laxton, Victoria; Galaczi, Evelina – Assessment in Education: Principles, Policy & Practice, 2021

Recent advances in machine learning have made automated scoring of learner speech widespread, and yet validation research that provides support for applying automated scoring technology to assessment is still in its infancy. Both the educational measurement and language assessment communities have called for greater transparency in describing…

Descriptors: Second Language Learning, Second Language Instruction, English (Second Language), Computer Software

Effects of Second Language Pronunciation Teaching Revisited: A Proposed Measurement Framework and Meta-Analysis

Peer reviewed

Direct link

Saito, Kazuya; Plonsky, Luke – Language Learning, 2019

We propose a new framework for conceptualizing measures of instructed second language (L2) pronunciation performance according to three sets of parameters: (a) the constructs (focused on global vs. specific aspects of pronunciation), (b) the scoring method (human raters vs. acoustic analyses), and (c) the type of knowledge elicited (controlled vs.…

Descriptors: Second Language Learning, Second Language Instruction, Scoring, Pronunciation Instruction

The Processes of Rating L2 Speaking Performance Using an Analytic Rating Scale -- A Qualitative Exploration

Peer reviewed
PDF on ERIC

Download full text

Thai, Thuy; Sheehan, Susan – Language Education & Assessment, 2022

In language performance tests, raters are important as their scoring decisions determine which aspects of performance the scores represent; however, raters are considered as one of the potential sources contributing to unwanted variability in scores (Davis, 2012). Although a great number of studies have been conducted to unpack how rater…

Descriptors: Rating Scales, Speech Communication, Second Language Learning, Second Language Instruction

A Comparative Judgment Approach to Assessing Chinese Sign Language Interpreting

Peer reviewed

Direct link

Han, Chao; Xiao, Xiaoyan – Language Testing, 2022

The quality of sign language interpreting (SLI) is a gripping construct among practitioners, educators and researchers, calling for reliable and valid assessment. There has been a diverse array of methods in the extant literature to measure SLI quality, ranging from traditional error analysis to recent rubric scoring. In this study, we want to…

Descriptors: Comparative Analysis, Sign Language, Deaf Interpreting, Evaluators

The Use of Semantic Similarity Tools in Automated Content Scoring of Fact-Based Essays Written by EFL Learners

Peer reviewed

Direct link

Wang, Qiao – Education and Information Technologies, 2022

This study searched for open-source semantic similarity tools and evaluated their effectiveness in automated content scoring of fact-based essays written by English-as-a-Foreign-Language (EFL) learners. Fifty writing samples under a fact-based writing task from an academic English course in a Japanese university were collected and a gold standard…

Descriptors: English (Second Language), Second Language Learning, Second Language Instruction, Scoring

Previous Page | Next Page »

Pages: 1 | 2 | 3 | 4

Language Testing	15
ETS Research Report Series	9
ProQuest LLC	4
Language Assessment Quarterly	3
Language Testing in Asia	3
Language Learning	2
Applied Measurement in…	1
Assessment in Education:…	1
CALICO Journal	1
Computer Assisted Language…	1
Education and Information…	1
Educational Linguistics	1
English Language Teaching	1
English Teaching	1
Grantee Submission	1
Innovation in Language…	1
JALT CALL Journal	1
Journal of Education and…	1
Journal of Pan-Pacific…	1
Language Education &…	1
Language Teaching Research	1
PASAA: Journal of Language…	1
Research-publishing.net	1
SAGE Open	1
TESL-EJ	1
More ▼

Xi, Xiaoming	4
Barkaoui, Khaled	2
Bridgeman, Brent	2
Han, Chao	2
Mollaun, Pam	2
Mollaun, Pamela	2
Winke, Paula	2
Abbasi, Abbas	1
Ahmadi Shirazi, Masoumeh	1
Ahmadi, Alireza	1
Allen, Laura K.	1
Arslan, Burcu	1
Attali, Yigal	1
Bejar, Isaac I.	1
Blanchard, Daniel	1
Breyer, F. Jay	1
Brooks, Rachel Lunde	1
Brown, Alan V.	1
Brown, Anne	1
Cahill, Aoife	1
Carey, Michael D.	1
Casabianca, Jodi M.	1
Chodorow, Martin	1
Cox, Troy L.	1
Crossley, Scott A.	1
More ▼