ERIC - Search Results

Publication Date

In 2025	0
Since 2024	0
Since 2021 (last 5 years)	3
Since 2016 (last 10 years)	8
Since 2006 (last 20 years)	17

Descriptor

English (Second Language)	25
Scoring	25
Language Tests	24
Second Language Learning	19
Interrater Reliability	16
Computer Assisted Testing	11
Evaluators	9
Foreign Countries	9
Scores	9
Correlation	8
Test Reliability	8
Writing Tests	8
Test Validity	7
Language Proficiency	6
Comparative Analysis	5
Essay Tests	5
Essays	5
Writing Evaluation	5
Automation	4
College Entrance Examinations	4
Computer Software	4
Oral Language	4
Prompting	4
Reliability	4
Accuracy	3
More ▼

Source

ETS Research Report Series	9
Educational Testing Service	2
Applied Linguistics	1
Assessment in Education:…	1
College Entrance Examination…	1
Education and Information…	1
JALT CALL Journal	1
Journal of Pan-Pacific…	1
ProQuest LLC	1
SAGE Open	1
Taiwan Journal of TESOL	1
More ▼

Publication Type

Reports - Research	19
Journal Articles	16
Tests/Questionnaires	6
Speeches/Meeting Papers	3
Reports - Descriptive	2
Reports - Evaluative	2
Dissertations/Theses -…	1
Numerical/Quantitative Data	1

Education Level

Higher Education	5
Postsecondary Education	5
High Schools	2
Secondary Education	2
Elementary Education	1
Grade 10	1
Grade 11	1
Grade 12	1
Grade 6	1
Grade 7	1
Grade 8	1
Grade 9	1
Intermediate Grades	1
Junior High Schools	1
Middle Schools	1
More ▼

Audience

Researchers

Location

Iran	3
Germany	2
India	2
Mexico	2
Australia	1
Canada	1
China	1
Colombia	1
Hong Kong	1
Japan	1
Japan (Tokyo)	1
Jordan	1
North America	1
South Korea	1
Switzerland	1
Taiwan	1
Turkey	1
United States	1
More ▼

Laws, Policies, & Programs

Assessments and Surveys

Test of English as a Foreign…	25
Graduate Record Examinations	3
Graduate Management Admission…	2
International English…	2
Computer Attitude Scale	1
Law School Admission Test	1
Medical College Admission Test	1
SAT (College Admission Test)	1
Test of English for…	1

What Works Clearinghouse Rating

Showing 1 to 15 of 25 results Save | Export

Complementary Strengths? Evaluation of a Hybrid Human-Machine Scoring Approach for a Test of Oral Academic English

Peer reviewed

Direct link

Davis, Larry; Papageorgiou, Spiros – Assessment in Education: Principles, Policy & Practice, 2021

Human raters and machine scoring systems potentially have complementary strengths in evaluating language ability; specifically, it has been suggested that automated systems might be used to make consistent measurements of specific linguistic phenomena, whilst humans evaluate more global aspects of performance. We report on an empirical study that…

Descriptors: Scoring, English for Academic Purposes, Oral English, Speech Tests

Rater Dominance in Discussion as a Resolution Method

Peer reviewed
PDF on ERIC

Download full text

Ahmadi, Alireza – Taiwan Journal of TESOL, 2020

Rater subjectivity has long been an intriguing topic. The use of discussion as a resolution method is a practical way to reduce this subjectivity. However, the efficacy of discussion depends on whether different raters get equally engaged in it or one rater tends to dominate others. This study investigated whether and how rater dominance occurs in…

Descriptors: Evaluators, Interrater Reliability, Discussion, Discourse Analysis

Developing an Innovative Elicited Imitation Task for Efficient English Proficiency Assessment. TOEFL® Research Report. RR-96. ETS RR-21-24

Peer reviewed
PDF on ERIC

Download full text

Davis, Larry; Norris, John – ETS Research Report Series, 2021

The elicited imitation task (EIT), in which language learners listen to a series of spoken sentences and repeat each one verbatim, is a commonly used measure of language proficiency in second language acquisition research. The "TOEFL® Essentials"™ test includes an EIT as a holistic measure of speaking proficiency, referred to as the…

Descriptors: Task Analysis, Language Proficiency, Speech Communication, Language Tests

Computerized Testing in Reading Comprehension Skill: Investigating Score Interchangeability, Item Review, Age and Gender Stereotypes, ICT Literacy and Computer Attitudes

Peer reviewed

Direct link

Toroujeni, Seyyed Morteza Hashemi – Education and Information Technologies, 2022

Score interchangeability of Computerized Fixed-Length Linear Testing (henceforth CFLT) and Paper-and-Pencil-Based Testing (henceforth PPBT) has become a controversial issue over the last decade when technology has meaningfully restructured methods of the educational assessment. Given this controversy, various testing guidelines published on…

Descriptors: Computer Assisted Testing, Reading Tests, Reading Comprehension, Scoring

Automated Essay Scoring at Scale: A Case Study in Switzerland and Germany. TOEFL® Research Report. RR-86. ETS RR-19-12

Peer reviewed
PDF on ERIC

Download full text

Rupp, André A.; Casabianca, Jodi M.; Krüger, Maleika; Keller, Stefan; Köller, Olaf – ETS Research Report Series, 2019

In this research report, we describe the design and empirical findings for a large-scale study of essay writing ability with approximately 2,500 high school students in Germany and Switzerland on the basis of 2 tasks with 2 associated prompts, each from a standardized writing assessment whose scoring involved both human and automated components.…

Descriptors: Automation, Foreign Countries, English (Second Language), Language Tests

For a Greater Good: Bias Analysis in Writing Assessment

Peer reviewed

Direct link

Ahmadi Shirazi, Masoumeh – SAGE Open, 2019

Threats to construct validity should be reduced to a minimum. If true, sources of bias, namely raters, items, tests as well as gender, age, race, language background, culture, and socio-economic status need to be spotted and removed. This study investigates raters' experience, language background, and the choice of essay prompt as potential…

Descriptors: Foreign Countries, Language Tests, Test Bias, Essay Tests

Use of Automated Scoring in Spoken Language Assessments for Test Takers with Speech Impairments. Research Report. ETS RR-17-42

Peer reviewed
PDF on ERIC

Download full text

Loukina, Anastassia; Buzick, Heather – ETS Research Report Series, 2017

This study is an evaluation of the performance of automated speech scoring for speakers with documented or suspected speech impairments. Given that the use of automated scoring of open-ended spoken responses is relatively nascent and there is little research to date that includes test takers with disabilities, this small exploratory study focuses…

Descriptors: Automation, Scoring, Language Tests, Speech Tests

Assessment Behavior and Perceptions of Raters in Paired and Group Oral Interaction

Peer reviewed
PDF on ERIC

Download full text

Negishi, Junko – Journal of Pan-Pacific Association of Applied Linguistics, 2015

The study considers the assessment of L2 English learners by trained raters in paired and group oral assessments in comparison to an individual, monologue assessment, to determine 1) the degree to which raters assign pairs/groups shared (the same) scores and the degree to which raters give individual members of pairs/groups higher or lower as…

Descriptors: Evaluators, English (Second Language), Second Language Learning, Scores

A Comparative Analysis of Face to Face Instruction vs. Telegram Mobile Instruction in Terms of Narrative Writing

Peer reviewed
PDF on ERIC

Download full text

Heidari, Jamshid; Khodabandeh, Farzaneh; Soleimani, Hassan – JALT CALL Journal, 2018

The emergence of computer technology in English language teaching has paved the way for teachers' application of Mobile Assisted Language Learning (mall) and its advantages in teaching. This study aimed to compare the effectiveness of the face to face instruction with Telegram mobile instruction. Based on a toefl test, 60 English foreign language…

Descriptors: Comparative Analysis, Conventional Instruction, Teaching Methods, Computer Assisted Instruction

Use of e-rater[R] in Scoring of the TOEFL iBT[R] Writing Test. Research Report. ETS RR-11-25

Download full text

Haberman, Shelby J. – Educational Testing Service, 2011

Alternative approaches are discussed for use of e-rater[R] to score the TOEFL iBT[R] Writing test. These approaches involve alternate criteria. In the 1st approach, the predicted variable is the expected rater score of the examinee's 2 essays. In the 2nd approach, the predicted variable is the expected rater score of 2 essay responses by the…

Descriptors: Writing Tests, Scoring, Essays, Language Tests

Rater Expertise in a Second Language Speaking Assessment: The Influence of Training and Experience

Direct link

Davis, Lawrence Edward – ProQuest LLC, 2012

Speaking performance tests typically employ raters to produce scores; accordingly, variability in raters' scoring decisions has important consequences for test reliability and validity. One such source of variability is the rater's level of expertise in scoring. Therefore, it is important to understand how raters' performance is influenced by…

Descriptors: Evaluators, Expertise, Scores, Second Language Learning

Toward Automated Multi-Trait Scoring of Essays: Investigating Links among Holistic, Analytic, and Text Feature Scores

Peer reviewed

Direct link

Lee, Yong-Won; Gentile, Claudia; Kantor, Robert – Applied Linguistics, 2010

The main purpose of the study was to investigate the distinctness and reliability of analytic (or multi-trait) rating dimensions and their relationships to holistic scores and "e-rater"[R] essay feature variables in the context of the TOEFL[R] computer-based test (TOEFL CBT) writing assessment. Data analyzed in the study were holistic…

Descriptors: Writing Evaluation, Writing Tests, Scoring, Essays

How Do Raters from India Perform in Scoring the TOEFL iBT[TM] Speaking Section and What Kind of Training Helps? TOEFL iBT[TM] Research Report. RR-09-31

Download full text

Xi, Xiaoming; Mollaun, Pam – Educational Testing Service, 2009

This study investigated the scoring of the Test of English as a Foreign Language[TM] Internet-based Test (TOEFL iBT[TM]) Speaking section by bilingual or multilingual speakers of English and 1 or more Indian languages. We explored the extent to which raters from India, after being trained and certified, were able to score the Speaking section for…

Descriptors: Foreign Countries, English (Second Language), Internet, Language Tests

Analytic Scoring of TOEFL® CBT Essays: Scores from Humans and "E-rater"®. TOEFL® Research Reports. RR-81. ETS RR-08-01

Peer reviewed
PDF on ERIC

Download full text

Lee, Yong-Won; Gentile, Claudia; Kantor, Robert – ETS Research Report Series, 2008

The main purpose of the study was to investigate the distinctness and reliability of analytic (or multitrait) rating dimensions and their relationships to holistic scores and "e-rater"® essay feature variables in the context of the TOEFL® computer-based test (CBT) writing assessment. Data analyzed in the study were analytic and holistic…

Descriptors: English (Second Language), Language Tests, Second Language Learning, Scoring

Relationships between Direct and Indirect Measures of Writing Ability.

Download full text

Carlson, Sybil B.; Camp, Roberta – 1985

This paper reports on Educational Testing Service research studies investigating the parameters critical to reliability and validity in both the direct and indirect writing ability assessment of higher education applicants. The studies involved: (1) formulating an operational definition of writing competence; (2) designing and pretesting writing…

Descriptors: College Entrance Examinations, Computer Assisted Testing, English (Second Language), Essay Tests

Previous Page | Next Page »

Pages: 1 | 2

Lee, Yong-Won	4
Kantor, Robert	3
Attali, Yigal	2
Carlson, Sybil B.	2
Davis, Larry	2
Gentile, Claudia	2
Mollaun, Pam	2
Xi, Xiaoming	2
Ahmadi Shirazi, Masoumeh	1
Ahmadi, Alireza	1
Bejar, Isaac I.	1
Breland, Hunter M.	1
Bridgeman, Brent	1
Burstein, Jill	1
Buzick, Heather	1
Camp, Roberta	1
Casabianca, Jodi M.	1
Davis, Lawrence Edward	1
Fowles, Mary E.	1
Haberman, Shelby J.	1
Heidari, Jamshid	1
Hemat, Ramin	1
Henning, Grant	1
Keller, Stefan	1
More ▼