ERIC - Search Results

Publication Date

In 2025	0
Since 2024	0
Since 2021 (last 5 years)	0
Since 2016 (last 10 years)	6
Since 2006 (last 20 years)	11

Descriptor

Evaluators	13
Interrater Reliability	13
English (Second Language)	12
Language Tests	12
Second Language Learning	11
Computer Assisted Testing	8
Oral Language	7
Scoring	7
Language Proficiency	6
Scores	6
Foreign Countries	5
Scoring Rubrics	3
Statistical Analysis	3
Comparative Analysis	2
Context Effect	2
Correlation	2
Cues	2
Essay Tests	2
Essays	2
Language Teachers	2
Language Usage	2
Multiple Regression Analysis	2
Multivariate Analysis	2
Novices	2
Prompting	2
More ▼

Source

ETS Research Report Series	4
Language Testing	2
JALT CALL Journal	1
Journal of Pan-Pacific…	1
Language Assessment Quarterly	1
SAGE Open	1
Taiwan Journal of TESOL	1

Publication Type

Reports - Research	13
Journal Articles	11
Tests/Questionnaires	2
Numerical/Quantitative Data	1
Speeches/Meeting Papers	1

Education Level

Higher Education	3
Postsecondary Education	3
High Schools	1
Secondary Education	1

Audience

Location

Iran	3
Germany	1
Japan (Tokyo)	1
Switzerland	1
United States	1

Laws, Policies, & Programs

Assessments and Surveys

Test of English as a Foreign…	13
International English…	2
Test of English for…	1

What Works Clearinghouse Rating

Showing all 13 results Save | Export

Rater Dominance in Discussion as a Resolution Method

Peer reviewed
PDF on ERIC

Download full text

Ahmadi, Alireza – Taiwan Journal of TESOL, 2020

Rater subjectivity has long been an intriguing topic. The use of discussion as a resolution method is a practical way to reduce this subjectivity. However, the efficacy of discussion depends on whether different raters get equally engaged in it or one rater tends to dominate others. This study investigated whether and how rater dominance occurs in…

Descriptors: Evaluators, Interrater Reliability, Discussion, Discourse Analysis

Automated Essay Scoring at Scale: A Case Study in Switzerland and Germany. TOEFL® Research Report. RR-86. ETS RR-19-12

Peer reviewed
PDF on ERIC

Download full text

Rupp, André A.; Casabianca, Jodi M.; Krüger, Maleika; Keller, Stefan; Köller, Olaf – ETS Research Report Series, 2019

In this research report, we describe the design and empirical findings for a large-scale study of essay writing ability with approximately 2,500 high school students in Germany and Switzerland on the basis of 2 tasks with 2 associated prompts, each from a standardized writing assessment whose scoring involved both human and automated components.…

Descriptors: Automation, Foreign Countries, English (Second Language), Language Tests

For a Greater Good: Bias Analysis in Writing Assessment

Peer reviewed

Direct link

Ahmadi Shirazi, Masoumeh – SAGE Open, 2019

Threats to construct validity should be reduced to a minimum. If true, sources of bias, namely raters, items, tests as well as gender, age, race, language background, culture, and socio-economic status need to be spotted and removed. This study investigates raters' experience, language background, and the choice of essay prompt as potential…

Descriptors: Foreign Countries, Language Tests, Test Bias, Essay Tests

The Effect of Training and Rater Differences on Oral Proficiency Assessment

Peer reviewed

Direct link

Kang, Okim; Rubin, Don; Kermad, Alyssa – Language Testing, 2019

As a result of the fact that judgments of non-native speech are closely tied to social biases, oral proficiency ratings are susceptible to error because of rater background and social attitudes. In the present study we seek first to estimate the variance attributable to rater background and attitudinal variables on novice raters' assessments of L2…

Descriptors: Evaluators, Second Language Learning, Language Tests, English (Second Language)

The Influence of Training and Experience on Rater Performance in Scoring Spoken Language

Peer reviewed

Direct link

Davis, Larry – Language Testing, 2016

Two factors were investigated that are thought to contribute to consistency in rater scoring judgments: rater training and experience in scoring. Also considered were the relative effects of scoring rubrics and exemplars on rater performance. Experienced teachers of English (N = 20) scored recorded responses from the TOEFL iBT speaking test prior…

Descriptors: Evaluators, Oral Language, Scores, Language Tests

Assessment Behavior and Perceptions of Raters in Paired and Group Oral Interaction

Peer reviewed
PDF on ERIC

Download full text

Negishi, Junko – Journal of Pan-Pacific Association of Applied Linguistics, 2015

The study considers the assessment of L2 English learners by trained raters in paired and group oral assessments in comparison to an individual, monologue assessment, to determine 1) the degree to which raters assign pairs/groups shared (the same) scores and the degree to which raters give individual members of pairs/groups higher or lower as…

Descriptors: Evaluators, English (Second Language), Second Language Learning, Scores

A Comparative Analysis of Face to Face Instruction vs. Telegram Mobile Instruction in Terms of Narrative Writing

Peer reviewed
PDF on ERIC

Download full text

Heidari, Jamshid; Khodabandeh, Farzaneh; Soleimani, Hassan – JALT CALL Journal, 2018

The emergence of computer technology in English language teaching has paved the way for teachers' application of Mobile Assisted Language Learning (mall) and its advantages in teaching. This study aimed to compare the effectiveness of the face to face instruction with Telegram mobile instruction. Based on a toefl test, 60 English foreign language…

Descriptors: Comparative Analysis, Conventional Instruction, Teaching Methods, Computer Assisted Instruction

The Role of Lexical Properties and Cohesive Devices in Text Integration and Their Effect on Human Ratings of Speaking Proficiency

Peer reviewed

Direct link

Crossley, Scott; Clevinger, Amanda; Kim, YouJin – Language Assessment Quarterly, 2014

There has been a growing interest in the use of integrated tasks in the field of second language testing to enhance the authenticity of language tests. However, the role of text integration in test takers' performance has not been widely investigated. The purpose of the current study is to examine the effects of text-based relational (i.e.,…

Descriptors: Language Proficiency, Connected Discourse, Language Tests, English (Second Language)

Developing Analytic Rating Guides for "TOEFL iBT"® Integrated Speaking Tasks. "TOEFL iBT"® Research Report, TOEFL iBT-20. ETS Research Report. RR-13-13

Peer reviewed
PDF on ERIC

Download full text

Jamieson, Joan; Poonpon, Kornwipa – ETS Research Report Series, 2013

Research and development of a new type of scoring rubric for the integrated speaking tasks of "TOEFL iBT"® are described. These "analytic rating guides" could be helpful if tasks modeled after those in TOEFL iBT were used for formative assessment, a purpose which is different from TOEFL iBT's primary use for admission…

Descriptors: Oral Language, Language Proficiency, Scaling, Scores

The Impact of Composition Medium on Essay Raters in Foreign Language Testing.

Download full text

Manalo, Jonathan R.; Wolfe, Edward W. – 2000

Recently, the Test of English as a Foreign Language (TOEFL) changed by including a writing section that gives the examinee an option between computer and handwritten formats to compose their responses. Unfortunately, this may introduce several potential sources of error that might reduce the reliability and validity of the scores. The seriousness…

Descriptors: Computer Assisted Testing, Essay Tests, Evaluators, Handwriting

Toward an Understanding of the Role of Speech Recognition in Nonnative Speech Assessment. TOEFL iBT Research Report. TOEFL iBT-02. ETS RR-07-02

Peer reviewed
PDF on ERIC

Download full text

Zechner, Klaus; Bejar, Isaac I.; Hemat, Ramin – ETS Research Report Series, 2007

The increasing availability and performance of computer-based testing has prompted more research on the automatic assessment of language and speaking proficiency. In this investigation, we evaluated the feasibility of using an off-the-shelf speech-recognition system for scoring speaking prompts from the LanguEdge field test of 2002. We first…

Descriptors: Role, Computer Assisted Testing, Language Proficiency, Oral Language

Context Bias in the Test of English as a Foreign Language.

Download full text

Angoff, William H. – 1989

This study was undertaken to test the hypothesis that items of the Test of English as a Foreign Language (TOEFL) containing reference to American people, places, customs, etc., tend to favor examinees who have spent some time living in the United States. Two samples of examinees were drawn from the March 1987 TOEFL administration, one tested in…

Descriptors: Context Effect, English (Second Language), Evaluators, Foreign Nationals

Investigating the Utility of Analytic Scoring for the TOEFL Academic Speaking Test (TAST). TOEFL iBT Research Report. TOEFL iBT-01. ETS RR-06-07

Peer reviewed
PDF on ERIC

Download full text

Xi, Xiaoming; Mollaun, Pam – ETS Research Report Series, 2006

This study explores the utility of analytic scoring for the TOEFL® Academic Speaking Test (TAST) in providing useful and reliable diagnostic information in three aspects of candidates' performance: delivery, language use, and topic development. G studies were used to investigate the dependability of the analytic scores, the distinctness of the…

Descriptors: English (Second Language), Language Tests, Second Language Learning, Oral Language

Ahmadi Shirazi, Masoumeh	1
Ahmadi, Alireza	1
Angoff, William H.	1
Bejar, Isaac I.	1
Casabianca, Jodi M.	1
Clevinger, Amanda	1
Crossley, Scott	1
Davis, Larry	1
Heidari, Jamshid	1
Hemat, Ramin	1
Jamieson, Joan	1
Kang, Okim	1
Keller, Stefan	1
Kermad, Alyssa	1
Khodabandeh, Farzaneh	1
Kim, YouJin	1
Krüger, Maleika	1
Köller, Olaf	1
Manalo, Jonathan R.	1
Mollaun, Pam	1
Negishi, Junko	1
Poonpon, Kornwipa	1
Rubin, Don	1
Rupp, André A.	1
Soleimani, Hassan	1
More ▼