ERIC - Search Results

Publication Date

In 2025	0
Since 2024	1
Since 2021 (last 5 years)	9
Since 2016 (last 10 years)	22
Since 2006 (last 20 years)	48

Descriptor

Language Tests	53
Scoring	53
Second Language Learning	53
English (Second Language)	52
Computer Assisted Testing	21
Evaluators	20
Correlation	17
Scores	17
Essays	15
Foreign Countries	15
Writing Tests	14
Writing Evaluation	12
Accuracy	11
Comparative Analysis	11
Automation	10
Interrater Reliability	10
Oral Language	10
Language Proficiency	9
Speech Communication	9
Native Language	8
Test Validity	8
Computer Software	7
Prompting	7
Statistical Analysis	7
College Entrance Examinations	6
More ▼

Publication Type

Journal Articles	49
Reports - Research	45
Tests/Questionnaires	13
Reports - Evaluative	4
Dissertations/Theses -…	1
Information Analyses	1
Numerical/Quantitative Data	1
Opinion Papers	1
Reports - Descriptive	1

Education Level

Higher Education	15
Postsecondary Education	14
Secondary Education	3
Elementary Education	2
High Schools	2
Junior High Schools	2
Middle Schools	2
Adult Education	1
Grade 10	1
Grade 11	1
Grade 12	1
Grade 6	1
Grade 7	1
Grade 8	1
Grade 9	1
Intermediate Grades	1
More ▼

Audience

Researchers

Location

Iran	4
Canada	3
China	3
Japan	3
Germany	2
India	2
Mexico	2
Australia	1
Brazil	1
California (Los Angeles)	1
Colombia	1
Ethiopia	1
Georgia	1
Hong Kong	1
Indiana	1
Iowa	1
Japan (Tokyo)	1
Jordan	1
Kenya	1
Michigan	1
Minnesota	1
New York	1
South Korea	1
Switzerland	1
Taiwan	1
More ▼

Laws, Policies, & Programs

Assessments and Surveys

Test of English as a Foreign…	53
Graduate Record Examinations	5
International English…	3
Graduate Management Admission…	2
Praxis Series	2
Computer Attitude Scale	1
Law School Admission Test	1
Medical College Admission Test	1
Michigan Test of English…	1
SAT (College Admission Test)	1
Test of English for…	1
Test of Written English	1
More ▼

What Works Clearinghouse Rating

Showing 1 to 15 of 53 results Save | Export

Complementary Strengths? Evaluation of a Hybrid Human-Machine Scoring Approach for a Test of Oral Academic English

Peer reviewed

Direct link

Davis, Larry; Papageorgiou, Spiros – Assessment in Education: Principles, Policy & Practice, 2021

Human raters and machine scoring systems potentially have complementary strengths in evaluating language ability; specifically, it has been suggested that automated systems might be used to make consistent measurements of specific linguistic phenomena, whilst humans evaluate more global aspects of performance. We report on an empirical study that…

Descriptors: Scoring, English for Academic Purposes, Oral English, Speech Tests

Challenges and Opportunities for Spoken English Learning and Instruction Brought by Automated Speech Scoring in Large-Scale Speaking Tests: A Mixed-Method Investigation into the Washback of "SpeechRater" in TOEFL iBT

Peer reviewed

Direct link

Gong, Kaixuan – Asian-Pacific Journal of Second and Foreign Language Education, 2023

The extensive use of automated speech scoring in large-scale speaking assessment can be revolutionary not only to test design and rating, but also to the learning and instruction of speaking based on how students and teachers perceive and react to this technology. However, its washback remained underexplored. This mixed-method study aimed to…

Descriptors: Second Language Learning, Language Tests, English (Second Language), Automation

Design Framework for the "TOEFL® Essentials"™ Test 2021. Research Memorandum. ETS RM-21-03

Download full text

Papageorgiou, Spiros; Davis, Larry; Norris, John M.; Garcia Gomez, Pablo; Manna, Venessa F.; Monfils, Lora – Educational Testing Service, 2021

The "TOEFL® Essentials"™ test is a new English language proficiency test in the "TOEFL"® family of assessments. It measures foundational language skills and communication abilities in academic and general (daily life) contexts. The test covers the four language skills of reading, listening, writing, and speaking and is intended…

Descriptors: Language Tests, English (Second Language), Second Language Learning, Language Proficiency

To Be with Artificial Intelligence in Oral Test or Not to Be: A Probe into the Traces of Success in Speaking Skill, Psychological Well-Being, Autonomy, and Academic Buoyancy

Peer reviewed

Direct link

Biju Theruvil Sayed; Zein Bassam Bani Younes; Ahmad Alkhayyat; Iroda Adhamova; Habesha Teferi – Language Testing in Asia, 2024

There has been a surge in employing artificial intelligence (AI) in all areas of language pedagogy, not the least among them language testing and assessment. This study investigated the effects of AI-powered tools on English as a Foreign Language (EFL) learners' speaking skills, psychological well-being, autonomy, and academic buoyancy. Using a…

Descriptors: Artificial Intelligence, Language Tests, Success, Speech Skills

Automated Scoring of Speaking and Writing: Starting to Hit Its Stride

Peer reviewed
PDF on ERIC

Download full text

Jones, Daniel Marc; Cheng, Liying; Tweedie, M. Gregory – Canadian Journal of Learning and Technology, 2022

This article reviews recent literature (2011-present) on the automated scoring (AS) of writing and speaking. Its purpose is to first survey the current research on automated scoring of language, then highlight how automated scoring impacts the present and future of assessment, teaching, and learning. The article begins by outlining the general…

Descriptors: Automation, Computer Assisted Testing, Scoring, Writing (Composition)

Auto-Scoring of Student Speech: Proprietary vs. Open-Source Solutions

Peer reviewed
PDF on ERIC

Download full text

Daniels, Paul – TESL-EJ, 2022

This paper compares the speaking scores generated by two online systems that are designed to automatically grade student speech and provide personalized speaking feedback in an EFL context. The first system, "Speech Assessment for Moodle" ("SAM"), is an open-source solution developed by the author that makes use of Google's…

Descriptors: Speech Communication, Auditory Perception, Computer Uses in Education, Computer Assisted Testing

Applying Cognitive Theory to the Human Essay Rating Process

Peer reviewed

Direct link

Finn, Bridgid; Arslan, Burcu; Walsh, Matthew – Applied Measurement in Education, 2020

To score an essay response, raters draw on previously trained skills and knowledge about the underlying rubric and score criterion. Cognitive processes such as remembering, forgetting, and skill decay likely influence rater performance. To investigate how forgetting influences scoring, we evaluated raters' scoring accuracy on TOEFL and GRE essays.…

Descriptors: Epistemology, Essay Tests, Evaluators, Cognitive Processes

Application of Best Linear Prediction and Penalized Best Linear Prediction to ETS Tests. Research Report. ETS RR-20-08

Peer reviewed
PDF on ERIC

Download full text

Haberman, Shelby J. – ETS Research Report Series, 2020

Best linear prediction (BLP) and penalized best linear prediction (PBLP) are techniques for combining sources of information to produce task scores, section scores, and composite test scores. The report examines issues to consider in operational implementation of BLP and PBLP in testing programs administered by ETS [Educational Testing Service].

Descriptors: Prediction, Scores, Tests, Testing Programs

Rater Dominance in Discussion as a Resolution Method

Peer reviewed
PDF on ERIC

Download full text

Ahmadi, Alireza – Taiwan Journal of TESOL, 2020

Rater subjectivity has long been an intriguing topic. The use of discussion as a resolution method is a practical way to reduce this subjectivity. However, the efficacy of discussion depends on whether different raters get equally engaged in it or one rater tends to dominate others. This study investigated whether and how rater dominance occurs in…

Descriptors: Evaluators, Interrater Reliability, Discussion, Discourse Analysis

In Search of New Benchmarks: Using L2 Lexical Frequency and Contextual Diversity Indices to Assess Second Language Writing

Peer reviewed

Direct link

Monteiro, Kátia R.; Crossley, Scott A.; Kyle, Kristopher – Applied Linguistics, 2020

Lexical items that are encountered more frequently and in varying contexts have important effects on second language (L2) development because frequent and contextually diverse words are learned faster and become more entrenched in a learner's lexicon (Ellis 2002a, b). Despite evidence that L2 learners are generally exposed to non-native input,…

Descriptors: English (Second Language), Language Tests, Second Language Learning, Benchmarking

Developing an Innovative Elicited Imitation Task for Efficient English Proficiency Assessment. TOEFL® Research Report. RR-96. ETS RR-21-24

Peer reviewed
PDF on ERIC

Download full text

Davis, Larry; Norris, John – ETS Research Report Series, 2021

The elicited imitation task (EIT), in which language learners listen to a series of spoken sentences and repeat each one verbatim, is a commonly used measure of language proficiency in second language acquisition research. The "TOEFL® Essentials"™ test includes an EIT as a holistic measure of speaking proficiency, referred to as the…

Descriptors: Task Analysis, Language Proficiency, Speech Communication, Language Tests

Computerized Testing in Reading Comprehension Skill: Investigating Score Interchangeability, Item Review, Age and Gender Stereotypes, ICT Literacy and Computer Attitudes

Peer reviewed

Direct link

Toroujeni, Seyyed Morteza Hashemi – Education and Information Technologies, 2022

Score interchangeability of Computerized Fixed-Length Linear Testing (henceforth CFLT) and Paper-and-Pencil-Based Testing (henceforth PPBT) has become a controversial issue over the last decade when technology has meaningfully restructured methods of the educational assessment. Given this controversy, various testing guidelines published on…

Descriptors: Computer Assisted Testing, Reading Tests, Reading Comprehension, Scoring

Screener Tests Need Validation Too: Weighing an Argument for Test Use against Practical Concerns

Peer reviewed

Direct link

Schmidgall, Jonathan E.; Getman, Edward P.; Zu, Jiyun – Language Testing, 2018

In this study, we define the term "screener test," elaborate key considerations in test design, and describe how to incorporate the concepts of practicality and argument-based validation to drive an evaluation of screener tests for language assessment. A screener test is defined as a brief assessment designed to identify an examinee as a…

Descriptors: Test Validity, Test Use, Test Construction, Language Tests

Automated Essay Scoring at Scale: A Case Study in Switzerland and Germany. TOEFL® Research Report. RR-86. ETS RR-19-12

Peer reviewed
PDF on ERIC

Download full text

Rupp, André A.; Casabianca, Jodi M.; Krüger, Maleika; Keller, Stefan; Köller, Olaf – ETS Research Report Series, 2019

In this research report, we describe the design and empirical findings for a large-scale study of essay writing ability with approximately 2,500 high school students in Germany and Switzerland on the basis of 2 tasks with 2 associated prompts, each from a standardized writing assessment whose scoring involved both human and automated components.…

Descriptors: Automation, Foreign Countries, English (Second Language), Language Tests

Prediction of Writing True Scores in Automated Scoring of Essays by Best Linear Predictors and Penalized Best Linear Predictors. Research Report. ETS RR-19-13

Peer reviewed
PDF on ERIC

Download full text

Yao, Lili; Haberman, Shelby J.; Zhang, Mo – ETS Research Report Series, 2019

Many assessments of writing proficiency that aid in making high-stakes decisions consist of several essay tasks evaluated by a combination of human holistic scores and computer-generated scores for essay features such as the rate of grammatical errors per word. Under typical conditions, a summary writing score is provided by a linear combination…

Descriptors: Prediction, True Scores, Computer Assisted Testing, Scoring

Previous Page | Next Page »

Pages: 1 | 2 | 3 | 4

ETS Research Report Series	18
Language Testing	5
Applied Linguistics	2
Grantee Submission	2
JALT CALL Journal	2
Language Assessment Quarterly	2
Advances in Language and…	1
Applied Measurement in…	1
Asian-Pacific Journal of…	1
Assessing Writing	1
Assessment in Education:…	1
Canadian Journal of Learning…	1
College Entrance Examination…	1
Education and Information…	1
Educational Testing Service	1
English Language Teaching	1
Journal of Pan-Pacific…	1
Language Education &…	1
Language Learning	1
Language Learning in Higher…	1
Language Testing in Asia	1
ProQuest LLC	1
Reading Matrix: An…	1
SAGE Open	1
TESL-EJ	1
More ▼

Xi, Xiaoming	5
Kantor, Robert	4
Attali, Yigal	3
Bridgeman, Brent	3
Crossley, Scott A.	3
Davis, Larry	3
Lee, Yong-Won	3
Blanchard, Daniel	2
Gentile, Claudia	2
Guo, Liang	2
Haberman, Shelby J.	2
Higgins, Derrick	2
Kyle, Kristopher	2
McNamara, Danielle S.	2
Mollaun, Pam	2
Mollaun, Pamela	2
Papageorgiou, Spiros	2
Weigle, Sara Cushing	2
Zechner, Klaus	2
Ahmad Alkhayyat	1
Ahmadi Shirazi, Masoumeh	1
Ahmadi, Alireza	1
Allen, Laura K.	1
Arslan, Burcu	1
More ▼