ERIC - Search Results

Publication Date

In 2026	0
Since 2025	0
Since 2022 (last 5 years)	1
Since 2017 (last 10 years)	4
Since 2007 (last 20 years)	12

Descriptor

Computer Assisted Testing	12
Essays	12
Scoring	11
English (Second Language)	10
Language Tests	10
Second Language Learning	9
Writing Tests	9
Writing Evaluation	8
Correlation	6
Accuracy	4
Evaluators	4
Automation	3
Computer Software	3
Evaluation Criteria	3
Prediction	3
Scores	3
Construct Validity	2
Cues	2
Essay Tests	2
Evaluation Methods	2
Factor Analysis	2
Foreign Countries	2
High Stakes Tests	2
Holistic Approach	2
Interrater Reliability	2
More ▼

Source

ETS Research Report Series	5
Educational Testing Service	2
Applied Linguistics	1
Assessing Writing	1
Canadian Journal of Learning…	1
Journal of Technology,…	1
Language Testing	1

Publication Type

Journal Articles	10
Reports - Research	6
Reports - Evaluative	4
Information Analyses	1
Reports - Descriptive	1

Education Level

Higher Education	3
Postsecondary Education	2
Elementary Secondary Education	1
High Schools	1
Secondary Education	1

Audience

Location

Canada	1
Germany	1
Switzerland	1

Laws, Policies, & Programs

Assessments and Surveys

Test of English as a Foreign…	12
Graduate Record Examinations	3
Praxis Series	1

What Works Clearinghouse Rating

Showing all 12 results Save | Export

Automated Scoring of Speaking and Writing: Starting to Hit Its Stride

Peer reviewed
PDF on ERIC

Download full text

Jones, Daniel Marc; Cheng, Liying; Tweedie, M. Gregory – Canadian Journal of Learning and Technology, 2022

This article reviews recent literature (2011-present) on the automated scoring (AS) of writing and speaking. Its purpose is to first survey the current research on automated scoring of language, then highlight how automated scoring impacts the present and future of assessment, teaching, and learning. The article begins by outlining the general…

Descriptors: Automation, Computer Assisted Testing, Scoring, Writing (Composition)

Automated Essay Scoring at Scale: A Case Study in Switzerland and Germany. TOEFL® Research Report. RR-86. ETS RR-19-12

Peer reviewed
PDF on ERIC

Download full text

Rupp, André A.; Casabianca, Jodi M.; Krüger, Maleika; Keller, Stefan; Köller, Olaf – ETS Research Report Series, 2019

In this research report, we describe the design and empirical findings for a large-scale study of essay writing ability with approximately 2,500 high school students in Germany and Switzerland on the basis of 2 tasks with 2 associated prompts, each from a standardized writing assessment whose scoring involved both human and automated components.…

Descriptors: Automation, Foreign Countries, English (Second Language), Language Tests

Prediction of Writing True Scores in Automated Scoring of Essays by Best Linear Predictors and Penalized Best Linear Predictors. Research Report. ETS RR-19-13

Peer reviewed
PDF on ERIC

Download full text

Yao, Lili; Haberman, Shelby J.; Zhang, Mo – ETS Research Report Series, 2019

Many assessments of writing proficiency that aid in making high-stakes decisions consist of several essay tasks evaluated by a combination of human holistic scores and computer-generated scores for essay features such as the rate of grammatical errors per word. Under typical conditions, a summary writing score is provided by a linear combination…

Descriptors: Prediction, True Scores, Computer Assisted Testing, Scoring

Shaping a Score: Complexity, Accuracy, and Fluency in Integrated Writing Performances

Peer reviewed

Direct link

Plakans, Lia; Gebril, Atta; Bilki, Zeynep – Language Testing, 2019

The present study investigates integrated writing assessment performances with regard to the linguistic features of complexity, accuracy, and fluency (CAF). Given the increasing presence of integrated tasks in large-scale and classroom assessments, validity evidence is needed for the claim that their scores reflect targeted language abilities.…

Descriptors: Accuracy, Language Tests, Scores, Writing Evaluation

Automated Trait Scores for "TOEFL"® Writing Tasks. Research Report. ETS RR-15-14

Peer reviewed
PDF on ERIC

Download full text

Attali, Yigal; Sinharay, Sandip – ETS Research Report Series, 2015

The "e-rater"® automated essay scoring system is used operationally in the scoring of "TOEFL iBT"® independent and integrated tasks. In this study we explored the psychometric added value of reporting four trait scores for each of these two tasks, beyond the total e-rater score.The four trait scores are word choice, grammatical…

Descriptors: Writing Tests, Scores, Language Tests, English (Second Language)

English Language Learners and Automated Scoring of Essays: Critical Considerations

Peer reviewed

Direct link

Weigle, Sara Cushing – Assessing Writing, 2013

This article presents considerations for using automated scoring systems to evaluate second language writing. A distinction is made between English language learners in English-medium educational systems and those studying English in their own countries for a variety of purposes, and between learning-to-write and writing-to-learn in a second…

Descriptors: Scoring, Second Language Learning, Second Languages, English Language Learners

Evaluation of the "e-rater"® Scoring Engine for the "TOEFL"® Independent and Integrated Prompts. Research Report. ETS RR-12-06

Peer reviewed
PDF on ERIC

Download full text

Ramineni, Chaitanya; Trapani, Catherine S.; Williamson, David M.; Davey, Tim; Bridgeman, Brent – ETS Research Report Series, 2012

Scoring models for the "e-rater"® system were built and evaluated for the "TOEFL"® exam's independent and integrated writing prompts. Prompt-specific and generic scoring models were built, and evaluation statistics, such as weighted kappas, Pearson correlations, standardized differences in mean scores, and correlations with…

Descriptors: Scoring, Prompting, Evaluators, Computer Software

Use of e-rater[R] in Scoring of the TOEFL iBT[R] Writing Test. Research Report. ETS RR-11-25

Download full text

Haberman, Shelby J. – Educational Testing Service, 2011

Alternative approaches are discussed for use of e-rater[R] to score the TOEFL iBT[R] Writing test. These approaches involve alternate criteria. In the 1st approach, the predicted variable is the expected rater score of the examinee's 2 essays. In the 2nd approach, the predicted variable is the expected rater score of 2 essay responses by the…

Descriptors: Writing Tests, Scoring, Essays, Language Tests

Toward Automated Multi-Trait Scoring of Essays: Investigating Links among Holistic, Analytic, and Text Feature Scores

Peer reviewed

Direct link

Lee, Yong-Won; Gentile, Claudia; Kantor, Robert – Applied Linguistics, 2010

The main purpose of the study was to investigate the distinctness and reliability of analytic (or multi-trait) rating dimensions and their relationships to holistic scores and "e-rater"[R] essay feature variables in the context of the TOEFL[R] computer-based test (TOEFL CBT) writing assessment. Data analyzed in the study were holistic…

Descriptors: Writing Evaluation, Writing Tests, Scoring, Essays

Performance of a Generic Approach in Automated Essay Scoring

Peer reviewed
PDF on ERIC

Download full text

Attali, Yigal; Bridgeman, Brent; Trapani, Catherine – Journal of Technology, Learning, and Assessment, 2010

A generic approach in automated essay scoring produces scores that have the same meaning across all prompts, existing or new, of a writing assessment. This is accomplished by using a single set of linguistic indicators (or features), a consistent way of combining and weighting these features into essay scores, and a focus on features that are not…

Descriptors: Writing Evaluation, Writing Tests, Scoring, Test Scoring Machines

Evaluating the Construct-Coverage of the e-rater[R] Scoring Engine. Research Report. ETS RR-09-01

Download full text

Quinlan, Thomas; Higgins, Derrick; Wolff, Susanne – Educational Testing Service, 2009

This report evaluates the construct coverage of the e-rater[R[ scoring engine. The matter of construct coverage depends on whether one defines writing skill, in terms of process or product. Originally, the e-rater engine consisted of a large set of components with a proven ability to predict human holistic scores. By organizing these capabilities…

Descriptors: Guides, Writing Skills, Factor Analysis, Writing Tests

Analytic Scoring of TOEFL® CBT Essays: Scores from Humans and "E-rater"®. TOEFL® Research Reports. RR-81. ETS RR-08-01

Peer reviewed
PDF on ERIC

Download full text

Lee, Yong-Won; Gentile, Claudia; Kantor, Robert – ETS Research Report Series, 2008

The main purpose of the study was to investigate the distinctness and reliability of analytic (or multitrait) rating dimensions and their relationships to holistic scores and "e-rater"® essay feature variables in the context of the TOEFL® computer-based test (CBT) writing assessment. Data analyzed in the study were analytic and holistic…

Descriptors: English (Second Language), Language Tests, Second Language Learning, Scoring

Attali, Yigal	2
Bridgeman, Brent	2
Gentile, Claudia	2
Haberman, Shelby J.	2
Kantor, Robert	2
Lee, Yong-Won	2
Bilki, Zeynep	1
Casabianca, Jodi M.	1
Cheng, Liying	1
Davey, Tim	1
Gebril, Atta	1
Higgins, Derrick	1
Jones, Daniel Marc	1
Keller, Stefan	1
Krüger, Maleika	1
Köller, Olaf	1
Plakans, Lia	1
Quinlan, Thomas	1
Ramineni, Chaitanya	1
Rupp, André A.	1
Sinharay, Sandip	1
Trapani, Catherine	1
Trapani, Catherine S.	1
Tweedie, M. Gregory	1
Weigle, Sara Cushing	1
More ▼