ERIC - Search Results

Publication Date

In 2025	0
Since 2024	0
Since 2021 (last 5 years)	1
Since 2016 (last 10 years)	3
Since 2006 (last 20 years)	28

Descriptor

Computer Assisted Testing	35
Essays	25
Scoring	17
Essay Tests	15
Writing Evaluation	14
Evaluation Methods	13
Writing Tests	11
Foreign Countries	10
Grading	9
Interrater Reliability	9
Computer Software	8
Second Language Learning	8
Comparative Analysis	7
English (Second Language)	7
Standardized Tests	7
Test Scoring Machines	7
Educational Technology	6
Validity	6
Automation	5
Correlation	5
Measurement	5
Writing Instruction	5
College Faculty	4
Educational Testing	4
Feedback (Response)	4
More ▼

Publication Type

Reports - Evaluative	35
Journal Articles	30
Speeches/Meeting Papers	2

Education Level

Higher Education	14
Postsecondary Education	9
Elementary Secondary Education	7
Secondary Education	2
Grade 11	1

Audience

Location

United Kingdom	4
Hong Kong	2
Australia	1
Qatar	1
Taiwan	1
Texas	1
United Kingdom (Scotland)	1

Laws, Policies, & Programs

Assessments and Surveys

Test of English as a Foreign…	5
Graduate Record Examinations	3
National Assessment of…	2
SAT (College Admission Test)	2
ACT Assessment	1
Praxis Series	1

What Works Clearinghouse Rating

Showing 1 to 15 of 35 results Save | Export

Evaluating Quadratic Weighted Kappa as the Standard Performance Metric for Automated Essay Scoring

Peer reviewed
PDF on ERIC

Download full text

Doewes, Afrizal; Kurdhi, Nughthoh Arfawi; Saxena, Akrati – International Educational Data Mining Society, 2023

Automated Essay Scoring (AES) tools aim to improve the efficiency and consistency of essay scoring by using machine learning algorithms. In the existing research work on this topic, most researchers agree that human-automated score agreement remains the benchmark for assessing the accuracy of machine-generated scores. To measure the performance of…

Descriptors: Essays, Writing Evaluation, Evaluators, Accuracy

A Historical Analysis of Technological Advances to Educational Testing: A Drive for Efficiency and the Interplay with Validity

Peer reviewed

Direct link

Moncaleano, Sebastian; Russell, Michael – Journal of Applied Testing Technology, 2018

2017 marked a century since the development and administration of the first large-scale group administered standardized test. Since that time, both the importance of testing and the technology of testing have advanced significantly. This paper traces the technological advances that have led to the large-scale administration of educational tests in…

Descriptors: Technological Advancement, Standardized Tests, Computer Assisted Testing, Automation

On the Relation between Automated Essay Scoring and Modern Views of the Writing Construct

Peer reviewed

Direct link

Deane, Paul – Assessing Writing, 2013

This paper examines the construct measured by automated essay scoring (AES) systems. AES systems measure features of the text structure, linguistic structure, and conventional print form of essays; as such, the systems primarily measure text production skills. In the current state-of-the-art, AES provide little direct evidence about such matters…

Descriptors: Scoring, Essays, Text Structure, Writing (Composition)

Righting Technologies: How Large-Scale Assessment Can Foster a More Equitable Education System

Peer reviewed
PDF on ERIC

Download full text

Behizadeh, Nadia; Lynch, Tom Liam – Berkeley Review of Education, 2017

For the last century, the quality of large-scale assessment in the United States has been undermined by narrow educational theory and hindered by limitations in technology. As a result, poor assessment practices have encouraged low-level instructional practices that disparately affect students from the most disadvantaged communities and schools.…

Descriptors: Equal Education, Measurement, Educational Theories, Evaluation Methods

Large-Scale Assessment, Locally-Developed Measures, and Automated Scoring of Essays: Fishing for Red Herrings?

Peer reviewed

Direct link

Condon, William – Assessing Writing, 2013

Automated Essay Scoring (AES) has garnered a great deal of attention from the rhetoric and composition/writing studies community since the Educational Testing Service began using e-rater[R] and the "Criterion"[R] Online Writing Evaluation Service as products in scoring writing tests, and most of the responses have been negative. While the…

Descriptors: Measurement, Psychometrics, Evaluation Methods, Educational Testing

English Language Learners and Automated Scoring of Essays: Critical Considerations

Peer reviewed

Direct link

Weigle, Sara Cushing – Assessing Writing, 2013

This article presents considerations for using automated scoring systems to evaluate second language writing. A distinction is made between English language learners in English-medium educational systems and those studying English in their own countries for a variety of purposes, and between learning-to-write and writing-to-learn in a second…

Descriptors: Scoring, Second Language Learning, Second Languages, English Language Learners

MARC: A Thought Experiment in the Morality of Automated Marking of English

Peer reviewed

Direct link

Elliott, Victoria – Changing English: Studies in Culture and Education, 2014

Automated essay scoring programs are becoming more common and more technically advanced. They provoke strong reactions from both their advocates and their detractors. Arguments tend to fall into two categories: technical and principled. This paper argues that since technical difficulties will be overcome with time, the debate ought to be held in…

Descriptors: English, English Instruction, Grading, Computer Assisted Testing

The Validity of Examination Essays in Higher Education: Issues and Responses

Peer reviewed

Direct link

Brown, Gavin T. L. – Higher Education Quarterly, 2010

The use of timed, essay examinations is a well-established means of evaluating student learning in higher education. The reliability of essay scoring is highly problematic and it appears that essay examination grades are highly dependent on language and organisational components of writing. Computer-assisted scoring of essays makes use of language…

Descriptors: Higher Education, Essay Tests, Validity, Scoring

Automated Essay Scoring: Psychometric Guidelines and Practices

Peer reviewed

Direct link

Ramineni, Chaitanya; Williamson, David M. – Assessing Writing, 2013

In this paper, we provide an overview of psychometric procedures and guidelines Educational Testing Service (ETS) uses to evaluate automated essay scoring for operational use. We briefly describe the e-rater system, the procedures and criteria used to evaluate e-rater, implications for a range of potential uses of e-rater, and directions for…

Descriptors: Educational Testing, Guidelines, Scoring, Psychometrics

Sources of Evidence-of-Learning: Learning and Assessment in the Era of Big Data

Peer reviewed

Direct link

Cope, Bill; Kalantzis, Mary – Open Review of Educational Research, 2015

This article sets out to explore a shift in the sources of evidence-of-learning in the era of networked computing. One of the key features of recent developments has been popularly characterized as "big data". We begin by examining, in general terms, the frame of reference of contemporary debates on machine intelligence and the role of…

Descriptors: Data Analysis, Evidence, Computer Uses in Education, Artificial Intelligence

Keeping up with the Standards: What One English Professor Learned from Taking Every Standardized Exam in His Discipline

Peer reviewed
PDF on ERIC

Download full text

Brown, Kevin – CEA Forum, 2015

In this article, the author describes his project to take every standardized exam English majors students take. During the summer and fall semesters of 2012, the author signed up for and took the GRE General Test, the Praxis Content Area Exam (English Language, Literature, and Composition: Content Knowledge), the Senior Major Field Tests in…

Descriptors: College Faculty, College English, Test Preparation, Standardized Tests

Evaluating the Construct-Coverage of the e-rater[R] Scoring Engine. Research Report. ETS RR-09-01

Download full text

Quinlan, Thomas; Higgins, Derrick; Wolff, Susanne – Educational Testing Service, 2009

This report evaluates the construct coverage of the e-rater[R[ scoring engine. The matter of construct coverage depends on whether one defines writing skill, in terms of process or product. Originally, the e-rater engine consisted of a large set of components with a proven ability to predict human holistic scores. By organizing these capabilities…

Descriptors: Guides, Writing Skills, Factor Analysis, Writing Tests

An Automated Individual Feedback and Marking System: An Empirical Study

Peer reviewed
PDF on ERIC

Download full text

Barker, Trevor – Electronic Journal of e-Learning, 2011

The recent National Students Survey showed that feedback to students was an ongoing problem in Higher Education. This paper reports on the extension of our past research into the provision of automated feedback for objective testing. In the research presented here, the system has been further developed for marking practical and essay questions and…

Descriptors: Feedback (Response), Evaluation, Adaptive Testing, Objective Tests

Can Machine Scoring Deal with Broad and Open Writing Tests as Well as Human Readers?

Peer reviewed

Direct link

McCurry, Doug – Assessing Writing, 2010

This article considers the claim that machine scoring of writing test responses agrees with human readers as much as humans agree with other humans. These claims about the reliability of machine scoring of writing are usually based on specific and constrained writing tasks, and there is reason for asking whether machine scoring of writing requires…

Descriptors: Writing Tests, Scoring, Interrater Reliability, Computer Assisted Testing

Toward Automated Multi-Trait Scoring of Essays: Investigating Links among Holistic, Analytic, and Text Feature Scores

Peer reviewed

Direct link

Lee, Yong-Won; Gentile, Claudia; Kantor, Robert – Applied Linguistics, 2010

The main purpose of the study was to investigate the distinctness and reliability of analytic (or multi-trait) rating dimensions and their relationships to holistic scores and "e-rater"[R] essay feature variables in the context of the TOEFL[R] computer-based test (TOEFL CBT) writing assessment. Data analyzed in the study were holistic…

Descriptors: Writing Evaluation, Writing Tests, Scoring, Essays

Previous Page | Next Page »

Pages: 1 | 2 | 3

Assessing Writing	8
ALT-J: Research in Learning…	1
Applied Linguistics	1
Assessment & Evaluation in…	1
Australasian Journal of…	1
Berkeley Review of Education	1
British Journal of…	1
CEA Forum	1
Changing English: Studies in…	1
E-Learning	1
Educational Research and…	1
Educational Testing Service	1
Electronic Journal of…	1
Higher Education Quarterly	1
Innovations in Education and…	1
Interactive Technology and…	1
International Educational…	1
Journal of Applied Testing…	1
Journal of Developmental…	1
Journal of Educational…	1
Journal of Experimental…	1
Journal of Technology,…	1
Modern Language Journal	1
Open Review of Educational…	1
ReCALL	1
More ▼

Coniam, David	2
Davies, Phil	2
James, Cindy L.	2
Attali, Yigal	1
Barker, Trevor	1
Behizadeh, Nadia	1
Bridgeman, Brent	1
Brown, Gavin T. L.	1
Brown, Kevin	1
Burk, John	1
Burrows, Steven	1
Cason, Gerald J.	1
Chung, Gregory K. W. K.	1
Condon, William	1
Cope, Bill	1
Deane, Paul	1
Doewes, Afrizal	1
Elliott, Victoria	1
Gentile, Claudia	1
Higgins, Derrick	1
Hoyt, Jeff E.	1
Kalantzis, Mary	1
Kantor, Robert	1
Kurdhi, Nughthoh Arfawi	1
More ▼