ERIC - Search Results

Publication Date

In 2025	1
Since 2024	1
Since 2021 (last 5 years)	5
Since 2016 (last 10 years)	30
Since 2006 (last 20 years)	73

Descriptor

Essay Tests	217
Scoring	217
Writing Evaluation	86
Higher Education	53
Test Reliability	44
Computer Assisted Testing	41
Interrater Reliability	41
Writing Skills	39
Test Construction	38
Writing Tests	37
Test Validity	31
Automation	29
Scores	28
Holistic Evaluation	27
Writing (Composition)	27
Essays	26
Evaluators	26
Testing Programs	26
College Entrance Examinations	25
Evaluation Methods	24
High Schools	23
Student Evaluation	23
State Programs	22
Comparative Analysis	21
Correlation	21
More ▼

Publication Type

Reports - Research	126
Journal Articles	97
Reports - Evaluative	38
Speeches/Meeting Papers	37
Reports - Descriptive	27
Tests/Questionnaires	20
Guides - Non-Classroom	14
Information Analyses	13
Numerical/Quantitative Data	6
Opinion Papers	4
Books	3
Guides - Classroom - Teacher	3
Guides - General	2
Reports - General	2
Collected Works - Proceedings	1
Collected Works - Serials	1
More ▼

Education Level

Higher Education	30
Postsecondary Education	21
Elementary Secondary Education	13
Secondary Education	11
Elementary Education	6
Intermediate Grades	4
Middle Schools	4
Grade 5	3
Grade 4	2
Grade 6	2
High Schools	2
Adult Education	1
Early Childhood Education	1
Grade 10	1
Grade 11	1
Grade 12	1
Grade 3	1
Grade 7	1
Grade 8	1
Grade 9	1
Junior High Schools	1
Primary Education	1
More ▼

Audience

Researchers	13
Practitioners	12
Teachers	7
Administrators	1
Students	1

Location

California	7
Canada	6
Florida	6
Hong Kong	2
Iran	2
Japan	2
New Jersey	2
Nigeria	2
North Carolina	2
United Kingdom	2
Africa	1
Canada (Vancouver)	1
China	1
Georgia	1
Greece	1
India	1
Iowa	1
Massachusetts	1
Pennsylvania	1
South Korea	1
Taiwan	1
United Kingdom (England)	1
More ▼

Laws, Policies, & Programs

What Works Clearinghouse Rating

Showing 1 to 15 of 217 results Save | Export

Employing a Hierarchical Rater Models for Automated Scoring: Scope Review on the Application in Educational Assessment

Peer reviewed
PDF on ERIC

Download full text

Direct link

Akif Avcu – Malaysian Online Journal of Educational Technology, 2025

This scope-review presents the milestones of how Hierarchical Rater Models (HRMs) become operable to used in automated essay scoring (AES) to improve instructional evaluation. Although essay evaluations--a useful instrument for evaluating higher-order cognitive abilities--have always depended on human raters, concerns regarding rater bias,…

Descriptors: Automation, Scoring, Models, Educational Assessment

Exploring Rater Accuracy Using Unfolding Models Combined with Topic Models: Incorporating Supervised Latent Dirichlet Allocation

Peer reviewed

Direct link

Wheeler, Jordan M.; Engelhard, George; Wang, Jue – Measurement: Interdisciplinary Research and Perspectives, 2022

Objectively scoring constructed-response items on educational assessments has long been a challenge due to the use of human raters. Even well-trained raters using a rubric can inaccurately assess essays. Unfolding models measure rater's scoring accuracy by capturing the discrepancy between criterion and operational ratings by placing essays on an…

Descriptors: Accuracy, Scoring, Statistical Analysis, Models

The Impact of Setting Scoring Expectations on Rater Scoring Rates and Accuracy

Peer reviewed

Direct link

Wendler, Cathy; Glazer, Nancy; Bridgeman, Brent – Applied Measurement in Education, 2020

Efficient constructed response (CR) scoring requires both accuracy and speed from human raters. This study was designed to determine if setting scoring rate expectations would encourage raters to score at a faster pace, and if so, if there would be differential effects on scoring accuracy for raters who score at different rates. Three rater groups…

Descriptors: Scoring, Expectation, Accuracy, Time

An Error-Analysis Study from an EFL Writing Context: Human and Automated Essay Scoring Approaches

Peer reviewed

Direct link

Almusharraf, Norah; Alotaibi, Hind – Technology, Knowledge and Learning, 2023

Evaluating written texts is believed to be a time-consuming process that can lack consistency and objectivity. Automated essay scoring (AES) can provide solutions to some of the limitations of human scoring. This research aimed to evaluate the performance of one AES system, Grammarly, in comparison to human raters. Both approaches' performances…

Descriptors: Writing Evaluation, Writing Tests, Essay Tests, Essays

Automated Topical Component Extraction Using Neural Network Attention Scores from Source-Based Essay Scoring

Peer reviewed
PDF on ERIC

Download full text

Zhang, Haoran; Litman, Diane – Grantee Submission, 2020

While automated essay scoring (AES) can reliably grade essays at scale, automated writing evaluation (AWE) additionally provides formative feedback to guide essay revision. However, a neural AES typically does not provide useful feature representations for supporting AWE. This paper presents a method for linking AWE and neural AES, by extracting…

Descriptors: Computer Assisted Testing, Scoring, Essay Tests, Writing Evaluation

Predictive Modeling of Rater Behavior: Implications for Quality Assurance in Essay Scoring

Peer reviewed

Direct link

Bejar, Isaac I.; Li, Chen; McCaffrey, Daniel – Applied Measurement in Education, 2020

We evaluate the feasibility of developing predictive models of rater behavior, that is, "rater-specific" models for predicting the scores produced by a rater under operational conditions. In the present study, the dependent variable is the score assigned to essays by a rater, and the predictors are linguistic attributes of the essays…

Descriptors: Scoring, Essays, Behavior, Predictive Measurement

Application of an Automated Essay Scoring Engine to English Writing Assessment Using Many-Facet Rasch Measurement

Peer reviewed

Direct link

Chan, Kinnie Kin Yee; Bond, Trevor; Yan, Zi – Language Testing, 2023

We investigated the relationship between the scores assigned by an Automated Essay Scoring (AES) system, the Intelligent Essay Assessor (IEA), and grades allocated by trained, professional human raters to English essay writing by instigating two procedures novel to written-language assessment: the logistic transformation of AES raw scores into…

Descriptors: Computer Assisted Testing, Essays, Scoring, Scores

The Impact of Operational Scoring Experience and Additional Mentored Training on Raters' Essay Scoring Accuracy

Peer reviewed

Direct link

Choi, Ikkyu; Wolfe, Edward W. – Applied Measurement in Education, 2020

Rater training is essential in ensuring the quality of constructed response scoring. Most of the current knowledge about rater training comes from experimental contexts with an emphasis on short-term effects. Few sources are available for empirical evidence on whether and how raters become more accurate as they gain scoring experiences or what…

Descriptors: Scoring, Accuracy, Training, Evaluators

Why IELTS Candidates Score Low in Writing: Investigating the Effects of Test Design and Scoring Criteria on Test-Takers' Grades in IELTS and World Englishes Essay Writing Tests

Peer reviewed
PDF on ERIC

Download full text

Arefsadr, Sajjad; Babaii, Esmat; Hashemi, Mohammad Reza – International Journal of Language Testing, 2022

This study explored possible reasons why IELTS candidates usually score low in writing by investigating the effects of two different test designs and scoring criteria on Iranian IELTS candidates' obtained grades in IELTS and World Englishes (WEs) essay writing tests. To this end, first, a WEs essay writing test was preliminarily designed. Then, 17…

Descriptors: English (Second Language), Second Language Learning, Language Tests, Writing Evaluation

Investigating a New Method for Standardising Essay Marking Using Levels-Based Mark Schemes

Peer reviewed
PDF on ERIC

Download full text

Greatorex, Jackie; Sutch, Tom; Werno, Magda; Bowyer, Jess; Dunn, Karen – International Journal of Assessment Tools in Education, 2019

Standardisation is a procedure used by Awarding Organisations to maximise marking reliability, by teaching examiners to consistently judge scripts using a mark scheme. However, research shows that people are better at comparing two objects than judging each object individually. Consequently, Oxford, Cambridge and RSA (OCR, a UK awarding…

Descriptors: Reliability, Achievement Rating, Standards, Scoring

Why Can't It Mark This One? A Qualitative Analysis of Student Writing Rejected by an Automated Essay Scoring System

Peer reviewed

Direct link

Reinertsen, Nathanael – English in Australia, 2018

The difference in how humans read and how Automated Essay Scoring (AES) systems process written language leads to a situation where a portion of student responses will be comprehensible to human markers, but unable to be parsed by AES systems. This paper examines a number of pieces of student writing that were marked by trained human markers, but…

Descriptors: Qualitative Research, Writing Evaluation, Essay Tests, Computer Assisted Testing

Classification Accuracy and Efficiency of Writing Screening Using Automated Essay Scoring

Peer reviewed
PDF on ERIC

Download full text

Direct link

Wilson, Joshua; Rodrigues, Jessica – Grantee Submission, 2020

The present study leveraged advances in automated essay scoring (AES) technology to explore a proof of concept for a writing screener using the "Project Essay Grade" (PEG) program. First, the study investigated the extent to which an AES-scored multi-prompt writing screener accurately classified students as at risk of failing a Common…

Descriptors: Writing Tests, Screening Tests, Classification, Accuracy

Applying Cognitive Theory to the Human Essay Rating Process

Peer reviewed

Direct link

Finn, Bridgid; Arslan, Burcu; Walsh, Matthew – Applied Measurement in Education, 2020

To score an essay response, raters draw on previously trained skills and knowledge about the underlying rubric and score criterion. Cognitive processes such as remembering, forgetting, and skill decay likely influence rater performance. To investigate how forgetting influences scoring, we evaluated raters' scoring accuracy on TOEFL and GRE essays.…

Descriptors: Epistemology, Essay Tests, Evaluators, Cognitive Processes

Does the Time between Scoring Sessions Impact Scoring Accuracy? An Evaluation of Constructed-Response Essay Responses on the "GRE"® General Test. Research Report. ETS RR-18-31

Peer reviewed
PDF on ERIC

Download full text

Finn, Bridgid; Wendler, Cathy; Ricker-Pedley, Kathryn L.; Arslan, Burcu – ETS Research Report Series, 2018

This report investigates whether the time between scoring sessions has an influence on operational and nonoperational scoring accuracy. The study evaluates raters' scoring accuracy on constructed-response essay responses for the "GRE"® General Test. Binomial linear mixed-effect models are presented that evaluate how the effect of various…

Descriptors: Intervals, Scoring, Accuracy, Essay Tests

The Influence of Rater Effects in Training Sets on the Psychometric Quality of Automated Scoring for Writing Assessments

Peer reviewed

Direct link

Wind, Stefanie A.; Wolfe, Edward W.; Engelhard, George, Jr.; Foltz, Peter; Rosenstein, Mark – International Journal of Testing, 2018

Automated essay scoring engines (AESEs) are becoming increasingly popular as an efficient method for performance assessments in writing, including many language assessments that are used worldwide. Before they can be used operationally, AESEs must be "trained" using machine-learning techniques that incorporate human ratings. However, the…

Descriptors: Computer Assisted Testing, Essay Tests, Writing Evaluation, Scoring

Previous Page | Next Page »

Pages: 1 | 2 | 3 | 4 | 5 | 6 | 7 | 8 | 9 | 10 | 11 | ... | 15

ETS Research Report Series	12
Applied Measurement in…	9
Journal of Educational…	9
Assessing Writing	8
Educational Measurement:…	6
Educational Testing Service	4
Educational and Psychological…	4
International Journal of…	3
Journal of Technology,…	3
Educational Review	2
Grantee Submission	2
Higher Education Quarterly	2
Journal of Educational…	2
Language Testing	2
National Center for Research…	2
Academic Medicine	1
American Journal of…	1
Asia Pacific Education Review	1
Assessment in Education:…	1
Assessment in Higher Education	1
Berkeley Review of Education	1
British Journal of…	1
CEA Forum	1
Canadian Modern Language…	1
Clearing House	1
More ▼

Attali, Yigal	8
White, Edward M.	6
Breland, Hunter M.	5
Bridgeman, Brent	4
Ramineni, Chaitanya	4
Wolfe, Edward W.	4
Zhang, Mo	4
Baker, Eva L.	3
Bejar, Isaac I.	3
Braun, Henry I.	3
Chase, Clinton I.	3
Hughes, David C.	3
Matter, M. Kevin	3
Rupp, André A.	3
Williamson, David M.	3
Anderson, Paul S.	2
Arslan, Burcu	2
Barkaoui, Khaled	2
Bloom, Diane S.	2
Breyer, F. Jay	2
Brossell, Gordon	2
Burstein, Jill	2
Chen, Jing	2
Clariana, Roy B.	2
More ▼

Test of English as a Foreign…	16
Graduate Record Examinations	12
General Educational…	5
SAT (College Admission Test)	5
College Level Academic Skills…	4
Test of Standard Written…	4
Advanced Placement…	3
National Assessment of…	3
Praxis Series	3
Graduate Management Admission…	2
International English…	2
Medical College Admission Test	2
New Jersey High School…	2
Test of Written English	2
ACT Assessment	1
Alberta Grade Twelve Diploma…	1
Flesch Reading Ease Formula	1
Iowa Tests of Basic Skills	1
Massachusetts Comprehensive…	1
Metropolitan Achievement Tests	1
National Teacher Examinations	1
New Jersey College Basic…	1
Student Descriptive…	1
More ▼