ERIC - Search Results

Publication Date

In 2025	0
Since 2024	0
Since 2021 (last 5 years)	1
Since 2016 (last 10 years)	3
Since 2006 (last 20 years)	4

Source

Grantee Submission	2
AERA Online Paper Repository	1
International Educational…	1
Journal of Educational…	1

Publication Type

Speeches/Meeting Papers	16
Reports - Research	11
Reports - Evaluative	5
Journal Articles	1

Education Level

High Schools	1
Higher Education	1
Postsecondary Education	1

Audience

Location

Georgia

Laws, Policies, & Programs

Assessments and Surveys

Gates MacGinitie Reading Tests	1
General Educational…	1
Test of English as a Foreign…	1

What Works Clearinghouse Rating

Showing 1 to 15 of 16 results Save | Export

Evaluating Quadratic Weighted Kappa as the Standard Performance Metric for Automated Essay Scoring

Peer reviewed
PDF on ERIC

Download full text

Doewes, Afrizal; Kurdhi, Nughthoh Arfawi; Saxena, Akrati – International Educational Data Mining Society, 2023

Automated Essay Scoring (AES) tools aim to improve the efficiency and consistency of essay scoring by using machine learning algorithms. In the existing research work on this topic, most researchers agree that human-automated score agreement remains the benchmark for assessing the accuracy of machine-generated scores. To measure the performance of…

Descriptors: Essays, Writing Evaluation, Evaluators, Accuracy

Investigating Human Essay Rating Quality in a Large-Scale Assessment Using Many-Facet Rasch Measurement

Peer reviewed

Direct link

Zhang, Xiuyuan – AERA Online Paper Repository, 2019

The main purpose of the study is to evaluate the qualities of human essay ratings for a large-scale assessment using Rasch measurement theory. Specifically, Many-Facet Rasch Measurement (MFRM) was utilized to examine the rating scale category structure and provide important information about interpretations of ratings in the large-scale…

Descriptors: Essays, Evaluators, Writing Evaluation, Reliability

Recurrence Quantification Analysis: A Technique for the Dynamical Analysis of Student Writing

Peer reviewed
PDF on ERIC

Download full text

Allen, Laura K.; Likens, Aaron D.; McNamara, Danielle S. – Grantee Submission, 2017

The current study examined the degree to which the quality and characteristics of students' essays could be modeled through dynamic natural language processing analyses. Undergraduate students (n = 131) wrote timed, persuasive essays in response to an argumentative writing prompt. Recurrent patterns of the words in the essays were then analyzed…

Descriptors: Writing Evaluation, Essays, Persuasive Discourse, Natural Language Processing

Writing Quality, Knowledge, and Comprehension Correlates of Human and Automated Essay Scoring

Peer reviewed
PDF on ERIC

Download full text

Roscoe, Rod D.; Crossley, Scott A.; Snow, Erica L.; Varner, Laura K.; McNamara, Danielle S. – Grantee Submission, 2014

Automated essay scoring tools are often criticized on the basis of construct validity. Specifically, it has been argued that computational scoring algorithms may be unaligned to higher-level indicators of quality writing, such as writers' demonstrated knowledge and understanding of the essay topics. In this paper, we consider how and whether the…

Descriptors: Correlation, Essays, Scoring, Writing Evaluation

Evaluating Rater Accuracy in Performance Assessments.

Peer reviewed

Englehard, George, Jr. – Journal of Educational Measurement, 1996

A new method for evaluating rater accuracy within the context of performance assessments is described. It uses an extended Rasch measurement model, FACETS, which is illustrated with 373 benchmark papers from the Georgia High School Graduation Writing Test rated by 20 operational raters and an expert panel. (SLD)

Descriptors: Essay Tests, Evaluation Methods, Evaluators, Performance Based Assessment

The Impact of Composition Medium on Essay Raters in Foreign Language Testing.

Download full text

Manalo, Jonathan R.; Wolfe, Edward W. – 2000

Recently, the Test of English as a Foreign Language (TOEFL) changed by including a writing section that gives the examinee an option between computer and handwritten formats to compose their responses. Unfortunately, this may introduce several potential sources of error that might reduce the reliability and validity of the scores. The seriousness…

Descriptors: Computer Assisted Testing, Essay Tests, Evaluators, Handwriting

Computer Analysis of Student Essays: Finding Trait Differences in Student Profile.

Download full text

Page, Ellis B.; Poggio, John P.; Keith, Timothy Z. – 1997

Most human gradings of essays are holistic, or "overall." Therefore, Project Essay Grade (PEG), an attempt to develop computerized grading of essays, has concentrated most of its research on overall grading. It has successfully simulated human judges. However, since computer grading is less expensive than human grading, PEG has also…

Descriptors: Computer Assisted Testing, Elementary Secondary Education, Essays, Evaluators

The Relationship between Scoring Procedures and Focus and the Reliability of Direct Writing Assessment Scores.

Download full text

Wolfe, Edward W.; Kao, Chi-Wen – 1996

This paper reports the results of an analysis of the relationship between scorer behaviors and score variability. Thirty-six essay scorers were interviewed and asked to perform a think-aloud task as they scored 24 essays. Each comment made by a scorer was coded according to its content focus (i.e. appearance, assignment, mechanics, communication,…

Descriptors: Content Analysis, Educational Assessment, Essays, Evaluation Methods

Rater, Domain, and Gender Influences on the Assessed Quality of Student Writing Using Weighted and Unweighted Scoring.

Download full text

Gyagenda, Ismail S.; Engelhard, George, Jr. – 1998

The purpose of this study was to examine rater, domain, and gender influences on the assessed quality of student writing using weighted and unweighted scores. Twenty rates were randomly selected from a group of 87 operational raters contracted to rate essays as part of the 1993 field test of the Georgia High School Writing Test. All of the raters…

Descriptors: Essay Tests, Evaluators, High School Students, High Schools

Applying the Rasch Model To Explore Rater Influences on the Assessed Quality of Students' Writing Ability.

Download full text

Gyagenda, Ismail S.; Engelhard, George, Jr. – 1998

The purpose of this study was to describe the Rasch model for measurement and apply the model to examine the relationship between raters, domains of written compositions, and student writing ability. Twenty raters were randomly selected from a group of 87 operational raters contracted to rate essays as part of the 1993 field test of the Georgia…

Descriptors: Difficulty Level, Essay Tests, Evaluators, High School Students

Learning To Rate Essays: A Study of Scorer Cognition.

Download full text

Wolfe, Edward W.; Feltovich, Brian – 1994

This paper presents a model of scored cognition that incorporates two types of mental models: models of performance (i.e., the criteria for judging performance) and models of scoring (i.e., the procedural scripts for scoring an essay). In Study 1, six novice and five experienced scorers wrote definitions of three levels of a 6-point holistic…

Descriptors: Cognitive Processes, Criteria, Essays, Evaluation Methods

Severity of Grading across Time Periods.

Download full text

Lunz, Mary E.; Stahl, John A. – 1990

Three examinations administered to medical students were analyzed to determine differences among severities of judges' assessments and among grading periods. The examinations included essay, clinical, and oral forms of the tests. Twelve judges graded the three essays for 32 examinees during a 4-day grading session, which was divided into eight…

Descriptors: Clinical Diagnosis, Comparative Testing, Difficulty Level, Essay Tests

Rank Ordering or Judge-Awarded Ratings?

Download full text

Linacre, John M. – 1990

Rank ordering examinees is an easier task for judges than is awarding numerical ratings. A measurement model for rankings based on Rasch's objectivity axioms provides linear, sample-independent and judge-independent measures. Estimates of examinee measures are obtained from the data set of rankings, along with standard errors and fit statistics.…

Descriptors: Comparative Analysis, Error of Measurement, Essay Tests, Evaluators

Effects of Essay Order on Raters' Score Assignments in a Large-Scale Writing Assessment.

Ferrara, Steven F. – 1987

The necessity of controlling the order in which trained essay raters for a statewide writing assessment program receive student essays was studied. The underlying theoretical question concerns possible rater bias caused by raters reading long strings of essays of homogeneous quality; this problem is usually referred to as context effect or…

Descriptors: Context Effect, Essay Tests, Evaluators, Graduation Requirements

The Assessment of Writing Proficiency via Qualitative Ratings of Writing Samples.

Steele, Joe M. – 1979

The College Outcome Measures Project/American College Testing Program (COMP/ACT) Writing Assessment is described, and issues of validity and reliability in the assessment of writing samples using qualitative rating scales are explored. COMP/ACT is composed of three role-playing tasks in the social sciences, natural sciences, and arts, which are…

Descriptors: Adults, Essay Tests, Evaluators, Higher Education

Previous Page | Next Page »

Pages: 1 | 2

Evaluators	16
Scoring	10
Essay Tests	9
Essays	7
Writing Evaluation	7
Interrater Reliability	5
High School Students	4
Performance Based Assessment	4
Rating Scales	4
Testing Programs	4
Writing Tests	4
Computer Assisted Testing	3
Evaluation Methods	3
High Schools	3
Holistic Evaluation	3
State Programs	3
Writing (Composition)	3
Computer Software	2
Difficulty Level	2
Goodness of Fit	2
Higher Education	2
Holistic Approach	2
Item Response Theory	2
Persuasive Discourse	2
Scoring Rubrics	2
More ▼

Wolfe, Edward W.	3
Engelhard, George, Jr.	2
Gyagenda, Ismail S.	2
McNamara, Danielle S.	2
Allen, Laura K.	1
Auchter, Joan Chikos	1
Crossley, Scott A.	1
Doewes, Afrizal	1
Englehard, George, Jr.	1
Feltovich, Brian	1
Ferrara, Steven F.	1
Kao, Chi-Wen	1
Keith, Timothy Z.	1
Kurdhi, Nughthoh Arfawi	1
Likens, Aaron D.	1
Linacre, John M.	1
Lunz, Mary E.	1
Manalo, Jonathan R.	1
Page, Ellis B.	1
Patience, Wayne	1
Poggio, John P.	1
Roscoe, Rod D.	1
Saxena, Akrati	1
Snow, Erica L.	1
Stahl, John A.	1
More ▼