Publication Date
In 2025 | 0 |
Since 2024 | 0 |
Since 2021 (last 5 years) | 1 |
Since 2016 (last 10 years) | 2 |
Since 2006 (last 20 years) | 3 |
Descriptor
Essay Tests | 46 |
Test Reliability | 46 |
Writing Evaluation | 46 |
Scoring | 25 |
Higher Education | 22 |
Test Validity | 19 |
Interrater Reliability | 17 |
Writing Skills | 16 |
Holistic Evaluation | 11 |
Testing Programs | 10 |
State Programs | 9 |
More ▼ |
Source
Author
White, Edward M. | 7 |
Breland, Hunter M. | 4 |
Ackerman, Terry A. | 2 |
Bloom, Diane S. | 2 |
Swartz, Richard | 2 |
Aghbar, Ali-Asghar | 1 |
Anderson, Judith A. | 1 |
Anderson, Judy | 1 |
Atilgan, Hakan | 1 |
Auchter, Joan | 1 |
Badjadi, Nour El Imane | 1 |
More ▼ |
Publication Type
Education Level
Higher Education | 1 |
Postsecondary Education | 1 |
Audience
Researchers | 6 |
Practitioners | 1 |
Location
California | 6 |
New Jersey | 2 |
Arizona | 1 |
Canada | 1 |
Iowa | 1 |
Nigeria | 1 |
Turkey | 1 |
Laws, Policies, & Programs
Assessments and Surveys
What Works Clearinghouse Rating
Atilgan, Hakan – Eurasian Journal of Educational Research, 2019
Purpose: This study intended to examine the generalizability and reliability of essay ratings within the scope of the generalizability (G) theory. Specifically, the effect of raters on the generalizability and reliability of students' essay ratings was examined. Furthermore, variations of the generalizability and reliability coefficients with…
Descriptors: Foreign Countries, Essay Tests, Test Reliability, Interrater Reliability
Michelle Herridge – ProQuest LLC, 2021
Evaluation of student written work during summative assessments is an important and critical task for instructors at all educational levels. Nevertheless, few research studies exist that provide insights into how different instructors approach this task. Chemistry faculty (FIs) and graduate student instructors (GSIs) regularly engage in the…
Descriptors: Science Instruction, Chemistry, College Faculty, Teaching Assistants
Badjadi, Nour El Imane – Online Submission, 2013
The current paper on writing assessment surveys the literature on the reliability and validity of essay tests. The paper aims to examine the two concepts in relationship with essay testing as well as to provide a snapshot of the current understandings of the reliability and validity of essay tests as drawn in recent research studies. Bearing in…
Descriptors: Essay Tests, Writing Evaluation, Test Validity, Test Reliability
Patience, Wayne; Auchter, Joan – 1988
A central aim in any assessment program is to ensure fair and stable scoring from administration to administration. When administrations are decentralized, not only in location, but in frequency and in logistical configuration, it is imperative to construct training, certifying, and monitoring systems that provide continuity between the original…
Descriptors: Equivalency Tests, Essay Tests, Scoring, Secondary Education

Spaulding, Cheryl L. – Journal of Reading, 1989
Reviews "Written Language Assessment" (WLA), a new standardized test to evaluate children's and adolescents' written language competence by having students write essays instead of answer multiple choice questions. Finds problems with the WLA in terms of interrater reliability. (RS)
Descriptors: Elementary Secondary Education, Essay Tests, Interrater Reliability, Standardized Tests

Mitchell, Karen; Anderson, Judy – Educational and Psychological Measurement, 1986
This study examined the reliability of holistic scoring for a sample of essays written during the Spring 1985 MCAT administration. Analysis of variance techniques was used to estimate the reliability of scoring and to partition score variance into that due to level differences between papers and to context-specific factors. (Author/LMO)
Descriptors: Analysis of Variance, Essay Tests, Holistic Evaluation, Medical Education
Fishman, Judith – Writing Program Administration, 1984
Examines the CUNY-WAT program and questions many aspects of it, especially the choice and phrasing of topics. (FL)
Descriptors: Essay Tests, Higher Education, Test Format, Test Items

Ruth, Leo; Murphy, Sandra – College Composition and Communication, 1984
Discusses the problems of readers' different responses to essay test passages and the implications for the subsequent writing assignment. Describes a model of the writing assessment episode and discusses four generalizations about writing task interpretation. (HTH)
Descriptors: Essay Tests, Evaluation Methods, Higher Education, Reader Response
Swartz, Richard; And Others – 1985
In preparation for adding an essay test to the General Educational Development (GED) test, the GED Testing Service undertook a series of studies to establish (1) whether acceptable reading reliabilities were attainable in decentralized holistic scoring sessions often involving no more than a dozen papers; (2) whether essay readers in a variety of…
Descriptors: Essay Tests, High School Equivalency Programs, Scoring, Test Reliability
Crocker, Linda – New Directions for Community Colleges, 1987
Examines reasons for using essay tests in the direct assessment of writing ability. Reviews the steps in developing a large-scale testing program; e.g., creating a pool of topics or prompts; developing scoring procedures; training raters; field-testing the system; scoring writing samples; assessing reliability; and assessing validity. (DMM)
Descriptors: Essay Tests, Postsecondary Education, Scoring, Test Construction

Meredith, Vana Hutto; Williams, Paula L. – Educational Measurement: Issues and Practice, 1984
The accuracy issues involved in detecting student writing skill deficiencies, factors to be dealt with or controlled, and procedural and statistical methods for controlling these factors are the focus of this article. The accuracy needed for effective instructional planning requires stability in writing assessment programs. (DWH)
Descriptors: Elementary Secondary Education, Essay Tests, Scoring, Test Reliability

Branthwaite, Alan; And Others – Educational Review, 1981
In this naturalistic study of essay marking, 15 university lecturers graded an examination paper and completed the Eysenck Personality Questionnaire. A significant positive correlation was found between the marks given and the grader's lie score, indicating possible effects of staff-student interactions or social desirability on biases in grading.…
Descriptors: Essay Tests, Experimenter Characteristics, Higher Education, Personality Traits
Paden, Patricia A. – 1986
Two factors which may affect the ratings assigned to an essay test are investigated: (1) context effects; and (2) score level effects. Context effects exist in essay scoring if an essay is rated higher when preceded by poor quality essays than when preceded by high quality essays. A score level effect is defined as a change in the score (value)…
Descriptors: Context Effect, Essay Tests, Holistic Evaluation, Interrater Reliability
Swartz, Richard; Whitney, Douglas R. – 1985
The primary purpose of this study was to examine the relationship between scores on the multiple-choice General Educational Development (GED) Writing Skills test and scores on holistically graded essays. Secondary purposes included the following: (1) examining the relationship of essay scores to scores on the multiple-choice GED Reading Skills…
Descriptors: Essay Tests, High School Equivalency Programs, Reading Tests, Test Reliability
Ackerman, Terry A.; Davey, Tim C. – 1989
This study examines differences and similarities in the information provided by direct and indirect measures of writing from the Collegiate Assessment of Academic Proficiency (CAAP). The indirect measure was a 72-item multiple-choice test, while the direct measure involved responding to two essay prompts. The 40-minute multiple-choice test can be…
Descriptors: College Entrance Examinations, Essay Tests, Higher Education, Latent Trait Theory