ERIC - Search Results

Publication Date

In 2025	0
Since 2024	0
Since 2021 (last 5 years)	1
Since 2016 (last 10 years)	3
Since 2006 (last 20 years)	4

Descriptor

Essay Tests	23
Interrater Reliability	23
Test Reliability	23
Writing Evaluation	17
Higher Education	8
Scoring	8
Test Validity	8
Holistic Evaluation	7
Writing Skills	6
Scores	5
Test Construction	5
College Entrance Examinations	4
Correlation	4
Generalizability Theory	4
Standardized Tests	4
Academic Achievement	3
Evaluation Criteria	3
Evaluation Methods	3
Grading	3
Measurement Techniques	3
Rating Scales	3
Student Evaluation	3
College Freshmen	2
College Students	2
Comparative Testing	2
More ▼

Source

ProQuest LLC	2
ETS Research Report Series	1
Eurasian Journal of…	1
European Journal of…	1
Journal of Reading	1

Publication Type

Reports - Research	15
Speeches/Meeting Papers	9
Journal Articles	4
Reports - Evaluative	3
Tests/Questionnaires	3
Dissertations/Theses -…	2
Book/Product Reviews	1
Books	1
Information Analyses	1
Numerical/Quantitative Data	1
Reference Materials -…	1
More ▼

Education Level

Higher Education	3
Postsecondary Education	2

Audience

Researchers

Location

Arizona	1
Turkey	1

Laws, Policies, & Programs

Assessments and Surveys

SAT (College Admission Test)	2
ACT Assessment	1
Cognitive Abilities Test	1
Graduate Record Examinations	1
Iowa Tests of Basic Skills	1
Medical College Admission Test	1
Student Descriptive…	1
Test of English as a Foreign…	1
Test of Standard Written…	1

What Works Clearinghouse Rating

Showing 1 to 15 of 23 results Save | Export

Reliability of Essay Ratings: A Study on Generalizability Theory

Peer reviewed
PDF on ERIC

Download full text

Atilgan, Hakan – Eurasian Journal of Educational Research, 2019

Purpose: This study intended to examine the generalizability and reliability of essay ratings within the scope of the generalizability (G) theory. Specifically, the effect of raters on the generalizability and reliability of students' essay ratings was examined. Furthermore, variations of the generalizability and reliability coefficients with…

Descriptors: Foreign Countries, Essay Tests, Test Reliability, Interrater Reliability

Grading in Chemistry: Variations in Instructors' Evaluation of Student Written Responses

Direct link

Michelle Herridge – ProQuest LLC, 2021

Evaluation of student written work during summative assessments is an important and critical task for instructors at all educational levels. Nevertheless, few research studies exist that provide insights into how different instructors approach this task. Chemistry faculty (FIs) and graduate student instructors (GSIs) regularly engage in the…

Descriptors: Science Instruction, Chemistry, College Faculty, Teaching Assistants

Development and Validation of the Written Communication Assessment of the "HEIghten"® Outcomes Assessment Suite. Research Report. ETS RR-17-53

Peer reviewed
PDF on ERIC

Download full text

Rios, Joseph A.; Sparks, Jesse R.; Zhang, Mo; Liu, Ou Lydia – ETS Research Report Series, 2017

Proficiency with written communication (WC) is critical for success in college and careers. As a result, institutions face a growing challenge to accurately evaluate their students' writing skills to obtain data that can support demands of accreditation, accountability, or curricular improvement. Many current standardized measures, however, lack…

Descriptors: Test Construction, Test Validity, Writing Tests, College Outcomes Assessment

An Intervention and Assessment to Improve Information Literacy

Direct link

Scharf, Davida – ProQuest LLC, 2013

Purpose: The goal of the study was to test an intervention using a brief essay as an instrument for evaluating higher-order information literacy skills in college students, while accounting for prior conditions such as socioeconomic status and prior academic achievement, and identify other predictors of information literacy through an evaluation…

Descriptors: Information Literacy, Intervention, Student Evaluation, College Students

Marking Reliability in B.Sc. Engineering Examinations.

Peer reviewed

Jackson, E. A. – European Journal of Engineering Education, 1988

Investigates the marker-marker reliability of an examination for a third-year degree course in circuit theory. Reports that the coefficient of correlation between markers fell within the range 0.94 to 0.98. (YP)

Descriptors: College Science, Engineering Education, Essay Tests, Interrater Reliability

Written Language Assessment (Test Review).

Peer reviewed

Spaulding, Cheryl L. – Journal of Reading, 1989

Reviews "Written Language Assessment" (WLA), a new standardized test to evaluate children's and adolescents' written language competence by having students write essays instead of answer multiple choice questions. Finds problems with the WLA in terms of interrater reliability. (RS)

Descriptors: Elementary Secondary Education, Essay Tests, Interrater Reliability, Standardized Tests

Interrater Reliability: A Selected and Annotated Bibliography of Articles Concerning Interrater Reliability.

Weare, Jane; And Others – 1987

This annotated bibliography was developed upon noting a deficiency of information in the literature regarding the training of raters for establishing agreement. The ERIC descriptor, "Interrater Reliability", was used to locate journal articles. Some of the 33 resulting articles focus on mathematical concepts and present formulas for computing…

Descriptors: Annotated Bibliographies, Cloze Procedure, Correlation, Essay Tests

The Potential Dual Effect of Context Effects and Score Level Effects on the Assignment of Scores to Essays.

Download full text

Paden, Patricia A. – 1986

Two factors which may affect the ratings assigned to an essay test are investigated: (1) context effects; and (2) score level effects. Context effects exist in essay scoring if an essay is rated higher when preceded by poor quality essays than when preceded by high quality essays. A score level effect is defined as a change in the score (value)…

Descriptors: Context Effect, Essay Tests, Holistic Evaluation, Interrater Reliability

Total Score Reliability in Large-Scale Writing Assessment.

Download full text

Bunch, Michael B.; Littlefair, Wendy – 1988

A total of 2,000 essays written by 1,000 students was submitted to generalizability analyses for domain-referenced tests. Each student had written one essay on each of two prompts representing two models of discourse. Each essay was read by six readers and judged on a scale of from 1 to 4. No reader read essays from both prompts. Reader agreement…

Descriptors: Cutting Scores, Essay Tests, Generalizability Theory, Interrater Reliability

Use of the Graded Response IRT Model to Assess the Reliability of Direct and Indirect Measures of Writing Assessment.

Download full text

Ackerman, Terry A. – 1986

The purpose of this paper is to compare the precision of direct and indirect measures of writing assessment using the test information functions from a graded response Item Response Theory (IRT) model. Subjects were 192 sophomore English students from a parochial high school in Wisconsin. Both direct and indirect measures of writing ability were…

Descriptors: Correlation, Essay Tests, High Schools, Interrater Reliability

Estimation of Interrater and Parallel Forms Reliability for the MCAT Essay.

Mitchell, Karen J.; Anderson, Judith A. – 1987

The Association of American Medical Colleges is conducting research to develop, implement, and evaluate a Medical College Admission Test (MCAT) essay testing program. Essay administration in the spring and fall of 1985 and 1986 suggested that additional research was needed on the development of topics which elicit similar skills and meet standard…

Descriptors: College Entrance Examinations, Essay Tests, Estimation (Mathematics), Generalizability Theory

Essay Reliability: Form and Meaning.

Download full text

Shale, Doug – 1986

This study is an attempt at a cohesive characterization of the concept of essay reliability. As such, it takes as a basic premise that previous and current practices in reporting reliability estimates for essay tests have certain shortcomings. The study provides an analysis of these shortcomings--partly to encourage a fuller understanding of the…

Descriptors: Analysis of Variance, Correlation, Error of Measurement, Essay Tests

The Development of a Baccalaureate Outcome Measure Based on a Generic Skills Theory of Human Performance.

Download full text

Peterson, Gary W. – 1983

Even though several national testing firms have developed measures to evaluate the effectiveness of baccalaureate education, there continues to be a general reluctance on the part of faculty in colleges and universities to accept these measures as criteria on which to evaluate educational programs. Some of the resistance appears to lie in the lack…

Descriptors: Bachelors Degrees, Cognitive Processes, Difficulty Level, Essay Tests

Assessing Writing Skill. Research Monograph No. 11.

Breland, Hunter M.; And Others – 1987

Six university English departments collaborated in this examination of the differences between multiple-choice and essay tests in evaluating writing skills. The study also investigated ways the two tools can complement one another, ways to improve cost effectiveness of essay testing, and ways to integrate assessment and the educational process.…

Descriptors: Comparative Testing, Efficiency, Essay Tests, Higher Education

The Direct Assessment of Writing Skill: A Measurement Review. College Board Report No. 83-6.

Download full text

Breland, Hunter M. – 1983

Direct assessment of writing skill, usually considered to be synonymous with assessment by means of writing samples, is reviewed in terms of its history and with respect to evidence of its reliability and validity. Reliability is examined as it is influenced by reader inconsistency, domain sampling, and other sources of error. Validity evidence is…

Descriptors: Essay Tests, Evaluation Needs, Higher Education, Interrater Reliability

Previous Page | Next Page »

Pages: 1 | 2

Breland, Hunter M.	2
Ackerman, Terry A.	1
Aghbar, Ali-Asghar	1
Anderson, Judith A.	1
Atilgan, Hakan	1
Barter, Alice K.	1
Bunch, Michael B.	1
Camp, Roberta	1
Cantor, Nancy K.	1
Carlson, Sybil B.	1
Cason, Gerald J.	1
Gearhart, Maryl	1
Herman, Joan L.	1
Hoover, H. D.	1
Jackson, E. A.	1
Littlefair, Wendy	1
Liu, Ou Lydia	1
Martin, Diane	1
Michelle Herridge	1
Mitchell, Karen J.	1
Novak, John R.	1
Olson, Margot A.	1
Paden, Patricia A.	1
Peterson, Gary W.	1
More ▼