ERIC - Search Results

Publication Date

In 2025	0
Since 2024	0
Since 2021 (last 5 years)	0
Since 2016 (last 10 years)	0
Since 2006 (last 20 years)	1

Descriptor

Essay Tests	20
Scoring	9
Higher Education	6
Writing Skills	6
Correlation	5
Multiple Choice Tests	5
Writing Evaluation	5
Test Validity	4
Interrater Reliability	3
Scores	3
Test Construction	3
Test Reliability	3
Writing (Composition)	3
Analysis of Variance	2
Computer Simulation	2
Elementary Education	2
Elementary Secondary Education	2
Essays	2
Evaluation Methods	2
Evaluators	2
Handwriting	2
High Schools	2
Holistic Evaluation	2
Models	2
Performance Factors	2
More ▼

Source

Journal of Educational…

Publication Type

Journal Articles	17
Reports - Research	15
Reports - Evaluative	2
Speeches/Meeting Papers	2

Education Level

Audience

Researchers

Location

Africa	1
Georgia	1

Laws, Policies, & Programs

Assessments and Surveys

Test of Standard Written…	2
Advanced Placement…	1
Flesch Reading Ease Formula	1
Graduate Record Examinations	1
Metropolitan Achievement Tests	1
National Teacher Examinations	1

What Works Clearinghouse Rating

Showing 1 to 15 of 20 results Save | Export

A Hierarchical Rater Model for Constructed Responses, with a Signal Detection Rater Model

Peer reviewed

Direct link

DeCarlo, Lawrence T.; Kim, YoungKoung; Johnson, Matthew S. – Journal of Educational Measurement, 2011

The hierarchical rater model (HRM) recognizes the hierarchical structure of data that arises when raters score constructed response items. In this approach, raters' scores are not viewed as being direct indicators of examinee proficiency but rather as indicators of essay quality; the (latent categorical) quality of an examinee's essay in turn…

Descriptors: Responses, Essay Tests, Models, Scores

Evaluating Rater Accuracy in Performance Assessments.

Peer reviewed

Englehard, George, Jr. – Journal of Educational Measurement, 1996

A new method for evaluating rater accuracy within the context of performance assessments is described. It uses an extended Rasch measurement model, FACETS, which is illustrated with 373 benchmark papers from the Georgia High School Graduation Writing Test rated by 20 operational raters and an expert panel. (SLD)

Descriptors: Essay Tests, Evaluation Methods, Evaluators, Performance Based Assessment

Primary Essay Tests

Peer reviewed

Veal, L. Ramon; Biesbrock, Edieann F. – Journal of Educational Measurement, 1971

Reports the development of an experimental essay test for use with young children based on the now-obsolete STEP Essay Tests. The child's essay on an assigned topic is rated in comparison with previously rated and scaled essays which serve as performance models. (Author)

Descriptors: Essay Tests, Grade 2, Grade 3, Test Construction

Relationships between Essay Tests and Objective Tests of Language Skills for Elementary School Students.

Peer reviewed

Hogan, Thomas P.; Mishler, Carol – Journal of Educational Measurement, 1980

The relationship between scores on objective tests of language skills and on free writing tasks was analyzed for third and eighth graders. Correlations between scores were of the same magnitude as reported for college students. Differences in relationships between free-writing performance and objective test scores are discussed. (Author/RD)

Descriptors: Correlation, Elementary Education, Essay Tests, Language Skills

Choice among Essay Topics: Impact on Performance and Validity.

Peer reviewed

Bridgeman, Brent; Morgan, Rick; Wang, Ming-mei – Journal of Educational Measurement, 1997

Test results of 915 high school students taking a history examination with a choice of topics show that students were generally able to pick the topic on which they could get the highest score. Implications for fair scoring when topic choice is allowed are discussed. (SLD)

Descriptors: Essay Tests, High School Students, History, Performance Factors

The Effect of the Quality of Preceding Responses on the Grades Assigned to Subsequent Responses to an Essay Question

Peer reviewed

Hales, Loyde W.; Tokar, Edward – Journal of Educational Measurement, 1975

Investigates the effect of initial blocks of either very good or poor essay question responses on the grades assigned to subsequent essay responses. (Author/DEP)

Descriptors: Adaptation Level Theory, Essay Tests, Grading, Performance

Estimating the Reliability, Validity, and Invalidity of Essay Ratings.

Peer reviewed

Blok, H. – Journal of Educational Measurement, 1985

Raters judged essays on two occasions making it possible to address the question of whether multiple ratings, however obtained, represent the same true scores. Multiple ratings of a given rater did represent the same true scores, but ratings of different raters did not. Reliability, validity, and invalidity coefficients were computed. (Author/DWH)

Descriptors: Analysis of Variance, Elementary Education, Essay Tests, Evaluators

The Use of Model Essays to Reduce Context Effects in Essay Scoring.

Peer reviewed

Hughes, David C.; Keeling, Brian – Journal of Educational Measurement, 1984

Several studies have shown that essays receive higher marks when preceded by poor quality scripts than when preceded by good quality scripts. This study investigated the effectiveness of providing scorers with model essays to reduce the influence of context. Context effects persisted despite the scoring procedures used. (Author/EGS)

Descriptors: Context Effect, Essay Tests, Essays, High Schools

The Influence of Context Position and Scoring Method on Essay Scoring.

Peer reviewed

And Others; Hughes, David C. – Journal of Educational Measurement, 1980

The effect of context on the scoring of essays was examined by arranging that the scoring of the criterion essay would be preceded either by five superior essays or by five inferior essays. The contrast in essay quality had the hypothesized effect. Other effects were not significant. (CTM)

Descriptors: Essay Tests, High Schools, Holistic Evaluation, Scoring

Essay Test Scores and Reading Difficulty.

Peer reviewed

Chase, Clinton I. – Journal of Educational Measurement, 1983

Proposition analysis was used to equate the text base of two essays with different readability levels. Easier reading essays were given higher scores than difficult reading essays. The results appear to identify another noncontent influence on essay test scores, leaving increasingly less variance for differences in content. (Author/PN)

Descriptors: Content Analysis, Difficulty Level, Essay Tests, Higher Education

The Reliability of General Certificate of Education Examination English Composition Papers in West Africa

Peer reviewed

Akeju, S. A. – Journal of Educational Measurement, 1972

Study was an attempt to evaluate the West African Examinations Council efforts in terms of the extent to which its marking procedures have ensured high reader reliability for the English Language Essay examination, a test which was designed to measure writing ability. (Author)

Descriptors: Essay Tests, Examiners, Foreign Countries, Multiple Choice Tests

Effects of Applying Different Time Limits to a Proposed GRE Writing Test.

Peer reviewed

Powers, Donald E.; Fowles, Mary E. – Journal of Educational Measurement, 1996

Approximately 300 prospective graduate students each wrote two essays for the Graduate Record Examinations in 40-minute and 60-minute time periods. Analysis revealed that performance was, on average, significantly better with the 60-minute limit. There was no interaction between self-described test-taking style (fast versus slow) and time limits.…

Descriptors: College Entrance Examinations, College Students, Essay Tests, Higher Education

A Method of Estimating Rater Reliability.

Peer reviewed

van den Bergh, Huub; Eiting, Mindert H. – Journal of Educational Measurement, 1989

A method of assessing rater reliability via a design of overlapping rater teams is presented. Covariances or correlations of ratings can be analyzed with LISREL models. Models in which the rater reliabilities are congeneric, tau-equivalent, or parallel can be tested. Two examples based on essay ratings are presented. (TJH)

Descriptors: Analysis of Covariance, Computer Simulation, Correlation, Elementary Secondary Education

Measuring the Organizational Aspects of Writing Ability.

Peer reviewed

Benton, Stephen L.; Kiewra, Kenneth A. – Journal of Educational Measurement, 1986

This paper assessed the relationships among holistic writing ability, the Test of Standard Written English, and four tests of organizational ability. Findings showed a significant correlation between writing ability and the tests. It was concluded that tests assessing organizational strategies ought to be included in assessments of writing…

Descriptors: Correlation, Essay Tests, Higher Education, Holistic Evaluation

Essay Test Scoring: Interaction of Relevant Variables.

Peer reviewed

Chase, Clinton I. – Journal of Educational Measurement, 1986

It is hypothesized that the readers of an essay respond to a variable in terms of its context with other variables. Sex, race, reader expectation, and quality of handwriting were crossed to study their interaction effects. Results showed complex interactions of expectations, writing, and sex within race. (Author/LMO)

Descriptors: Analysis of Variance, Elementary Secondary Education, Essay Tests, Handwriting

Previous Page | Next Page »

Pages: 1 | 2

Chase, Clinton I.	3
Bridgeman, Brent	2
Hughes, David C.	2
Akeju, S. A.	1
Benton, Stephen L.	1
Biesbrock, Edieann F.	1
Blok, H.	1
Breland, Hunter M.	1
Cross, Lawrence H.	1
DeCarlo, Lawrence T.	1
Eiting, Mindert H.	1
Englehard, George, Jr.	1
Fowles, Mary E.	1
Frary, Robert B.	1
Gaynor, Judith L.	1
Hales, Loyde W.	1
Hogan, Thomas P.	1
Johnson, Matthew S.	1
Keeling, Brian	1
Kiewra, Kenneth A.	1
Kim, YoungKoung	1
Lewis, Charles	1
Mishler, Carol	1
Morgan, Rick	1
More ▼