ERIC - Search Results

Source

Author

Barrett, Thomas J.	1
Crehan, Kevin D.	1
Dirir, Mohamed A.	1
Griffin, Patrick	1
Johnson, Eugene G.	1
Kao, Chi-Wen	1
Kaplan, Bruce A.	1
Kuhlemeir, Hans	1
Vansickle, Timothy R.	1
Wolfe, Edward W.	1

Publication Type

Speeches/Meeting Papers	8
Reports - Evaluative	5
Reports - Research	3
Tests/Questionnaires	1

Education Level

Audience

Location

Australia	1
Netherlands	1

Laws, Policies, & Programs

Assessments and Surveys

International English…	1
National Assessment of…	1

What Works Clearinghouse Rating

Showing all 8 results Save | Export

The Relationship between Scoring Procedures and Focus and the Reliability of Direct Writing Assessment Scores.

Download full text

Wolfe, Edward W.; Kao, Chi-Wen – 1996

This paper reports the results of an analysis of the relationship between scorer behaviors and score variability. Thirty-six essay scorers were interviewed and asked to perform a think-aloud task as they scored 24 essays. Each comment made by a scorer was coded according to its content focus (i.e. appearance, assignment, mechanics, communication,…

Descriptors: Content Analysis, Educational Assessment, Essays, Evaluation Methods

Generalizability of Writing Tasks at Fourth Grade in the Riverside Unified School District.

Download full text

Barrett, Thomas J. – 1994

Students at grades four and five were administered a writing assessment that was developed to correspond to the California Learning Assessment System (CLAS) writing tasks at grade four. Teachers were trained to score the CLAS-like tasks according to the rubric developed by the State for CLAS. In addition, 164 students at three schools in the…

Descriptors: Evaluation Methods, Grade 4, Intermediate Grades, Student Evaluation

A Discussion of Analytic Scoring for Writing Performance Assessments.

Download full text

Crehan, Kevin D. – 1997

Writing fits well within the realm of outcomes suitable for observation by performance assessments. Studies of the reliability of performance assessments have suggested that interrater reliability can be consistently high. Scoring consistency, however, is only one aspect of quality in decisions based on assessment results. Another is…

Descriptors: Evaluation Methods, Feedback, Generalizability Theory, Interrater Reliability

Characteristics of the Test Components of the IELTS Battery: Australian Trial Data.

Download full text

Griffin, Patrick – 1990

Results of the International English Language Testing System (IELTS) battery trials in Australia are reported. The IELTS tests of productive language skills use direct assessment strategies and subjective scoring according to detailed guidelines. The receptive skills tests use indirect assessment strategies and clerical scoring procedures.…

Descriptors: English (Second Language), Foreign Countries, Grammar, Interrater Reliability

Construction of Parallel Test Forms Using Optimal Test Designs.

Download full text

Dirir, Mohamed A. – 1995

The effectiveness of an optimal item selection method in designing parallel test forms was studied during the development of two forms that were parallel to an existing form for each of three language arts tests for fourth graders used in the Connecticut Mastery Test. Two listening comprehension forms, two reading comprehension forms, and two…

Descriptors: Elementary School Students, Grade 4, Intermediate Grades, Item Banks

Reliability of Professionally Scored Data: NAEP-Related Issues.

Kaplan, Bruce A.; Johnson, Eugene G. – 1992

Across the field of educational assessment the case has been made for alternatives to the multiple-choice item type. Most of the alternative types of items require a subjective evaluation by a rater. The reliability of this subjective rating is a key component of these types of alternative items. In this paper, measures of reliability are…

Descriptors: Educational Assessment, Elementary Secondary Education, Estimation (Mathematics), Evaluators

Multilevel Factor Analysis Applied to National Assessment Data.

Download full text

Kuhlemeir, Hans; And Others – 1995

In the Dutch Educational Assessment Program, the students' language proficiency is measured in grade 9, at age 15. Writing performance is measured through several performance-based writing tasks, rated on numerous aspects such as content, style, organization, punctuation, spelling and grammar. As a consequence, national performance levels are…

Descriptors: Educational Assessment, Factor Analysis, Factor Structure, Foreign Countries

Work Keys: Developing a Usable Scale for Multi-Level, Criterion-Referenced Assessments.

Download full text

Vansickle, Timothy R. – 1992

The scaling of a new assessment is a significant undertaking. The scaling of a new assessment designed as a multiple-level, criterion-referenced assessment is even more so. A Guttman approach to scaling was used with the Work Keys selected-response assessments, Reading for Information and Applied Mathematics. Assessments in development in the Work…

Descriptors: Criterion Referenced Tests, Employment Qualifications, High School Students, High Schools

Test Reliability	8
Writing Tests	8
Test Construction	4
Educational Assessment	3
Evaluation Methods	3
Interrater Reliability	3
Performance Based Assessment	3
Scores	3
Scoring	3
Test Use	3
Evaluators	2
Foreign Countries	2
Grade 4	2
High School Students	2
High Schools	2
Intermediate Grades	2
Listening Comprehension Tests	2
National Surveys	2
Reading Tests	2
Test Validity	2
Testing Problems	2
Writing (Composition)	2
Writing Evaluation	2
Content Analysis	1
Criterion Referenced Tests	1
More ▼