NotesFAQContact Us
Collection
Advanced
Search Tips
Publication Date
In 20250
Since 20240
Since 2021 (last 5 years)0
Since 2016 (last 10 years)1
Since 2006 (last 20 years)5
Laws, Policies, & Programs
Kentucky Education Reform Act…1
What Works Clearinghouse Rating
Showing 1 to 15 of 22 results Save | Export
Peer reviewed Peer reviewed
PDF on ERIC Download full text
Conti, Maria; LaMance, Rachel; Miller-Cochran, Susan – Composition Forum, 2017
To address the needs and interests of primary stakeholders in a writing program, this article presents a model of "grassroots" assessment that involves instructors from all ranks as well as students in the development, facilitation, and interpretation of assessment results. The authors describe two assessment plans that measured student…
Descriptors: Writing Improvement, Needs Assessment, Stakeholders, Student Needs
Peer reviewed Peer reviewed
PDF on ERIC Download full text
Zhang, Mo; Breyer, F. Jay; Lorenz, Florian – ETS Research Report Series, 2013
In this research, we investigated the suitability of implementing "e-rater"® automated essay scoring in a high-stakes large-scale English language testing program. We examined the effectiveness of generic scoring and 2 variants of prompt-based scoring approaches. Effectiveness was evaluated on a number of dimensions, including agreement…
Descriptors: Computer Assisted Testing, Computer Software, Scoring, Language Tests
Meyers, Jason L.; Murphy, Stephen; Goodman, Joshua; Turhan, Ahmet – Pearson, 2012
Operational testing programs employing item response theory (IRT) applications benefit from of the property of item parameter invariance whereby item parameter estimates obtained from one sample can be applied to other samples (when the underlying assumptions are satisfied). In theory, this feature allows for applications such as computer-adaptive…
Descriptors: Equated Scores, Test Items, Test Format, Item Response Theory
Peer reviewed Peer reviewed
Direct linkDirect link
Albertson, Bonnie – Research in the Teaching of English, 2007
The primary purpose of this study was to investigate the efficacy of formulaic writing such as the five-paragraph theme (FPT) or essay for the purpose of earning high scores on high-stakes writing assessments. This qualitative descriptive study analyzed more than 1000 essays from Delaware Grade 8 and 10 writers, written for a statewide…
Descriptors: Grade 8, Grade 10, Testing Programs, Essays
Peer reviewed Peer reviewed
Moon, Tonya R.; Hughes, Kevin R. – Educational Measurement: Issues and Practice, 2002
Examined a scoring anomaly that became apparent in a state-mandated writing assessment. Results for 3,660 essays by sixth graders show that using a spiral model for training raters and scoring papers results in higher mean ratings than does using a sequential model for training and scoring. Findings demonstrate the importance of making decisions…
Descriptors: Elementary School Students, Essay Tests, Intermediate Grades, Scoring
Peer reviewed Peer reviewed
Congdon, Peter J.; McQueen, Joy – Journal of Educational Measurement, 2000
Studied the stability of rater severity over an extended rating period by applying multifaceted Rasch analysis to ratings of 16 raters of writing performances of 8,285 elementary school students. Findings cast doubt on the practice of using a single calibration of rate severity as the basis for adjustment of person measures. (SLD)
Descriptors: Educational Assessment, Elementary Education, Elementary School Students, Interrater Reliability
Gyagenda, Ismail S.; Engelhard, George, Jr. – 1998
The purpose of this study was to examine rater, domain, and gender influences on the assessed quality of student writing using weighted and unweighted scores. Twenty rates were randomly selected from a group of 87 operational raters contracted to rate essays as part of the 1993 field test of the Georgia High School Writing Test. All of the raters…
Descriptors: Essay Tests, Evaluators, High School Students, High Schools
Gyagenda, Ismail S.; Engelhard, George, Jr. – 1998
The purpose of this study was to describe the Rasch model for measurement and apply the model to examine the relationship between raters, domains of written compositions, and student writing ability. Twenty raters were randomly selected from a group of 87 operational raters contracted to rate essays as part of the 1993 field test of the Georgia…
Descriptors: Difficulty Level, Essay Tests, Evaluators, High School Students
Peer reviewed Peer reviewed
Direct linkDirect link
Kobrin, Jennifer L.; Deng, Hui; Shaw, Emily J. – Journal of Applied Testing Technology, 2007
This study was designed to address two frequent criticisms of the SAT essay--that essay length is the best predictor of scores, and that there is an advantage in using more "sophisticated" examples as opposed to personal experience. The study was based on 2,820 essays from the first three administrations of the new SAT. Each essay was…
Descriptors: Testing Programs, Computer Assisted Testing, Construct Validity, Writing Skills
Crehan, Kevin D.; Curfman, Mary – 1999
The effect of rapid feedback for a state writing assessment on subsequent writing performance was investigated. In addition, the agreement between teachers' scores for the state writing assessment and state department scores was analyzed. Eighth grade English teachers (n=8) were trained in analytic trait scoring of writing assessments. They then…
Descriptors: Elementary School Teachers, English, Feedback, Junior High Schools
Wolf, Shelby A.; McIver, Monette C. – 1998
In 1990, the state of Kentucky created a new school system through the Kentucky Educational Reform Act (KERA). While KERA mandates wide-ranging progressive reform, testing through the Kentucky Instructional Results Information System (KIRIS) makes sure teachers get the job done. Though all Kentucky teachers are involved in writing, those at the…
Descriptors: Case Studies, Educational Change, Grade 7, Junior High Schools
Zhang, Liru – 2000
This study invesitigated possible reasons for the low performance on the text-based writing assessment of the Delaware Student Testing Program (DSTP) in 2000, especially for grades 3 and 5, and considered ways to improve classroom instruction. In the first part of the study, a panel of teachers reviewed the anchor papers from the assessment and…
Descriptors: Academic Achievement, Elementary Education, Elementary School Students, Low Achievement
Peer reviewed Peer reviewed
Gabrielson, Stephen; And Others – Applied Measurement in Education, 1995
The effects of presenting a choice of writing tasks on the quality of essays produced by eleventh graders were studied with 34,200 students in Georgia. The choice condition had no substantive effect on the quality of essays, but race, gender, and the writing task variable did. (SLD)
Descriptors: Essay Tests, Grade 11, High School Students, High Schools
Yen, Shu Jing; Ochieng, Charles; Michaels, Hillary; Friedman, Greg – Online Submission, 2005
Year-to-year rater variation may result in constructed response (CR) parameter changes, making CR items inappropriate to use in anchor sets for linking or equating. This study demonstrates how rater severity affected the writing and reading scores. Rater adjustments were made to statewide results using an item response theory (IRT) methodology…
Descriptors: Test Items, Writing Tests, Reading Tests, Measures (Individuals)
Welch, Catherine; And Others – 1989
The differential performance of black and white college freshmen on a direct measure of writing skills, the essay portion of the American College Testing Program's Collegiate Assessment of Academic Proficiency (CAAP), was studied. The data consisted of responses of a sample of 998 black and 3,727 white examinees to two individual prompts for…
Descriptors: Black Students, College Freshmen, Comparative Testing, Essay Tests
Previous Page | Next Page »
Pages: 1  |  2