NotesFAQContact Us
Collection
Advanced
Search Tips
Audience
Laws, Policies, & Programs
What Works Clearinghouse Rating
Showing all 10 results Save | Export
Peer reviewed Peer reviewed
PDF on ERIC Download full text
Merchant, Stefan; Rich, Jessica; Klinger, Don A. – Canadian Journal of Educational Administration and Policy, 2022
Both school and district administrators use the results of standardized, large-scale tests to inform decisions about the need for, or success of, educational programs and interventions. However, test results at the school level are subject to random fluctuations due to changes in cohort, test items, and other factors outside of the school's…
Descriptors: Standardized Tests, Foreign Countries, Generalizability Theory, Scores
Patrick, Helen; French, Brian F.; Mantzicopoulos, Panayota – Journal of Psychoeducational Assessment, 2020
We evaluated the score stability of the Framework for Teaching (FFT), a prominent observation instrument used for teacher evaluation. Three raters each scored 200 reading and mathematics lessons taught by 20 kindergarten teachers. Using Generalizability theory analyses, we decomposed the FFT's Classroom Environment, Instruction, and Total scores…
Descriptors: Teacher Evaluation, Observation, Scores, Test Reliability
Peer reviewed Peer reviewed
Direct linkDirect link
D'Agostino, Jerome V.; Rodgers, Emily; Winkler, Christa; Johnson, Tracy; Berenbon, Rebecca – Reading Psychology, 2021
Running Records provide a standardized method for recording and assessing students' oral reading behaviors and are excellent formative assessment tools to guide instructional decision-making. This study expands on prior Running Record reliability work by evaluating the extent to which external raters and teachers consistently assessed students'…
Descriptors: Accuracy, Oral Reading, Generalizability Theory, Error Correction
Peer reviewed Peer reviewed
Direct linkDirect link
Briggs, Derek C.; Alzen, Jessica L. – Educational and Psychological Measurement, 2019
Observation protocol scores are commonly used as status measures to support inferences about teacher practices. When multiple observations are collected for the same teacher over the course of a year, some portion of a teacher's score on each occasion may be attributable to the rater, lesson, and the time of year of the observation. All three of…
Descriptors: Observation, Inferences, Generalizability Theory, Scores
Peer reviewed Peer reviewed
Direct linkDirect link
Wilson, Joshua; Chen, Dandan; Sandbank, Micheal P.; Hebert, Michael – Journal of Educational Psychology, 2019
The present study examined issues pertaining to the reliability of writing assessment in the elementary grades, and among samples of struggling and nonstruggling writers. The present study also extended nascent research on the reliability and the practical applications of automated essay scoring (AES) systems in Response to Intervention frameworks…
Descriptors: Computer Assisted Testing, Automation, Scores, Writing Tests
Peer reviewed Peer reviewed
Direct linkDirect link
Keller-Margulis, Milena A.; Mercer, Sterett H.; Thomas, Erin L. – School Psychology Quarterly, 2016
The purpose of this study was to examine the reliability of written expression curriculum-based measurement (WE-CBM) in the context of universal screening from a generalizability theory framework. Students in second through fifth grade (n = 145) participated in the study. The sample included 54% female students, 49% White students, 23% African…
Descriptors: Generalizability Theory, Reliability, Written Language, Curriculum Based Assessment
Peer reviewed Peer reviewed
Direct linkDirect link
Taylor, Melinda Ann; Pastor, Dena A. – Applied Measurement in Education, 2013
Although federal regulations require testing students with severe cognitive disabilities, there is little guidance regarding how technical quality should be established. It is known that challenges exist with documentation of the reliability of scores for alternate assessments. Typical measures of reliability do little in modeling multiple sources…
Descriptors: Generalizability Theory, Alternative Assessment, Test Reliability, Scores
Bloom, Howard S.; Porter, Kristin E. – Society for Research on Educational Effectiveness, 2012
In recent years, the regression discontinuity design (RDD) has gained widespread recognition as a quasi-experimental method that when used correctly, can produce internally valid estimates of causal effects of a treatment, a program or an intervention (hereafter referred to as treatment effects). In an RDD study, subjects or groups of subjects…
Descriptors: Regression (Statistics), Research Design, Computation, Generalizability Theory
Peer reviewed Peer reviewed
Direct linkDirect link
Volpe, Robert J.; McConaughy, Stephanie H.; Hintze, John M. – School Psychology Review, 2009
The present study used generalizability theory to investigate the dependability of systematic observations of students' problem behavior and on-task behavior in classrooms. The Direct Observation Form (McConaughy & Achenbach, 2009) was used with a sample of 24, 6- to-11-year-old children attending 18 different elementary schools. The participants…
Descriptors: Generalizability Theory, Behavior Problems, Student Behavior, Evaluation Methods
Peer reviewed Peer reviewed
Direct linkDirect link
Bergeron, Renee; Floyd, Randy G.; McCormack, Allison C.; Farmer, William L. – School Psychology Review, 2008
The dependability of externalizing behavior composites and subscale scores from the Behavior Assessment System for Children, Second Edition, Teacher Rating Scale-Child (Reynolds & Kamphaus, 2004) and the Achenbach System of Empirically Based Assessment, Teacher's Report Form for Ages 6-18 (Achenbach & Rescorla, 2001) was investigated.…
Descriptors: Generalizability Theory, Scores, Rating Scales, Error of Measurement