Publication Date
In 2025 | 0 |
Since 2024 | 0 |
Since 2021 (last 5 years) | 0 |
Since 2016 (last 10 years) | 0 |
Since 2006 (last 20 years) | 6 |
Descriptor
Generalizability Theory | 6 |
Grade 10 | 4 |
Grade 8 | 3 |
Mathematics Tests | 3 |
Scores | 3 |
Computation | 2 |
Grade 11 | 2 |
Reading Tests | 2 |
Statistical Analysis | 2 |
Test Construction | 2 |
Test Reliability | 2 |
More ▼ |
Source
Applied Measurement in… | 1 |
Educational and Psychological… | 1 |
Language Assessment Quarterly | 1 |
Pearson | 1 |
Society for Research on… | 1 |
Studies in Educational… | 1 |
Author
Allen, Joseph P. | 1 |
Bloom, Howard S. | 1 |
Harsch, Claudia | 1 |
Keng, Leslie | 1 |
Mashburn, Andrew J. | 1 |
Meyer, J. Patrick | 1 |
Murphy, Daniel | 1 |
Newton, Xiaoxia A. | 1 |
Pastor, Dena A. | 1 |
Pianta, Robert C. | 1 |
Porter, Kristin E. | 1 |
More ▼ |
Publication Type
Reports - Research | 5 |
Journal Articles | 4 |
Reports - Evaluative | 1 |
Speeches/Meeting Papers | 1 |
Education Level
Grade 10 | 6 |
Grade 8 | 5 |
High Schools | 4 |
Middle Schools | 4 |
Secondary Education | 4 |
Elementary Education | 3 |
Elementary Secondary Education | 3 |
Junior High Schools | 3 |
Grade 5 | 2 |
Grade 9 | 2 |
Grade 11 | 1 |
More ▼ |
Audience
Location
Germany | 1 |
Laws, Policies, & Programs
Assessments and Surveys
What Works Clearinghouse Rating
Taylor, Melinda Ann; Pastor, Dena A. – Applied Measurement in Education, 2013
Although federal regulations require testing students with severe cognitive disabilities, there is little guidance regarding how technical quality should be established. It is known that challenges exist with documentation of the reliability of scores for alternate assessments. Typical measures of reliability do little in modeling multiple sources…
Descriptors: Generalizability Theory, Alternative Assessment, Test Reliability, Scores
Bloom, Howard S.; Porter, Kristin E. – Society for Research on Educational Effectiveness, 2012
In recent years, the regression discontinuity design (RDD) has gained widespread recognition as a quasi-experimental method that when used correctly, can produce internally valid estimates of causal effects of a treatment, a program or an intervention (hereafter referred to as treatment effects). In an RDD study, subjects or groups of subjects…
Descriptors: Regression (Statistics), Research Design, Computation, Generalizability Theory
Mashburn, Andrew J.; Meyer, J. Patrick; Allen, Joseph P.; Pianta, Robert C. – Educational and Psychological Measurement, 2014
Observational methods are increasingly being used in classrooms to evaluate the quality of teaching. Operational procedures for observing teachers are somewhat arbitrary in existing measures and vary across different instruments. To study the effect of different observation procedures on score reliability and validity, we conducted an experimental…
Descriptors: Observation, Teacher Evaluation, Reliability, Validity
Harsch, Claudia; Rupp, Andre Alexander – Language Assessment Quarterly, 2011
The "Common European Framework of Reference" (CEFR; Council of Europe, 2001) provides a competency model that is increasingly used as a point of reference to compare language examinations. Nevertheless, aligning examinations to the CEFR proficiency levels remains a challenge. In this article, we propose a new, level-centered approach to…
Descriptors: Language Tests, Writing Tests, Test Construction, Test Items
Newton, Xiaoxia A. – Studies in Educational Evaluation, 2010
This paper reported results from a generalizability study that examined the process of developing classroom practice indicators used to evaluate the impact of a school district's mathematics reform initiative. The study utilized classroom observational data from 32 second, fourth, eighth, and tenth grade teachers. The study addresses important…
Descriptors: Generalizability Theory, Theory Practice Relationship, Program Effectiveness, Grade 10
Way, Walter D.; Murphy, Daniel; Powers, Sonya; Keng, Leslie – Pearson, 2012
Significant momentum exists for next-generation assessments to increasingly utilize technology to develop and deliver performance-based assessments. Many traditional challenges with this assessment approach still apply, including psychometric concerns related to performance-based tasks (PBTs), which include low reliability, efficiency of…
Descriptors: Task Analysis, Performance Based Assessment, Technology Uses in Education, Models