ERIC - Search Results

Publication Date

In 2025	0
Since 2024	0
Since 2021 (last 5 years)	0
Since 2016 (last 10 years)	0
Since 2006 (last 20 years)	6

Descriptor

Generalizability Theory	6
Grade 10	4
Grade 8	3
Mathematics Tests	3
Scores	3
Computation	2
Grade 11	2
Reading Tests	2
Statistical Analysis	2
Test Construction	2
Test Reliability	2
Alternative Assessment	1
Classroom Observation…	1
Computer Simulation	1
Correlation	1
Cutting Scores	1
Difficulty Level	1
Disabilities	1
Educational Change	1
Error of Measurement	1
Evaluation Criteria	1
Evaluation Methods	1
Foreign Countries	1
Grade 2	1
Grade 4	1
More ▼

Source

Applied Measurement in…	1
Educational and Psychological…	1
Language Assessment Quarterly	1
Pearson	1
Society for Research on…	1
Studies in Educational…	1

Author

Allen, Joseph P.	1
Bloom, Howard S.	1
Harsch, Claudia	1
Keng, Leslie	1
Mashburn, Andrew J.	1
Meyer, J. Patrick	1
Murphy, Daniel	1
Newton, Xiaoxia A.	1
Pastor, Dena A.	1
Pianta, Robert C.	1
Porter, Kristin E.	1
Powers, Sonya	1
Rupp, Andre Alexander	1
Taylor, Melinda Ann	1
Way, Walter D.	1
More ▼

Publication Type

Reports - Research	5
Journal Articles	4
Reports - Evaluative	1
Speeches/Meeting Papers	1

Education Level

Grade 10	6
Grade 8	5
High Schools	4
Middle Schools	4
Secondary Education	4
Elementary Education	3
Elementary Secondary Education	3
Junior High Schools	3
Grade 5	2
Grade 9	2
Grade 11	1
Grade 2	1
Grade 3	1
Grade 4	1
Grade 6	1
Grade 7	1
Intermediate Grades	1
More ▼

Audience

Location

Germany

Laws, Policies, & Programs

Assessments and Surveys

What Works Clearinghouse Rating

Showing all 6 results Save | Export

An Application of Generalizability Theory to Evaluate the Technical Quality of an Alternate Assessment

Peer reviewed

Direct link

Taylor, Melinda Ann; Pastor, Dena A. – Applied Measurement in Education, 2013

Although federal regulations require testing students with severe cognitive disabilities, there is little guidance regarding how technical quality should be established. It is known that challenges exist with documentation of the reliability of scores for alternate assessments. Typical measures of reliability do little in modeling multiple sources…

Descriptors: Generalizability Theory, Alternative Assessment, Test Reliability, Scores

Assessing the Generalizability of Estimates of Causal Effects from Regression Discontinuity Designs

Download full text

Bloom, Howard S.; Porter, Kristin E. – Society for Research on Educational Effectiveness, 2012

In recent years, the regression discontinuity design (RDD) has gained widespread recognition as a quasi-experimental method that when used correctly, can produce internally valid estimates of causal effects of a treatment, a program or an intervention (hereafter referred to as treatment effects). In an RDD study, subjects or groups of subjects…

Descriptors: Regression (Statistics), Research Design, Computation, Generalizability Theory

The Effect of Observation Length and Presentation Order on the Reliability and Validity of an Observational Measure of Teaching Quality

Peer reviewed

Direct link

Mashburn, Andrew J.; Meyer, J. Patrick; Allen, Joseph P.; Pianta, Robert C. – Educational and Psychological Measurement, 2014

Observational methods are increasingly being used in classrooms to evaluate the quality of teaching. Operational procedures for observing teachers are somewhat arbitrary in existing measures and vary across different instruments. To study the effect of different observation procedures on score reliability and validity, we conducted an experimental…

Descriptors: Observation, Teacher Evaluation, Reliability, Validity

Designing and Scaling Level-Specific Writing Tasks in Alignment with the CEFR: A Test-Centered Approach

Peer reviewed

Direct link

Harsch, Claudia; Rupp, Andre Alexander – Language Assessment Quarterly, 2011

The "Common European Framework of Reference" (CEFR; Council of Europe, 2001) provides a competency model that is increasingly used as a point of reference to compare language examinations. Nevertheless, aligning examinations to the CEFR proficiency levels remains a challenge. In this article, we propose a new, level-centered approach to…

Descriptors: Language Tests, Writing Tests, Test Construction, Test Items

Developing Indicators of Classroom Practice to Evaluate the Impact of District Mathematics Reform Initiative: A Generalizability Analysis

Peer reviewed

Direct link

Newton, Xiaoxia A. – Studies in Educational Evaluation, 2010

This paper reported results from a generalizability study that examined the process of developing classroom practice indicators used to evaluate the impact of a school district's mathematics reform initiative. The study utilized classroom observational data from 32 second, fourth, eighth, and tenth grade teachers. The study addresses important…

Descriptors: Generalizability Theory, Theory Practice Relationship, Program Effectiveness, Grade 10

The Case for Performance-Based Tasks without Equating

Direct link

Way, Walter D.; Murphy, Daniel; Powers, Sonya; Keng, Leslie – Pearson, 2012

Significant momentum exists for next-generation assessments to increasingly utilize technology to develop and deliver performance-based assessments. Many traditional challenges with this assessment approach still apply, including psychometric concerns related to performance-based tasks (PBTs), which include low reliability, efficiency of…

Descriptors: Task Analysis, Performance Based Assessment, Technology Uses in Education, Models