Publication Date
In 2025 | 0 |
Since 2024 | 0 |
Since 2021 (last 5 years) | 1 |
Since 2016 (last 10 years) | 1 |
Since 2006 (last 20 years) | 4 |
Descriptor
Generalizability Theory | 12 |
Performance Based Assessment | 12 |
Test Validity | 12 |
Test Reliability | 7 |
Interrater Reliability | 5 |
Student Evaluation | 4 |
Educational Assessment | 3 |
Evaluation Methods | 3 |
Statistical Analysis | 3 |
Academic Achievement | 2 |
Accountability | 2 |
More ▼ |
Source
Journal of Educational… | 2 |
Advances in Physiology… | 1 |
Chemistry Education Research… | 1 |
Educational and Psychological… | 1 |
Journal of Special Education | 1 |
National Academy of Education | 1 |
Pearson | 1 |
Author
Shavelson, Richard J. | 2 |
Abedi, Jamal | 1 |
Baker, Eva L. | 1 |
Barbera, Jack | 1 |
Garcia, Raymond E. | 1 |
Geller, Josh P. | 1 |
Keng, Leslie | 1 |
Kim, Joshua M. | 1 |
Krilowicz, Beverly L. | 1 |
Lane, Suzanne | 1 |
Moore, Alan D. | 1 |
More ▼ |
Publication Type
Journal Articles | 6 |
Reports - Evaluative | 6 |
Reports - Research | 6 |
Speeches/Meeting Papers | 3 |
Information Analyses | 1 |
Reports - Descriptive | 1 |
Education Level
Higher Education | 2 |
Grade 10 | 1 |
Postsecondary Education | 1 |
Audience
Location
Colorado | 1 |
Laws, Policies, & Programs
Assessments and Surveys
Teacher Performance… | 1 |
edTPA (Teacher Performance… | 1 |
What Works Clearinghouse Rating
Peck, Charles A.; Young, Maia Goodman; Zhang, Wenqi – National Academy of Education, 2021
In this paper the authors examine the uses of teaching performance assessments (TPAs) as resources for learning, program evaluation, and improvement in teacher education. The authors begin by outlining their conceptual framing and related research questions about the uses of TPAs as resources for program evaluation and improvement. They describe…
Descriptors: Performance Based Assessment, Preservice Teachers, Teacher Evaluation, Program Evaluation
Wren, David; Barbera, Jack – Chemistry Education Research and Practice, 2014
Assessing conceptual understanding of foundational topics before instruction on higher-order concepts can provide chemical educators with information to aid instructional design. This study provides an instrument that can be used to identify students' alternative conceptions regarding thermochemistry concepts. The Thermochemistry Concept Inventory…
Descriptors: Psychometrics, Thermodynamics, Chemistry, Item Response Theory
Tindal, Gerald; Yovanoff, Paul; Geller, Josh P. – Journal of Special Education, 2010
Students with significant disabilities must participate in large-scale assessments, often using an alternate assessment judged against alternate achievement standards. The development and administration of this type of assessment must necessarily balance meaningful participation with accurate measurement. In this study, generalizability theory is…
Descriptors: Generalizability Theory, Alternative Assessment, Disabilities, Severe Mental Retardation
Way, Walter D.; Murphy, Daniel; Powers, Sonya; Keng, Leslie – Pearson, 2012
Significant momentum exists for next-generation assessments to increasingly utilize technology to develop and deliver performance-based assessments. Many traditional challenges with this assessment approach still apply, including psychometric concerns related to performance-based tasks (PBTs), which include low reliability, efficiency of…
Descriptors: Task Analysis, Performance Based Assessment, Technology Uses in Education, Models
Oh, Deborah M.; Kim, Joshua M.; Garcia, Raymond E.; Krilowicz, Beverly L. – Advances in Physiology Education, 2005
There is increasing pressure, both from institutions central to the national scientific mission and from regional and national accrediting agencies, on natural sciences faculty to move beyond course examinations as measures of student performance and to instead develop and use reliable and valid authentic assessment measures for both individual…
Descriptors: Evaluation Methods, Biochemistry, Natural Sciences, Generalizability Theory
Moore, Alan D.; Young, Suzanne – 1997
As schools move toward performance assessment, there is increasing discussion of using these assessments for accountability purposes. When used for making decisions, performance assessments must meet high standards of validity and reliability. One major source of unreliability in performance assessments is interrater disagreement. In this paper,…
Descriptors: Accountability, Correlation, Elementary Secondary Education, Generalizability Theory
Phillips, Gary W., Ed. – 1996
Recently, there has been a significant expansion in the use of performance assessment in large scale testing programs. Although there has been significant support from curriculum and policy stakeholders, the technical feasibility of large scale performance assessments has remained a question. This report is intended to contribute to the debate by…
Descriptors: Comparative Analysis, Generalizability Theory, Performance Based Assessment, Psychometrics
Reckase, Mark D. – 1997
This paper argues that special procedures for constructing assessment tools containing performance assessment tasks are unnecessary and that current test methodology can easily be generalized to complex performance assessment tasks without destroying the desirable characteristics of those tasks. Reasonable statistical requirements for sound…
Descriptors: Educational Assessment, Generalizability Theory, High Stakes Tests, Interrater Reliability

Shavelson, Richard J.; And Others – Journal of Educational Measurement, 1993
Evidence is presented on the generalizability and convergent validity of performance assessments using data from six studies of student achievement that sampled a wide range of measurement facets and methods. Results at individual and school levels indicate that task-sampling variability is the major source of measurement error. (SLD)
Descriptors: Academic Achievement, Educational Assessment, Error of Measurement, Generalizability Theory

Lane, Suzanne; And Others – Journal of Educational Measurement, 1996
Evidence from test results of 3,604 sixth and seventh graders is provided for the generalizability and validity of the Quantitative Understanding: Amplifying Student Achievement and Reasoning (QUASAR) Cognitive Assessment Instrument, which is designed to measure program outcomes and growth in mathematics. (SLD)
Descriptors: Achievement Tests, Cognitive Processes, Elementary Education, Elementary School Students

Abedi, Jamal; Baker, Eva L. – Educational and Psychological Measurement, 1995
Results from a performance assessment in which 68 high school students wrote essays support the use of latent variable modeling for estimating reliability, concurrent validity, and generalizability of a scoring rubric. The latent variable modeling approach overcomes the limitations of certain conventional statistical techniques in handling…
Descriptors: Criteria, Essays, Estimation (Mathematics), Generalizability Theory
Shavelson, Richard J.; And Others – 1993
In this paper, performance assessments are cast within a sampling framework. A performance assessment score is viewed as a sample of student performance drawn from a complex universe defined by a combination of all possible tasks, occasions, raters, and measurement methods. Using generalizability theory, the authors present evidence bearing on the…
Descriptors: Academic Achievement, Educational Assessment, Error of Measurement, Evaluators