ERIC - Search Results

Publication Date

In 2025	0
Since 2024	0
Since 2021 (last 5 years)	0
Since 2016 (last 10 years)	2
Since 2006 (last 20 years)	7

Descriptor

Generalizability Theory	16
Performance Based Assessment	16
Test Reliability	16
Interrater Reliability	7
Test Validity	7
Error of Measurement	6
Evaluation Methods	5
Scores	5
Student Evaluation	5
Educational Assessment	3
Higher Education	3
Test Construction	3
Decision Making	2
Estimation (Mathematics)	2
Foreign Countries	2
Psychometrics	2
Scoring	2
Standards	2
Statistical Analysis	2
Summative Evaluation	2
Task Analysis	2
Academic Achievement	1
Allied Health Occupations…	1
Alternative Assessment	1
Biochemistry	1
More ▼

Source

Advances in Health Sciences…	1
Advances in Physiology…	1
Alberta Journal of…	1
Asian Journal of Education…	1
Chemistry Education Research…	1
Educational Measurement:…	1
Journal of Educational…	1
Journal of Psychoeducational…	1
Journal of Special Education	1
Pearson	1
Research & Practice in…	1
More ▼

Publication Type

Journal Articles	10
Reports - Research	9
Reports - Evaluative	6
Speeches/Meeting Papers	4
Reports - Descriptive	1

Education Level

Higher Education	4
Postsecondary Education	2
Grade 10	1

Audience

Location

Canada	1
Colorado	1
Oklahoma	1
Turkey (Ankara)	1

Laws, Policies, & Programs

Assessments and Surveys

What Works Clearinghouse Rating

Showing 1 to 15 of 16 results Save | Export

Examining the Reliability of Scores from a Performance Assessment of Practice-Based Competencies

Peer reviewed

Direct link

Roduta Roberts, Mary; Alves, Cecilia Brito; Werther, Karin; Bahry, Louise M. – Journal of Psychoeducational Assessment, 2019

The purpose of this study was to examine the reliability and sources of score variation from a performance assessment of practice competencies within an occupational therapy program. Data from 99 students who participated in a practical exam were examined. A generalizability analysis of analytic, total, and overall holistic scores was completed…

Descriptors: Performance Based Assessment, Test Reliability, Scores, Occupational Therapy

Using Generalizability Theory to Assess the Score Reliability of Communication Skills of Dentistry Students

Peer reviewed
PDF on ERIC

Download full text

Uzun, N. Bilge; Aktas, Mehtap; Asiret, Semih; Yormaz, Seha – Asian Journal of Education and Training, 2018

The goal of this study is to determine the reliability of the performance points of dentistry students regarding communication skills and to examine the scoring reliability by generalizability theory in balanced random and fixed facet (mixed design) data, considering also the interactions of student, rater and duty. The study group of the research…

Descriptors: Foreign Countries, Generalizability Theory, Scores, Test Reliability

Psychometric Analysis of the Thermochemistry Concept Inventory

Peer reviewed

Direct link

Wren, David; Barbera, Jack – Chemistry Education Research and Practice, 2014

Assessing conceptual understanding of foundational topics before instruction on higher-order concepts can provide chemical educators with information to aid instructional design. This study provides an instrument that can be used to identify students' alternative conceptions regarding thermochemistry concepts. The Thermochemistry Concept Inventory…

Descriptors: Psychometrics, Thermodynamics, Chemistry, Item Response Theory

Using Multivariate Generalizability Theory to Assess the Effect of Content Stratification on the Reliability of a Performance Assessment

Peer reviewed

Direct link

Keller, Lisa A.; Clauser, Brian E.; Swanson, David B. – Advances in Health Sciences Education, 2010

In recent years, demand for performance assessments has continued to grow. However, performance assessments are notorious for lower reliability, and in particular, low reliability resulting from task specificity. Since reliability analyses typically treat the performance tasks as randomly sampled from an infinite universe of tasks, these estimates…

Descriptors: Generalizability Theory, Test Reliability, Performance Based Assessment, Error of Measurement

Generalizability of Student Writing across Multiple Tasks: A Challenge for Authentic Assessment

Peer reviewed
PDF on ERIC

Download full text

Hathcoat, John D.; Penn, Jeremy D. – Research & Practice in Assessment, 2012

Critics of standardized testing have recommended replacing standardized tests with more authentic assessment measures, such as classroom assignments, projects, or portfolios rated by a panel of raters using common rubrics. Little research has examined the consistency of scores across multiple authentic assignments or the implications of this…

Descriptors: Generalizability Theory, Performance Based Assessment, Writing Across the Curriculum, Standardized Tests

Generalizability Theory Applied to Reading Assessments for Students with Significant Cognitive Disabilities

Peer reviewed

Direct link

Tindal, Gerald; Yovanoff, Paul; Geller, Josh P. – Journal of Special Education, 2010

Students with significant disabilities must participate in large-scale assessments, often using an alternate assessment judged against alternate achievement standards. The development and administration of this type of assessment must necessarily balance meaningful participation with accurate measurement. In this study, generalizability theory is…

Descriptors: Generalizability Theory, Alternative Assessment, Disabilities, Severe Mental Retardation

The Case for Performance-Based Tasks without Equating

Direct link

Way, Walter D.; Murphy, Daniel; Powers, Sonya; Keng, Leslie – Pearson, 2012

Significant momentum exists for next-generation assessments to increasingly utilize technology to develop and deliver performance-based assessments. Many traditional challenges with this assessment approach still apply, including psychometric concerns related to performance-based tasks (PBTs), which include low reliability, efficiency of…

Descriptors: Task Analysis, Performance Based Assessment, Technology Uses in Education, Models

The Conventional Wisdom about Group Mean Scores.

Peer reviewed

Brennan, Robert L. – Journal of Educational Measurement, 1995

Generalizability theory is used to show that the assumption that reliability for groups is greater than that for persons (and that error variance for groups is less than that for persons) is not necessarily true. Examples are provided from course evaluation and performance test literature. (SLD)

Descriptors: Course Evaluation, Decision Making, Equations (Mathematics), Generalizability Theory

Error Sources Influencing Performance Assessment Reliability or Generalizability: A Meta Analysis.

Download full text

Jiang, Ying Hong; And Others – 1997

As performance-based assessments have gained wider use, there are increasing concerns about their dependability. This study is a synthesis of existing studies regarding the reliability or generalizability of performance assessments. The meta-analysis involves summarizing, examining, and evaluating research findings. Articles on the dependability…

Descriptors: Error of Measurement, Estimation (Mathematics), Generalizability Theory, Judges

A Discussion of Analytic Scoring for Writing Performance Assessments.

Download full text

Crehan, Kevin D. – 1997

Writing fits well within the realm of outcomes suitable for observation by performance assessments. Studies of the reliability of performance assessments have suggested that interrater reliability can be consistently high. Scoring consistency, however, is only one aspect of quality in decisions based on assessment results. Another is…

Descriptors: Evaluation Methods, Feedback, Generalizability Theory, Interrater Reliability

Valid and Reliable Authentic Assessment of Culminating Student Performance in the Biomedical Sciences

Peer reviewed

Direct link

Oh, Deborah M.; Kim, Joshua M.; Garcia, Raymond E.; Krilowicz, Beverly L. – Advances in Physiology Education, 2005

There is increasing pressure, both from institutions central to the national scientific mission and from regional and national accrediting agencies, on natural sciences faculty to move beyond course examinations as measures of student performance and to instead develop and use reliable and valid authentic assessment measures for both individual…

Descriptors: Evaluation Methods, Biochemistry, Natural Sciences, Generalizability Theory

Technical Issues in Large-Scale Performance Assessment.

Download full text

Phillips, Gary W., Ed. – 1996

Recently, there has been a significant expansion in the use of performance assessment in large scale testing programs. Although there has been significant support from curriculum and policy stakeholders, the technical feasibility of large scale performance assessments has remained a question. This report is intended to contribute to the debate by…

Descriptors: Comparative Analysis, Generalizability Theory, Performance Based Assessment, Psychometrics

Statistical Test Specifications for Performance Assessments: Is This an Oxymoron?

Download full text

Reckase, Mark D. – 1997

This paper argues that special procedures for constructing assessment tools containing performance assessment tasks are unnecessary and that current test methodology can easily be generalized to complex performance assessment tasks without destroying the desirable characteristics of those tasks. Reasonable statistical requirements for sound…

Descriptors: Educational Assessment, Generalizability Theory, High Stakes Tests, Interrater Reliability

Generalizability of Performance Assessments.

Peer reviewed

Brennan, Robert L.; Johnson, Eugene G. – Educational Measurement: Issues and Practice, 1995

The application of generalizability theory to the reliability and error variance estimation for performance assessment scores is discussed. Decision makers concerned with performance assessment need to realize the restrictions that limit generalizability such as limitations that lead to reductions in the number of tasks possible, rater quality,…

Descriptors: Decision Making, Educational Assessment, Error of Measurement, Estimation (Mathematics)

Generalizability of Written-Response Scores for the Alberta Education English 30 Diploma Examination.

Peer reviewed

Gierl, Mark J. – Alberta Journal of Educational Research, 1998

Examined the generalizability of written-response scores on the English 30 diploma examination administered to Alberta 12th-grade students. Student scores differed as a function of rater, but this variance component was small across two tasks and two administrations; score generalizability was high using a two-rater system; and scale variability…

Descriptors: Error of Measurement, Foreign Countries, Generalizability Theory, High School Seniors

Previous Page | Next Page »

Pages: 1 | 2

Brennan, Robert L.	2
Aktas, Mehtap	1
Alves, Cecilia Brito	1
Asiret, Semih	1
Bahry, Louise M.	1
Barbera, Jack	1
Clauser, Brian E.	1
Crehan, Kevin D.	1
Garcia, Raymond E.	1
Geller, Josh P.	1
Gierl, Mark J.	1
Hathcoat, John D.	1
Jiang, Ying Hong	1
Johnson, Eugene G.	1
Keller, Lisa A.	1
Keng, Leslie	1
Kim, Joshua M.	1
Krilowicz, Beverly L.	1
Murphy, Daniel	1
Oh, Deborah M.	1
Penn, Jeremy D.	1
Phillips, Gary W., Ed.	1
Powers, Sonya	1
Reckase, Mark D.	1
More ▼