ERIC - Search Results

Publication Date

In 2025	0
Since 2024	0
Since 2021 (last 5 years)	1
Since 2016 (last 10 years)	6
Since 2006 (last 20 years)	8

Descriptor

Generalizability Theory	14
Performance Based Assessment	14
Scores	14
Error of Measurement	7
Reliability	7
Interrater Reliability	6
Test Reliability	5
Foreign Countries	3
Graduate Students	3
Language Tests	3
Student Evaluation	3
Validity	3
Academic Achievement	2
Course Evaluation	2
Data Analysis	2
Decision Making	2
Educational Assessment	2
Error Patterns	2
Evaluation Methods	2
Evaluators	2
Measurement Techniques	2
Middle School Teachers	2
Psychometrics	2
Sampling	2
Science Instruction	2
More ▼

Source

Journal of Educational…	3
Educational and Psychological…	2
Applied Measurement in…	1
Asian Journal of Education…	1
Journal of Psychoeducational…	1
Language Assessment Quarterly	1
Language Testing	1
Research & Practice in…	1

Publication Type

Journal Articles	11
Reports - Research	9
Reports - Evaluative	5
Numerical/Quantitative Data	1
Speeches/Meeting Papers	1
Tests/Questionnaires	1

Education Level

Higher Education	5
Postsecondary Education	3
Junior High Schools	1
Middle Schools	1
Secondary Education	1

Audience

Location

California (Los Angeles)	1
China (Beijing)	1
Oklahoma	1
South Korea	1
Turkey (Ankara)	1

Laws, Policies, & Programs

Assessments and Surveys

What Works Clearinghouse Rating

Showing all 14 results Save | Export

Examining the Reliability of Scores from a Performance Assessment of Practice-Based Competencies

Peer reviewed

Direct link

Roduta Roberts, Mary; Alves, Cecilia Brito; Werther, Karin; Bahry, Louise M. – Journal of Psychoeducational Assessment, 2019

The purpose of this study was to examine the reliability and sources of score variation from a performance assessment of practice competencies within an occupational therapy program. Data from 99 students who participated in a practical exam were examined. A generalizability analysis of analytic, total, and overall holistic scores was completed…

Descriptors: Performance Based Assessment, Test Reliability, Scores, Occupational Therapy

Using Generalizability Theory to Assess the Score Reliability of Communication Skills of Dentistry Students

Peer reviewed
PDF on ERIC

Download full text

Uzun, N. Bilge; Aktas, Mehtap; Asiret, Semih; Yormaz, Seha – Asian Journal of Education and Training, 2018

The goal of this study is to determine the reliability of the performance points of dentistry students regarding communication skills and to examine the scoring reliability by generalizability theory in balanced random and fixed facet (mixed design) data, considering also the interactions of student, rater and duty. The study group of the research…

Descriptors: Foreign Countries, Generalizability Theory, Scores, Test Reliability

Developing Situated Measures of Science Instruction through an Innovative Electronic Portfolio App for Mobile Devices: Reliability, Validity, and Feasibility

Peer reviewed

Direct link

Martínez, José Felipe; Kloser, Matt; Srinivasan, Jayashri; Stecher, Brian; Edelman, Amanda – Educational and Psychological Measurement, 2022

Adoption of new instructional standards in science demands high-quality information about classroom practice. Teacher portfolios can be used to assess instructional practice and support teacher self-reflection anchored in authentic evidence from classrooms. This study investigated a new type of electronic portfolio tool that allows efficient…

Descriptors: Science Instruction, Academic Standards, Instructional Innovation, Electronic Publishing

Working with Sparse Data in Rated Language Tests: Generalizability Theory Applications

Peer reviewed

Direct link

Lin, Chih-Kai – Language Testing, 2017

Sparse-rated data are common in operational performance-based language tests, as an inevitable result of assigning examinee responses to a fraction of available raters. The current study investigates the precision of two generalizability-theory methods (i.e., the rating method and the subdividing method) specifically designed to accommodate the…

Descriptors: Data Analysis, Language Tests, Generalizability Theory, Accuracy

Evaluating Score and Decision Consistency across Claims in a Validation Argument

Peer reviewed

Direct link

Schmidgall, Jonathan – Applied Measurement in Education, 2017

This study utilizes an argument-based approach to validation to examine the implications of reliability in order to further differentiate the concepts of score and decision consistency. In a methodological example, the framework of generalizability theory was used to estimate appropriate indices of score consistency and evaluations of the…

Descriptors: Scores, Reliability, Validity, Generalizability Theory

Investigating Score Dependability in English/Chinese Interpreter Certification Performance Testing: A Generalizability Theory Approach

Peer reviewed

Direct link

Han, Chao – Language Assessment Quarterly, 2016

As a property of test scores, reliability/dependability constitutes an important psychometric consideration, and it underpins the validity of measurement results. A review of interpreter certification performance tests (ICPTs) reveals that (a) although reliability/dependability checking has been recognized as an important concern, its theoretical…

Descriptors: Foreign Countries, Scores, English, Chinese

An Examination of Rater Drift within a Generalizability Theory Framework

Peer reviewed

Direct link

Harik, Polina; Clauser, Brian E.; Grabovsky, Irina; Nungester, Ronald J.; Swanson, Dave; Nandakumar, Ratna – Journal of Educational Measurement, 2009

The present study examined the long-term usefulness of estimated parameters used to adjust the scores from a performance assessment to account for differences in rater stringency. Ratings from four components of the USMLE[R] Step 2 Clinical Skills Examination data were analyzed. A generalizability-theory framework was used to examine the extent to…

Descriptors: Generalizability Theory, Performance Based Assessment, Performance Tests, Clinical Experience

Generalizability of Student Writing across Multiple Tasks: A Challenge for Authentic Assessment

Peer reviewed
PDF on ERIC

Download full text

Hathcoat, John D.; Penn, Jeremy D. – Research & Practice in Assessment, 2012

Critics of standardized testing have recommended replacing standardized tests with more authentic assessment measures, such as classroom assignments, projects, or portfolios rated by a panel of raters using common rubrics. Little research has examined the consistency of scores across multiple authentic assignments or the implications of this…

Descriptors: Generalizability Theory, Performance Based Assessment, Writing Across the Curriculum, Standardized Tests

The Conventional Wisdom about Group Mean Scores.

Peer reviewed

Brennan, Robert L. – Journal of Educational Measurement, 1995

Generalizability theory is used to show that the assumption that reliability for groups is greater than that for persons (and that error variance for groups is less than that for persons) is not necessarily true. Examples are provided from course evaluation and performance test literature. (SLD)

Descriptors: Course Evaluation, Decision Making, Equations (Mathematics), Generalizability Theory

Some Measurement Characteristics of Aggregated versus Individual Scores.

Download full text

Brennan, Robert L. – 1993

Not infrequently, investigators assume that reliability for groups is greater than reliability for persons, or that the error variance for groups is less than that for persons. Using generalizability theory, it is shown that this "conventional wisdom" is not necessarily true. Examples are provided from the course-evaluation and the…

Descriptors: Comparative Analysis, Course Evaluation, Generalizability Theory, Measurement Techniques

Investigating the Generalizability of Scores from Different Rating Systems in Performance Assessment.

Download full text

Kim, Sungsook C. – 2000

The generalizability of scores from different scales in performance assessment was studied. First, a concept map of teachers' and raters' perceptions about various scores and scales was constructed using multidimensional scaling analysis. Then, a generalizability study using a random, partially nested design was conducted to analyze the…

Descriptors: Concept Mapping, Foreign Countries, Generalizability Theory, Middle School Students

Generalizability Analysis for Performance Assessments of Student Achievement or School Effectiveness.

Peer reviewed

Cronbach, Lee J.; And Others – Educational and Psychological Measurement, 1997

Through the standard error, rather than a reliability coefficient, generalizability theory provides an indicator of the uncertainty attached to school and individual scores on performance assessments. Recommendations are made to apply generalizability theory to current performance assessments, emphasizing practices that differ from usual…

Descriptors: Academic Achievement, Error of Measurement, Generalizability Theory, Performance Based Assessment

On the Stability of Performance Assessments.

Peer reviewed

Ruiz-Primo, Maria Araceli; And Others – Journal of Educational Measurement, 1993

The stability of scores on 2 types of performance assessments, an observed hands-on investigation and a notebook surrogate, was investigated for 29 sixth graders on 2 occasions. Results indicate that student performance and procedures changed and that generalizability across occasions was moderate. Implications for assessment are discussed. (SLD)

Descriptors: Educational Assessment, Elementary School Students, Error of Measurement, Generalizability Theory

Sampling Variability of Performance Assessments. Report on the Status of Generalizability Performance: Generalizability and Transfer of Performance Assessments. Project 2.4: Design Theory and Psychometrics for Complex Performance Assessment in Science.

Download full text

Shavelson, Richard J.; And Others – 1993

In this paper, performance assessments are cast within a sampling framework. A performance assessment score is viewed as a sample of student performance drawn from a complex universe defined by a combination of all possible tasks, occasions, raters, and measurement methods. Using generalizability theory, the authors present evidence bearing on the…

Descriptors: Academic Achievement, Educational Assessment, Error of Measurement, Evaluators

Brennan, Robert L.	2
Aktas, Mehtap	1
Alves, Cecilia Brito	1
Asiret, Semih	1
Bahry, Louise M.	1
Clauser, Brian E.	1
Cronbach, Lee J.	1
Edelman, Amanda	1
Grabovsky, Irina	1
Han, Chao	1
Harik, Polina	1
Hathcoat, John D.	1
Kim, Sungsook C.	1
Kloser, Matt	1
Lin, Chih-Kai	1
Martínez, José Felipe	1
Nandakumar, Ratna	1
Nungester, Ronald J.	1
Penn, Jeremy D.	1
Roduta Roberts, Mary	1
Ruiz-Primo, Maria Araceli	1
Schmidgall, Jonathan	1
Shavelson, Richard J.	1
Srinivasan, Jayashri	1
More ▼