ERIC - Search Results

Publication Date

In 2025	0
Since 2024	0
Since 2021 (last 5 years)	1
Since 2016 (last 10 years)	3
Since 2006 (last 20 years)	5

Descriptor

Error of Measurement	15
Generalizability Theory	15
Performance Based Assessment	15
Scores	7
Interrater Reliability	6
Reliability	6
Test Reliability	6
Educational Assessment	5
Sampling	4
Student Evaluation	4
Academic Achievement	3
Decision Making	2
Elementary School Students	2
Estimation (Mathematics)	2
Evaluation Methods	2
Evaluators	2
Foreign Countries	2
Intermediate Grades	2
Language Tests	2
Measurement Techniques	2
Psychometrics	2
Science Instruction	2
Scoring	2
Test Construction	2
Test Validity	2
More ▼

Source

Educational and Psychological…	2
Journal of Educational…	2
Advances in Health Sciences…	1
Alberta Journal of…	1
Educational Measurement:…	1
Language Assessment Quarterly	1
Language Testing	1
Research & Practice in…	1

Publication Type

Journal Articles	10
Reports - Research	9
Reports - Evaluative	5
Speeches/Meeting Papers	2
Reports - Descriptive	1
Tests/Questionnaires	1

Education Level

Higher Education	2
Postsecondary Education	2
Junior High Schools	1
Middle Schools	1
Secondary Education	1

Audience

Location

Canada	1
China (Beijing)	1
Oklahoma	1

Laws, Policies, & Programs

Assessments and Surveys

What Works Clearinghouse Rating

Showing all 15 results Save | Export

Developing Situated Measures of Science Instruction through an Innovative Electronic Portfolio App for Mobile Devices: Reliability, Validity, and Feasibility

Peer reviewed

Direct link

Martínez, José Felipe; Kloser, Matt; Srinivasan, Jayashri; Stecher, Brian; Edelman, Amanda – Educational and Psychological Measurement, 2022

Adoption of new instructional standards in science demands high-quality information about classroom practice. Teacher portfolios can be used to assess instructional practice and support teacher self-reflection anchored in authentic evidence from classrooms. This study investigated a new type of electronic portfolio tool that allows efficient…

Descriptors: Science Instruction, Academic Standards, Instructional Innovation, Electronic Publishing

Working with Sparse Data in Rated Language Tests: Generalizability Theory Applications

Peer reviewed

Direct link

Lin, Chih-Kai – Language Testing, 2017

Sparse-rated data are common in operational performance-based language tests, as an inevitable result of assigning examinee responses to a fraction of available raters. The current study investigates the precision of two generalizability-theory methods (i.e., the rating method and the subdividing method) specifically designed to accommodate the…

Descriptors: Data Analysis, Language Tests, Generalizability Theory, Accuracy

Investigating Score Dependability in English/Chinese Interpreter Certification Performance Testing: A Generalizability Theory Approach

Peer reviewed

Direct link

Han, Chao – Language Assessment Quarterly, 2016

As a property of test scores, reliability/dependability constitutes an important psychometric consideration, and it underpins the validity of measurement results. A review of interpreter certification performance tests (ICPTs) reveals that (a) although reliability/dependability checking has been recognized as an important concern, its theoretical…

Descriptors: Foreign Countries, Scores, English, Chinese

Using Multivariate Generalizability Theory to Assess the Effect of Content Stratification on the Reliability of a Performance Assessment

Peer reviewed

Direct link

Keller, Lisa A.; Clauser, Brian E.; Swanson, David B. – Advances in Health Sciences Education, 2010

In recent years, demand for performance assessments has continued to grow. However, performance assessments are notorious for lower reliability, and in particular, low reliability resulting from task specificity. Since reliability analyses typically treat the performance tasks as randomly sampled from an infinite universe of tasks, these estimates…

Descriptors: Generalizability Theory, Test Reliability, Performance Based Assessment, Error of Measurement

Generalizability of Student Writing across Multiple Tasks: A Challenge for Authentic Assessment

Peer reviewed
PDF on ERIC

Download full text

Hathcoat, John D.; Penn, Jeremy D. – Research & Practice in Assessment, 2012

Critics of standardized testing have recommended replacing standardized tests with more authentic assessment measures, such as classroom assignments, projects, or portfolios rated by a panel of raters using common rubrics. Little research has examined the consistency of scores across multiple authentic assignments or the implications of this…

Descriptors: Generalizability Theory, Performance Based Assessment, Writing Across the Curriculum, Standardized Tests

Improved Confidence Interval Estimation for Variance Components and Error Variances in Generalizability Theory.

Download full text

Betebenner, Damian W. – 1998

The zeitgeist for reform in education precipitated a number of changes in assessment. Among these are performance assessments, sometimes linked to "high stakes" accountability decisions. In some instances, the trustworthiness of these decisions is based on variance components and error variances derived through generalizability theory.…

Descriptors: Accountability, Educational Change, Error of Measurement, Generalizability Theory

Generalizability of Performance-Based Assessments. Technical Guidelines for Performance Assessment.

Miller, M. David – 2002

In 1994 the State Collaborative on Assessment and Student Standards of the Council of Chief State School Officers began a study to examine the generalizability of performance-based assessments (PBAs) for state-mandated assessment programs. The intent was to examine the major sources of error associated with PBAs and the generalizability and…

Descriptors: Elementary Secondary Education, Error of Measurement, Generalizability Theory, Performance Based Assessment

Error Sources Influencing Performance Assessment Reliability or Generalizability: A Meta Analysis.

Download full text

Jiang, Ying Hong; And Others – 1997

As performance-based assessments have gained wider use, there are increasing concerns about their dependability. This study is a synthesis of existing studies regarding the reliability or generalizability of performance assessments. The meta-analysis involves summarizing, examining, and evaluating research findings. Articles on the dependability…

Descriptors: Error of Measurement, Estimation (Mathematics), Generalizability Theory, Judges

Generalizability Analysis for Performance Assessments of Student Achievement or School Effectiveness.

Peer reviewed

Cronbach, Lee J.; And Others – Educational and Psychological Measurement, 1997

Through the standard error, rather than a reliability coefficient, generalizability theory provides an indicator of the uncertainty attached to school and individual scores on performance assessments. Recommendations are made to apply generalizability theory to current performance assessments, emphasizing practices that differ from usual…

Descriptors: Academic Achievement, Error of Measurement, Generalizability Theory, Performance Based Assessment

Generalizability of Performance Assessments.

Peer reviewed

Brennan, Robert L.; Johnson, Eugene G. – Educational Measurement: Issues and Practice, 1995

The application of generalizability theory to the reliability and error variance estimation for performance assessment scores is discussed. Decision makers concerned with performance assessment need to realize the restrictions that limit generalizability such as limitations that lead to reductions in the number of tasks possible, rater quality,…

Descriptors: Decision Making, Educational Assessment, Error of Measurement, Estimation (Mathematics)

Sampling Variability of Performance Assessments.

Peer reviewed

Shavelson, Richard J.; And Others – Journal of Educational Measurement, 1993

Evidence is presented on the generalizability and convergent validity of performance assessments using data from six studies of student achievement that sampled a wide range of measurement facets and methods. Results at individual and school levels indicate that task-sampling variability is the major source of measurement error. (SLD)

Descriptors: Academic Achievement, Educational Assessment, Error of Measurement, Generalizability Theory

Generalizability of Written-Response Scores for the Alberta Education English 30 Diploma Examination.

Peer reviewed

Gierl, Mark J. – Alberta Journal of Educational Research, 1998

Examined the generalizability of written-response scores on the English 30 diploma examination administered to Alberta 12th-grade students. Student scores differed as a function of rater, but this variance component was small across two tasks and two administrations; score generalizability was high using a two-rater system; and scale variability…

Descriptors: Error of Measurement, Foreign Countries, Generalizability Theory, High School Seniors

On the Stability of Performance Assessments.

Peer reviewed

Ruiz-Primo, Maria Araceli; And Others – Journal of Educational Measurement, 1993

The stability of scores on 2 types of performance assessments, an observed hands-on investigation and a notebook surrogate, was investigated for 29 sixth graders on 2 occasions. Results indicate that student performance and procedures changed and that generalizability across occasions was moderate. Implications for assessment are discussed. (SLD)

Descriptors: Educational Assessment, Elementary School Students, Error of Measurement, Generalizability Theory

Generalizability of New Standards Project, 1993 Pilot Study Tasks in Mathematics. Technical Issues in Procedures To Link State Results to a Common National Standard. Project 2.4 Quantitative Models To Monitor the Status and Progress of Learning and Performance and Their Antecedents.

Download full text

Linn, Robert L. – 1994

The New Standards Project conducted a pilot test of a series of performance-based assessment tasks in mathematics and English language arts at Grades 4 and 8 in the spring of 1993. This paper reports the results of a series of generalizability analyses conducted for a subset of the 1993 pilot study data in mathematics. Generalizability analyses…

Descriptors: Decision Making, Educational Assessment, Elementary Education, Elementary School Students

Sampling Variability of Performance Assessments. Report on the Status of Generalizability Performance: Generalizability and Transfer of Performance Assessments. Project 2.4: Design Theory and Psychometrics for Complex Performance Assessment in Science.

Download full text

Shavelson, Richard J.; And Others – 1993

In this paper, performance assessments are cast within a sampling framework. A performance assessment score is viewed as a sample of student performance drawn from a complex universe defined by a combination of all possible tasks, occasions, raters, and measurement methods. Using generalizability theory, the authors present evidence bearing on the…

Descriptors: Academic Achievement, Educational Assessment, Error of Measurement, Evaluators

Shavelson, Richard J.	2
Betebenner, Damian W.	1
Brennan, Robert L.	1
Clauser, Brian E.	1
Cronbach, Lee J.	1
Edelman, Amanda	1
Gierl, Mark J.	1
Han, Chao	1
Hathcoat, John D.	1
Jiang, Ying Hong	1
Johnson, Eugene G.	1
Keller, Lisa A.	1
Kloser, Matt	1
Lin, Chih-Kai	1
Linn, Robert L.	1
Martínez, José Felipe	1
Miller, M. David	1
Penn, Jeremy D.	1
Ruiz-Primo, Maria Araceli	1
Srinivasan, Jayashri	1
Stecher, Brian	1
Swanson, David B.	1
More ▼