ERIC - Search Results

Publication Date

In 2025	0
Since 2024	0
Since 2021 (last 5 years)	1
Since 2016 (last 10 years)	1
Since 2006 (last 20 years)	4

Descriptor

Generalizability Theory	15
Test Construction	15
Test Reliability	15
Test Validity	7
Evaluation Methods	5
Statistical Analysis	4
Test Items	4
Adults	3
Analysis of Variance	3
Educational Assessment	3
Performance Based Assessment	3
Test Use	3
College Students	2
Data Collection	2
Decision Making	2
Disabilities	2
Elementary Secondary Education	2
Error Patterns	2
Error of Measurement	2
Foreign Countries	2
Higher Education	2
Interrater Reliability	2
Models	2
Multiple Choice Tests	2
Reading Tests	2
More ▼

Source

Educational and Psychological…	3
Applied Measurement in…	1
Educational Measurement:…	1
Language Testing	1
Routledge, Taylor & Francis…	1

Publication Type

Reports - Research	9
Journal Articles	6
Reports - Evaluative	4
Speeches/Meeting Papers	4
Books	1
Collected Works - General	1
Reports - Descriptive	1

Education Level

Secondary Education	3
Junior High Schools	2
Middle Schools	2
Elementary Education	1
Elementary Secondary Education	1
Grade 10	1
Grade 5	1
Grade 7	1
Grade 8	1
High Schools	1
Higher Education	1
Intermediate Grades	1
Two Year Colleges	1
More ▼

Audience

Researchers

Location

California	1
Netherlands	1
United Kingdom	1

Laws, Policies, & Programs

Assessments and Surveys

What Works Clearinghouse Rating

Showing all 15 results Save | Export

Extended Multivariate Generalizability Theory with Complex Design Structures

Peer reviewed

Direct link

Brennan, Robert L.; Kim, Stella Y.; Lee, Won-Chan – Educational and Psychological Measurement, 2022

This article extends multivariate generalizability theory (MGT) to tests with different random-effects designs for each level of a fixed facet. There are numerous situations in which the design of a test and the resulting data structure are not definable by a single design. One example is mixed-format tests that are composed of multiple-choice and…

Descriptors: Multivariate Analysis, Generalizability Theory, Multiple Choice Tests, Test Construction

Assessing Reading Comprehension in Adolescent Low Achievers: Subskills Identification and Task Specificity

Peer reviewed

Direct link

van Steensel, Roel; Oostdam, Ron; van Gelderen, Amos – Language Testing, 2013

On the basis of a validation study of a new test for assessing low-achieving adolescents' reading comprehension skills--the SALT-reading--we analyzed two issues relevant to the field of reading test development. Using the test results of 200 seventh graders, we examined the possibility of identifying reading comprehension subskills and the effects…

Descriptors: Adolescents, Low Achievement, Reading Comprehension, Reading Tests

An Application of Generalizability Theory to Evaluate the Technical Quality of an Alternate Assessment

Peer reviewed

Direct link

Taylor, Melinda Ann; Pastor, Dena A. – Applied Measurement in Education, 2013

Although federal regulations require testing students with severe cognitive disabilities, there is little guidance regarding how technical quality should be established. It is known that challenges exist with documentation of the reliability of scores for alternate assessments. Typical measures of reliability do little in modeling multiple sources…

Descriptors: Generalizability Theory, Alternative Assessment, Test Reliability, Scores

Measurement Precision of the Sex-Role Egalitarianism Scale: A Generalizability Analysis.

Peer reviewed

King, Daniel W.; King, Lynda A. – Educational and Psychological Measurement, 1983

A three-facet (items, forms, and testing occasions) random effects generalizability analysis was used to evaluate the precision of each of the five domain measures of the Sex-Role Egalitarianism Scale. The recently developed scale measures attitudes toward the equality of males and females. (Author/PN)

Descriptors: Adults, Attitude Measures, Generalizability Theory, Rating Scales

One Iota Fills the Quota: A Paradox in Multifacet Reliability Coefficients.

Peer reviewed

Conger, Anthony J. – Educational and Psychological Measurement, 1983

A paradoxical phenomenon of decreases in reliability as the number of elements averaged over increases is shown to be possible in multifacet reliability procedures (intraclass correlations or generalizability coefficients). Conditions governing this phenomenon are presented along with implications and cautions. (Author)

Descriptors: Generalizability Theory, Test Construction, Test Items, Test Length

Technical Issues in Large-Scale Performance Assessment.

Download full text

Phillips, Gary W., Ed. – 1996

Recently, there has been a significant expansion in the use of performance assessment in large scale testing programs. Although there has been significant support from curriculum and policy stakeholders, the technical feasibility of large scale performance assessments has remained a question. This report is intended to contribute to the debate by…

Descriptors: Comparative Analysis, Generalizability Theory, Performance Based Assessment, Psychometrics

Statistical Test Specifications for Performance Assessments: Is This an Oxymoron?

Download full text

Reckase, Mark D. – 1997

This paper argues that special procedures for constructing assessment tools containing performance assessment tasks are unnecessary and that current test methodology can easily be generalized to complex performance assessment tasks without destroying the desirable characteristics of those tasks. Reasonable statistical requirements for sound…

Descriptors: Educational Assessment, Generalizability Theory, High Stakes Tests, Interrater Reliability

Generalizability of Performance Assessments.

Peer reviewed

Brennan, Robert L.; Johnson, Eugene G. – Educational Measurement: Issues and Practice, 1995

The application of generalizability theory to the reliability and error variance estimation for performance assessment scores is discussed. Decision makers concerned with performance assessment need to realize the restrictions that limit generalizability such as limitations that lead to reductions in the number of tasks possible, rater quality,…

Descriptors: Decision Making, Educational Assessment, Error of Measurement, Estimation (Mathematics)

A Generalizability Theory Approach To Examining Teaching Evaluation Instruments Completed by Students.

Download full text

Huang, Chi-yu; And Others – 1995

Generalizability theory is used to examine the sources of variability present in a teacher and course evaluation instrument. Two studies were conducted. In the first study, four different forms commonly used by one specific college of a large midwestern university were examined using responses of 915 students. The analysis of variance performed on…

Descriptors: Analysis of Variance, College Students, Course Evaluation, Evaluation Methods

On-the-Job Training: Development and Assessment of a Methodology for Generating Task Proficiency Evaluation Instruments.

Warm, Ronnie; And Others – 1986

This document describes the development and assessment of a methodology for generating on-the-job-training (OJT) task proficiency assessment instruments. The Task Evaluation Form (TEF) development procedures were derived to address previously identified deficiencies in the evaluation of OJT task proficiency. The TEF development procedures allow…

Descriptors: Adults, Correlation, Data Collection, Evaluation Methods

Handbook on Measurement, Assessment, and Evaluation in Higher Education

Direct link

Secolsky, Charles, Ed.; Denison, D. Brian, Ed. – Routledge, Taylor & Francis Group, 2011

Increased demands for colleges and universities to engage in outcomes assessment for accountability purposes have accelerated the need to bridge the gap between higher education practice and the fields of measurement, assessment, and evaluation. The "Handbook on Measurement, Assessment, and Evaluation in Higher Education" provides higher…

Descriptors: Generalizability Theory, Higher Education, Institutional Advancement, Teacher Effectiveness

Testing Pronunciation: An Application of Generalizability Theory.

van Weeren, J.; Theunissen, T. J. J. M. – 1986

Pronunciation is regarded as a valuable subskill in foreign language teaching and testing. Its quality is commonly assessed in a global way by having examinees read aloud. An atomistic test is a more systematic and explicit approach. Such a test would consist of about 40 items, use recorded performances, and draw on an inventory of pronunciation…

Descriptors: Audiotape Recordings, Error Patterns, French, Generalizability Theory

Establishing the Reliability of the Florida Performance Measurement System's Research Based Observation Instrument.

Download full text

Micceri, Theodore – 1984

This paper investigates the reliability of the Florida Performance Measurement Systems' Summative Observation instrument. Developed for the Florida Beginning Teacher Evaluation Program, it provides behavioral ratings for teachers in a classroom setting. Data came from ratings of videotapes of nine teachers conducting actual lessons by nine teams…

Descriptors: Analysis of Variance, Classroom Observation Techniques, Elementary Secondary Education, Evaluation Methods

Content Specifications of a Test and Generalizability Theory.

Gonzalez-Tamayo, Eulogio – 1987

The concepts of universe of admissible observation and universe of generalization from the generalizability theory were applied to calculate the intraclass correlation coefficient of a licensure test. The internal consistency coefficient of a dichotomously scored test is identical to the intraclass correlation coefficient of a two-facet design.…

Descriptors: Adults, Analysis of Variance, Content Validity, Criterion Referenced Tests

Quality Assurance in Teachers' Assessment.

Download full text

Gipps, Caroline V. – 1994

The teacher assessment that is the subject of this paper is an essentially informal activity. The teacher assesses the student by posing questions, observing activities, and evaluating work in a planned or ad hoc way. The information obtained may be partial or fragmented, but repeating such assessments over time will allow the buildup of a solid…

Descriptors: Academic Achievement, Educational Assessment, Elementary Secondary Education, Evaluation Methods

Brennan, Robert L.	2
Conger, Anthony J.	1
Denison, D. Brian, Ed.	1
Gipps, Caroline V.	1
Gonzalez-Tamayo, Eulogio	1
Huang, Chi-yu	1
Johnson, Eugene G.	1
Kim, Stella Y.	1
King, Daniel W.	1
King, Lynda A.	1
Lee, Won-Chan	1
Micceri, Theodore	1
Oostdam, Ron	1
Pastor, Dena A.	1
Phillips, Gary W., Ed.	1
Reckase, Mark D.	1
Secolsky, Charles, Ed.	1
Taylor, Melinda Ann	1
Theunissen, T. J. J. M.	1
Warm, Ronnie	1
van Gelderen, Amos	1
van Steensel, Roel	1
van Weeren, J.	1
More ▼