ERIC - Search Results

Publication Date

In 2025	0
Since 2024	0
Since 2021 (last 5 years)	1
Since 2016 (last 10 years)	5
Since 2006 (last 20 years)	13

Descriptor

Generalizability Theory	17
Statistical Analysis	17
Test Reliability	17
Test Validity	7
Foreign Countries	5
Interrater Reliability	5
Scores	4
Teacher Effectiveness	4
Teacher Evaluation	4
Test Construction	4
Item Analysis	3
Item Response Theory	3
Observation	3
Teaching Methods	3
Academic Achievement	2
College Students	2
Decision Making	2
Disabilities	2
Educational Assessment	2
English (Second Language)	2
Evaluation Methods	2
Evidence	2
Goodness of Fit	2
Higher Education	2
Language Tests	2
More ▼

Source

Applied Measurement in…	1
Assessing Writing	1
Assessment for Effective…	1
Canadian Journal of…	1
Chemistry Education Research…	1
Counseling Psychologist	1
EURASIA Journal of…	1
Educational Assessment	1
Educational Sciences: Theory…	1
Journal of Special Education…	1
Practical Assessment,…	1
ProQuest LLC	1
Routledge, Taylor & Francis…	1
School Effectiveness and…	1
Topics in Early Childhood…	1
More ▼

Publication Type

Journal Articles	13
Reports - Research	10
Reports - Evaluative	3
Speeches/Meeting Papers	2
Books	1
Collected Works - General	1
Dissertations/Theses -…	1
Information Analyses	1
Reports - Descriptive	1
Tests/Questionnaires	1

Education Level

Higher Education	5
Elementary Education	4
Postsecondary Education	3
Early Childhood Education	2
Elementary Secondary Education	2
Intermediate Grades	2
Middle Schools	2
Secondary Education	2
Grade 10	1
Grade 3	1
Grade 5	1
Grade 6	1
Grade 8	1
High Schools	1
Junior High Schools	1
Preschool Education	1
Primary Education	1
Two Year Colleges	1
More ▼

Audience

Location

Cyprus	2
California	1
Canada	1
Colorado	1
Idaho	1
Turkey	1
United Kingdom	1

Laws, Policies, & Programs

Assessments and Surveys

Strengths and Difficulties…

What Works Clearinghouse Rating

Showing 1 to 15 of 17 results Save | Export

(In)Stability of Test Scores

Peer reviewed
PDF on ERIC

Download full text

Merchant, Stefan; Rich, Jessica; Klinger, Don A. – Canadian Journal of Educational Administration and Policy, 2022

Both school and district administrators use the results of standardized, large-scale tests to inform decisions about the need for, or success of, educational programs and interventions. However, test results at the school level are subject to random fluctuations due to changes in cohort, test items, and other factors outside of the school's…

Descriptors: Standardized Tests, Foreign Countries, Generalizability Theory, Scores

Generalizability Theory in R

Peer reviewed
PDF on ERIC

Download full text

Huebner, Alan; Lucht, Marissa – Practical Assessment, Research & Evaluation, 2019

Generalizability theory is a modern, powerful, and broad framework used to assess the reliability, or dependability, of measurements. While there exist classic works that explain the basic concepts and mathematical foundations of the method, there is currently a lack of resources addressing computational resources for those researchers wishing to…

Descriptors: Generalizability Theory, Test Reliability, Computer Software, Statistical Analysis

The Stability of Kindergarten Teachers' Effectiveness: A Generalizability Study Comparing the Framework for Teaching and the Classroom Assessment Scoring System

Peer reviewed

Direct link

Mantzicopoulos, Panayota; French, Brian F.; Patrick, Helen; Watson, J. Samuel; Ahn, Inok – Educational Assessment, 2018

To meet recent accountability mandates, school districts are implementing assessment frameworks to document teachers' effectiveness. Observational assessments play a key role in this process, albeit without compelling evidence of their psychometric rigor. Using a sample of kindergarten teachers, we employed Generalizability theory to investigate…

Descriptors: Preschool Teachers, Kindergarten, Teacher Effectiveness, Generalizability Theory

Generalizability Theory Research on Developing a Scoring Rubric to Assess Primary School Students' Problem Posing Skills

Peer reviewed

Direct link

Cankoy, Osman; Özder, Hasan – EURASIA Journal of Mathematics, Science & Technology Education, 2017

The aim of this study is to develop a scoring rubric to assess primary school students' problem posing skills. The rubric including five dimensions namely solvability, reasonability, mathematical structure, context and language was used. The raters scored the students' problem posing skills both with and without the scoring rubric to test the…

Descriptors: Generalizability Theory, Elementary School Students, Foreign Countries, Problem Solving

Exploring the Reliability of Generic and Content-Specific Instructional Aspects in Physical Education Lessons

Peer reviewed

Direct link

Charalambous, Charalambos Y.; Kyriakides, Ermis; Tsangaridou, Niki; Kyriakides, Leonidas – School Effectiveness and School Improvement, 2017

Heightened accountability pressures and an increased emphasis on teaching quality have directed scholarly attention to scrutinizing instruction, particularly with respect to issues of validity and reliability. However, these attempts have largely been directed toward "core" content areas and investigated generic or content-specific…

Descriptors: Physical Education, Instructional Effectiveness, Lesson Plans, Interrater Reliability

The Effects of Testlets on Reliability and Differential Item Functioning

Peer reviewed
PDF on ERIC

Download full text

Teker, Gulsen Tasdelen; Dogan, Nuri – Educational Sciences: Theory and Practice, 2015

Reliability and differential item functioning (DIF) analyses were conducted on testlets displaying local item dependence in this study. The data set employed in the research was obtained from the answers given by 1,500 students to the 20 items included in six testlets given in English Proficiency Exam by the School of Foreign Languages of a state…

Descriptors: Foreign Countries, Test Items, Test Bias, Item Response Theory

Measuring Rater Reliability on a Special Education Observation Tool

Peer reviewed

Direct link

Semmelroth, Carrie Lisa; Johnson, Evelyn – Assessment for Effective Intervention, 2014

This study used generalizability theory to measure reliability on the Recognizing Effective Special Education Teachers (RESET) observation tool designed to evaluate special education teacher effectiveness. At the time of this study, the RESET tool included three evidence-based instructional practices (direct, explicit instruction; whole-group…

Descriptors: Observation, Special Education Teachers, Teacher Effectiveness, Teacher Evaluation

The Student Risk Screening Scale for Early Childhood: An Initial Validation Study

Peer reviewed

Direct link

Lane, Kathleen Lynne; Oakes, Wendy Peia; Menzies, Holly Mariah; Major, Rebecca; Allegra, Laurie; Powers, Lisa; Schatschneider, Chris – Topics in Early Childhood Special Education, 2015

We report findings of two exploratory validation studies of a revised instrument: the "Student Risk Screening Scale for Early Childhood" version (SRSS-EC). The SRSS-EC was modified to reflect characteristics of externalizing and internalizing behaviors manifested by preschool-age children. In Study 1, we explored the reliability of…

Descriptors: Screening Tests, At Risk Students, Early Childhood Education, Rating Scales

Psychometric Analysis of the Thermochemistry Concept Inventory

Peer reviewed

Direct link

Wren, David; Barbera, Jack – Chemistry Education Research and Practice, 2014

Assessing conceptual understanding of foundational topics before instruction on higher-order concepts can provide chemical educators with information to aid instructional design. This study provides an instrument that can be used to identify students' alternative conceptions regarding thermochemistry concepts. The Thermochemistry Concept Inventory…

Descriptors: Psychometrics, Thermodynamics, Chemistry, Item Response Theory

An Application of Generalizability Theory to Evaluate the Technical Quality of an Alternate Assessment

Peer reviewed

Direct link

Taylor, Melinda Ann; Pastor, Dena A. – Applied Measurement in Education, 2013

Although federal regulations require testing students with severe cognitive disabilities, there is little guidance regarding how technical quality should be established. It is known that challenges exist with documentation of the reliability of scores for alternate assessments. Typical measures of reliability do little in modeling multiple sources…

Descriptors: Generalizability Theory, Alternative Assessment, Test Reliability, Scores

Examining Interrater Agreement Analyses of a Pilot Special Education Observation Tool

Peer reviewed
PDF on ERIC

Download full text

Johnson, Evelyn S.; Semmelroth, Carrie L. – Journal of Special Education Apprenticeship, 2012

This paper reports the results of interrater agreement analyses on a pilot special education teacher evaluation instrument, the Recognizing Effective Special Education Teachers (RESET) Observation Tool (OT). Using evidence-based instructional practices as the basis for the evaluation, the RESET OT is designed for the spectrum of different…

Descriptors: Interrater Reliability, Pilot Projects, Special Education, Special Education Teachers

Validation of an Academic Listening Test: Effects of "Breakdown" Tests and Test Takers' Cognitive Awareness of Listening Processes

Direct link

Chi, Youngshin – ProQuest LLC, 2011

This study investigated the breakdown effect of a listening comprehension test, whether test takers are affected in comprehending lectures by impediments, and collected test takers' cognitive awareness on test tasks which contain listening breakdown factors how they perceived these impediments. In this context of the study, a "Breakdown" is a test…

Descriptors: Generalizability Theory, Listening Comprehension, Intervals, Second Languages

Dependability of Measurement in Counseling Psychology: An Introduction to Generalizability Theory.

Peer reviewed

Hoyt, William T.; Melby, Janet N. – Counseling Psychologist, 1999

Addresses generalizability theory (GT), which offers a flexible framework for assessing dependability of measurement. GT allows for consideration of multiple sources of error, allowing investigators to assess the overall impact of measurement error. Illustrative analyses demonstrate the special advantages of GT for planning studies in which…

Descriptors: Counseling Psychology, Generalizability Theory, Measurement, Research Design

A Comparison of Generalizability Theory and Many-Facet Rasch Measurement in an Analysis of College Sophomore Writing

Peer reviewed

Direct link

Sudweeks, Richard R.; Reeve, Suzanne; Bradshaw, William S. – Assessing Writing, 2004

A pilot study was conducted to evaluate and improve the rating procedure proposed for use in a research effort designed to assess the essay writing ability of college sophomores. Generalizability theory and the Many-Facet Rasch Model were each used to (a) estimate potential sources of error in the rating, (b) to obtain reliability estimates, and…

Descriptors: Generalizability Theory, College Students, Writing Ability, Writing Evaluation

Statistical Test Specifications for Performance Assessments: Is This an Oxymoron?

Download full text

Reckase, Mark D. – 1997

This paper argues that special procedures for constructing assessment tools containing performance assessment tasks are unnecessary and that current test methodology can easily be generalized to complex performance assessment tasks without destroying the desirable characteristics of those tasks. Reasonable statistical requirements for sound…

Descriptors: Educational Assessment, Generalizability Theory, High Stakes Tests, Interrater Reliability

Previous Page | Next Page »

Pages: 1 | 2

Ahn, Inok	1
Allegra, Laurie	1
Barbera, Jack	1
Bradshaw, William S.	1
Cankoy, Osman	1
Charalambous, Charalambos Y.	1
Chi, Youngshin	1
Denison, D. Brian, Ed.	1
Dogan, Nuri	1
French, Brian F.	1
Gipps, Caroline V.	1
Hoyt, William T.	1
Huebner, Alan	1
Johnson, Evelyn	1
Johnson, Evelyn S.	1
Klinger, Don A.	1
Kyriakides, Ermis	1
Kyriakides, Leonidas	1
Lane, Kathleen Lynne	1
Lucht, Marissa	1
Major, Rebecca	1
Mantzicopoulos, Panayota	1
Melby, Janet N.	1
Menzies, Holly Mariah	1
Merchant, Stefan	1
More ▼