Publication Date
In 2025 | 1 |
Since 2024 | 1 |
Since 2021 (last 5 years) | 1 |
Since 2016 (last 10 years) | 2 |
Since 2006 (last 20 years) | 6 |
Descriptor
Generalizability Theory | 10 |
Statistical Analysis | 10 |
Test Validity | 10 |
Test Reliability | 7 |
Interrater Reliability | 3 |
Item Analysis | 3 |
Models | 3 |
Performance Based Assessment | 3 |
Test Construction | 3 |
Educational Assessment | 2 |
Evaluation Methods | 2 |
More ▼ |
Source
Author
Abedi, Jamal | 1 |
Allegra, Laurie | 1 |
Anum Khushal | 1 |
Baker, Eva L. | 1 |
Barbera, Jack | 1 |
Brian A. Couch | 1 |
Charalambous, Charalambos Y. | 1 |
Chi, Youngshin | 1 |
Denison, D. Brian, Ed. | 1 |
Gipps, Caroline V. | 1 |
Hopkins, Kenneth D. | 1 |
More ▼ |
Publication Type
Journal Articles | 6 |
Reports - Research | 6 |
Reports - Evaluative | 3 |
Speeches/Meeting Papers | 2 |
Books | 1 |
Collected Works - General | 1 |
Dissertations/Theses -… | 1 |
Education Level
Higher Education | 3 |
Early Childhood Education | 1 |
Elementary Education | 1 |
Postsecondary Education | 1 |
Secondary Education | 1 |
Two Year Colleges | 1 |
Audience
Researchers | 1 |
Location
California | 1 |
Colorado | 1 |
Cyprus | 1 |
United Kingdom | 1 |
Laws, Policies, & Programs
Assessments and Surveys
Strengths and Difficulties… | 1 |
What Works Clearinghouse Rating
Lyrica Lucas; Anum Khushal; Robert Mayes; Brian A. Couch; Joseph Dauer – International Journal of Science Education, 2025
Educational reform priorities such as emphasis on quantitative modelling (QM) have positioned undergraduate biology instructors as designers of QM experiences to engage students in authentic science practices that support the development of data-driven and evidence-based reasoning. Yet, little is known about how biology instructors adapt to the…
Descriptors: Undergraduate Students, College Science, Biology, Classroom Observation Techniques
Charalambous, Charalambos Y.; Kyriakides, Ermis; Tsangaridou, Niki; Kyriakides, Leonidas – School Effectiveness and School Improvement, 2017
Heightened accountability pressures and an increased emphasis on teaching quality have directed scholarly attention to scrutinizing instruction, particularly with respect to issues of validity and reliability. However, these attempts have largely been directed toward "core" content areas and investigated generic or content-specific…
Descriptors: Physical Education, Instructional Effectiveness, Lesson Plans, Interrater Reliability
Lane, Kathleen Lynne; Oakes, Wendy Peia; Menzies, Holly Mariah; Major, Rebecca; Allegra, Laurie; Powers, Lisa; Schatschneider, Chris – Topics in Early Childhood Special Education, 2015
We report findings of two exploratory validation studies of a revised instrument: the "Student Risk Screening Scale for Early Childhood" version (SRSS-EC). The SRSS-EC was modified to reflect characteristics of externalizing and internalizing behaviors manifested by preschool-age children. In Study 1, we explored the reliability of…
Descriptors: Screening Tests, At Risk Students, Early Childhood Education, Rating Scales
Wren, David; Barbera, Jack – Chemistry Education Research and Practice, 2014
Assessing conceptual understanding of foundational topics before instruction on higher-order concepts can provide chemical educators with information to aid instructional design. This study provides an instrument that can be used to identify students' alternative conceptions regarding thermochemistry concepts. The Thermochemistry Concept Inventory…
Descriptors: Psychometrics, Thermodynamics, Chemistry, Item Response Theory
Chi, Youngshin – ProQuest LLC, 2011
This study investigated the breakdown effect of a listening comprehension test, whether test takers are affected in comprehending lectures by impediments, and collected test takers' cognitive awareness on test tasks which contain listening breakdown factors how they perceived these impediments. In this context of the study, a "Breakdown" is a test…
Descriptors: Generalizability Theory, Listening Comprehension, Intervals, Second Languages

Hopkins, Kenneth D. – American Educational Research Journal, 1984
In behavior research using cognitive and affective measures, there is often incongruity between the statistical analysis employed and the intended inference. This paper argues that incorporating items as levels of a random facet via generalizability theory allows the statistical examination of the inferential question in the desired universe of…
Descriptors: Affective Measures, Analysis of Variance, Behavioral Science Research, Cognitive Measurement
Reckase, Mark D. – 1997
This paper argues that special procedures for constructing assessment tools containing performance assessment tasks are unnecessary and that current test methodology can easily be generalized to complex performance assessment tasks without destroying the desirable characteristics of those tasks. Reasonable statistical requirements for sound…
Descriptors: Educational Assessment, Generalizability Theory, High Stakes Tests, Interrater Reliability
Secolsky, Charles, Ed.; Denison, D. Brian, Ed. – Routledge, Taylor & Francis Group, 2011
Increased demands for colleges and universities to engage in outcomes assessment for accountability purposes have accelerated the need to bridge the gap between higher education practice and the fields of measurement, assessment, and evaluation. The "Handbook on Measurement, Assessment, and Evaluation in Higher Education" provides higher…
Descriptors: Generalizability Theory, Higher Education, Institutional Advancement, Teacher Effectiveness

Abedi, Jamal; Baker, Eva L. – Educational and Psychological Measurement, 1995
Results from a performance assessment in which 68 high school students wrote essays support the use of latent variable modeling for estimating reliability, concurrent validity, and generalizability of a scoring rubric. The latent variable modeling approach overcomes the limitations of certain conventional statistical techniques in handling…
Descriptors: Criteria, Essays, Estimation (Mathematics), Generalizability Theory
Gipps, Caroline V. – 1994
The teacher assessment that is the subject of this paper is an essentially informal activity. The teacher assesses the student by posing questions, observing activities, and evaluating work in a planned or ad hoc way. The information obtained may be partial or fragmented, but repeating such assessments over time will allow the buildup of a solid…
Descriptors: Academic Achievement, Educational Assessment, Elementary Secondary Education, Evaluation Methods