ERIC - Search Results

Publication Date

In 2025	1
Since 2024	3
Since 2021 (last 5 years)	15

Descriptor

Generalizability Theory	15
Test Reliability	15
Foreign Countries	5
Error of Measurement	4
Test Validity	4
Elementary School Students	3
Evaluation Methods	3
Interrater Reliability	3
Psychometrics	3
Reading Tests	3
Scores	3
Writing Tests	3
Decision Making	2
Educational Research	2
Factor Analysis	2
Factor Structure	2
Goodness of Fit	2
Item Response Theory	2
Mathematics Tests	2
Test Items	2
Test Theory	2
Academic Achievement	1
Accuracy	1
Adoption (Ideas)	1
Allied Health Personnel	1
More ▼

Source

Educational and Psychological…	2
Annenberg Institute for…	1
Canadian Journal of…	1
Center for Research Use in…	1
International Journal of…	1
International Journal of…	1
International Journal of…	1
Journal of Psychoeducational…	1
Journal of Speech, Language,…	1
Practical Assessment,…	1
Reading Psychology	1
School Psychology	1
Society for Research on…	1
Structural Equation Modeling:…	1
More ▼

Publication Type

Reports - Research	15
Journal Articles	12
Numerical/Quantitative Data	1

Education Level

Elementary Education	6
Middle Schools	4
Junior High Schools	3
Secondary Education	3
Early Childhood Education	2
Higher Education	2
Postsecondary Education	2
Primary Education	2
Grade 1	1
Grade 3	1
Grade 6	1
Grade 7	1
Grade 8	1
Intermediate Grades	1
More ▼

Audience

Location

Canada	1
Mexico (Mexico City)	1
Michigan	1
Norway	1
South Korea	1
Turkey	1
Turkey (Ankara)	1
Turkey (Istanbul)	1

Laws, Policies, & Programs

Assessments and Surveys

Dynamic Indicators of Basic…	1
Stanford Binet Intelligence…	1

What Works Clearinghouse Rating

Showing all 15 results Save | Export

Direct Discrepancy Dynamic Fit Index Cutoffs for Arbitrary Covariance Structure Models

Peer reviewed

Direct link

Daniel McNeish; Melissa G. Wolf – Structural Equation Modeling: A Multidisciplinary Journal, 2024

Despite the popularity of traditional fit index cutoffs like RMSEA [less than or equal to] 0.06 and CFI [greater than or equal to] 0.95, several studies have noted issues with overgeneralizing traditional cutoffs. Computational methods have been proposed to avoid overgeneralization by deriving cutoffs specifically tailored to the characteristics…

Descriptors: Structural Equation Models, Cutting Scores, Generalizability Theory, Error of Measurement

Comparison of the Results of the Generalizability Theory with the Inter-Rater Agreement Coefficients

Peer reviewed
PDF on ERIC

Download full text

Eser, Mehmet Taha; Aksu, Gökhan – International Journal of Curriculum and Instruction, 2022

The agreement between raters is examined within the scope of the concept of "inter-rater reliability". Although there are clear definitions of the concepts of agreement between raters and reliability between raters, there is no clear information about the conditions under which agreement and reliability level methods are appropriate to…

Descriptors: Generalizability Theory, Interrater Reliability, Evaluation Methods, Test Theory

How Not to Fool Ourselves about Heterogeneity of Treatment Effects. EdWorkingPaper No. 25-1116

Download full text

Paul T. von Hippel; Brendan A. Schuetze – Annenberg Institute for School Reform at Brown University, 2025

Researchers across many fields have called for greater attention to heterogeneity of treatment effects--shifting focus from the average effect to variation in effects between different treatments, studies, or subgroups. True heterogeneity is important, but many reports of heterogeneity have proved to be false, non-replicable, or exaggerated. In…

Descriptors: Educational Research, Replication (Evaluation), Generalizability Theory, Inferences

Extended Multivariate Generalizability Theory with Complex Design Structures

Peer reviewed

Direct link

Brennan, Robert L.; Kim, Stella Y.; Lee, Won-Chan – Educational and Psychological Measurement, 2022

This article extends multivariate generalizability theory (MGT) to tests with different random-effects designs for each level of a fixed facet. There are numerous situations in which the design of a test and the resulting data structure are not definable by a single design. One example is mixed-format tests that are composed of multiple-choice and…

Descriptors: Multivariate Analysis, Generalizability Theory, Multiple Choice Tests, Test Construction

A Short Note on Optimizing Cost-Generalizability via a Machine-Learning Approach

Peer reviewed

Direct link

Jiang, Zhehan; Shi, Dexin; Distefano, Christine – Educational and Psychological Measurement, 2021

The costs of an objective structured clinical examination (OSCE) are of concern to health profession educators globally. As OSCEs are usually designed under generalizability theory (G-theory) framework, this article proposes a machine-learning-based approach to optimize the costs, while maintaining the minimum required generalizability…

Descriptors: Artificial Intelligence, Generalizability Theory, Objective Tests, Foreign Countries

Comparison of G and Phi Coefficients Estimated in Generalizability Theory with Real Cases

Peer reviewed
PDF on ERIC

Download full text

Deniz, Kaan Zulfikar; Ilican, Emel – International Journal of Assessment Tools in Education, 2021

This study aims to compare the G and Phi coefficients as estimated by D studies for a measurement tool with the G and Phi coefficients obtained from real cases in which items of differing difficulty levels were added and also to determine the conditions under which the D studies estimated reliability coefficients closer to reality. The study group…

Descriptors: Generalizability Theory, Test Items, Difficulty Level, Test Reliability

Quantile Reliability: Beyond Global Estimates of Internal Consistency

Peer reviewed

Direct link

Jeffrey Shero; Jessica Logan – Society for Research on Educational Effectiveness, 2024

Background/Context: Previous research in educational assessment has consistently emphasized the importance of reliability as a cornerstone of test quality. Traditional measures of reliability, such as test-retest and split-half reliability, offer a broad view of how internally consistent a measure is but overlook the variability in this internal…

Descriptors: Educational Assessment, Special Education, Students with Disabilities, Learning Disabilities

Conditional Standard Error of Measurement: Classical Test Theory, Generalizability Theory and Many-Facet Rasch Measurement with Applications to Writing Assessment

Peer reviewed
PDF on ERIC

Download full text

Huebner, Alan; Skar, Gustaf B. – Practical Assessment, Research & Evaluation, 2021

Writing assessments often consist of students responding to multiple prompts, which are judged by more than one rater. To establish the reliability of these assessments, there exist different methods to disentangle variation due to prompts and raters, including classical test theory, Many Facet Rasch Measurement (MFRM), and Generalizability Theory…

Descriptors: Error of Measurement, Test Theory, Generalizability Theory, Item Response Theory

Mastery Measurement in Mathematics and the Goldilocks Effect

Peer reviewed

Direct link

Solomon, Benjamin G.; VanDerHeyden, Amanda M.; Solomon, Emily C.; Korzeniewski, Erika R.; Payne, Lexy L.; Campaña, Kayla V.; Dillon, Chasen R. – School Psychology, 2022

Math curriculum-based measurement (CBM) is an essential tool for multi-tiered systems of support decision making, but the reliability of math CBMs has received little research, particularly using more rigorous methods such as generalizability (G) theory. Math CBM is historically organized into two domains: mastery measures and general outcome…

Descriptors: Mathematics Tests, Mathematics Skills, Mathematics Achievement, Curriculum Based Assessment

(In)Stability of Test Scores

Peer reviewed
PDF on ERIC

Download full text

Merchant, Stefan; Rich, Jessica; Klinger, Don A. – Canadian Journal of Educational Administration and Policy, 2022

Both school and district administrators use the results of standardized, large-scale tests to inform decisions about the need for, or success of, educational programs and interventions. However, test results at the school level are subject to random fluctuations due to changes in cohort, test items, and other factors outside of the school's…

Descriptors: Standardized Tests, Foreign Countries, Generalizability Theory, Scores

Structural Validity, Internal Consistency, and Rater Reliability of the Modified Barium Swallow Impairment Profile: Breaking Ground on a 52,726-Patient, Clinical Data Set

Peer reviewed

Direct link

Clain, Alex E.; Alkhuwaiter, Munirah; Davidson, Kate; Martin-Harris, Bonnie – Journal of Speech, Language, and Hearing Research, 2022

Purpose: The purpose of this study was to extend the assessment of the psychometric properties of the Modified Barium Swallow Impairment Profile (MBSImP). Here, we re-examined structural validity and internal consistency using a large clinical-registry data set and formally examined rater reliability in a smaller data set. Method: This study…

Descriptors: Diagnostic Tests, Disability Identification, Physical Disabilities, Eating Disorders

The Generalizability of Running Record Accuracy and Self-Correction Scores

Peer reviewed

Direct link

D'Agostino, Jerome V.; Rodgers, Emily; Winkler, Christa; Johnson, Tracy; Berenbon, Rebecca – Reading Psychology, 2021

Running Records provide a standardized method for recording and assessing students' oral reading behaviors and are excellent formative assessment tools to guide instructional decision-making. This study expands on prior Running Record reliability work by evaluating the extent to which external raters and teachers consistently assessed students'…

Descriptors: Accuracy, Oral Reading, Generalizability Theory, Error Correction

Validation of the Child Observation Record Advantage 1.5 Assessment Tool for Preschool Children: A Multilevel Bifactor Modeling Approach

Peer reviewed

Direct link

Akaeze, Hope O.; Wu, Jamie Heng-Chieh; Lawrence, Frank R.; Weber, Everett P. – Journal of Psychoeducational Assessment, 2023

This paper reports an investigation into the psychometric properties of the COR-Advantage1.5 (COR-Adv1.5) assessment tool, a criterion-referenced observation-based instrument designed to assess the developmental abilities of children from birth through kindergarten. Using data from 8534 children participating in a state-funded preschool program…

Descriptors: Criterion Referenced Tests, Evaluation Methods, Measures (Individuals), Measurement Techniques

Survey of Evidence in Education for Schools (SEE-S) Technical Report

Download full text

May, Henry; Blackman, Horatio; Van Horne, Sam; Tilley, Katherine; Farley-Ripple, Elizabeth N.; Shewchuk, Samantha; Agboh, Darren; Micklos, Deborah Amsden – Center for Research Use in Education, 2022

In this technical report, the Center for Research Use in Education (CRUE) presents the methodological design of a large-scale quantitative investigation of research use by school-based practitioners through the "Survey of Evidence in Education for Schools (SEE-S)." It documents the major technical aspects of the development of SEE-S,…

Descriptors: Surveys, Schools, Educational Research, Research Utilization

Psychometric Properties of MATE: A Study Focused on Testing the Generalizability of the Measure of Acceptance of the Theory of Evolution

Peer reviewed

Direct link

Sya'bandari, Yustika; Rachmatullah, Arif; Ha, Minsu – International Journal of Science Education, 2021

The Measure of Acceptance of the Theory of Evolution (MATE) has been extensively used in science education research for more than two decades. This study examines the fairness of MATE items based on religious convictions and academic majors. The multidimensional item response theory and differential item functioning analyses were run on data…

Descriptors: Attitude Measures, Scientific Attitudes, Evolution, Adoption (Ideas)

Agboh, Darren	1
Akaeze, Hope O.	1
Aksu, Gökhan	1
Alkhuwaiter, Munirah	1
Berenbon, Rebecca	1
Blackman, Horatio	1
Brendan A. Schuetze	1
Brennan, Robert L.	1
Campaña, Kayla V.	1
Clain, Alex E.	1
D'Agostino, Jerome V.	1
Daniel McNeish	1
Davidson, Kate	1
Deniz, Kaan Zulfikar	1
Dillon, Chasen R.	1
Distefano, Christine	1
Eser, Mehmet Taha	1
Farley-Ripple, Elizabeth N.	1
Ha, Minsu	1
Huebner, Alan	1
Ilican, Emel	1
Jeffrey Shero	1
Jessica Logan	1
Jiang, Zhehan	1
Johnson, Tracy	1
More ▼