ERIC - Search Results

Publication Date

In 2025	1
Since 2024	1
Since 2021 (last 5 years)	13
Since 2016 (last 10 years)	39
Since 2006 (last 20 years)	102

Descriptor

Generalizability Theory	146
Scores	146
Reliability	52
Error of Measurement	39
Test Reliability	33
Interrater Reliability	26
English (Second Language)	21
Scoring	21
Correlation	18
Foreign Countries	18
Comparative Analysis	16
Measures (Individuals)	16
Second Language Learning	15
Statistical Analysis	15
Language Tests	14
Measurement	14
Performance Based Assessment	14
Validity	14
Writing Tests	14
Evaluators	13
Psychometrics	13
Test Items	13
Models	12
Test Validity	12
Elementary School Students	11
More ▼

Publication Type

Journal Articles	111
Reports - Research	101
Reports - Evaluative	34
Speeches/Meeting Papers	21
Reports - Descriptive	7
Numerical/Quantitative Data	4
Tests/Questionnaires	3
Dissertations/Theses -…	2
Information Analyses	2
Opinion Papers	2
Books	1
Guides - Non-Classroom	1
Non-Print Media	1
Reference Materials - General	1
More ▼

Education Level

Higher Education	27
Postsecondary Education	19
Elementary Education	10
Secondary Education	9
Elementary Secondary Education	8
Middle Schools	7
Early Childhood Education	5
Grade 8	5
Junior High Schools	5
Preschool Education	5
Grade 3	4
High Schools	4
Grade 10	3
Grade 5	3
Grade 6	3
Primary Education	3
Grade 7	2
Grade 9	2
Intermediate Grades	2
Grade 1	1
Grade 11	1
Grade 12	1
Grade 4	1
Kindergarten	1
More ▼

Audience

Researchers

Location

Canada	5
Turkey	4
Florida	3
New York	2
South Korea	2
Turkey (Ankara)	2
Australia	1
Belgium	1
California (Los Angeles)	1
China (Beijing)	1
Colorado (Denver)	1
Denmark	1
Egypt	1
Georgia	1
Hong Kong	1
Idaho	1
Indiana	1
Iowa	1
Japan	1
Kenya	1
Mexico	1
Netherlands	1
New York (New York)	1
North Carolina (Charlotte)	1
Norway	1
More ▼

Laws, Policies, & Programs

No Child Left Behind Act 2001

Assessments and Surveys

Test of English as a Foreign…	6
Childrens Depression Inventory	2
Test of English for…	2
Advanced Placement…	1
Behavior Assessment System…	1
Big Five Inventory	1
Classroom Assessment Scoring…	1
Florida Comprehensive…	1
Group Assessment of Logical…	1
Motivated Strategies for…	1
Myers Briggs Type Indicator	1
Preschool Language Scale	1
SAT (College Admission Test)	1
Teacher Performance…	1
Teacher Rating Scale	1
United States Medical…	1
More ▼

What Works Clearinghouse Rating

Showing 1 to 15 of 146 results Save | Export

The Implications of Propensity Score Augmentation for Generalization

Peer reviewed

Direct link

Wendy Chan; Jimin Oh; Chen Li; Jiexuan Huang; Yeran Tong – Society for Research on Educational Effectiveness, 2023

Background: The generalizability of a study's results continues to be at the forefront of concerns in evaluation research in education (Tipton & Olsen, 2018). Over the past decade, statisticians have developed methods, mainly based on propensity scores, to improve generalizations in the absence of random sampling (Stuart et al., 2011; Tipton,…

Descriptors: Generalizability Theory, Probability, Scores, Sampling

The Role of Distributional Overlap on the Precision Gain of Bounds for Generalization

Peer reviewed

Direct link

Chan, Wendy – American Journal of Evaluation, 2022

Over the past ten years, propensity score methods have made an important contribution to improving generalizations from studies that do not select samples randomly from a population of inference. However, these methods require assumptions and recent work has considered the role of bounding approaches that provide a range of treatment impact…

Descriptors: Probability, Scores, Scoring, Generalization

Integrating Bifactor Models into a Generalizability Theory Based Structural Equation Modeling Framework

Peer reviewed

Direct link

Vispoel, Walter P.; Lee, Hyeryung; Xu, Guanlan; Hong, Hyeri – Journal of Experimental Education, 2023

Although generalizability theory (GT) designs have traditionally been analyzed within an ANOVA framework, identical results can be obtained with structural equation models (SEMs) but extended to represent multiple sources of both systematic and measurement error variance, include estimation methods less likely to produce negative variance…

Descriptors: Generalizability Theory, Structural Equation Models, Programming Languages, Scores

Indices of Subscore Utility for Individuals and Subgroups Based on Multivariate Generalizability Theory

Peer reviewed

Direct link

Raymond, Mark R.; Jiang, Zhehan – Educational and Psychological Measurement, 2020

Conventional methods for evaluating the utility of subscores rely on traditional indices of reliability and on correlations among subscores. One limitation of correlational methods is that they do not explicitly consider variation in subtest means. An exception is an index of score profile reliability designated as [G], which quantifies the ratio…

Descriptors: Generalizability Theory, Multivariate Analysis, Scores, Reliability

Standard Errors of Variance Components, Measurement Errors and Generalizability Coefficients for Crossed Designs

Peer reviewed

Direct link

Almehrizi, Rashid S. – Journal of Educational Measurement, 2021

Estimates of various variance components, universe score variance, measurement error variances, and generalizability coefficients, like all statistics, are subject to sampling variability, particularly in small samples. Such variability is quantified traditionally through estimated standard errors and/or confidence intervals. The paper derived new…

Descriptors: Error of Measurement, Statistics, Design, Generalizability Theory

Sample Size and Item Parameter Estimation Precision When Utilizing the Masters' Partial Credit Model

Download full text

Custer, Michael; Kim, Jongpil – Online Submission, 2023

This study utilizes an analysis of diminishing returns to examine the relationship between sample size and item parameter estimation precision when utilizing the Masters' Partial Credit Model for polytomous items. Item data from the standardization of the Batelle Developmental Inventory, 3rd Edition were used. Each item was scored with a…

Descriptors: Sample Size, Item Response Theory, Test Items, Computation

The Affectability of Writing Assessment Scores: A G-Theory Analysis of Rater, Task, and Scoring Method Contribution

Peer reviewed

Direct link

Khodi, Ali – Language Testing in Asia, 2021

The present study attempted to to investigate factors which affect EFL writing scores through using generalizability theory (G-theory). To this purpose, one hundred and twenty students participated in one independent and one integrated writing tasks. Proceeding, their performances were scored by six raters: one self-rating, three peers,-rating and…

Descriptors: Writing Tests, Scores, Generalizability Theory, English (Second Language)

Development of the Quantitative Modelling Observation Protocol (QMOP) for Undergraduate Biology Courses: Validity Evidence for Score Interpretation and Uses

Peer reviewed

Direct link

Lyrica Lucas; Anum Khushal; Robert Mayes; Brian A. Couch; Joseph Dauer – International Journal of Science Education, 2025

Educational reform priorities such as emphasis on quantitative modelling (QM) have positioned undergraduate biology instructors as designers of QM experiences to engage students in authentic science practices that support the development of data-driven and evidence-based reasoning. Yet, little is known about how biology instructors adapt to the…

Descriptors: Undergraduate Students, College Science, Biology, Classroom Observation Techniques

Extended Multivariate Generalizability Theory with Complex Design Structures

Peer reviewed

Direct link

Brennan, Robert L.; Kim, Stella Y.; Lee, Won-Chan – Educational and Psychological Measurement, 2022

This article extends multivariate generalizability theory (MGT) to tests with different random-effects designs for each level of a fixed facet. There are numerous situations in which the design of a test and the resulting data structure are not definable by a single design. One example is mixed-format tests that are composed of multiple-choice and…

Descriptors: Multivariate Analysis, Generalizability Theory, Multiple Choice Tests, Test Construction

Leveraging the Power of Observations: Locating the Sources of Error in the Individualized Classroom Assessment Scoring System

Peer reviewed

Direct link

Carbonneau, Kira J.; Van Orman, Dustin S. J.; Lemberger-Truelove, Matthew E.; Atencio, David J. – Early Education and Development, 2020

Research Findings: Given the variable nature of early childhood settings, practitioners and researchers need better guidance on what conditions influence observations conducted within early childhood settings (National Research Council, 2008). Using 230 observations from 23 three- and four-year-old children, we conducted a Generalizability study…

Descriptors: Classroom Environment, Observation, Preschool Children, Influences

Validity. Improving Literacy Brief: Understanding Screening

Direct link

Petscher, Y.; Pentimonti, J.; Stanley, C. – National Center on Improving Literacy, 2019

Validity is broadly defined as how well something measures what it's supposed to measure. The reliability and validity of scores from assessments are two concepts that are closely knit together and feed into each other.

Descriptors: Screening Tests, Scores, Test Validity, Test Reliability

(In)Stability of Test Scores

Peer reviewed
PDF on ERIC

Download full text

Merchant, Stefan; Rich, Jessica; Klinger, Don A. – Canadian Journal of Educational Administration and Policy, 2022

Both school and district administrators use the results of standardized, large-scale tests to inform decisions about the need for, or success of, educational programs and interventions. However, test results at the school level are subject to random fluctuations due to changes in cohort, test items, and other factors outside of the school's…

Descriptors: Standardized Tests, Foreign Countries, Generalizability Theory, Scores

The Reliability of Framework for Teaching Scores in Kindergarten

Peer reviewed
PDF on ERIC

Download full text

Direct link

Patrick, Helen; French, Brian F.; Mantzicopoulos, Panayota – Journal of Psychoeducational Assessment, 2020

We evaluated the score stability of the Framework for Teaching (FFT), a prominent observation instrument used for teacher evaluation. Three raters each scored 200 reading and mathematics lessons taught by 20 kindergarten teachers. Using Generalizability theory analyses, we decomposed the FFT's Classroom Environment, Instruction, and Total scores…

Descriptors: Teacher Evaluation, Observation, Scores, Test Reliability

Thematic Content Analysis of Studies Using Generalizability Theory

Peer reviewed
PDF on ERIC

Download full text

Teker, Gülsen Tasdelen; Güler, Nese – International Journal of Assessment Tools in Education, 2019

One of the important theories in education and psychology is Generalizability (G) Theory and various properties distinguish it from the other measurement theories. To better understand methodological trends of G theory, a thematic content analysis was conducted. This study analyzes the studies using generalizability theory in the field of…

Descriptors: Generalizability Theory, Content Analysis, Foreign Countries, Education

The Generalizability of Running Record Accuracy and Self-Correction Scores

Peer reviewed

Direct link

D'Agostino, Jerome V.; Rodgers, Emily; Winkler, Christa; Johnson, Tracy; Berenbon, Rebecca – Reading Psychology, 2021

Running Records provide a standardized method for recording and assessing students' oral reading behaviors and are excellent formative assessment tools to guide instructional decision-making. This study expands on prior Running Record reliability work by evaluating the extent to which external raters and teachers consistently assessed students'…

Descriptors: Accuracy, Oral Reading, Generalizability Theory, Error Correction

Previous Page | Next Page »

Pages: 1 | 2 | 3 | 4 | 5 | 6 | 7 | 8 | 9 | 10

Educational and Psychological…	12
Advances in Health Sciences…	8
Applied Measurement in…	7
Journal of Educational…	7
Language Testing	7
Society for Research on…	4
Assessing Writing	3
ETS Research Report Series	3
Educational Sciences: Theory…	3
Language Assessment Quarterly	3
Measurement and Evaluation in…	3
School Psychology Review	3
Asia Pacific Education Review	2
Assessment for Effective…	2
Educational Measurement:…	2
Intelligence	2
Journal of Consulting and…	2
Journal of Educational and…	2
Journal of Psychoeducational…	2
Language Testing in Asia	2
ProQuest LLC	2
American Journal of Evaluation	1
Applied Linguistics	1
Asian Journal of Education…	1
Canadian Journal of…	1
More ▼

Lee, Guemin	8
Lee, Yong-Won	7
Brennan, Robert L.	5
Kantor, Robert	4
Bordage, Georges	3
Floyd, Randy G.	3
Huang, Jinyan	3
Yudkowsky, Rachel	3
Attali, Yigal	2
Bergeron, Renee	2
Boyd, Donald	2
Chang, Lei	2
Clauser, Brian E.	2
Crowley, Susan L.	2
French, Brian F.	2
Frisbie, David A.	2
Gebril, Atta	2
Guler, Nese	2
Johnson, Robert L.	2
Kreiter, Clarence D.	2
Lankford, Hamilton	2
Li, Min	2
Loeb, Susanna	2
Mantzicopoulos, Panayota	2
More ▼