Publication Date
In 2025 | 0 |
Since 2024 | 2 |
Since 2021 (last 5 years) | 3 |
Since 2016 (last 10 years) | 6 |
Since 2006 (last 20 years) | 12 |
Descriptor
Generalizability Theory | 17 |
Multivariate Analysis | 17 |
Reliability | 7 |
Language Tests | 5 |
Scores | 5 |
Test Items | 4 |
Comparative Analysis | 3 |
Correlation | 3 |
Foreign Countries | 3 |
Psychometrics | 3 |
Scoring | 3 |
More ▼ |
Source
Author
Gibbons, Robert D. | 2 |
Hyeri Hong | 2 |
Hyeryung Lee | 2 |
Walter P. Vispoel | 2 |
Atilgan, Hakan | 1 |
Blikstein, Paulo | 1 |
Brennan, Robert L. | 1 |
Chon, Kyong Hee | 1 |
Clauser, Brian E. | 1 |
DeBrock, Lindsay | 1 |
Ewert, Doreen | 1 |
More ▼ |
Publication Type
Reports - Research | 17 |
Journal Articles | 15 |
Tests/Questionnaires | 2 |
Speeches/Meeting Papers | 1 |
Education Level
Higher Education | 4 |
Postsecondary Education | 4 |
Elementary Education | 2 |
Grade 6 | 1 |
Intermediate Grades | 1 |
Kindergarten | 1 |
Middle Schools | 1 |
Audience
Location
Australia | 1 |
Canada | 1 |
China (Beijing) | 1 |
Hong Kong | 1 |
Iowa | 1 |
Mexico | 1 |
North Carolina | 1 |
Taiwan | 1 |
Turkey | 1 |
United States | 1 |
Laws, Policies, & Programs
Assessments and Surveys
Rosenberg Self Esteem Scale | 1 |
Test of English as a Foreign… | 1 |
United States Medical… | 1 |
What Works Clearinghouse Rating
Walter P. Vispoel; Hyeryung Lee; Hyeri Hong – Structural Equation Modeling: A Multidisciplinary Journal, 2024
We demonstrate how to analyze complete multivariate generalizability theory (GT) designs within structural equation modeling frameworks that encompass both individual subscale scores and composites formed from those scores. Results from numerous analyses of observed scores obtained from respondents who completed the recently updated form of the…
Descriptors: Structural Equation Models, Multivariate Analysis, Generalizability Theory, College Students
Walter P. Vispoel; Hyeri Hong; Hyeryung Lee – Structural Equation Modeling: A Multidisciplinary Journal, 2024
Although generalizability theory (GT) designs typically are analyzed using analysis of variance (ANOVA) procedures, they also can be integrated into structural equation models (SEMs). In this tutorial, we review basic concepts for conducting univariate and multivariate GT analyses and demonstrate advantages of doing such analyses within SEM…
Descriptors: Structural Equation Models, Self Concept Measures, Self Esteem, Generalizability Theory
Raymond, Mark R.; Jiang, Zhehan – Educational and Psychological Measurement, 2020
Conventional methods for evaluating the utility of subscores rely on traditional indices of reliability and on correlations among subscores. One limitation of correlational methods is that they do not explicitly consider variation in subtest means. An exception is an index of score profile reliability designated as [G], which quantifies the ratio…
Descriptors: Generalizability Theory, Multivariate Analysis, Scores, Reliability
Brennan, Robert L.; Kim, Stella Y.; Lee, Won-Chan – Educational and Psychological Measurement, 2022
This article extends multivariate generalizability theory (MGT) to tests with different random-effects designs for each level of a fixed facet. There are numerous situations in which the design of a test and the resulting data structure are not definable by a single design. One example is mixed-format tests that are composed of multiple-choice and…
Descriptors: Multivariate Analysis, Generalizability Theory, Multiple Choice Tests, Test Construction
Sung, Kyung Hee; Noh, Eun Hee; Chon, Kyong Hee – Asia Pacific Education Review, 2017
With increased use of constructed response items in large scale assessments, the cost of scoring has been a major consideration (Noh et al. in KICE Report RRE 2012-6, 2012; Wainer and Thissen in "Applied Measurement in Education" 6:103-118, 1993). In response to the scoring cost issues, various forms of automated system for scoring…
Descriptors: Automation, Scoring, Social Studies, Test Items
Han, Chao – Language Assessment Quarterly, 2016
As a property of test scores, reliability/dependability constitutes an important psychometric consideration, and it underpins the validity of measurement results. A review of interpreter certification performance tests (ICPTs) reveals that (a) although reliability/dependability checking has been recognized as an important concern, its theoretical…
Descriptors: Foreign Countries, Scores, English, Chinese
Shin, Sun-Young; Ewert, Doreen – Language Testing, 2015
Reading-to-write (RTW) tasks are becoming increasingly popular and have already been used in several high-stakes English proficiency exams, either replacing or complementing a prompt-based essay test. However, it is still not clear that what accounts for successful or unsuccessful performance on an integrated reading-writing task is owing to the…
Descriptors: English (Second Language), Language Tests, Language Proficiency, Test Items
Atilgan, Hakan – Eurasian Journal of Educational Research, 2013
Problem Statement: Reliability, which refers to the degree to which measurement results are free from measurement errors, as well as its estimation, is an important issue in psychometrics. Several methods for estimating reliability have been suggested by various theories in the field of psychometrics. One of these theories is the generalizability…
Descriptors: Sample Size, Generalizability Theory, Mathematical Formulas, Measurement Techniques
Worsley, Marcelo; Blikstein, Paulo – Journal of Learning Analytics, 2014
Learning analytics and educational data mining are introducing a number of new techniques and frameworks for studying learning. The scalability and complexity of these novel techniques has afforded new ways for enacting education research and has helped scholars gain new insights into human cognition and learning. Nonetheless, there remain some…
Descriptors: Data Analysis, Data Collection, Engineering, Design
Heilmann, John; DeBrock, Lindsay; Riley-Tillman, T. Chris – American Journal of Speech-Language Pathology, 2013
Purpose: The purpose of this study was to examine the reliability of, and sources of variability in, language measures from interviews collected from young school-age children. Method: Two 10-min interviews were collected from 20 at-risk kindergarten children by an examiner using a standardized set of questions. Test-retest reliability…
Descriptors: Measures (Individuals), Structured Interviews, Reliability, Kindergarten
Gebril, Atta – Assessing Writing, 2010
Integrated tasks are currently employed in a number of L2 exams since they are perceived as an addition to the writing-only task type. Given this trend, the current study investigates composite score generalizability of both reading-to-write and writing-only tasks. For this purpose, a multivariate generalizability analysis is used to investigate…
Descriptors: Scoring, Scores, Second Language Instruction, Writing Evaluation
Gibbons, Robert D.; And Others – 1990
The probability integral of the multivariate normal distribution (ND) has received considerable attention since W. F. Sheppard's (1900) and K. Pearson's (1901) seminal work on the bivariate ND. This paper evaluates the formula that represents the "n x n" correlation matrix of the "chi(sub i)" and the standardized multivariate…
Descriptors: Algorithms, Equations (Mathematics), Estimation (Mathematics), Generalizability Theory
Gibbons, Robert D.; And Others – 1990
In the process of developing a conditionally-dependent item response theory (IRT) model, the problem arose of modeling an underlying multivariate normal (MVN) response process with general correlation among the items. Without the assumption of conditional independence, for which the underlying MVN cdf takes on comparatively simple forms and can be…
Descriptors: Equations (Mathematics), Estimation (Mathematics), Generalizability Theory, Item Response Theory

Marcoulides, George A. – Educational and Psychological Measurement, 1994
Effects of different weighting schemes on selecting the optimal number of observations in multivariate-multifacet generalizability designs are studied when cost constraints are imposed. Comparison of four schemes through simulation indicates that all four produce similar optimal values and that reliability should be similar. (SLD)
Descriptors: Budgeting, Comparative Analysis, Costs, Factor Analysis
Lee, Yong-Won; Kantor, Robert – ETS Research Report Series, 2005
Possible integrated and independent tasks were pilot tested for the writing section of a new generation of TOEFL® (Test of English as a Foreign Language™) examination. This study examines the impact of various rating designs as well as the impact of the number of tasks and raters on the reliability of writing scores based on integrated and…
Descriptors: Language Tests, English (Second Language), Second Language Learning, Writing Tests
Previous Page | Next Page »
Pages: 1 | 2