ERIC - Search Results

Publication Date

In 2025	0
Since 2024	2
Since 2021 (last 5 years)	3
Since 2016 (last 10 years)	6
Since 2006 (last 20 years)	12

Descriptor

Generalizability Theory	17
Multivariate Analysis	17
Reliability	7
Language Tests	5
Scores	5
Test Items	4
Comparative Analysis	3
Correlation	3
Foreign Countries	3
Psychometrics	3
Scoring	3
Writing Tests	3
College Students	2
Computation	2
English (Second Language)	2
Equations (Mathematics)	2
Error of Measurement	2
Estimation (Mathematics)	2
Interrater Reliability	2
Mathematical Models	2
Performance Based Assessment	2
Rating Scales	2
Sample Size	2
Second Language Learning	2
Structural Equation Models	2
More ▼

Source

Educational and Psychological…	3
Structural Equation Modeling:…	2
American Journal of…	1
Asia Pacific Education Review	1
Assessing Writing	1
ETS Research Report Series	1
Eurasian Journal of…	1
Evaluation and Program…	1
Journal of Educational…	1
Journal of Learning Analytics	1
Language Assessment Quarterly	1
Language Testing	1
More ▼

Publication Type

Reports - Research	17
Journal Articles	15
Tests/Questionnaires	2
Speeches/Meeting Papers	1

Education Level

Higher Education	4
Postsecondary Education	4
Elementary Education	2
Grade 6	1
Intermediate Grades	1
Kindergarten	1
Middle Schools	1

Audience

Location

Australia	1
Canada	1
China (Beijing)	1
Hong Kong	1
Iowa	1
Mexico	1
North Carolina	1
Taiwan	1
Turkey	1
United States	1

Laws, Policies, & Programs

Assessments and Surveys

Rosenberg Self Esteem Scale	1
Test of English as a Foreign…	1
United States Medical…	1

What Works Clearinghouse Rating

Showing 1 to 15 of 17 results Save | Export

Analyzing Multivariate Generalizability Theory Designs within Structural Equation Modeling Frameworks

Peer reviewed

Direct link

Walter P. Vispoel; Hyeryung Lee; Hyeri Hong – Structural Equation Modeling: A Multidisciplinary Journal, 2024

We demonstrate how to analyze complete multivariate generalizability theory (GT) designs within structural equation modeling frameworks that encompass both individual subscale scores and composites formed from those scores. Results from numerous analyses of observed scores obtained from respondents who completed the recently updated form of the…

Descriptors: Structural Equation Models, Multivariate Analysis, Generalizability Theory, College Students

Benefits of Doing Generalizability Theory Analyses within Structural Equation Modeling Frameworks: Illustrations Using the Rosenberg Self-Esteem Scale

Peer reviewed

Direct link

Walter P. Vispoel; Hyeri Hong; Hyeryung Lee – Structural Equation Modeling: A Multidisciplinary Journal, 2024

Although generalizability theory (GT) designs typically are analyzed using analysis of variance (ANOVA) procedures, they also can be integrated into structural equation models (SEMs). In this tutorial, we review basic concepts for conducting univariate and multivariate GT analyses and demonstrate advantages of doing such analyses within SEM…

Descriptors: Structural Equation Models, Self Concept Measures, Self Esteem, Generalizability Theory

Indices of Subscore Utility for Individuals and Subgroups Based on Multivariate Generalizability Theory

Peer reviewed

Direct link

Raymond, Mark R.; Jiang, Zhehan – Educational and Psychological Measurement, 2020

Conventional methods for evaluating the utility of subscores rely on traditional indices of reliability and on correlations among subscores. One limitation of correlational methods is that they do not explicitly consider variation in subtest means. An exception is an index of score profile reliability designated as [G], which quantifies the ratio…

Descriptors: Generalizability Theory, Multivariate Analysis, Scores, Reliability

Extended Multivariate Generalizability Theory with Complex Design Structures

Peer reviewed

Direct link

Brennan, Robert L.; Kim, Stella Y.; Lee, Won-Chan – Educational and Psychological Measurement, 2022

This article extends multivariate generalizability theory (MGT) to tests with different random-effects designs for each level of a fixed facet. There are numerous situations in which the design of a test and the resulting data structure are not definable by a single design. One example is mixed-format tests that are composed of multiple-choice and…

Descriptors: Multivariate Analysis, Generalizability Theory, Multiple Choice Tests, Test Construction

Multivariate Generalizability Analysis of Automated Scoring for Short Answer Items of Social Studies in Large-Scale Assessment

Peer reviewed

Direct link

Sung, Kyung Hee; Noh, Eun Hee; Chon, Kyong Hee – Asia Pacific Education Review, 2017

With increased use of constructed response items in large scale assessments, the cost of scoring has been a major consideration (Noh et al. in KICE Report RRE 2012-6, 2012; Wainer and Thissen in "Applied Measurement in Education" 6:103-118, 1993). In response to the scoring cost issues, various forms of automated system for scoring…

Descriptors: Automation, Scoring, Social Studies, Test Items

Investigating Score Dependability in English/Chinese Interpreter Certification Performance Testing: A Generalizability Theory Approach

Peer reviewed

Direct link

Han, Chao – Language Assessment Quarterly, 2016

As a property of test scores, reliability/dependability constitutes an important psychometric consideration, and it underpins the validity of measurement results. A review of interpreter certification performance tests (ICPTs) reveals that (a) although reliability/dependability checking has been recognized as an important concern, its theoretical…

Descriptors: Foreign Countries, Scores, English, Chinese

What Accounts for Integrated Reading-to-Write Task Scores?

Peer reviewed

Direct link

Shin, Sun-Young; Ewert, Doreen – Language Testing, 2015

Reading-to-write (RTW) tasks are becoming increasingly popular and have already been used in several high-stakes English proficiency exams, either replacing or complementing a prompt-based essay test. However, it is still not clear that what accounts for successful or unsuccessful performance on an integrated reading-writing task is owing to the…

Descriptors: English (Second Language), Language Tests, Language Proficiency, Test Items

Sample Size for Estimation of G and Phi Coefficients in Generalizability Theory

Peer reviewed
PDF on ERIC

Download full text

Atilgan, Hakan – Eurasian Journal of Educational Research, 2013

Problem Statement: Reliability, which refers to the degree to which measurement results are free from measurement errors, as well as its estimation, is an important issue in psychometrics. Several methods for estimating reliability have been suggested by various theories in the field of psychometrics. One of these theories is the generalizability…

Descriptors: Sample Size, Generalizability Theory, Mathematical Formulas, Measurement Techniques

Analyzing Engineering Design through the Lens of Computation

Peer reviewed
PDF on ERIC

Download full text

Worsley, Marcelo; Blikstein, Paulo – Journal of Learning Analytics, 2014

Learning analytics and educational data mining are introducing a number of new techniques and frameworks for studying learning. The scalability and complexity of these novel techniques has afforded new ways for enacting education research and has helped scholars gain new insights into human cognition and learning. Nonetheless, there remain some…

Descriptors: Data Analysis, Data Collection, Engineering, Design

Stability of Measures from Children's Interviews: The Effects of Time, Sample Length, and Topic

Peer reviewed

Direct link

Heilmann, John; DeBrock, Lindsay; Riley-Tillman, T. Chris – American Journal of Speech-Language Pathology, 2013

Purpose: The purpose of this study was to examine the reliability of, and sources of variability in, language measures from interviews collected from young school-age children. Method: Two 10-min interviews were collected from 20 at-risk kindergarten children by an examiner using a standardized set of questions. Test-retest reliability…

Descriptors: Measures (Individuals), Structured Interviews, Reliability, Kindergarten

Bringing Reading-to-Write and Writing-Only Assessment Tasks Together: A Generalizability Analysis

Peer reviewed

Direct link

Gebril, Atta – Assessing Writing, 2010

Integrated tasks are currently employed in a number of L2 exams since they are perceived as an addition to the writing-only task type. Given this trend, the current study investigates composite score generalizability of both reading-to-write and writing-only tasks. For this purpose, a multivariate generalizability analysis is used to investigate…

Descriptors: Scoring, Scores, Second Language Instruction, Writing Evaluation

Approximating Multivariate Normal Orthant Probabilities. ONR Technical Report. [Biometric Lab Report No. 90-1.]

Download full text

Gibbons, Robert D.; And Others – 1990

The probability integral of the multivariate normal distribution (ND) has received considerable attention since W. F. Sheppard's (1900) and K. Pearson's (1901) seminal work on the bivariate ND. This paper evaluates the formula that represents the "n x n" correlation matrix of the "chi(sub i)" and the standardized multivariate…

Descriptors: Algorithms, Equations (Mathematics), Estimation (Mathematics), Generalizability Theory

Multivariate Generalizations of Student's t-Distribution. ONR Technical Report. [Biometric Lab Report No. 90-3.]

Download full text

Gibbons, Robert D.; And Others – 1990

In the process of developing a conditionally-dependent item response theory (IRT) model, the problem arose of modeling an underlying multivariate normal (MVN) response process with general correlation among the items. Without the assumption of conditional independence, for which the underlying MVN cdf takes on comparatively simple forms and can be…

Descriptors: Equations (Mathematics), Estimation (Mathematics), Generalizability Theory, Item Response Theory

Selecting Weighting Schemes in Multivariate Generalizability Studies.

Peer reviewed

Marcoulides, George A. – Educational and Psychological Measurement, 1994

Effects of different weighting schemes on selecting the optimal number of observations in multivariate-multifacet generalizability designs are studied when cost constraints are imposed. Comparison of four schemes through simulation indicates that all four produce similar optimal values and that reliability should be similar. (SLD)

Descriptors: Budgeting, Comparative Analysis, Costs, Factor Analysis

Dependability of New ESL Writing Test Scores: Evaluating Prototype Tasks and Alternative Rating Schemes. TOEFL® Monograph Series. MS-31. ETS RR-05-14

Peer reviewed
PDF on ERIC

Download full text

Lee, Yong-Won; Kantor, Robert – ETS Research Report Series, 2005

Possible integrated and independent tasks were pilot tested for the writing section of a new generation of TOEFL® (Test of English as a Foreign Language™) examination. This study examines the impact of various rating designs as well as the impact of the number of tasks and raters on the reliability of writing scores based on integrated and…

Descriptors: Language Tests, English (Second Language), Second Language Learning, Writing Tests

Previous Page | Next Page »

Pages: 1 | 2

Gibbons, Robert D.	2
Hyeri Hong	2
Hyeryung Lee	2
Walter P. Vispoel	2
Atilgan, Hakan	1
Blikstein, Paulo	1
Brennan, Robert L.	1
Chon, Kyong Hee	1
Clauser, Brian E.	1
DeBrock, Lindsay	1
Ewert, Doreen	1
Gebril, Atta	1
Green, Rex S.	1
Han, Chao	1
Harik, Polina	1
Heilmann, John	1
Jerrell, Jeanette M.	1
Jiang, Zhehan	1
Kantor, Robert	1
Kim, Stella Y.	1
Lee, Won-Chan	1
Lee, Yong-Won	1
Marcoulides, George A.	1
Margolis, Melissa J.	1
More ▼