NotesFAQContact Us
Collection
Advanced
Search Tips
Publication Type
Reports - Research17
Journal Articles15
Tests/Questionnaires2
Speeches/Meeting Papers1
Audience
Laws, Policies, & Programs
What Works Clearinghouse Rating
Showing 1 to 15 of 17 results Save | Export
Peer reviewed Peer reviewed
Direct linkDirect link
Walter P. Vispoel; Hyeryung Lee; Hyeri Hong – Structural Equation Modeling: A Multidisciplinary Journal, 2024
We demonstrate how to analyze complete multivariate generalizability theory (GT) designs within structural equation modeling frameworks that encompass both individual subscale scores and composites formed from those scores. Results from numerous analyses of observed scores obtained from respondents who completed the recently updated form of the…
Descriptors: Structural Equation Models, Multivariate Analysis, Generalizability Theory, College Students
Peer reviewed Peer reviewed
Direct linkDirect link
Walter P. Vispoel; Hyeri Hong; Hyeryung Lee – Structural Equation Modeling: A Multidisciplinary Journal, 2024
Although generalizability theory (GT) designs typically are analyzed using analysis of variance (ANOVA) procedures, they also can be integrated into structural equation models (SEMs). In this tutorial, we review basic concepts for conducting univariate and multivariate GT analyses and demonstrate advantages of doing such analyses within SEM…
Descriptors: Structural Equation Models, Self Concept Measures, Self Esteem, Generalizability Theory
Peer reviewed Peer reviewed
Direct linkDirect link
Raymond, Mark R.; Jiang, Zhehan – Educational and Psychological Measurement, 2020
Conventional methods for evaluating the utility of subscores rely on traditional indices of reliability and on correlations among subscores. One limitation of correlational methods is that they do not explicitly consider variation in subtest means. An exception is an index of score profile reliability designated as [G], which quantifies the ratio…
Descriptors: Generalizability Theory, Multivariate Analysis, Scores, Reliability
Peer reviewed Peer reviewed
Direct linkDirect link
Brennan, Robert L.; Kim, Stella Y.; Lee, Won-Chan – Educational and Psychological Measurement, 2022
This article extends multivariate generalizability theory (MGT) to tests with different random-effects designs for each level of a fixed facet. There are numerous situations in which the design of a test and the resulting data structure are not definable by a single design. One example is mixed-format tests that are composed of multiple-choice and…
Descriptors: Multivariate Analysis, Generalizability Theory, Multiple Choice Tests, Test Construction
Peer reviewed Peer reviewed
Direct linkDirect link
Sung, Kyung Hee; Noh, Eun Hee; Chon, Kyong Hee – Asia Pacific Education Review, 2017
With increased use of constructed response items in large scale assessments, the cost of scoring has been a major consideration (Noh et al. in KICE Report RRE 2012-6, 2012; Wainer and Thissen in "Applied Measurement in Education" 6:103-118, 1993). In response to the scoring cost issues, various forms of automated system for scoring…
Descriptors: Automation, Scoring, Social Studies, Test Items
Peer reviewed Peer reviewed
Direct linkDirect link
Han, Chao – Language Assessment Quarterly, 2016
As a property of test scores, reliability/dependability constitutes an important psychometric consideration, and it underpins the validity of measurement results. A review of interpreter certification performance tests (ICPTs) reveals that (a) although reliability/dependability checking has been recognized as an important concern, its theoretical…
Descriptors: Foreign Countries, Scores, English, Chinese
Peer reviewed Peer reviewed
Direct linkDirect link
Shin, Sun-Young; Ewert, Doreen – Language Testing, 2015
Reading-to-write (RTW) tasks are becoming increasingly popular and have already been used in several high-stakes English proficiency exams, either replacing or complementing a prompt-based essay test. However, it is still not clear that what accounts for successful or unsuccessful performance on an integrated reading-writing task is owing to the…
Descriptors: English (Second Language), Language Tests, Language Proficiency, Test Items
Peer reviewed Peer reviewed
PDF on ERIC Download full text
Atilgan, Hakan – Eurasian Journal of Educational Research, 2013
Problem Statement: Reliability, which refers to the degree to which measurement results are free from measurement errors, as well as its estimation, is an important issue in psychometrics. Several methods for estimating reliability have been suggested by various theories in the field of psychometrics. One of these theories is the generalizability…
Descriptors: Sample Size, Generalizability Theory, Mathematical Formulas, Measurement Techniques
Peer reviewed Peer reviewed
PDF on ERIC Download full text
Worsley, Marcelo; Blikstein, Paulo – Journal of Learning Analytics, 2014
Learning analytics and educational data mining are introducing a number of new techniques and frameworks for studying learning. The scalability and complexity of these novel techniques has afforded new ways for enacting education research and has helped scholars gain new insights into human cognition and learning. Nonetheless, there remain some…
Descriptors: Data Analysis, Data Collection, Engineering, Design
Peer reviewed Peer reviewed
Direct linkDirect link
Heilmann, John; DeBrock, Lindsay; Riley-Tillman, T. Chris – American Journal of Speech-Language Pathology, 2013
Purpose: The purpose of this study was to examine the reliability of, and sources of variability in, language measures from interviews collected from young school-age children. Method: Two 10-min interviews were collected from 20 at-risk kindergarten children by an examiner using a standardized set of questions. Test-retest reliability…
Descriptors: Measures (Individuals), Structured Interviews, Reliability, Kindergarten
Peer reviewed Peer reviewed
Direct linkDirect link
Gebril, Atta – Assessing Writing, 2010
Integrated tasks are currently employed in a number of L2 exams since they are perceived as an addition to the writing-only task type. Given this trend, the current study investigates composite score generalizability of both reading-to-write and writing-only tasks. For this purpose, a multivariate generalizability analysis is used to investigate…
Descriptors: Scoring, Scores, Second Language Instruction, Writing Evaluation
Gibbons, Robert D.; And Others – 1990
The probability integral of the multivariate normal distribution (ND) has received considerable attention since W. F. Sheppard's (1900) and K. Pearson's (1901) seminal work on the bivariate ND. This paper evaluates the formula that represents the "n x n" correlation matrix of the "chi(sub i)" and the standardized multivariate…
Descriptors: Algorithms, Equations (Mathematics), Estimation (Mathematics), Generalizability Theory
Gibbons, Robert D.; And Others – 1990
In the process of developing a conditionally-dependent item response theory (IRT) model, the problem arose of modeling an underlying multivariate normal (MVN) response process with general correlation among the items. Without the assumption of conditional independence, for which the underlying MVN cdf takes on comparatively simple forms and can be…
Descriptors: Equations (Mathematics), Estimation (Mathematics), Generalizability Theory, Item Response Theory
Peer reviewed Peer reviewed
Marcoulides, George A. – Educational and Psychological Measurement, 1994
Effects of different weighting schemes on selecting the optimal number of observations in multivariate-multifacet generalizability designs are studied when cost constraints are imposed. Comparison of four schemes through simulation indicates that all four produce similar optimal values and that reliability should be similar. (SLD)
Descriptors: Budgeting, Comparative Analysis, Costs, Factor Analysis
Peer reviewed Peer reviewed
PDF on ERIC Download full text
Lee, Yong-Won; Kantor, Robert – ETS Research Report Series, 2005
Possible integrated and independent tasks were pilot tested for the writing section of a new generation of TOEFL® (Test of English as a Foreign Language™) examination. This study examines the impact of various rating designs as well as the impact of the number of tasks and raters on the reliability of writing scores based on integrated and…
Descriptors: Language Tests, English (Second Language), Second Language Learning, Writing Tests
Previous Page | Next Page »
Pages: 1  |  2