ERIC - Search Results

Publication Date

In 2025	0
Since 2024	0
Since 2021 (last 5 years)	1
Since 2016 (last 10 years)	2
Since 2006 (last 20 years)	4

Descriptor

Generalizability Theory	11
Test Construction	11
Test Items	11
Test Reliability	4
Language Tests	3
Scores	3
Second Language Learning	3
Difficulty Level	2
English (Second Language)	2
Error Patterns	2
Error of Measurement	2
Item Response Theory	2
Multiple Choice Tests	2
Multivariate Analysis	2
Test Format	2
Test Length	2
Test Theory	2
Writing Tests	2
Achievement Tests	1
Adults	1
Analysis of Variance	1
Audiotape Recordings	1
Communication (Thought…	1
Content Validity	1
Criterion Referenced Tests	1
More ▼

Source

Educational and Psychological…	3
ETS Research Report Series	1
Journal of Educational…	1
Language Assessment Quarterly	1
Language Testing	1

Author

Brennan, Robert L.	1
Chang, Lei	1
Colton, Dean A.	1
Conger, Anthony J.	1
Gonzalez-Tamayo, Eulogio	1
Harsch, Claudia	1
Kantor, Robert	1
Kim, Stella Y.	1
Lee, Won-Chan	1
Lee, Yong-Won	1
Li, Feifei	1
Mollaun, Pam	1
Rupp, Andre Alexander	1
Theunissen, T. J. J. M.	1
Webb, Noreen M.	1
Zhang, Su	1
van Weeren, J.	1
More ▼

Publication Type

Reports - Research	8
Journal Articles	7
Reports - Evaluative	3
Information Analyses	1
Numerical/Quantitative Data	1
Speeches/Meeting Papers	1

Education Level

Grade 10	1
Grade 3	1
Grade 8	1
Grade 9	1
Secondary Education	1

Audience

Location

Germany

Laws, Policies, & Programs

Assessments and Surveys

Test of English as a Foreign…	2
ACT Assessment	1
Test of English for…	1

What Works Clearinghouse Rating

Showing all 11 results Save | Export

Extended Multivariate Generalizability Theory with Complex Design Structures

Peer reviewed

Direct link

Brennan, Robert L.; Kim, Stella Y.; Lee, Won-Chan – Educational and Psychological Measurement, 2022

This article extends multivariate generalizability theory (MGT) to tests with different random-effects designs for each level of a fixed facet. There are numerous situations in which the design of a test and the resulting data structure are not definable by a single design. One example is mixed-format tests that are composed of multiple-choice and…

Descriptors: Multivariate Analysis, Generalizability Theory, Multiple Choice Tests, Test Construction

An Information-Correction Method for Testlet-Based Test Analysis: From the Perspectives of Item Response Theory and Generalizability Theory. Research Report. ETS RR-17-27

Peer reviewed
PDF on ERIC

Download full text

Li, Feifei – ETS Research Report Series, 2017

An information-correction method for testlet-based tests is introduced. This method takes advantage of both generalizability theory (GT) and item response theory (IRT). The measurement error for the examinee proficiency parameter is often underestimated when a unidimensional conditional-independence IRT model is specified for a testlet dataset. By…

Descriptors: Item Response Theory, Generalizability Theory, Tests, Error of Measurement

Designing and Scaling Level-Specific Writing Tasks in Alignment with the CEFR: A Test-Centered Approach

Peer reviewed

Direct link

Harsch, Claudia; Rupp, Andre Alexander – Language Assessment Quarterly, 2011

The "Common European Framework of Reference" (CEFR; Council of Europe, 2001) provides a competency model that is increasingly used as a point of reference to compare language examinations. Nevertheless, aligning examinations to the CEFR proficiency levels remains a challenge. In this article, we propose a new, level-centered approach to…

Descriptors: Language Tests, Writing Tests, Test Construction, Test Items

One Iota Fills the Quota: A Paradox in Multifacet Reliability Coefficients.

Peer reviewed

Conger, Anthony J. – Educational and Psychological Measurement, 1983

A paradoxical phenomenon of decreases in reliability as the number of elements averaged over increases is shown to be possible in multifacet reliability procedures (intraclass correlations or generalizability coefficients). Conditions governing this phenomenon are presented along with implications and cautions. (Author)

Descriptors: Generalizability Theory, Test Construction, Test Items, Test Length

Connotatively Consistent and Reversed Connotatively Inconsistent Items Are Not Fully Equivalent: Generalizability Study.

Peer reviewed

Chang, Lei – Educational and Psychological Measurement, 1995

Items previously described as "negatively worded" are redefined as "connotatively inconsistent" because this term has a broader base for generalization. Using generalizability theory with a sample of 102 graduate students, the study showed that connotatively consistent and reversed connotatively inconsistent items were not…

Descriptors: Generalizability Theory, Graduate Students, Graduate Study, Likert Scales

A Domain-Referenced Approach to Diagnostic Testing Using Generalizability Theory.

Peer reviewed

Webb, Noreen M.; And Others – Journal of Educational Measurement, 1987

This paper describes a four-step approach to constructing diagnostic test profiles that provide precise but practical information on students' instructional needs. A test of pronoun use was constructed to represent 32 categories of usage defined by different combinations of five factors in a domain. (Author/LMO)

Descriptors: Diagnostic Tests, Estimation (Mathematics), Generalizability Theory, Intermediate Grades

Investigating the Relative Effects of Persons, Items, Sections, and Languages on TOEIC Score Dependability

Peer reviewed

Direct link

Zhang, Su – Language Testing, 2006

This study applied generalizability theory to investigate the contributions of persons, items, sections, and language backgrounds to the score dependability of the Test of English for International Communication (TOEIC). I replicated and extended Brown's (1999) study of the Test of English as a Foreign Language (TOEFL), using data from two…

Descriptors: Communication (Thought Transfer), Generalizability Theory, English (Second Language), Scores

Score Reliability as an Essential Prerequisite for Validating New Writing and Speaking Tasks for TOEFL.

Lee, Yong-Won; Kantor, Robert; Mollaun, Pam – 2002

This paper reports the results of generalizability theory (G) analyses done for new writing and speaking tasks for the Test of English as a Foreign Language (TOEFL). For writing, a special focus was placed on evaluating the impact on the reliability of the number of raters (or ratings) per essay (one or two) and the number of tasks (one, two, or…

Descriptors: English (Second Language), Generalizability Theory, Reliability, Scores

A Multivariate Generalizability Analysis of the 1989 and 1990 AAP Mathematics Test Forms with Respect to the Table of Specifications.

Download full text

Colton, Dean A. – 1993

Tables of specifications are used to guide test developers in sampling items and maintaining consistency from form to form. This paper is a generalizability study of the American College Testing Program (ACT) Achievement Program Mathematics Test (AAP), with the content areas of the table of specifications representing multiple dependent variables.…

Descriptors: Achievement Tests, Difficulty Level, Error of Measurement, Generalizability Theory

Testing Pronunciation: An Application of Generalizability Theory.

van Weeren, J.; Theunissen, T. J. J. M. – 1986

Pronunciation is regarded as a valuable subskill in foreign language teaching and testing. Its quality is commonly assessed in a global way by having examinees read aloud. An atomistic test is a more systematic and explicit approach. Such a test would consist of about 40 items, use recorded performances, and draw on an inventory of pronunciation…

Descriptors: Audiotape Recordings, Error Patterns, French, Generalizability Theory

Content Specifications of a Test and Generalizability Theory.

Gonzalez-Tamayo, Eulogio – 1987

The concepts of universe of admissible observation and universe of generalization from the generalizability theory were applied to calculate the intraclass correlation coefficient of a licensure test. The internal consistency coefficient of a dichotomously scored test is identical to the intraclass correlation coefficient of a two-facet design.…

Descriptors: Adults, Analysis of Variance, Content Validity, Criterion Referenced Tests