ERIC - Search Results

Publication Date

In 2025	0
Since 2024	0
Since 2021 (last 5 years)	0
Since 2016 (last 10 years)	2
Since 2006 (last 20 years)	3

Descriptor

Generalizability Theory	4
Interrater Reliability	4
Accuracy	2
Data Analysis	2
Evaluators	2
Performance Based Assessment	2
Scores	2
Achievement Tests	1
Certification	1
Comparative Analysis	1
Cutting Scores	1
English	1
English (Second Language)	1
Error of Measurement	1
Essays	1
Evaluation Methods	1
Expertise	1
Factor Analysis	1
Factor Structure	1
Feedback (Response)	1
Foreign Countries	1
Group Testing	1
High Stakes Tests	1
Investigations	1
Japanese	1
More ▼

Source

Language Testing

Author

Attali, Yigal	1
Kozaki ,Y.	1
Lin, Chih-Kai	1
Van Moere, Alistair	1

Publication Type

Journal Articles	4
Reports - Research	3
Reports - Evaluative	1

Education Level

Higher Education

Audience

Location

Japan

Laws, Policies, & Programs

Assessments and Surveys

What Works Clearinghouse Rating

Showing all 4 results Save | Export

Working with Sparse Data in Rated Language Tests: Generalizability Theory Applications

Peer reviewed

Direct link

Lin, Chih-Kai – Language Testing, 2017

Sparse-rated data are common in operational performance-based language tests, as an inevitable result of assigning examinee responses to a fraction of available raters. The current study investigates the precision of two generalizability-theory methods (i.e., the rating method and the subdividing method) specifically designed to accommodate the…

Descriptors: Data Analysis, Language Tests, Generalizability Theory, Accuracy

A Comparison of Newly-Trained and Experienced Raters on a Standardized Writing Assessment

Peer reviewed

Direct link

Attali, Yigal – Language Testing, 2016

A short training program for evaluating responses to an essay writing task consisted of scoring 20 training essays with immediate feedback about the correct score. The same scoring session also served as a certification test for trainees. Participants with little or no previous rating experience completed this session and 14 trainees who passed an…

Descriptors: Writing Evaluation, Writing Tests, Standardized Tests, Evaluators

Using GENOVA and FACETS to Set Multiple Standards on Performance Assessment for Certification in Medical Translation from Japanese into English

Peer reviewed

Direct link

Kozaki ,Y. – Language Testing, 2004

This article presents a standard-setting procedure for performance assessment in a foreign language, through which some of the major problems facing performance assessment in criterion-referenced testing can be addressed. The procedure, which was geared to revealing and accommodating inter-judge variability, employed the synergy of multiple…

Descriptors: Data Analysis, Testing, Performance Tests, Generalizability Theory

Validity Evidence in a University Group Oral Test

Peer reviewed

Direct link

Van Moere, Alistair – Language Testing, 2006

This article investigates a group oral test as administered at a university in Japan to find if it is appropriate to use scores for higher stakes decision making. It is one component of an in-house English proficiency test used for placing students, evaluating their progress, and making informed decisions for the development of the English…

Descriptors: Foreign Countries, Generalizability Theory, Achievement Tests, English (Second Language)