Publication Date
| In 2026 | 0 |
| Since 2025 | 0 |
| Since 2022 (last 5 years) | 0 |
| Since 2017 (last 10 years) | 1 |
| Since 2007 (last 20 years) | 2 |
Descriptor
| Data Analysis | 4 |
| Interrater Reliability | 3 |
| Language Tests | 3 |
| Error of Measurement | 2 |
| Foreign Countries | 2 |
| Generalizability Theory | 2 |
| Performance Based Assessment | 2 |
| Scoring | 2 |
| Accuracy | 1 |
| Coding | 1 |
| College Students | 1 |
| More ▼ | |
Source
| Language Testing | 4 |
Publication Type
| Journal Articles | 4 |
| Reports - Research | 4 |
Education Level
Audience
Location
| Denmark | 1 |
| Netherlands | 1 |
Laws, Policies, & Programs
Assessments and Surveys
What Works Clearinghouse Rating
Lin, Chih-Kai – Language Testing, 2017
Sparse-rated data are common in operational performance-based language tests, as an inevitable result of assigning examinee responses to a fraction of available raters. The current study investigates the precision of two generalizability-theory methods (i.e., the rating method and the subdividing method) specifically designed to accommodate the…
Descriptors: Data Analysis, Language Tests, Generalizability Theory, Accuracy
Deygers, Bart; Van Gorp, Koen – Language Testing, 2015
Considering scoring validity as encompassing both reliable rating scale use and valid descriptor interpretation, this study reports on the validation of a CEFR-based scale that was co-constructed and used by novice raters. The research questions this paper wishes to answer are (a) whether it is possible to construct a CEFR-based rating scale with…
Descriptors: Rating Scales, Scoring, Validity, Interrater Reliability
Kozaki ,Y. – Language Testing, 2004
This article presents a standard-setting procedure for performance assessment in a foreign language, through which some of the major problems facing performance assessment in criterion-referenced testing can be addressed. The procedure, which was geared to revealing and accommodating inter-judge variability, employed the synergy of multiple…
Descriptors: Data Analysis, Testing, Performance Tests, Generalizability Theory
Peer reviewedDollerup, Cay; And Others – Language Testing, 1994
Examines a Danish English-language reading proficiency test offered to freshman students to diagnose weaknesses which may impede their academic careers. To facilitate the assessment of what parts can be transferred and used in other language areas, the article discusses the test construction, development and improvement. (11 references) (Author/CK)
Descriptors: College Students, Comparative Analysis, Danish, Data Analysis

Direct link
