Publication Date
In 2025 | 0 |
Since 2024 | 0 |
Since 2021 (last 5 years) | 0 |
Since 2016 (last 10 years) | 3 |
Since 2006 (last 20 years) | 4 |
Descriptor
Performance Based Assessment | 4 |
Scores | 4 |
Evaluators | 3 |
Language Tests | 3 |
Scoring | 3 |
Second Language Learning | 3 |
English (Second Language) | 2 |
Reliability | 2 |
Academic Discourse | 1 |
Accuracy | 1 |
Bias | 1 |
More ▼ |
Source
Language Testing | 4 |
Publication Type
Journal Articles | 4 |
Reports - Research | 3 |
Opinion Papers | 1 |
Reports - Evaluative | 1 |
Education Level
Audience
Location
Colombia | 1 |
Laws, Policies, & Programs
Assessments and Surveys
Test of English as a Foreign… | 1 |
What Works Clearinghouse Rating
Lin, Chih-Kai – Language Testing, 2017
Sparse-rated data are common in operational performance-based language tests, as an inevitable result of assigning examinee responses to a fraction of available raters. The current study investigates the precision of two generalizability-theory methods (i.e., the rating method and the subdividing method) specifically designed to accommodate the…
Descriptors: Data Analysis, Language Tests, Generalizability Theory, Accuracy
Trace, Jonathan; Janssen, Gerriet; Meier, Valerie – Language Testing, 2017
Previous research in second language writing has shown that when scoring performance assessments even trained raters can exhibit significant differences in severity. When raters disagree, using discussion to try to reach a consensus is one popular form of score resolution, particularly in contexts with limited resources, as it does not require…
Descriptors: Performance Based Assessment, Second Language Learning, Scoring, Evaluators
Hoekje, Barbara – Language Testing, 2016
This commentary argues that the OET research raises inescapable contradictions in trying to separate "language" from "communication" within a weak performance test and advocates for reconceptualizing the legitimate domain of "language" more widely, reclaiming the full potential of the communicative competence…
Descriptors: Language Tests, Languages for Special Purposes, Second Language Learning, Communicative Competence (Languages)
Xi, Xiaoming – Language Testing, 2007
This study explores the utility of analytic scoring for TAST in providing useful and reliable diagnostic information for operational use in three aspects of candidates' performance: delivery, language use and topic development. One hundred and forty examinees' responses to six TAST tasks were scored analytically on these three aspects of speech. G…
Descriptors: Scoring, Profiles, Performance Based Assessment, Academic Discourse