Publication Date
In 2025 | 0 |
Since 2024 | 0 |
Since 2021 (last 5 years) | 0 |
Since 2016 (last 10 years) | 2 |
Since 2006 (last 20 years) | 4 |
Descriptor
Source
Language Testing | 5 |
Author
Deygers, Bart | 1 |
Dollerup, Cay | 1 |
Lin, Chih-Kai | 1 |
Pae, Tae-Il | 1 |
Staples, Shelley | 1 |
Van Gorp, Koen | 1 |
Yan, Xun | 1 |
Publication Type
Journal Articles | 5 |
Reports - Research | 4 |
Reports - Evaluative | 1 |
Education Level
Higher Education | 1 |
Audience
Location
Denmark | 1 |
Netherlands | 1 |
South Korea | 1 |
Laws, Policies, & Programs
Assessments and Surveys
What Works Clearinghouse Rating
Yan, Xun; Staples, Shelley – Language Testing, 2020
The argument-based approach to validity (Kane, 2013) focuses on two steps: (1) making claims about the proposed interpretation and use of test scores as a coherent, interpretive argument; and (2) evaluating those claims based on theoretical and empirical evidence related to test performances and scores. This paper discusses the role of…
Descriptors: Writing Tests, Language Tests, Language Proficiency, Test Validity
Lin, Chih-Kai – Language Testing, 2017
Sparse-rated data are common in operational performance-based language tests, as an inevitable result of assigning examinee responses to a fraction of available raters. The current study investigates the precision of two generalizability-theory methods (i.e., the rating method and the subdividing method) specifically designed to accommodate the…
Descriptors: Data Analysis, Language Tests, Generalizability Theory, Accuracy
Deygers, Bart; Van Gorp, Koen – Language Testing, 2015
Considering scoring validity as encompassing both reliable rating scale use and valid descriptor interpretation, this study reports on the validation of a CEFR-based scale that was co-constructed and used by novice raters. The research questions this paper wishes to answer are (a) whether it is possible to construct a CEFR-based rating scale with…
Descriptors: Rating Scales, Scoring, Validity, Interrater Reliability
Pae, Tae-Il – Language Testing, 2012
This study tracked gender differential item functioning (DIF) on the English subtest of the Korean College Scholastic Aptitude Test (KCSAT) over a nine-year period across three data points, using both the Mantel-Haenszel (MH) and item response theory likelihood ratio (IRT-LR) procedures. Further, the study identified two factors (i.e. reading…
Descriptors: Aptitude Tests, Academic Aptitude, Language Tests, Test Items

Dollerup, Cay; And Others – Language Testing, 1994
Examines a Danish English-language reading proficiency test offered to freshman students to diagnose weaknesses which may impede their academic careers. To facilitate the assessment of what parts can be transferred and used in other language areas, the article discusses the test construction, development and improvement. (11 references) (Author/CK)
Descriptors: College Students, Comparative Analysis, Danish, Data Analysis