Publication Date
In 2025 | 1 |
Since 2024 | 2 |
Since 2021 (last 5 years) | 2 |
Since 2016 (last 10 years) | 5 |
Since 2006 (last 20 years) | 6 |
Descriptor
Source
Language Testing | 9 |
Author
Brown, James Dean | 2 |
Brunfaut, Tineke | 1 |
Guiberson, Mark | 1 |
Henning, Grant | 1 |
Janssen, Gerriet | 1 |
Jason Fan | 1 |
Kozhevnikova, Liudmila | 1 |
McCray, Gareth | 1 |
Pae, Tae-Il | 1 |
Park, Gi-Pyo | 1 |
Raatz, Ulrich | 1 |
More ▼ |
Publication Type
Journal Articles | 9 |
Reports - Research | 7 |
Information Analyses | 1 |
Reports - Evaluative | 1 |
Tests/Questionnaires | 1 |
Education Level
Higher Education | 2 |
Postsecondary Education | 1 |
Audience
Location
Australia | 1 |
China (Guangzhou) | 1 |
Japan | 1 |
Russia | 1 |
United Kingdom | 1 |
Laws, Policies, & Programs
Assessments and Surveys
Clinical Evaluation of… | 1 |
What Works Clearinghouse Rating
Ute Knoch; Jason Fan – Language Testing, 2024
While several test concordance tables have been published, the research underpinning such tables has rarely been examined in detail. This study aimed to survey the publically available studies or documentation underpinning the test concordance tables of the providers of four major international language tests, all accepted by the Australian…
Descriptors: Language Tests, English, Test Validity, Item Analysis
Xueliang Chen; Vahid Aryadoust; Wenxin Zhang – Language Testing, 2025
The growing diversity among test takers in second or foreign language (L2) assessments makes the importance of fairness front and center. This systematic review aimed to examine how fairness in L2 assessments was evaluated through differential item functioning (DIF) analysis. A total of 83 articles from 27 journals were included in a systematic…
Descriptors: Second Language Learning, Language Tests, Test Items, Item Analysis
Guiberson, Mark – Language Testing, 2019
This study will demonstrate that group differences on a morphosyntactic measure used for the identification of specific language impairment (SLI) do not guarantee validity for diagnosis and tracking, and will exemplify this with a case study of the Spanish version of the "Clinical Evaluation of Preschool Language-2 Estructura de…
Descriptors: Test Validity, Content Validity, Language Impairments, Morphology (Languages)
McCray, Gareth; Brunfaut, Tineke – Language Testing, 2018
This study investigates test-takers' processing while completing banked gap-fill tasks, designed to test reading proficiency, in order to test theoretically based expectations about the variation in cognitive processes of test-takers across levels of performance. Twenty-eight test-takers' eye traces on 24 banked gap-fill items (on six tasks) were…
Descriptors: Language Tests, Test Items, Item Analysis, Eye Movements
Trace, Jonathan; Brown, James Dean; Janssen, Gerriet; Kozhevnikova, Liudmila – Language Testing, 2017
Cloze tests have been the subject of numerous studies regarding their function and use in both first language and second language contexts (e.g., Jonz & Oller, 1994; Watanabe & Koyama, 2008). From a validity standpoint, one area of investigation has been the extent to which cloze tests measure reading ability beyond the sentence level.…
Descriptors: Cloze Procedure, Language Tests, Test Items, Item Analysis

Raatz, Ulrich – Language Testing, 1985
Argues that classical test theory cannot be used at the item level on "authentic" language tests. However, if the total score is derived by adding the scores of a number of different and independent parts, test reliability can be estimated. Suggests using the Classical Latent Additives model to examine test-part homogeneity. (Author/SED)
Descriptors: Item Analysis, Latent Trait Theory, Models, Second Language Learning
Pae, Tae-Il; Park, Gi-Pyo – Language Testing, 2006
The present study utilized both the IRT-LR (item response theory likelihood ratio) and a series of CFA (confirmatory factor analysis) multi-sample analyses to systematically examine the relationships between DIF (differential item functioning) and DTF (differential test functioning) with a random sample of 15 000 Korean examinees. Specifically,…
Descriptors: Item Response Theory, Factor Analysis, Test Bias, Test Validity

Brown, James Dean – Language Testing, 1988
The reliability and validity of a cloze procedure used as an English-as-a-second-language (ESL) test in China were improved by applying traditional item analysis and selection techniques. The 'best' test items were chosen on the basis of item facility and discrimination indices, and were administered as a 'tailored cloze.' 29 references listed.…
Descriptors: Adaptive Testing, Cloze Procedure, English (Second Language), Foreign Countries

Henning, Grant; And Others – Language Testing, 1994
Examines the effectiveness of an automated language proficiency test assembly system at an air force base English Language Center. The study focuses on the equivalence of mean score difficulty, total score variance, and intercorrelation covariance across test norms and finds a high level of test-form equivalence and internal consistency. (nine…
Descriptors: Computer Assisted Testing, English (Second Language), Foreign Nationals, Item Analysis