Publication Date
In 2025 | 0 |
Since 2024 | 0 |
Since 2021 (last 5 years) | 0 |
Since 2016 (last 10 years) | 0 |
Since 2006 (last 20 years) | 4 |
Descriptor
Models | 11 |
Language Tests | 9 |
Test Validity | 7 |
Second Language Learning | 6 |
Testing | 6 |
English (Second Language) | 4 |
Construct Validity | 3 |
Language Proficiency | 3 |
Predictive Validity | 3 |
Scoring | 3 |
Test Construction | 3 |
More ▼ |
Source
Language Testing | 11 |
Author
Bachman, Lyle F. | 3 |
Kane, Michael | 2 |
Bae, Jungok | 1 |
Bailey, Kathleen M. | 1 |
Boldt, Robert F. | 1 |
Chalhoub-Deville, Micheline | 1 |
Choi, Inn-Chull | 1 |
Hamp-Lyons, Liz | 1 |
Haug, Tobias | 1 |
Raatz, Ulrich | 1 |
Publication Type
Journal Articles | 11 |
Reports - Research | 6 |
Reports - Descriptive | 2 |
Information Analyses | 1 |
Opinion Papers | 1 |
Reports - Evaluative | 1 |
Speeches/Meeting Papers | 1 |
Education Level
Elementary Education | 1 |
Audience
Location
Germany | 1 |
United Kingdom | 1 |
Laws, Policies, & Programs
Assessments and Surveys
Test of English as a Foreign… | 2 |
What Works Clearinghouse Rating
Kane, Michael – Language Testing, 2012
The argument-based approach to validation involves two steps; specification of the proposed interpretations and uses of the test scores as an interpretive argument, and the evaluation of the plausibility of the proposed interpretive argument. More ambitious interpretations and uses tend to involve an extended network of inferences and assumptions…
Descriptors: Testing, Language Tests, Inferences, Test Validity
Haug, Tobias – Language Testing, 2012
Despite the current need for reliable and valid test instruments in different countries in order to monitor the sign language acquisition of deaf children, very few tests are commercially available that offer strong evidence for their psychometric properties. This mirrors the current state of affairs for many sign languages, where very little…
Descriptors: Evidence, Sign Language, Language Tests, Construct Validity
Kane, Michael – Language Testing, 2010
This paper presents the author's critique on Xiaoming Xi's article, "How do we go about investigating test fairness?," which lays out a broad framework for studying fairness as comparable validity across groups within the population of interest. Xi proposes to develop a fairness argument that would identify and evaluate potential fairness-based…
Descriptors: Test Bias, Test Validity, Language Tests, Testing
Bae, Jungok; Bachman, Lyle F. – Language Testing, 2010
This study investigated the validity of four theoretically motivated traits of writing ability across English and Korean, based on elementary school students' responses to letter- and story-writing tasks. Their responses were scored analytically and analyzed using confirmatory factor analysis. The findings include the following. A model of writing…
Descriptors: Elementary School Students, Validity, Korean, English (Second Language)

Boldt, Robert F. – Language Testing, 1992
The assumption called PIRC (proportional item response curve) was tested in which PIRC was used to predict item scores of selected examinees on selected items. Findings show approximate accuracies of prediction for PIRC, the three-parameter logist model, and a modified Rasch model. (12 references) (Author/LB)
Descriptors: Comparative Analysis, English (Second Language), Factor Analysis, Item Response Theory

Raatz, Ulrich – Language Testing, 1985
Argues that classical test theory cannot be used at the item level on "authentic" language tests. However, if the total score is derived by adding the scores of a number of different and independent parts, test reliability can be estimated. Suggests using the Classical Latent Additives model to examine test-part homogeneity. (Author/SED)
Descriptors: Item Analysis, Latent Trait Theory, Models, Second Language Learning

Hamp-Lyons, Liz – Language Testing, 1997
Links the theory of washback with the broader concept of impact in educational measurement and to the recent debate on construct validity associated with Messick. Notes that for many years it was asserted that language tests negatively impacted teaching and learning, an impact known as washback. (25 references) (Author/CK)
Descriptors: Ethics, Higher Education, Language Tests, Measurement Techniques

Chalhoub-Deville, Micheline – Language Testing, 1997
Reviews the usefulness of proficiency models influencing second language testing. Findings indicate that several factors contribute to the lack of congruence between models and test construction and make a case for distinguishing between theoretical models. Underscores the significance of an empirical, contextualized and structured approach to the…
Descriptors: Communicative Competence (Languages), Language Proficiency, Language Tests, Linguistic Theory

Choi, Inn-Chull; Bachman, Lyle F. – Language Testing, 1992
This study is part of a larger one examining the comparability of the First Certificate in English and the Test of English as a Foreign Language. The general assumption of unidimensionality and goodness-of-fit were tested. Findings raise questions about the consequences of rejecting or retaining misfitting items. (60 references) (LB)
Descriptors: Comparative Analysis, English (Second Language), Goodness of Fit, Item Response Theory

Bailey, Kathleen M. – Language Testing, 1996
Presents a literature review seeking to answer four questions: (1) What is washback? (2) How does washback work? (3) How can we promote positive washback? and (4) How can we investigate washback? A model is proposed that identifies participants, processes and products which may influence or be influenced by, washback. Strategies for investigating…
Descriptors: Change Strategies, Construct Validity, Educational Philosophy, Language Proficiency

Bachman, Lyle F.; And Others – Language Testing, 1996
Discusses the value of content considerations in the design of language tests and the implications of the findings of various investigations of content analysis. The article argues that content analysis can be viewed as the application of a model of test design to a particular measurement instrument, using judgments of trained analysts. (26…
Descriptors: College Students, Content Analysis, English (Second Language), Item Analysis