NotesFAQContact Us
Collection
Advanced
Search Tips
Showing all 11 results Save | Export
Peer reviewed Peer reviewed
Direct linkDirect link
Kane, Michael – Language Testing, 2012
The argument-based approach to validation involves two steps; specification of the proposed interpretations and uses of the test scores as an interpretive argument, and the evaluation of the plausibility of the proposed interpretive argument. More ambitious interpretations and uses tend to involve an extended network of inferences and assumptions…
Descriptors: Testing, Language Tests, Inferences, Test Validity
Peer reviewed Peer reviewed
Direct linkDirect link
Haug, Tobias – Language Testing, 2012
Despite the current need for reliable and valid test instruments in different countries in order to monitor the sign language acquisition of deaf children, very few tests are commercially available that offer strong evidence for their psychometric properties. This mirrors the current state of affairs for many sign languages, where very little…
Descriptors: Evidence, Sign Language, Language Tests, Construct Validity
Peer reviewed Peer reviewed
Direct linkDirect link
Kane, Michael – Language Testing, 2010
This paper presents the author's critique on Xiaoming Xi's article, "How do we go about investigating test fairness?," which lays out a broad framework for studying fairness as comparable validity across groups within the population of interest. Xi proposes to develop a fairness argument that would identify and evaluate potential fairness-based…
Descriptors: Test Bias, Test Validity, Language Tests, Testing
Peer reviewed Peer reviewed
Direct linkDirect link
Bae, Jungok; Bachman, Lyle F. – Language Testing, 2010
This study investigated the validity of four theoretically motivated traits of writing ability across English and Korean, based on elementary school students' responses to letter- and story-writing tasks. Their responses were scored analytically and analyzed using confirmatory factor analysis. The findings include the following. A model of writing…
Descriptors: Elementary School Students, Validity, Korean, English (Second Language)
Peer reviewed Peer reviewed
Boldt, Robert F. – Language Testing, 1992
The assumption called PIRC (proportional item response curve) was tested in which PIRC was used to predict item scores of selected examinees on selected items. Findings show approximate accuracies of prediction for PIRC, the three-parameter logist model, and a modified Rasch model. (12 references) (Author/LB)
Descriptors: Comparative Analysis, English (Second Language), Factor Analysis, Item Response Theory
Peer reviewed Peer reviewed
Raatz, Ulrich – Language Testing, 1985
Argues that classical test theory cannot be used at the item level on "authentic" language tests. However, if the total score is derived by adding the scores of a number of different and independent parts, test reliability can be estimated. Suggests using the Classical Latent Additives model to examine test-part homogeneity. (Author/SED)
Descriptors: Item Analysis, Latent Trait Theory, Models, Second Language Learning
Peer reviewed Peer reviewed
Hamp-Lyons, Liz – Language Testing, 1997
Links the theory of washback with the broader concept of impact in educational measurement and to the recent debate on construct validity associated with Messick. Notes that for many years it was asserted that language tests negatively impacted teaching and learning, an impact known as washback. (25 references) (Author/CK)
Descriptors: Ethics, Higher Education, Language Tests, Measurement Techniques
Peer reviewed Peer reviewed
Chalhoub-Deville, Micheline – Language Testing, 1997
Reviews the usefulness of proficiency models influencing second language testing. Findings indicate that several factors contribute to the lack of congruence between models and test construction and make a case for distinguishing between theoretical models. Underscores the significance of an empirical, contextualized and structured approach to the…
Descriptors: Communicative Competence (Languages), Language Proficiency, Language Tests, Linguistic Theory
Peer reviewed Peer reviewed
Choi, Inn-Chull; Bachman, Lyle F. – Language Testing, 1992
This study is part of a larger one examining the comparability of the First Certificate in English and the Test of English as a Foreign Language. The general assumption of unidimensionality and goodness-of-fit were tested. Findings raise questions about the consequences of rejecting or retaining misfitting items. (60 references) (LB)
Descriptors: Comparative Analysis, English (Second Language), Goodness of Fit, Item Response Theory
Peer reviewed Peer reviewed
Bailey, Kathleen M. – Language Testing, 1996
Presents a literature review seeking to answer four questions: (1) What is washback? (2) How does washback work? (3) How can we promote positive washback? and (4) How can we investigate washback? A model is proposed that identifies participants, processes and products which may influence or be influenced by, washback. Strategies for investigating…
Descriptors: Change Strategies, Construct Validity, Educational Philosophy, Language Proficiency
Peer reviewed Peer reviewed
Bachman, Lyle F.; And Others – Language Testing, 1996
Discusses the value of content considerations in the design of language tests and the implications of the findings of various investigations of content analysis. The article argues that content analysis can be viewed as the application of a model of test design to a particular measurement instrument, using judgments of trained analysts. (26…
Descriptors: College Students, Content Analysis, English (Second Language), Item Analysis