NotesFAQContact Us
Collection
Advanced
Search Tips
Showing all 9 results Save | Export
Peer reviewed Peer reviewed
Direct linkDirect link
Sawaki, Yasuyo; Sinharay, Sandip – Language Testing, 2018
The present study examined the reliability of the reading, listening, speaking, and writing section scores for the TOEFL iBT® test and their interrelationship in order to collect empirical evidence to support, respectively, the "generalization" inference and the "explanation" inference in the TOEFL iBT validity argument…
Descriptors: English (Second Language), Language Tests, Second Language Learning, Computer Assisted Testing
Peer reviewed Peer reviewed
Direct linkDirect link
Yi, Yeon-Sook – Language Testing, 2017
The present study examines the relative importance of attributes within and across items by applying four cognitive diagnostic assessment models. The current study utilizes the function of the models that can indicate inter-attribute relationships that reflect the response behaviors of examinees to analyze scored test-taker responses to four forms…
Descriptors: Second Language Learning, Reading Comprehension, Listening Comprehension, Language Tests
Peer reviewed Peer reviewed
Direct linkDirect link
Gu, Lin – Language Testing, 2015
In this study I examined the dimensionality of the latent ability underlying language use that is needed to fulfill the demands young learners face in English-medium instructional environments, where English is used as the means of instruction for teaching subject matters. Previous research on English language use by school-age children provided…
Descriptors: Language Aptitude, Language Proficiency, English (Second Language), English Language Learners
Peer reviewed Peer reviewed
Direct linkDirect link
Gu, Lin – Language Testing, 2014
This study investigated the relationship between latent components of academic English language ability and test takers' study-abroad and classroom learning experiences through a structural equation modeling approach in the context of TOEFL iBT® testing. Data from the TOEFL iBT public dataset were used. The results showed that test takers'…
Descriptors: Language Tests, Second Language Learning, Language Proficiency, English for Academic Purposes
Peer reviewed Peer reviewed
Direct linkDirect link
Kim, Ah-Young – Language Testing, 2015
Previous research in cognitive diagnostic assessment (CDA) of L2 reading ability has been frequently conducted using large-scale English proficiency exams (e.g., TOEFL, MELAB). Using CDA, it is possible to analyze individual learners' strengths and weaknesses in multiple attributes (i.e., knowledge, skill, strategy) measured at the item level.…
Descriptors: Language Tests, Diagnostic Tests, Cognitive Measurement, Reading Ability
Peer reviewed Peer reviewed
Direct linkDirect link
Kunnan, Antony John – Language Testing, 2010
This paper presents the author's response to Xiaoming Xi's article titled "How do we go about investigating test fairness?" In this response, the author focuses on test fairness and Toulmin's model of argument structure, Xi's proposal, and the challenges the proposal brings. Xi proposes an approach to investigating test fairness to guide…
Descriptors: Persuasive Discourse, Inferences, Test Bias, Models
Peer reviewed Peer reviewed
Direct linkDirect link
Sang-Keun, Shin – Language Testing, 2005
This study investigated the relationship between examinee proficiency and the structure of the Test of English as a Foreign Language (TOEFL) and the Speaking Proficiency in English Assessment Kit (SPEAK). Specifically, using multi-group structural equation modeling, this study tested two competing hypotheses about the relationship: whether or not…
Descriptors: Models, Language Proficiency, Language Tests, Language Aptitude
Peer reviewed Peer reviewed
Choi, Inn-Chull; Bachman, Lyle F. – Language Testing, 1992
This study is part of a larger one examining the comparability of the First Certificate in English and the Test of English as a Foreign Language. The general assumption of unidimensionality and goodness-of-fit were tested. Findings raise questions about the consequences of rejecting or retaining misfitting items. (60 references) (LB)
Descriptors: Comparative Analysis, English (Second Language), Goodness of Fit, Item Response Theory
Peer reviewed Peer reviewed
Boldt, Robert F. – Language Testing, 1992
The assumption called PIRC (proportional item response curve) was tested in which PIRC was used to predict item scores of selected examinees on selected items. Findings show approximate accuracies of prediction for PIRC, the three-parameter logist model, and a modified Rasch model. (12 references) (Author/LB)
Descriptors: Comparative Analysis, English (Second Language), Factor Analysis, Item Response Theory