NotesFAQContact Us
Collection
Advanced
Search Tips
Showing all 8 results Save | Export
Peer reviewed Peer reviewed
Direct linkDirect link
Shin, Jinnie; Gierl, Mark J. – Language Testing, 2021
Automated essay scoring (AES) has emerged as a secondary or as a sole marker for many high-stakes educational assessments, in native and non-native testing, owing to remarkable advances in feature engineering using natural language processing, machine learning, and deep-neural algorithms. The purpose of this study is to compare the effectiveness…
Descriptors: Scoring, Essays, Writing Evaluation, Computer Software
Peer reviewed Peer reviewed
Direct linkDirect link
Li, Hongli; Hunter, C. Vincent; Lei, Pui-Wa – Language Testing, 2016
Cognitive diagnostic models (CDMs) have great promise for providing diagnostic information to aid learning and instruction, and a large number of CDMs have been proposed. However, the assumptions and performances of different CDMs and their applications in regard to reading comprehension tests are not fully understood. In the present study, we…
Descriptors: Reading Comprehension, Reading Tests, Models, Comparative Analysis
Peer reviewed Peer reviewed
Direct linkDirect link
Tobia, Valentina; Ciancaleoni, Matteo; Bonifacci, Paola – Language Testing, 2017
In this study, two alternative theoretical models were compared, in order to analyze which of them best explains primary school children's text comprehension skills. The first one was based on the distinction between two types of answers requested by the comprehension test: local or global. The second model involved texts' input modality: written…
Descriptors: Elementary School Students, Language Tests, Foreign Countries, Comparative Analysis
Peer reviewed Peer reviewed
Direct linkDirect link
Yi, Yeon-Sook – Language Testing, 2017
The present study examines the relative importance of attributes within and across items by applying four cognitive diagnostic assessment models. The current study utilizes the function of the models that can indicate inter-attribute relationships that reflect the response behaviors of examinees to analyze scored test-taker responses to four forms…
Descriptors: Second Language Learning, Reading Comprehension, Listening Comprehension, Language Tests
Peer reviewed Peer reviewed
Direct linkDirect link
Elder, Catherine; McNamara, Tim – Language Testing, 2016
Gaining insights from domain experts into how they view communication in real world settings is recognized as an important authenticity consideration in the development of criteria to assess language proficiency for specific academic or occupational purposes. These "indigenous" criteria represent an articulation of the test construct and…
Descriptors: Feedback (Response), Language Proficiency, Language Tests, Models
Peer reviewed Peer reviewed
Direct linkDirect link
Granfeldt, Jonas; Ågren, Malin – Language Testing, 2014
One core area of research in Second Language Acquisition is the identification and definition of developmental stages in different L2s. For L2 French, Bartning and Schlyter (2004) presented a model of six morphosyntactic stages of development in the shape of grammatical profiles. The model formed the basis for the computer program Direkt Profil…
Descriptors: Second Language Learning, Language Tests, French, Language Teachers
Peer reviewed Peer reviewed
Choi, Inn-Chull; Bachman, Lyle F. – Language Testing, 1992
This study is part of a larger one examining the comparability of the First Certificate in English and the Test of English as a Foreign Language. The general assumption of unidimensionality and goodness-of-fit were tested. Findings raise questions about the consequences of rejecting or retaining misfitting items. (60 references) (LB)
Descriptors: Comparative Analysis, English (Second Language), Goodness of Fit, Item Response Theory
Peer reviewed Peer reviewed
Boldt, Robert F. – Language Testing, 1992
The assumption called PIRC (proportional item response curve) was tested in which PIRC was used to predict item scores of selected examinees on selected items. Findings show approximate accuracies of prediction for PIRC, the three-parameter logist model, and a modified Rasch model. (12 references) (Author/LB)
Descriptors: Comparative Analysis, English (Second Language), Factor Analysis, Item Response Theory