Publication Date
In 2025 | 1 |
Since 2024 | 2 |
Since 2021 (last 5 years) | 5 |
Since 2016 (last 10 years) | 13 |
Since 2006 (last 20 years) | 15 |
Descriptor
Computer Software | 20 |
Language Tests | 20 |
Test Items | 20 |
English (Second Language) | 16 |
Second Language Learning | 15 |
Comparative Analysis | 8 |
Foreign Countries | 8 |
Scores | 8 |
Second Language Instruction | 8 |
Accuracy | 7 |
Computer Assisted Testing | 7 |
More ▼ |
Source
Author
Akbari, Alireza | 1 |
Alderson, J. Charles | 1 |
Alghazali, Tawfeeq | 1 |
Ali, Usama S. | 1 |
Ariew, Robert A. | 1 |
Aryadoust, Vahid | 1 |
Ashwell, Tim | 1 |
Awadh, Awadh Nasser Munassar | 1 |
Ayaka Sugawara | 1 |
Breyer, F. Jay | 1 |
Davidson, Fred | 1 |
More ▼ |
Publication Type
Reports - Research | 17 |
Journal Articles | 16 |
Tests/Questionnaires | 2 |
Books | 1 |
Collected Works - General | 1 |
Dissertations/Theses -… | 1 |
Reports - Evaluative | 1 |
Education Level
Higher Education | 7 |
Postsecondary Education | 6 |
Audience
Practitioners | 1 |
Teachers | 1 |
Location
Japan | 3 |
Japan (Tokyo) | 1 |
Saudi Arabia | 1 |
Taiwan | 1 |
Yemen | 1 |
Laws, Policies, & Programs
Assessments and Surveys
Test of English as a Foreign… | 3 |
International English… | 2 |
Michigan Test of English… | 1 |
Test of English for… | 1 |
What Works Clearinghouse Rating
Aryadoust, Vahid; Ng, Li Ying; Sayama, Hiroki – Language Testing, 2021
Over the past decades, the application of Rasch measurement in language assessment has gradually increased. In the present study, we coded 215 papers using Rasch measurement published in 21 applied linguistics journals for multiple features. We found that seven Rasch models and 23 software packages were adopted in these papers, with many-facet…
Descriptors: Language Tests, Testing, Test Items, Network Analysis
Mingying Zheng – ProQuest LLC, 2024
The digital transformation in educational assessment has led to the proliferation of large-scale data, offering unprecedented opportunities to enhance language learning, and testing through machine learning (ML) techniques. Drawing on the extensive data generated by online English language assessments, this dissertation investigates the efficacy…
Descriptors: Artificial Intelligence, Computational Linguistics, Language Tests, English (Second Language)
Akbari, Alireza; Shahnazari, Mohammadtaghi – Language Testing in Asia, 2019
The present research paper introduces a translation evaluation method called Calibrated Parsing Items Evaluation (CPIE hereafter). This evaluation method maximizes translators' performance through identifying the parsing items with an optimal p-docimology and d-index (item discrimination). This method checks all the possible parses (annotations)…
Descriptors: Test Items, Translation, Computer Software, Evaluators
Qiao Wang; Ralph L. Rose; Ayaka Sugawara; Naho Orita – Vocabulary Learning and Instruction, 2025
VocQGen is an automated tool designed to generate multiple-choice cloze (MCC) questions for vocabulary assessment in second language learning contexts. It leverages several natural language processing (NLP) tools and OpenAI's GPT-4 model to produce MCC items quickly from user-specified word lists. To evaluate its effectiveness, we used the first…
Descriptors: Vocabulary Skills, Artificial Intelligence, Computer Software, Multiple Choice Tests
Mohammed, Aisha; Dawood, Abdul Kareem Shareef; Alghazali, Tawfeeq; Kadhim, Qasim Khlaif; Sabti, Ahmed Abdulateef; Sabit, Shaker Holh – International Journal of Language Testing, 2023
Cognitive diagnostic models (CDMs) have received much interest within the field of language testing over the last decade due to their great potential to provide diagnostic feedback to all stakeholders and ultimately improve language teaching and learning. A large number of studies have demonstrated the application of CDMs on advanced large-scale…
Descriptors: Reading Comprehension, Reading Tests, Language Tests, English (Second Language)
Liao, Linyu – English Language Teaching, 2020
As a high-stakes standardized test, IELTS is expected to have comparable forms of test papers so that test takers from different test administration on different dates receive comparable test scores. Therefore, this study examined the text difficulty and task characteristics of four parallel academic IELTS reading tests to reveal to what extent…
Descriptors: Second Language Learning, English (Second Language), Language Tests, High Stakes Tests
van Rijn, Peter W.; Ali, Usama S. – ETS Research Report Series, 2018
A computer program was developed to estimate speed-accuracy response models for dichotomous items. This report describes how the models are estimated and how to specify data and input files. An example using data from a listening section of an international language test is described to illustrate the modeling approach and features of the computer…
Descriptors: Computer Software, Computation, Reaction Time, Timed Tests
Nicklin, Christopher; Vitta, Joseph P. – Language Testing, 2022
Instrument measurement conducted with Rasch analysis is a common process in language assessment research. A recent systematic review of 215 studies involving Rasch analysis in language testing and applied linguistics research reported that 23 different software packages had been utilized. However, none of the analyses were conducted with one of…
Descriptors: Programming Languages, Vocabulary Development, Language Tests, Computer Software
Awadh, Awadh Nasser Munassar; Khan, Ansarullah Shafiull – Journal of Language and Linguistic Studies, 2020
This study aims at investigating the challenges that Yemeni translation students encounter when translating neologisms from English into Arabic. It also aims at comparing students' translation with outcomes of machine translation (MT). The authors follow the descriptive and comparative methods in conducting this study. To achieve the objective of…
Descriptors: Barriers, Translation, English (Second Language), Semitic Languages
Karlin, Omar; Karlin, Sayaka – InSight: A Journal of Scholarly Teaching, 2018
This study had two aims. The first was to explain the process of using the Rasch measurement model to validate tests in an easy-to-understand way for those unfamiliar with the Rasch measurement model. The second was to validate two final exams with several shared items. The exams were given to two groups of students with slightly differing English…
Descriptors: Item Response Theory, Test Validity, Test Items, Accuracy
Lesnov, Roman Olegovich – International Journal of Computer-Assisted Language Learning and Teaching, 2018
This article compares second language test-takers' performance on an academic listening test in an audio-only mode versus an audio-video mode. A new method of classifying video-based visuals was developed and piloted, which used L2 expert opinions to place the video on a continuum from being content-deficient (not helpful for answering…
Descriptors: Second Language Learning, Second Language Instruction, Video Technology, Classification
Sieh, Yu-cheng – Taiwan Journal of TESOL, 2016
In an attempt to compare how orthography and phonology interact in EFL learners with different reading abilities, online measures were administered in this study to two groups of university learners, indexed by their reading scores on the Test of English for International Communication (TOEIC). In terms of "accuracy," the less-skilled…
Descriptors: Comparative Analysis, Word Recognition, Phonology, English (Second Language)
Ashwell, Tim; Elam, Jesse R. – JALT CALL Journal, 2017
The ultimate aim of our research project was to use the Google Web Speech API to automate scoring of elicited imitation (EI) tests. However, in order to achieve this goal, we had to take a number of preparatory steps. We needed to assess how accurate this speech recognition tool is in recognizing native speakers' production of the test items; we…
Descriptors: English (Second Language), Second Language Learning, Second Language Instruction, Language Tests
Zhang, Mo; Breyer, F. Jay; Lorenz, Florian – ETS Research Report Series, 2013
In this research, we investigated the suitability of implementing "e-rater"® automated essay scoring in a high-stakes large-scale English language testing program. We examined the effectiveness of generic scoring and 2 variants of prompt-based scoring approaches. Effectiveness was evaluated on a number of dimensions, including agreement…
Descriptors: Computer Assisted Testing, Computer Software, Scoring, Language Tests
Papageorgiou, Spiros; Stevens, Robin; Goodwin, Sarah – Language Assessment Quarterly, 2012
Listening comprehension tests typically include both monologic and dialogic input to measure listening ability. However, research as to which type of input is more challenging for examinees remains limited and has provided inconclusive results (Brindley & Slatyer, 2002; Read, 2002; Shohamy & Inbar, 1991). A better understanding of the…
Descriptors: Listening Comprehension Tests, Test Items, Content Analysis, Listening Comprehension
Previous Page | Next Page »
Pages: 1 | 2