Publication Date
| In 2026 | 0 |
| Since 2025 | 3 |
| Since 2022 (last 5 years) | 18 |
| Since 2017 (last 10 years) | 29 |
| Since 2007 (last 20 years) | 39 |
Descriptor
| Foreign Countries | 40 |
| Test Items | 40 |
| Language Processing | 27 |
| Second Language Learning | 18 |
| English (Second Language) | 17 |
| Language Tests | 17 |
| Item Analysis | 15 |
| Natural Language Processing | 14 |
| Comparative Analysis | 12 |
| Computer Assisted Testing | 12 |
| Multiple Choice Tests | 10 |
| More ▼ | |
Source
Author
| Goldhammer, Frank | 2 |
| Reima Al-Jarf | 2 |
| Sälzer, Christine | 2 |
| Zehner, Fabian | 2 |
| Afsar Rouhi | 1 |
| Al-Jarf, Reima | 1 |
| Aldabe, Itziar | 1 |
| Altinbas, Mehmet Emre | 1 |
| Anna Lucia Paoletti | 1 |
| Awadh, Awadh Nasser Munassar | 1 |
| Ayaka Sugawara | 1 |
| More ▼ | |
Publication Type
| Journal Articles | 37 |
| Reports - Research | 37 |
| Tests/Questionnaires | 4 |
| Reports - Descriptive | 2 |
| Speeches/Meeting Papers | 2 |
| Reports - Evaluative | 1 |
Education Level
Audience
Laws, Policies, & Programs
Assessments and Surveys
What Works Clearinghouse Rating
Owen Henkel; Libby Hills; Bill Roberts; Joshua McGrane – International Journal of Artificial Intelligence in Education, 2025
Formative assessment plays a critical role in improving learning outcomes by providing feedback on student mastery. Open-ended questions, which require students to produce multi-word, nontrivial responses, are a popular tool for formative assessment as they provide more specific insights into what students do and do not know. However, grading…
Descriptors: Artificial Intelligence, Grading, Reading Comprehension, Natural Language Processing
Valentina Albano; Donatella Firmani; Luigi Laura; Jerin George Mathew; Anna Lucia Paoletti; Irene Torrente – Journal of Learning Analytics, 2023
Multiple-choice questions (MCQs) are widely used in educational assessments and professional certification exams. Managing large repositories of MCQs, however, poses several challenges due to the high volume of questions and the need to maintain their quality and relevance over time. One of these challenges is the presence of questions that…
Descriptors: Natural Language Processing, Multiple Choice Tests, Test Items, Item Analysis
C. H., Dhawaleswar Rao; Saha, Sujan Kumar – IEEE Transactions on Learning Technologies, 2023
Multiple-choice question (MCQ) plays a significant role in educational assessment. Automatic MCQ generation has been an active research area for years, and many systems have been developed for MCQ generation. Still, we could not find any system that generates accurate MCQs from school-level textbook contents that are useful in real examinations.…
Descriptors: Multiple Choice Tests, Computer Assisted Testing, Automation, Test Items
Reima Al-Jarf – Online Submission, 2024
Expressions of impossibility refer to events that can never or rarely happen, tasks that are difficult or impossible to perform, people or things that are of no use and things that are impossible to find. This study explores the similarities and differences between English and Arabic expressions of impossibility, and the difficulties that…
Descriptors: English (Second Language), Second Language Learning, Arabic, Translation
Okumura, Yuko; Oshima-Takane, Yuriko; Kobayashi, Tessei; Ma, Michelle; Kayama, Yuhko – Language Learning and Development, 2023
In successful communication, it is critical to have the ability to identify what a speaker is referring to from previously mentioned information. This ability requires the identification of the topic initially introduced by lexical forms and its continuity in discourse expressed by anaphora such as null and pronominal forms in the subsequent…
Descriptors: Form Classes (Languages), Sentence Structure, Japanese, Language Acquisition
Gombert, Sebastian; Di Mitri, Daniele; Karademir, Onur; Kubsch, Marcus; Kolbe, Hannah; Tautz, Simon; Grimm, Adrian; Bohm, Isabell; Neumann, Knut; Drachsler, Hendrik – Journal of Computer Assisted Learning, 2023
Background: Formative assessments are needed to enable monitoring how student knowledge develops throughout a unit. Constructed response items which require learners to formulate their own free-text responses are well suited for testing their active knowledge. However, assessing such constructed responses in an automated fashion is a complex task…
Descriptors: Coding, Energy, Scientific Concepts, Formative Evaluation
Seyedeh Azadeh Ghiasian; Fatemeh Hemmati; Seyyed Mohammad Alavi; Afsar Rouhi – International Journal of Language Testing, 2025
A critical component of cognitive diagnostic models (CDMs) is a Q-matrix that stipulates associations between items of a test and their required attributes. The present study aims to develop and empirically validate a Q-matrix for the listening comprehension section of the International English Language Testing System (IELTS). To this end, a…
Descriptors: Test Items, Listening Comprehension Tests, English (Second Language), Language Tests
Rosemary Erlam; Lan Wei – Language Teaching Research, 2024
This study is a conceptual replication of Ellis' 'Measuring implicit and explicit knowledge of a second language: A psychometric study', published in "Studies in Second Language Acquisition" (2005), aiming to establish the importance of including belief statements (hypothesized to increase processing demands) in the design of Elicited…
Descriptors: Language Processing, Language Tests, Second Language Learning, Psychometrics
Shin, Jinnie; Gierl, Mark J. – International Journal of Testing, 2022
Over the last five years, tremendous strides have been made in advancing the AIG methodology required to produce items in diverse content areas. However, the one content area where enormous problems remain unsolved is language arts, generally, and reading comprehension, more specifically. While reading comprehension test items can be created using…
Descriptors: Reading Comprehension, Test Construction, Test Items, Natural Language Processing
Tim Stoeckel; Tomoko Ishii – Vocabulary Learning and Instruction, 2024
In an upcoming coverage-comprehension study, we plan to assess learners' meaning-recall knowledge of words as they occur in the study's reading passage. As several meaning-recall test formats exist, the purpose of this small-scale study (N = 10) was to determine which of three formats was most similar to a criterion interview regarding mean score…
Descriptors: Vocabulary Development, Language Tests, Second Language Learning, Classification
Reima Al-Jarf – Online Submission, 2023
Time metaphorical expressions are common in all languages and in general as well as specialized contexts. This study explores the similarities and differences between English and Arabic time metaphorical expressions containing , and the difficulties that student-translators have in translating them; the translation strategies they use and the…
Descriptors: Time, Figurative Language, Arabic, English (Second Language)
Al-Jarf, Reima – Online Submission, 2023
This study explores the similarities and differences between English and Arabic numeral-based formulaic expressions, and difficulties that student-translators have with them. A corpus of English and Arabic numeral-based formulaic expressions containing zero, two, three, twenty, sixty, hundred, thousand…etc., and another corpus of specialized…
Descriptors: Translation, Arabic, Contrastive Linguistics, Phrase Structure
Lions, Séverin; Dartnell, Pablo; Toledo, Gabriela; Godoy, María Inés; Córdova, Nora; Jiménez, Daniela; Lemarié, Julie – Educational and Psychological Measurement, 2023
Even though the impact of the position of response options on answers to multiple-choice items has been investigated for decades, it remains debated. Research on this topic is inconclusive, perhaps because too few studies have obtained experimental data from large-sized samples in a real-world context and have manipulated the position of both…
Descriptors: Multiple Choice Tests, Test Items, Item Analysis, Responses
Holzknecht, Franz; McCray, Gareth; Eberharter, Kathrin; Kremmel, Benjamin; Zehentner, Matthias; Spiby, Richard; Dunlea, Jamie – Language Testing, 2021
Studies from various disciplines have reported that spatial location of options in relation to processing order impacts the ultimate choice of the option. A large number of studies have found a primacy effect, that is, the tendency to prefer the first option. In this paper we report on evidence that position of the key in four-option…
Descriptors: Language Tests, Test Items, Multiple Choice Tests, Listening Comprehension Tests
Qiao Wang; Ralph L. Rose; Ayaka Sugawara; Naho Orita – Vocabulary Learning and Instruction, 2025
VocQGen is an automated tool designed to generate multiple-choice cloze (MCC) questions for vocabulary assessment in second language learning contexts. It leverages several natural language processing (NLP) tools and OpenAI's GPT-4 model to produce MCC items quickly from user-specified word lists. To evaluate its effectiveness, we used the first…
Descriptors: Vocabulary Skills, Artificial Intelligence, Computer Software, Multiple Choice Tests

Peer reviewed
Direct link
