Publication Date
In 2025 | 1 |
Since 2024 | 3 |
Since 2021 (last 5 years) | 10 |
Since 2016 (last 10 years) | 17 |
Since 2006 (last 20 years) | 33 |
Descriptor
Language Tests | 78 |
Multiple Choice Tests | 78 |
Test Validity | 78 |
English (Second Language) | 48 |
Second Language Learning | 41 |
Test Reliability | 32 |
Foreign Countries | 31 |
Test Construction | 29 |
Language Proficiency | 27 |
Test Items | 23 |
Cloze Procedure | 22 |
More ▼ |
Source
Author
Coniam, David | 3 |
Bensoussan, Marsha | 2 |
Lee, Tony | 2 |
Stansfield, Charles | 2 |
Ai, Haiyang | 1 |
Akhtar, Samina | 1 |
Ali Zahabi | 1 |
Allan, Alistair | 1 |
Alonzo, Julie | 1 |
Anderson, Daniel | 1 |
Appenzellar, Anne B. | 1 |
More ▼ |
Publication Type
Education Level
Higher Education | 16 |
Postsecondary Education | 16 |
Secondary Education | 8 |
Elementary Education | 4 |
Grade 8 | 4 |
Grade 7 | 3 |
Junior High Schools | 3 |
Middle Schools | 3 |
Grade 6 | 2 |
High Schools | 2 |
Intermediate Grades | 2 |
More ▼ |
Audience
Practitioners | 2 |
Teachers | 2 |
Researchers | 1 |
Location
China | 5 |
Netherlands | 5 |
Iran | 4 |
Canada | 2 |
Japan | 2 |
Alabama | 1 |
Algeria | 1 |
Arizona | 1 |
Arkansas | 1 |
Armenia | 1 |
California | 1 |
More ▼ |
Laws, Policies, & Programs
Assessments and Surveys
What Works Clearinghouse Rating
Stefan O'Grady – International Journal of Listening, 2025
Language assessment is increasingly computermediated. This development presents opportunities with new task formats and equally a need for renewed scrutiny of established conventions. Recent recommendations to increase integrated skills assessment in lecture comprehension tests is premised on empirical research that demonstrates enhanced construct…
Descriptors: Language Tests, Lecture Method, Listening Comprehension Tests, Multiple Choice Tests
Coniam, David; Lee, Tony; Lampropoulou, Leda – English Language Teaching, 2021
This article explores the issue of identifying guessers -- with a specific focus on multiple-choice tests. Guessing has long been considered a problem due to the fact that it compromises validity. A test taker scoring higher than they should through guessing does not provide a picture of their actual ability. After an initial description of issues…
Descriptors: Language Tests, Guessing (Tests), English (Second Language), Second Language Learning
Budi Waluyo; Ali Zahabi; Luksika Ruangsung – rEFLections, 2024
The increasing popularity of the Common European Framework of Reference (CEFR) in non-native English-speaking countries has generated a demand for concrete examples in the creation of CEFR-based tests that assess the four main English skills. In response, this research endeavors to provide insight into the development and validation of a…
Descriptors: Language Tests, Language Proficiency, Undergraduate Students, Language Skills
New York State Education Department, 2024
The New York State Education Department (NYSED) has a partnership with NWEA for the development of the 2024 Grades 3-8 English Language Arts Tests. Teachers from across the State work with NYSED in a variety of activities to ensure the validity and reliability of the New York State Testing Program (NYSTP). The 2024 Grades 6 and 7 English Language…
Descriptors: Language Tests, Test Format, Language Arts, English Instruction
Madyarov, Irshat; Movsisyan, Vahe; Madoyan, Habet; Galikyan, Irena; Gasparyan, Rubina – ETS Research Report Series, 2021
The "TOEFL Junior"® Standard test is a tool for measuring the English language skills of students ages 11+ who learn English as an additional language. It is a paper-based multiple-choice test and measures proficiency in three sections: listening, form and meaning, and reading. To date, empirical evidence provides some support for the…
Descriptors: English (Second Language), Second Language Learning, Language Tests, Standardized Tests
Asquith, Steven – TESL-EJ, 2022
Although an accurate measure of vocabulary size is integral to understanding the proficiency of language learners, the validity of multiple-choice (M/C) vocabulary tests to determine this has been questioned due to users guessing correct answers which inflates scores. In this paper the nature of guessing and partial knowledge used when taking the…
Descriptors: Guessing (Tests), English (Second Language), Second Language Learning, Language Tests
Zhang, Xian; Liu, Jianda; Ai, Haiyang – Language Testing, 2020
The main purpose of this study is to investigate guessing in the Yes/No (YN) format vocabulary test. One-hundred-and-five university students took a YN test, a translation task and a multiple-choice vocabulary size test (MC VST). With matched lexical properties between the real words and the pseudowords, pseudowords could index guessing in the YN…
Descriptors: Vocabulary Development, Language Tests, Test Format, College Students
Jamalzadeh, Mehri; Lotfi, Ahmad Reza; Rostami, Masoud – Language Testing in Asia, 2021
The current study sought to examine the validity of a General English Achievement Test (GEAT), administered to university students in the fall semester of 2018-2019 academic year, by hybridizing differential information (DIF) and differential distractor function (DDF) analytical models. Using a purposive sampling method, from the target population…
Descriptors: Language Tests, Achievement Tests, Undergraduate Students, Islam
Coniam, David; Lee, Tony; Milanovic, Michael; Pike, Nigel; Zhao, Wen – Language Education & Assessment, 2022
The calibration of test materials generally involves the interaction between empirical analysis and expert judgement. This paper explores the extent to which scale familiarity might affect expert judgement as a component of test validation in the calibration process. It forms part of a larger study that investigates the alignment of the…
Descriptors: Specialists, Language Tests, Test Validity, College Faculty
Yazdinejad, Anoushe; Zeraatpishe, Mitra – International Journal of Language Testing, 2019
In this study the validity of partial dictation as a measure of overall language proficiency was examined. Two partial dictation tests along with a C-Test, a cloze test, and a reading comprehension test, as criterion measures, were administered to a group of Iranian EFL learners. The coefficients of correlation between partial dictation and…
Descriptors: Test Validity, Verbal Communication, Language Proficiency, Language Tests
Bardovi-Harlig, Kathleen; Su, Yunwen – TESL-EJ, 2021
This exploratory study examines the role of foreign and second language contexts in the acquisition of conventional expressions. A group of 21 ESL learners was compared to 25 EFL learners randomly selected from a larger pool. Both groups completed an aural multiple-choice discourse completion task (MC-DCT), which was developed from a previously…
Descriptors: Multiple Choice Tests, Second Language Learning, Second Language Instruction, English (Second Language)
Toker, Deniz – TESL-EJ, 2019
The central purpose of this paper is to examine validity problems arising from the multiple-choice items and technical passages in the Test of English as a Foreign Language Internet-based Test (TOEFL iBT) reading section, primarily concentrating on construct-irrelevant variance (Messick, 1989). My personal TOEFL iBT experience, along with my…
Descriptors: English (Second Language), Language Tests, Second Language Learning, Computer Assisted Testing
Lim, Hyojung – Language Testing in Asia, 2019
Background: This study aims to empirically answer the question of whether the role of sub-reading skills changes depending on the test format (e.g., multiple-choice vs. open-ended reading questions). The test format effect also addresses the issue of test validity--whether the reading test properly elicits construct-relevant reading skills or…
Descriptors: Foreign Countries, Test Format, Language Tests, English (Second Language)
Zare, Samaneh; Boori, Ali Akbar – International Journal of Language Testing, 2018
In this study, the cloze-elide test was developed and administered under time constraints. This research is aimed to examine the validity and reliability of the speeded cloze-elide test and investigate its relationship with reading comprehension, C-Test, and multiple-choice cloze test. Processing speed is a vital indicator to distinguish high to…
Descriptors: Cloze Procedure, Timed Tests, Language Tests, English (Second Language)
Alonzo, Julie; Anderson, Daniel – Behavioral Research and Teaching, 2018
In response to a request for additional analyses, in particular reporting confidence intervals around the results, we re-analyzed the data from prior studies. This supplementary report presents the results of the additional analyses addressing classification accuracy, reliability, and criterion-related validity evidence. For ease of reference, we…
Descriptors: Curriculum Based Assessment, Computation, Statistical Analysis, Classification