Publication Date
In 2025 | 3 |
Since 2024 | 3 |
Since 2021 (last 5 years) | 8 |
Since 2016 (last 10 years) | 10 |
Since 2006 (last 20 years) | 20 |
Descriptor
Item Analysis | 37 |
Language Tests | 37 |
Test Reliability | 37 |
English (Second Language) | 22 |
Test Validity | 22 |
Foreign Countries | 18 |
Second Language Learning | 14 |
Test Items | 14 |
Test Construction | 13 |
Language Proficiency | 12 |
Comparative Analysis | 7 |
More ▼ |
Source
Author
Publication Type
Reports - Research | 26 |
Journal Articles | 21 |
Speeches/Meeting Papers | 4 |
Reports - Descriptive | 3 |
Tests/Questionnaires | 2 |
Dissertations/Theses -… | 1 |
Guides - Non-Classroom | 1 |
Reports - Evaluative | 1 |
Education Level
Audience
Practitioners | 1 |
Location
Iran | 5 |
China | 2 |
Asia | 1 |
China (Guangzhou) | 1 |
Connecticut | 1 |
Europe | 1 |
Indonesia | 1 |
Iraq | 1 |
Italy | 1 |
Japan | 1 |
Pakistan | 1 |
More ▼ |
Laws, Policies, & Programs
Assessments and Surveys
General Educational… | 1 |
Test of English as a Foreign… | 1 |
Test of English for… | 1 |
What Works Clearinghouse Rating
Hung Tan Ha; Duyen Thi Bich Nguyen; Tim Stoeckel – Language Assessment Quarterly, 2025
This article compares two methods for detecting local item dependence (LID): residual correlation examination and Rasch testlet modeling (RTM), in a commonly used 3:6 matching format and an extended matching test (EMT) format. The two formats are hypothesized to facilitate different levels of item dependency due to differences in the number of…
Descriptors: Comparative Analysis, Language Tests, Test Items, Item Analysis
Reza Shahi; Hamdollah Ravand; Golam Reza Rohani – International Journal of Language Testing, 2025
The current paper intends to exploit the Many Facet Rasch Model to investigate and compare the impact of situations (items) and raters on test takers' performance on the Written Discourse Completion Test (WDCT) and Discourse Self-Assessment Tests (DSAT). In this study, the participants were 110 English as a Foreign Language (EFL) students at…
Descriptors: Comparative Analysis, English (Second Language), Second Language Learning, Second Language Instruction
Farshad Effatpanah; Purya Baghaei; Mona Tabatabaee-Yazdi; Esmat Babaii – Language Testing, 2025
This study aimed to propose a new method for scoring C-Tests as measures of general language proficiency. In this approach, the unit of analysis is sentences rather than gaps or passages. That is, the gaps correctly reformulated in each sentence were aggregated as sentence score, and then each sentence was entered into the analysis as a polytomous…
Descriptors: Item Response Theory, Language Tests, Test Items, Test Construction
Mardiana – Eurasian Journal of Applied Linguistics, 2023
Written inquiries, which are more frequent and have less of a focus on complex thinking, are issues at school. Students are not taught how to respond to questions found in High-Level Thinking Skills (HOTS) tests, hence, their thinking abilities are generally weak. The issue for teachers is that neither they nor anyone else has been able to create…
Descriptors: Skill Development, Thinking Skills, Check Lists, Models
Al-Jarf, Reima – Online Submission, 2023
This study explores the similarities and differences between English and Arabic numeral-based formulaic expressions, and difficulties that student-translators have with them. A corpus of English and Arabic numeral-based formulaic expressions containing zero, two, three, twenty, sixty, hundred, thousand…etc., and another corpus of specialized…
Descriptors: Translation, Arabic, Contrastive Linguistics, Phrase Structure
Cheewasukthaworn, Kanchana – PASAA: Journal of Language Teaching and Learning in Thailand, 2022
In 2016, the Office of the Higher Education Commission issued a directive requiring all higher education institutions in Thailand to have their students take a standardized English proficiency test. According to the directive, the test's results had to align with the Common European Framework of Reference for Languages (CEFR). In response to this…
Descriptors: Test Construction, Standardized Tests, Language Tests, English (Second Language)
Omarov, Nazarbek Bakytbekovich; Mohammed, Aisha; Alghurabi, Ammar Muhi Khleel; Alallo, Hajir Mahmood Ibrahim; Ali, Yusra Mohammed; Hassan, Aalaa Yaseen; Demeuova, Lyazat; Viktorovna, Shvedova Irina; Nazym, Bekenova; Al Khateeb, Nashaat Sultan Afif – International Journal of Language Testing, 2023
The Multiple-choice (MC) item format is commonly used in educational assessments due to its economy and effectiveness across a variety of content domains. However, numerous studies have examined the quality of MC items in high-stakes and higher-education assessments and found many flawed items, especially in terms of distractors. These faulty…
Descriptors: Test Items, Multiple Choice Tests, Item Response Theory, English (Second Language)
Kim, Peter – Language Teaching Research Quarterly, 2021
Foreign language aptitude is defined as one's potential to learn a second language. A language learner with higher aptitude is predicted to learn more, faster, and reach a higher level of proficiency. If this is the case, one way to validate the construct of aptitude and its measure is to conduct a validation study in which measures of aptitude is…
Descriptors: Morphology (Languages), Syntax, Second Language Learning, Second Language Instruction
Baghaei, Purya; Dourakhshan, Alireza – International Journal of Language Testing, 2016
The purpose of the present study is to compare the psychometric qualities of canonical single-response multiple-choice items with their double-response counterparts. Thirty, two-response fouroption grammar items for undergraduate students of English were constructed. A second version of the test was constructed by replacing one of the correct…
Descriptors: Language Tests, Multiple Choice Tests, Test Items, Factor Analysis
Xu, Lan; Wannaruk, Anchalee – LEARN Journal: Language Education and Acquisition Research Network, 2016
Performing routines in interlanguage is vitally important for EFL learners since it can cause embarrassment between speakers from different cultures. The present study aims to 1) investigate the reliability and validity of an interlanguge pragmatic competence test on routines in a Chinese EFL context with multiple choice discourse completion task…
Descriptors: Language Tests, Test Construction, Pragmatics, Interlanguage
Brown, N. Anthony; Dewey, Dan P.; Cox, Troy L. – Foreign Language Annals, 2014
In this study, the authors evaluated the strengths and limitations of a self-assessment based on ACTFL Can-Do statements ("ACTFL," 2013]) as a tool for measuring linguistic gains over an internship abroad in Russia. They assessed its reliability, determined how its items mapped with the ACTFL scale, and measured the degree to which…
Descriptors: Self Evaluation (Individuals), Pretests Posttests, Interviews, Language Proficiency
Zandi, Hamed; Kaivanpanah, Shiva; Alavi, Seyed Mohammad – Iranian Journal of Language Teaching Research, 2014
Reviewing the test specifications to improve the quality of language tests may be a routine process in professional testing systems. However, there is a paucity of research about the effect of specifications review on improving the quality of small-scale tests. The purpose of the present study was twofold: how specifications review could help…
Descriptors: Test Reliability, Test Validity, Language Tests, Test Items
Gu, Lin; Turkan, Sultan; Gomez, Pablo Garcia – ETS Research Report Series, 2015
ELTeach is an online professional development program developed by Educational Testing Service (ETS) in collaboration with National Geographic Learning. The ELTeach program consists of two courses: English-for-Teaching and Professional Knowledge for English Language Teaching (ELT). Each course includes a coordinated assessment leading to a score…
Descriptors: Item Analysis, Test Items, English (Second Language), Second Language Instruction
Haider, Zubair; Latif, Farah; Akhtar, Samina; Mushtaq, Maria – Educational Research and Reviews, 2012
Validity, reliability and item analysis are critical to the process of evaluating the quality of an educational measurement. The present study evaluates the quality of an assessment constructed to measure elementary school student's achievement in English. In this study, the survey model of descriptive research was used as a research method.…
Descriptors: Foreign Countries, English (Second Language), Second Language Learning, Language Tests
Chi, Youngshin – ProQuest LLC, 2011
This study investigated the breakdown effect of a listening comprehension test, whether test takers are affected in comprehending lectures by impediments, and collected test takers' cognitive awareness on test tasks which contain listening breakdown factors how they perceived these impediments. In this context of the study, a "Breakdown" is a test…
Descriptors: Generalizability Theory, Listening Comprehension, Intervals, Second Languages