ERIC - Search Results

Publication Date

In 2026	0
Since 2025	7
Since 2022 (last 5 years)	34
Since 2017 (last 10 years)	66
Since 2007 (last 20 years)	103

Descriptor

Difficulty Level	137
Language Tests	137
Test Items	137
English (Second Language)	80
Second Language Learning	65
Foreign Countries	62
Item Response Theory	36
Language Proficiency	36
Item Analysis	35
Test Construction	34
Second Language Instruction	32
Scores	28
Test Format	28
Comparative Analysis	27
Reading Tests	23
Test Validity	22
Multiple Choice Tests	21
Test Reliability	21
Statistical Analysis	19
Reading Comprehension	16
Computer Assisted Testing	15
Listening Comprehension Tests	15
Mathematics Tests	15
College Students	14
Cloze Procedure	13
More ▼

Publication Type

Reports - Research	113
Journal Articles	99
Speeches/Meeting Papers	16
Reports - Evaluative	11
Tests/Questionnaires	10
Dissertations/Theses -…	6
Information Analyses	4
Reports - Descriptive	4
Numerical/Quantitative Data	2
Books	1
Collected Works - General	1
Guides - Classroom - Teacher	1
Multilingual/Bilingual…	1
More ▼

Education Level

Higher Education	32
Postsecondary Education	27
Secondary Education	18
Elementary Education	14
High Schools	7
Middle Schools	6
Elementary Secondary Education	5
Junior High Schools	5
Grade 7	3
Grade 8	3
Intermediate Grades	3
Early Childhood Education	2
Grade 3	2
Grade 9	2
Primary Education	2
Grade 10	1
Grade 4	1
Grade 5	1
Grade 6	1
More ▼

Audience

Researchers	3
Practitioners	2
Teachers	2
Students	1

Location

Japan	8
Iran	7
Germany	6
China	4
South Korea	4
Europe	3
Russia	3
Turkey	3
United Kingdom	3
Alabama	2
Belgium	2
California	2
Canada	2
Indonesia	2
Thailand	2
Ukraine	2
Vietnam	2
Arizona	1
Arkansas	1
Australia	1
Austria	1
Bulgaria	1
China (Beijing)	1
Connecticut	1
European Union	1
More ▼

Laws, Policies, & Programs

Assessments and Surveys

Test of English as a Foreign…	18
International English…	4
Test of English for…	3
Michigan Test of English…	2
ACT Assessment	1
Alabama High School…	1
English Proficiency Test	1
Expressive One Word Picture…	1
Measures of Academic Progress	1
SAT (College Admission Test)	1

What Works Clearinghouse Rating

Showing 1 to 15 of 137 results Save | Export

Revisiting the Lexical Differences between Academic and General Training IELTS Reading Tests

Peer reviewed

Direct link

Linh Thi Thao Le; Nam Thi Phuong Ho; Nguyen Huynh Trang; Hung Tan Ha – SAGE Open, 2025

The International English Language Testing System (IELTS) has served as one of the most reliable proofs of people's English language proficiency. There have been rumors about the discrepancy in difficulty between the two modules of IELTS, namely Academic (AC) and General Training (GT); however, there is little empirical evidence to confirm such a…

Descriptors: English (Second Language), Language Tests, Second Language Learning, Reading Tests

Evaluating Methodological Enhancements to the Yes/No Angoff Standard-Setting Method in Language Proficiency Assessment

Peer reviewed

Direct link

Tia M. Fechter; Heeyeon Yoon – Language Testing, 2024

This study evaluated the efficacy of two proposed methods in an operational standard-setting study conducted for a high-stakes language proficiency test of the U.S. government. The goal was to seek low-cost modifications to the existing Yes/No Angoff method to increase the validity and reliability of the recommended cut scores using a convergent…

Descriptors: Standard Setting, Language Proficiency, Language Tests, Evaluation Methods

Comparative Evaluation of C-Test Reliability Using Classical and Modern Psychometric Methods

Peer reviewed
PDF on ERIC

Download full text

Neda Kianinezhad; Mohsen Kianinezhad – Language Education & Assessment, 2025

This study presents a comparative analysis of classical reliability measures, including Cronbach's alpha, test-retest, and parallel forms reliability, alongside modern psychometric methods such as the Rasch model and Mokken scaling, to evaluate the reliability of C-tests in language proficiency assessment. Utilizing data from 150 participants…

Descriptors: Psychometrics, Test Reliability, Language Proficiency, Language Tests

What Makes a "Killer Question" Killer? A Text Mining Analysis of High-Difficulty Questions in the Korean CSAT English Section

Peer reviewed
PDF on ERIC

Download full text

Jeong-eun Kim – English Teaching, 2025

This study investigated the thematic and lexical characteristics of high-difficulty English reading items--commonly referred to as "killer questions"--in the Korean College Scholastic Ability Test (CSAT) between 2018 and 2025. Using text mining methods, including Latent Dirichlet Allocation (LDA) and CEFR-based lexical profiling, the…

Descriptors: English (Second Language), Difficulty Level, Test Items, Questioning Techniques

Argument-Based Validation of Chulalongkorn University Language Institute (CULI) Test: A Rasch-Based Evidence Investigation

Peer reviewed

Direct link

Apichat Khamboonruang – Language Testing in Asia, 2025

Chulalongkorn University Language Institute (CULI) test was developed as a local standardised test of English for professional and international communication. To ensure that the CULI test fulfils its intended purposes, this study employed Kane's argument-based validation and Rasch measurement approaches to construct the validity argument for the…

Descriptors: Universities, Second Language Learning, Second Language Instruction, Language Tests

The Features of Plausible but Incorrect Options: Distractor Plausibility in Synonym-Based Vocabulary Tests

Peer reviewed

Direct link

Ludewig, Ulrich; Schwerter, Jakob; McElvany, Nele – Journal of Psychoeducational Assessment, 2023

A better understanding of how distractor features influence the plausibility of distractors is essential for an efficient multiple-choice (MC) item construction in educational assessment. The plausibility of distractors has a major influence on the psychometric characteristics of MC items. Our analysis utilizes the nominal categories model to…

Descriptors: Vocabulary, Language Tests, German, Grade 4

Grammatical Complexity as a Predictor of Difficulty of Grammar Items in an English Test

Peer reviewed
PDF on ERIC

Download full text

Thirakunkovit, Suthathip; Rhee, Seongha – THAITESOL Journal, 2021

This study explores the extent to which the difficulty levels of grammar items in an English test can be predicted by the complexity of grammatical structures. The researchers carried out two sets of analyses. In the first analysis, the item facility and item discrimination indices of 175 multiple-choice items were examined. In the second…

Descriptors: Grammar, Test Items, Difficulty Level, English (Second Language)

Idea-Sharing Crafting Item Difficulty in TOEFL iBT Listening Tests

Peer reviewed
PDF on ERIC

Download full text

Alan Shaw – PASAA: Journal of Language Teaching and Learning in Thailand, 2023

Although the TOEFL iBT Listening test is sometimes used for other purposes, it was designed primarily for use as a college entrance examination. Item difficulty in TOEFL iBT Listening tests is the product of interactions between two sets of complex relationships: 1) relationships among numerous item characteristics themselves, and 2) relationships…

Descriptors: English (Second Language), Second Language Instruction, Listening Skills, Language Tests

Examining the Effect of Item Difficulty and Rater Leniency on Iranian Test Takers' Performance on WDCT and DSAT: A Comparative Study

Peer reviewed
PDF on ERIC

Download full text

Reza Shahi; Hamdollah Ravand; Golam Reza Rohani – International Journal of Language Testing, 2025

The current paper intends to exploit the Many Facet Rasch Model to investigate and compare the impact of situations (items) and raters on test takers' performance on the Written Discourse Completion Test (WDCT) and Discourse Self-Assessment Tests (DSAT). In this study, the participants were 110 English as a Foreign Language (EFL) students at…

Descriptors: Comparative Analysis, English (Second Language), Second Language Learning, Second Language Instruction

Testing Academic Language Proficiency: Comparing the TOEFL iBT® Test and the Duolingo English Test. TOEFL® Research Series. RR-104. ETS Research Report. RR-25-01

Peer reviewed
PDF on ERIC

Download full text

Sara T. Cushing – ETS Research Report Series, 2025

This report provides an in-depth comparison of TOEFL iBT® and the Duolingo English Test (DET) in terms of the degree to which both tests assess academic language proficiency in listening, reading, writing, and speaking. The analysis is based on publicly available documentation on both tests, including sample test questions available on the test…

Descriptors: Language Tests, English (Second Language), Second Language Learning, Academic Language

The Experimental Study of the Effect of Functional-Variational Factors on the Results of Linguistic Testing

Peer reviewed
PDF on ERIC

Download full text

Hryvko, Antonina V.; Zhuk, Yurii O. – Journal of Curriculum and Teaching, 2022

A feature of the presented study is a comprehensive approach to studying the reliability problem of linguistic testing results due to the several functional and variable factors impact. Contradictions and ambiguous views of scientists on the researched issues determine the relevance of this study. The article highlights the problem of equivalence…

Descriptors: Student Evaluation, Language Tests, Test Format, Test Items

The Effect of Phonological Overlap on English and Spanish Expressive Vocabulary

Peer reviewed

Direct link

Tibbits, Nicole; Lancaster, Hope Sparks; de Diego-Lázaroc, Beatriz – Language, Speech, and Hearing Services in Schools, 2023

Purpose: This study examined the effect of phonological overlap on English and Spanish expressive vocabulary accuracy as measured by the bilingual Expressive One-Word Picture Vocabulary Test--Fourth Edition (EOWPVT-IV). We hypothesized that if languages interact during an expressive vocabulary task, then higher phonological overlap will predict…

Descriptors: Phonology, English, Spanish, Bilingual Students

A Comparison of Polytomous Rasch Models for the Analysis of C-Tests

Peer reviewed
PDF on ERIC

Download full text

Dhyaaldian, Safa Mohammed Abdulridah; Kadhim, Qasim Khlaif; Mutlak, Dhameer A.; Neamah, Nour Raheem; Kareem, Zaidoon Hussein; Hamad, Doaa A.; Tuama, Jassim Hassan; Qasim, Mohammed Saad – International Journal of Language Testing, 2022

A C-Test is a gap-filling test for measuring language competence in the first and second language. C-Tests are usually analyzed with polytomous Rasch models by considering each passage as a super-item or testlet. This strategy helps overcome the local dependence inherent in C-Test gaps. However, there is little research on the best polytomous…

Descriptors: Item Response Theory, Cloze Procedure, Reading Tests, Language Tests

Psychometric Evaluation of Dictations with the Rasch Model

Peer reviewed
PDF on ERIC

Download full text

Hussein, Rasha Abed; Sabit, Shaker Holh; Alwan, Merriam Ghadhanfar; Wafqan, Hussam Mohammed; Baqer, Abeer Ameen; Ali, Muneam Hussein; Hachim, Safa K.; Sahi, Zahraa Tariq; AlSalami, Huda Takleef; Sulaiman, Bahaa Aldin Fawzi – International Journal of Language Testing, 2022

Dictation is a traditional technique for both teaching and testing overall language ability and listening comprehension. In a dictation, a passage is read aloud by the teacher and examinees write down what they hear. Due to the peculiar form of dictations, psychometric analysis of dictations is challenging. In a dictation, there is no clear…

Descriptors: Psychometrics, Verbal Communication, Teaching Methods, Language Skills

The Impact of Using Synthetically Generated Listening Stimuli on Test-Taker Performance: A Case Study with Multiple-Choice, Single-Selection Items. TOEFL® Research Reports. RR-98. ETS?RR-22-05

Peer reviewed
PDF on ERIC

Download full text

Choi, Ikkyu; Zu, Jiyun – ETS Research Report Series, 2022

Synthetically generated speech (SGS) has become an integral part of our oral communication in a wide variety of contexts. It can be generated instantly at a low cost and allows precise control over multiple aspects of output, all of which can be highly appealing to second language (L2) assessment developers who have traditionally relied upon human…

Descriptors: Test Wiseness, Multiple Choice Tests, Test Items, Difficulty Level

Previous Page | Next Page »

Pages: 1 | 2 | 3 | 4 | 5 | 6 | 7 | 8 | 9 | 10

Language Testing	15
Language Assessment Quarterly	11
ETS Research Report Series	8
International Journal of…	7
Online Submission	5
ProQuest LLC	5
SAGE Open	5
Language Testing in Asia	4
Educational and Psychological…	3
Partnership for Assessment of…	3
International Journal of…	2
International Journal of…	2
Applied Measurement in…	1
College Entrance Examination…	1
Early Education and…	1
Education and Information…	1
Educational Assessment	1
Educational Assessment,…	1
Educational Measurement:…	1
Educational Sciences: Theory…	1
Educational Testing Service	1
English Language Teaching	1
English Language Teaching…	1
English Teaching	1
Foreign Language Annals	1
More ▼

Perkins, Kyle	4
Huntley, Renee M.	3
Baghaei, Purya	2
Carlson, James E.	2
Henning, Grant	2
Khoshdel, Fahimeh	2
Papageorgiou, Spiros	2
Abdel-fattah, Abdel-fattah A.	1
Abdellah, Antar Solhy	1
Abraham, Roberta G.	1
Aesaert, Koen	1
Akase, Masaki	1
Al Khateeb, Nashaat Sultan…	1
Al-Jarf, Reima	1
AlSalami, Huda Takleef	1
Alallo, Hajir Mahmood Ibrahim	1
Alan Shaw	1
Alderson, J. Charles	1
Alghurabi, Ammar Muhi Khleel	1
Ali Zahabi	1
Ali, Muneam Hussein	1
Ali, Usama	1
Ali, Yusra Mohammed	1
Alwan, Merriam Ghadhanfar	1
More ▼