Publication Date
In 2025 | 1 |
Since 2024 | 2 |
Since 2021 (last 5 years) | 5 |
Since 2016 (last 10 years) | 10 |
Since 2006 (last 20 years) | 14 |
Descriptor
Difficulty Level | 15 |
Item Response Theory | 15 |
Language Proficiency | 15 |
Language Tests | 11 |
Test Items | 10 |
English (Second Language) | 9 |
Second Language Learning | 8 |
Second Language Instruction | 5 |
Foreign Countries | 4 |
Item Analysis | 4 |
Scores | 4 |
More ▼ |
Source
Author
Boubekki, Ahcène | 1 |
Brefeld, Ulf | 1 |
Changkyung Song | 1 |
Chapman, Mark | 1 |
Dhyaaldian, Safa Mohammed… | 1 |
Hamad, Doaa A. | 1 |
Hartig, Johannes | 1 |
Hicks, Marilyn M. | 1 |
Hojung Kim | 1 |
Hout, Roeland van | 1 |
Hyeyun Jeong | 1 |
More ▼ |
Publication Type
Journal Articles | 12 |
Reports - Research | 12 |
Tests/Questionnaires | 3 |
Dissertations/Theses -… | 2 |
Reports - Evaluative | 1 |
Education Level
Higher Education | 4 |
Postsecondary Education | 4 |
Elementary Education | 3 |
Secondary Education | 3 |
Junior High Schools | 2 |
Middle Schools | 2 |
Adult Education | 1 |
Grade 8 | 1 |
Audience
Location
France | 1 |
Greece | 1 |
Iran (Tehran) | 1 |
Japan | 1 |
Netherlands | 1 |
South Korea | 1 |
Vietnam | 1 |
Laws, Policies, & Programs
Assessments and Surveys
Test of English as a Foreign… | 3 |
English Proficiency Test | 1 |
International English… | 1 |
What Works Clearinghouse Rating
Neda Kianinezhad; Mohsen Kianinezhad – Language Education & Assessment, 2025
This study presents a comparative analysis of classical reliability measures, including Cronbach's alpha, test-retest, and parallel forms reliability, alongside modern psychometric methods such as the Rasch model and Mokken scaling, to evaluate the reliability of C-tests in language proficiency assessment. Utilizing data from 150 participants…
Descriptors: Psychometrics, Test Reliability, Language Proficiency, Language Tests
Hojung Kim; Changkyung Song; Jiyoung Kim; Hyeyun Jeong; Jisoo Park – Language Testing in Asia, 2024
This study presents a modified version of the Korean Elicited Imitation (EI) test, designed to resemble natural spoken language, and validates its reliability as a measure of proficiency. The study assesses the correlation between average test scores and Test of Proficiency in Korean (TOPIK) levels, examining score distributions among beginner,…
Descriptors: Korean, Test Validity, Test Reliability, Imitation
Dhyaaldian, Safa Mohammed Abdulridah; Kadhim, Qasim Khlaif; Mutlak, Dhameer A.; Neamah, Nour Raheem; Kareem, Zaidoon Hussein; Hamad, Doaa A.; Tuama, Jassim Hassan; Qasim, Mohammed Saad – International Journal of Language Testing, 2022
A C-Test is a gap-filling test for measuring language competence in the first and second language. C-Tests are usually analyzed with polytomous Rasch models by considering each passage as a super-item or testlet. This strategy helps overcome the local dependence inherent in C-Test gaps. However, there is little research on the best polytomous…
Descriptors: Item Response Theory, Cloze Procedure, Reading Tests, Language Tests
Kim, Ahyoung Alicia; Tywoniw, Rurik L.; Chapman, Mark – Language Assessment Quarterly, 2022
Technology-enhanced items (TEIs) are innovative, computer-delivered test items that allow test takers to better interact with the test environment compared to traditional multiple-choice items (MCIs). The interactive nature of TEIs offer improved construct coverage compared with MCIs but little research exists regarding students' performance on…
Descriptors: Language Tests, Test Items, Computer Assisted Testing, English (Second Language)
Lin, Chih-Kai – Language Assessment Quarterly, 2018
With multiple options to choose from, there is always a chance of lucky guessing by examinees on multiple-choice (MC) items, thereby potentially introducing bias in item difficulty estimates. Correct responses by random guessing thus pose threats to the validity of claims made from test performance on an MC test. Under the Rasch framework, the…
Descriptors: Guessing (Tests), Item Response Theory, Multiple Choice Tests, Language Tests
Pandarova, Irina; Schmidt, Torben; Hartig, Johannes; Boubekki, Ahcène; Jones, Roger Dale; Brefeld, Ulf – International Journal of Artificial Intelligence in Education, 2019
Advances in computer technology and artificial intelligence create opportunities for developing adaptive language learning technologies which are sensitive to individual learner characteristics. This paper focuses on one form of adaptivity in which the difficulty of learning content is dynamically adjusted to the learner's evolving language…
Descriptors: Intelligent Tutoring Systems, Difficulty Level, Cues, Second Language Learning
Lee, Senyung; Shin, Sun-Young – Language Assessment Quarterly, 2021
Multiple test tasks are available for assessing L2 collocation knowledge. However, few studies have investigated the characteristics of a variety of recognition and recall tasks of collocation simultaneously, and most research on L2 collocations has focused on verb-noun and adjective-noun collocations. This study investigates (1) the relative…
Descriptors: Phrase Structure, Second Language Learning, Language Tests, Recall (Psychology)
Khabbazbashi, Nahal – Language Testing, 2017
This study explores the extent to which topic and background knowledge of topic affect spoken performance in a high-stakes speaking test. It is argued that evidence of a substantial influence may introduce construct-irrelevant variance and undermine test fairness. Data were collected from 81 non-native speakers of English who performed on 10…
Descriptors: Speech Tests, High Stakes Tests, English (Second Language), Language Proficiency
van der Slik, Frans; Hout, Roeland van; Schepens, Job – Second Language Research, 2019
Applied linguistics may benefit from a morphological complexity measure to get a better grip on language learning problems and to better understand what kind of typological differences between languages are more important than others in facilitating or impeding adult learning of an additional language. Using speaking proficiency scores of 9,000…
Descriptors: Indo European Languages, Morphology (Languages), Applied Linguistics, Language Classification
Inoue, Chihiro – Language Learning Journal, 2016
The constructs of complexity, accuracy and fluency (CAF) have been used extensively to investigate learner performance on second language tasks. However, a serious concern is that the variables used to measure these constructs are sometimes used conventionally without any empirical justification. It is crucial for researchers to understand how…
Descriptors: Comparative Analysis, Syntax, Accuracy, Task Analysis
Shea, Christine A. – ProQuest LLC, 2013
The purpose of this study was to determine whether an eighth grade state-level math assessment contained items that function differentially (DIF) for English Learner students (EL) as compared to English Only students (EO) and if so, what factors might have caused DIF. To determine this, Differential Item Functioning (DIF) analysis was employed.…
Descriptors: Item Response Theory, English Language Learners, Grade 8, Mathematics Tests
Thompson, Carrie A. – ProQuest LLC, 2013
The Missionary Training Center (MTC), affiliated with the Church of Jesus Christ of Latter-day Saints, needs a reliable and cost effective way to measure the oral language proficiency of missionaries learning Spanish. The MTC needed to measure incoming missionaries' Spanish language proficiency for training and classroom assignment as well as to…
Descriptors: Religious Cultural Groups, Second Language Learning, Second Language Instruction, Interviews
Young, John W.; Morgan, Rick; Rybinski, Paul; Steinberg, Jonathan; Wang, Yuan – ETS Research Report Series, 2013
The "TOEFL Junior"® Standard Test is an assessment that measures the degree to which middle school-aged students learning English as a second language have attained proficiency in the academic and social English skills representative of English-medium instructional environments. The assessment measures skills in three areas: listening…
Descriptors: Item Response Theory, Test Items, Language Tests, Second Language Learning
Paek, Insu; Lee, Jihyun; Stankov, Lazar; Wilson, Mark – ETS Research Report Series, 2008
This study investigated the relationship between students' actual performance (accuracy) and their subjective judgments of accuracy (confidence) on selected English language proficiency tests. The unidimensional and multidimensional IRT Rasch approaches were used to model the discrepancy between confidence and accuracy at the item and test level…
Descriptors: Self Esteem, Accuracy, Item Response Theory, English
Hicks, Marilyn M. – 1988
Several exploratory analyses of the fifths data generated by Test of English as a Foreign Language (TOEFL) item analyses were developed in order to evaluate the effects of options on the discriminability of difficult items and to identify difficult items with low, unreliable biserials that had been rejected by test developers, but for which…
Descriptors: Difficulty Level, Estimation (Mathematics), Identification, Item Analysis