ERIC - Search Results

Publication Date

In 2025	1
Since 2024	2
Since 2021 (last 5 years)	5
Since 2016 (last 10 years)	10
Since 2006 (last 20 years)	14

Descriptor

Difficulty Level	15
Item Response Theory	15
Language Proficiency	15
Language Tests	11
Test Items	10
English (Second Language)	9
Second Language Learning	8
Second Language Instruction	5
Foreign Countries	4
Item Analysis	4
Scores	4
Accuracy	3
College Students	3
Comparative Analysis	3
Computer Assisted Testing	3
Grammar	3
Native Language	3
Psychometrics	3
Scoring	3
Test Construction	3
Test Reliability	3
Decision Making	2
Elementary School Students	2
Imitation	2
Multiple Choice Tests	2
More ▼

Source

Language Assessment Quarterly	3
ETS Research Report Series	2
ProQuest LLC	2
International Journal of…	1
International Journal of…	1
Language Education &…	1
Language Learning Journal	1
Language Testing	1
Language Testing in Asia	1
Second Language Research	1

Publication Type

Journal Articles	12
Reports - Research	12
Tests/Questionnaires	3
Dissertations/Theses -…	2
Reports - Evaluative	1

Education Level

Higher Education	4
Postsecondary Education	4
Elementary Education	3
Secondary Education	3
Junior High Schools	2
Middle Schools	2
Adult Education	1
Grade 8	1

Audience

Location

France	1
Greece	1
Iran (Tehran)	1
Japan	1
Netherlands	1
South Korea	1
Vietnam	1

Laws, Policies, & Programs

Assessments and Surveys

Test of English as a Foreign…	3
English Proficiency Test	1
International English…	1

What Works Clearinghouse Rating

Showing all 15 results Save | Export

Comparative Evaluation of C-Test Reliability Using Classical and Modern Psychometric Methods

Peer reviewed
PDF on ERIC

Download full text

Neda Kianinezhad; Mohsen Kianinezhad – Language Education & Assessment, 2025

This study presents a comparative analysis of classical reliability measures, including Cronbach's alpha, test-retest, and parallel forms reliability, alongside modern psychometric methods such as the Rasch model and Mokken scaling, to evaluate the reliability of C-tests in language proficiency assessment. Utilizing data from 150 participants…

Descriptors: Psychometrics, Test Reliability, Language Proficiency, Language Tests

Validation of an Elicited Imitation Test as a Measure of Korean Language Proficiency

Peer reviewed

Direct link

Hojung Kim; Changkyung Song; Jiyoung Kim; Hyeyun Jeong; Jisoo Park – Language Testing in Asia, 2024

This study presents a modified version of the Korean Elicited Imitation (EI) test, designed to resemble natural spoken language, and validates its reliability as a measure of proficiency. The study assesses the correlation between average test scores and Test of Proficiency in Korean (TOPIK) levels, examining score distributions among beginner,…

Descriptors: Korean, Test Validity, Test Reliability, Imitation

A Comparison of Polytomous Rasch Models for the Analysis of C-Tests

Peer reviewed
PDF on ERIC

Download full text

Dhyaaldian, Safa Mohammed Abdulridah; Kadhim, Qasim Khlaif; Mutlak, Dhameer A.; Neamah, Nour Raheem; Kareem, Zaidoon Hussein; Hamad, Doaa A.; Tuama, Jassim Hassan; Qasim, Mohammed Saad – International Journal of Language Testing, 2022

A C-Test is a gap-filling test for measuring language competence in the first and second language. C-Tests are usually analyzed with polytomous Rasch models by considering each passage as a super-item or testlet. This strategy helps overcome the local dependence inherent in C-Test gaps. However, there is little research on the best polytomous…

Descriptors: Item Response Theory, Cloze Procedure, Reading Tests, Language Tests

Technology-Enhanced Items in Grades 1-12 English Language Proficiency Assessments

Peer reviewed

Direct link

Kim, Ahyoung Alicia; Tywoniw, Rurik L.; Chapman, Mark – Language Assessment Quarterly, 2022

Technology-enhanced items (TEIs) are innovative, computer-delivered test items that allow test takers to better interact with the test environment compared to traditional multiple-choice items (MCIs). The interactive nature of TEIs offer improved construct coverage compared with MCIs but little research exists regarding students' performance on…

Descriptors: Language Tests, Test Items, Computer Assisted Testing, English (Second Language)

Effects of Removing Responses with Likely Random Guessing under Rasch Measurement on a Multiple-Choice Language Proficiency Test

Peer reviewed

Direct link

Lin, Chih-Kai – Language Assessment Quarterly, 2018

With multiple options to choose from, there is always a chance of lucky guessing by examinees on multiple-choice (MC) items, thereby potentially introducing bias in item difficulty estimates. Correct responses by random guessing thus pose threats to the validity of claims made from test performance on an MC test. Under the Rasch framework, the…

Descriptors: Guessing (Tests), Item Response Theory, Multiple Choice Tests, Language Tests

Predicting the Difficulty of Exercise Items for Dynamic Difficulty Adaptation in Adaptive Language Tutoring

Peer reviewed

Direct link

Pandarova, Irina; Schmidt, Torben; Hartig, Johannes; Boubekki, Ahcène; Jones, Roger Dale; Brefeld, Ulf – International Journal of Artificial Intelligence in Education, 2019

Advances in computer technology and artificial intelligence create opportunities for developing adaptive language learning technologies which are sensitive to individual learner characteristics. This paper focuses on one form of adaptivity in which the difficulty of learning content is dynamically adjusted to the learner's evolving language…

Descriptors: Intelligent Tutoring Systems, Difficulty Level, Cues, Second Language Learning

Towards Improved Assessment of L2 Collocation Knowledge

Peer reviewed

Direct link

Lee, Senyung; Shin, Sun-Young – Language Assessment Quarterly, 2021

Multiple test tasks are available for assessing L2 collocation knowledge. However, few studies have investigated the characteristics of a variety of recognition and recall tasks of collocation simultaneously, and most research on L2 collocations has focused on verb-noun and adjective-noun collocations. This study investigates (1) the relative…

Descriptors: Phrase Structure, Second Language Learning, Language Tests, Recall (Psychology)

Topic and Background Knowledge Effects on Performance in Speaking Assessment

Peer reviewed

Direct link

Khabbazbashi, Nahal – Language Testing, 2017

This study explores the extent to which topic and background knowledge of topic affect spoken performance in a high-stakes speaking test. It is argued that evidence of a substantial influence may introduce construct-irrelevant variance and undermine test fairness. Data were collected from 81 non-native speakers of English who performed on 10…

Descriptors: Speech Tests, High Stakes Tests, English (Second Language), Language Proficiency

The Role of Morphological Complexity in Predicting the Learnability of an Additional Language: The Case of La (Additional Language) Dutch

Peer reviewed

Direct link

van der Slik, Frans; Hout, Roeland van; Schepens, Job – Second Language Research, 2019

Applied linguistics may benefit from a morphological complexity measure to get a better grip on language learning problems and to better understand what kind of typological differences between languages are more important than others in facilitating or impeding adult learning of an additional language. Using speaking proficiency scores of 9,000…

Descriptors: Indo European Languages, Morphology (Languages), Applied Linguistics, Language Classification

A Comparative Study of the Variables Used to Measure Syntactic Complexity and Accuracy in Task-Based Research

Peer reviewed

Direct link

Inoue, Chihiro – Language Learning Journal, 2016

The constructs of complexity, accuracy and fluency (CAF) have been used extensively to investigate learner performance on second language tasks. However, a serious concern is that the variables used to measure these constructs are sometimes used conventionally without any empirical justification. It is crucial for researchers to understand how…

Descriptors: Comparative Analysis, Syntax, Accuracy, Task Analysis

Using a Mixture IRT Model to Understand English Learner Performance on Large-Scale Assessments

Direct link

Shea, Christine A. – ProQuest LLC, 2013

The purpose of this study was to determine whether an eighth grade state-level math assessment contained items that function differentially (DIF) for English Learner students (EL) as compared to English Only students (EO) and if so, what factors might have caused DIF. To determine this, Differential Item Functioning (DIF) analysis was employed.…

Descriptors: Item Response Theory, English Language Learners, Grade 8, Mathematics Tests

The Development and Validation of a Spanish Elicited Imitation Test of Oral Language Proficiency for the Missionary Training Center

Direct link

Thompson, Carrie A. – ProQuest LLC, 2013

The Missionary Training Center (MTC), affiliated with the Church of Jesus Christ of Latter-day Saints, needs a reliable and cost effective way to measure the oral language proficiency of missionaries learning Spanish. The MTC needed to measure incoming missionaries' Spanish language proficiency for training and classroom assignment as well as to…

Descriptors: Religious Cultural Groups, Second Language Learning, Second Language Instruction, Interviews

Assessing the Test Information Function and Differential Item Functioning for the "TOEFL Junior"® Standard Test. Research Report. ETS RR-13-17. "TOEFL Junior"® Research Report. TOEFL JR-01

Peer reviewed
PDF on ERIC

Download full text

Young, John W.; Morgan, Rick; Rybinski, Paul; Steinberg, Jonathan; Wang, Yuan – ETS Research Report Series, 2013

The "TOEFL Junior"® Standard Test is an assessment that measures the degree to which middle school-aged students learning English as a second language have attained proficiency in the academic and social English skills representative of English-medium instructional environments. The assessment measures skills in three areas: listening…

Descriptors: Item Response Theory, Test Items, Language Tests, Second Language Learning

A Study of Confidence and Accuracy Using the Rasch Modeling Procedures. Research Report. ETS RR-08-42

Peer reviewed
PDF on ERIC

Download full text

Direct link

Paek, Insu; Lee, Jihyun; Stankov, Lazar; Wilson, Mark – ETS Research Report Series, 2008

This study investigated the relationship between students' actual performance (accuracy) and their subjective judgments of accuracy (confidence) on selected English language proficiency tests. The unidimensional and multidimensional IRT Rasch approaches were used to model the discrepancy between confidence and accuracy at the item and test level…

Descriptors: Self Esteem, Accuracy, Item Response Theory, English

Analyzing the Option Effects of Difficult TOEFL Items with Low Biserials: Methods Developed for Use by Test Assemblers.

Download full text

Hicks, Marilyn M. – 1988

Several exploratory analyses of the fifths data generated by Test of English as a Foreign Language (TOEFL) item analyses were developed in order to evaluate the effects of options on the discriminability of difficult items and to identify difficult items with low, unreliable biserials that had been rejected by test developers, but for which…

Descriptors: Difficulty Level, Estimation (Mathematics), Identification, Item Analysis

Boubekki, Ahcène	1
Brefeld, Ulf	1
Changkyung Song	1
Chapman, Mark	1
Dhyaaldian, Safa Mohammed…	1
Hamad, Doaa A.	1
Hartig, Johannes	1
Hicks, Marilyn M.	1
Hojung Kim	1
Hout, Roeland van	1
Hyeyun Jeong	1
Inoue, Chihiro	1
Jisoo Park	1
Jiyoung Kim	1
Jones, Roger Dale	1
Kadhim, Qasim Khlaif	1
Kareem, Zaidoon Hussein	1
Khabbazbashi, Nahal	1
Kim, Ahyoung Alicia	1
Lee, Jihyun	1
Lee, Senyung	1
Lin, Chih-Kai	1
Mohsen Kianinezhad	1
Morgan, Rick	1
Mutlak, Dhameer A.	1
More ▼