ERIC - Search Results

Publication Date

In 2025	1
Since 2024	2
Since 2021 (last 5 years)	14
Since 2016 (last 10 years)	24
Since 2006 (last 20 years)	43

Descriptor

Difficulty Level	45
Item Response Theory	45
Language Tests	45
Test Items	36
Foreign Countries	23
English (Second Language)	19
Second Language Learning	17
Language Proficiency	11
Scores	11
Comparative Analysis	10
Test Reliability	10
Multiple Choice Tests	9
Item Analysis	8
Second Language Instruction	8
Test Construction	8
College Students	7
Psychometrics	7
Statistical Analysis	7
Mathematics Tests	6
Reading Comprehension	6
Reading Tests	6
Test Format	6
Computer Assisted Testing	5
Elementary School Students	5
Models	5
More ▼

Publication Type

Journal Articles	38
Reports - Research	38
Reports - Evaluative	4
Dissertations/Theses -…	3
Tests/Questionnaires	2
Information Analyses	1
Speeches/Meeting Papers	1

Education Level

Higher Education	11
Postsecondary Education	10
Secondary Education	10
Elementary Education	7
High Schools	3
Junior High Schools	3
Middle Schools	3
Early Childhood Education	2
Grade 3	2
Grade 7	2
Grade 8	2
Grade 9	2
Primary Education	2
Adult Education	1
Elementary Secondary Education	1
Grade 10	1
Grade 5	1
Intermediate Grades	1
Two Year Colleges	1
More ▼

Audience

Location

Japan	4
Germany	3
China	2
Greece	2
South Korea	2
Alabama	1
Arizona	1
Arkansas	1
Australia	1
Bulgaria	1
California	1
China (Beijing)	1
Connecticut	1
Europe	1
France	1
Georgia	1
Ghana	1
Idaho	1
Illinois	1
Indiana	1
Iowa	1
Iran	1
Iran (Tehran)	1
Iraq	1
Japan (Tokyo)	1
More ▼

Laws, Policies, & Programs

Assessments and Surveys

Test of English as a Foreign…	5
International English…	2
English Proficiency Test	1
Measures of Academic Progress	1
Peabody Picture Vocabulary…	1
Test of English for…	1

What Works Clearinghouse Rating

Showing 1 to 15 of 45 results Save | Export

Comparative Evaluation of C-Test Reliability Using Classical and Modern Psychometric Methods

Peer reviewed
PDF on ERIC

Download full text

Neda Kianinezhad; Mohsen Kianinezhad – Language Education & Assessment, 2025

This study presents a comparative analysis of classical reliability measures, including Cronbach's alpha, test-retest, and parallel forms reliability, alongside modern psychometric methods such as the Rasch model and Mokken scaling, to evaluate the reliability of C-tests in language proficiency assessment. Utilizing data from 150 participants…

Descriptors: Psychometrics, Test Reliability, Language Proficiency, Language Tests

Operationalizing the Reading-into-Writing Construct in Analytic Rating Scales: Effects of Different Approaches on Rating

Peer reviewed

Direct link

Lestari, Santi B.; Brunfaut, Tineke – Language Testing, 2023

Assessing integrated reading-into-writing task performances is known to be challenging, and analytic rating scales have been found to better facilitate the scoring of these performances than other common types of rating scales. However, little is known about how specific operationalizations of the reading-into-writing construct in analytic rating…

Descriptors: Reading Writing Relationship, Writing Tests, Rating Scales, Writing Processes

A Comparison of Polytomous Rasch Models for the Analysis of C-Tests

Peer reviewed
PDF on ERIC

Download full text

Dhyaaldian, Safa Mohammed Abdulridah; Kadhim, Qasim Khlaif; Mutlak, Dhameer A.; Neamah, Nour Raheem; Kareem, Zaidoon Hussein; Hamad, Doaa A.; Tuama, Jassim Hassan; Qasim, Mohammed Saad – International Journal of Language Testing, 2022

A C-Test is a gap-filling test for measuring language competence in the first and second language. C-Tests are usually analyzed with polytomous Rasch models by considering each passage as a super-item or testlet. This strategy helps overcome the local dependence inherent in C-Test gaps. However, there is little research on the best polytomous…

Descriptors: Item Response Theory, Cloze Procedure, Reading Tests, Language Tests

Psychometric Evaluation of Dictations with the Rasch Model

Peer reviewed
PDF on ERIC

Download full text

Hussein, Rasha Abed; Sabit, Shaker Holh; Alwan, Merriam Ghadhanfar; Wafqan, Hussam Mohammed; Baqer, Abeer Ameen; Ali, Muneam Hussein; Hachim, Safa K.; Sahi, Zahraa Tariq; AlSalami, Huda Takleef; Sulaiman, Bahaa Aldin Fawzi – International Journal of Language Testing, 2022

Dictation is a traditional technique for both teaching and testing overall language ability and listening comprehension. In a dictation, a passage is read aloud by the teacher and examinees write down what they hear. Due to the peculiar form of dictations, psychometric analysis of dictations is challenging. In a dictation, there is no clear…

Descriptors: Psychometrics, Verbal Communication, Teaching Methods, Language Skills

The Impact of Using Synthetically Generated Listening Stimuli on Test-Taker Performance: A Case Study with Multiple-Choice, Single-Selection Items. TOEFL® Research Reports. RR-98. ETS?RR-22-05

Peer reviewed
PDF on ERIC

Download full text

Choi, Ikkyu; Zu, Jiyun – ETS Research Report Series, 2022

Synthetically generated speech (SGS) has become an integral part of our oral communication in a wide variety of contexts. It can be generated instantly at a low cost and allows precise control over multiple aspects of output, all of which can be highly appealing to second language (L2) assessment developers who have traditionally relied upon human…

Descriptors: Test Wiseness, Multiple Choice Tests, Test Items, Difficulty Level

Item Position Effects in Listening but Not in Reading in the European Survey of Language Competences

Peer reviewed

Direct link

Christiansen, Andrés; Janssen, Rianne – Educational Assessment, Evaluation and Accountability, 2021

In contrast with the assumptions made in standard measurement models used in large-scale assessments, students' performance may change during the test administration. This change can be modeled as a function of item position in case of a test booklet design with item-order manipulations. The present study used an explanatory item response theory…

Descriptors: Foreign Countries, Surveys, Measures (Individuals), Language Skills

Text Complexity of Cambridge-Delivered IELTS Academic Reading Tests: Comparability with IELTS Academic Reading Practice Tests from Other Publishers

Peer reviewed
PDF on ERIC

Download full text

Huu Thanh Minh Nguyen; Nguyen Van Anh Le – TESL-EJ, 2024

Comparing language tests and test preparation materials holds important implications for the latter's validity and reliability. However, not enough studies compare such materials across a wide range of indices. Therefore, this study investigated the text complexity of IELTS academic reading tests (IRT) and IELTS reading practice tests (IRPrT).…

Descriptors: Second Language Learning, English (Second Language), Language Tests, Readability

MAP Growth Item Parameter Drift Study

Download full text

He, Wei – NWEA, 2022

To ensure that student academic growth in a subject area is accurately captured, it is imperative that the underlying scale remains stable over time. As item parameter stability constitutes one of the factors that affects scale stability, NWEA® periodically conducts studies to check for the stability of the item parameter estimates for MAP®…

Descriptors: Achievement Tests, Test Items, Test Reliability, Academic Achievement

Longitudinal Measurement of Growth in Vocabulary Size Using Rasch-Based Test Equating

Peer reviewed

Direct link

Akase, Masaki – Language Testing in Asia, 2022

The purpose of this study is to equate and further validate three forms of the vocabulary size test (VST) created by Aizawa and Mochizuki (2010). These three forms, VST 1, 2, and 3, were administered to a cohort of 189 high school students ranging in age from 16 to 18 in April of their 1st, 2nd, and 3rd year of high school. Although these…

Descriptors: Vocabulary Development, Vocabulary Skills, Language Tests, Longitudinal Studies

Distractor Analysis in Multiple-Choice Items Using the Rasch Model

Peer reviewed
PDF on ERIC

Download full text

Omarov, Nazarbek Bakytbekovich; Mohammed, Aisha; Alghurabi, Ammar Muhi Khleel; Alallo, Hajir Mahmood Ibrahim; Ali, Yusra Mohammed; Hassan, Aalaa Yaseen; Demeuova, Lyazat; Viktorovna, Shvedova Irina; Nazym, Bekenova; Al Khateeb, Nashaat Sultan Afif – International Journal of Language Testing, 2023

The Multiple-choice (MC) item format is commonly used in educational assessments due to its economy and effectiveness across a variety of content domains. However, numerous studies have examined the quality of MC items in high-stakes and higher-education assessments and found many flawed items, especially in terms of distractors. These faulty…

Descriptors: Test Items, Multiple Choice Tests, Item Response Theory, English (Second Language)

Rasch Testlet Model and Bifactor Analysis: How Do They Assess the Dimensionality of Large-Scale Iranian EFL Reading Comprehension Tests?

Peer reviewed

Direct link

Geramipour, Masoud – Language Testing in Asia, 2021

Rasch testlet and bifactor models are two measurement models that could deal with local item dependency (LID) in assessing the dimensionality of reading comprehension testlets. This study aimed to apply the measurement models to real item response data of the Iranian EFL reading comprehension tests and compare the validity of the bifactor models…

Descriptors: Foreign Countries, Second Language Learning, English (Second Language), Reading Tests

Artificial Intelligence-Generated and Human Expert-Designed Vocabulary Tests: A Comparative Study

Peer reviewed

Direct link

Yunjiu, Luo; Wei, Wei; Zheng, Ying – SAGE Open, 2022

Artificial intelligence (AI) technologies have the potential to reduce the workload for the second language (L2) teachers and test developers. We propose two AI distractor-generating methods for creating Chinese vocabulary items: semantic similarity and visual similarity. Semantic similarity refers to antonyms and synonyms, while visual similarity…

Descriptors: Chinese, Vocabulary Development, Artificial Intelligence, Undergraduate Students

Technology-Enhanced Items in Grades 1-12 English Language Proficiency Assessments

Peer reviewed

Direct link

Kim, Ahyoung Alicia; Tywoniw, Rurik L.; Chapman, Mark – Language Assessment Quarterly, 2022

Technology-enhanced items (TEIs) are innovative, computer-delivered test items that allow test takers to better interact with the test environment compared to traditional multiple-choice items (MCIs). The interactive nature of TEIs offer improved construct coverage compared with MCIs but little research exists regarding students' performance on…

Descriptors: Language Tests, Test Items, Computer Assisted Testing, English (Second Language)

Effects of Removing Responses with Likely Random Guessing under Rasch Measurement on a Multiple-Choice Language Proficiency Test

Peer reviewed

Direct link

Lin, Chih-Kai – Language Assessment Quarterly, 2018

With multiple options to choose from, there is always a chance of lucky guessing by examinees on multiple-choice (MC) items, thereby potentially introducing bias in item difficulty estimates. Correct responses by random guessing thus pose threats to the validity of claims made from test performance on an MC test. Under the Rasch framework, the…

Descriptors: Guessing (Tests), Item Response Theory, Multiple Choice Tests, Language Tests

Evaluating Performance of Missing Data Imputation Methods in IRT Analyses

Peer reviewed
PDF on ERIC

Download full text

Kalkan, Ömür Kaya; Kara, Yusuf; Kelecioglu, Hülya – International Journal of Assessment Tools in Education, 2018

Missing data is a common problem in datasets that are obtained by administration of educational and psychological tests. It is widely known that existence of missing observations in data can lead to serious problems such as biased parameter estimates and inflation of standard errors. Most of the missing data imputation methods are focused on…

Descriptors: Item Response Theory, Statistical Analysis, Data, Test Items

Previous Page | Next Page »

Pages: 1 | 2 | 3

Language Testing	7
ETS Research Report Series	4
Language Assessment Quarterly	4
Educational and Psychological…	3
International Journal of…	3
Language Testing in Asia	3
ProQuest LLC	3
Assessment for Effective…	1
Assessment in Education:…	1
Educational Assessment,…	1
InSight: A Journal of…	1
International Journal of…	1
International Journal of…	1
Journal of Educational…	1
Language Education &…	1
NWEA	1
Partnership for Assessment of…	1
Perspectives in Education	1
SAGE Open	1
Second Language Research	1
TESL-EJ	1
Universal Journal of…	1
Vocabulary Learning and…	1
More ▼

Abdel-fattah, Abdel-fattah A.	1
Akase, Masaki	1
Al Khateeb, Nashaat Sultan…	1
AlSalami, Huda Takleef	1
Alallo, Hajir Mahmood Ibrahim	1
Alghurabi, Ammar Muhi Khleel	1
Ali, Muneam Hussein	1
Ali, Usama	1
Ali, Yusra Mohammed	1
Alwan, Merriam Ghadhanfar	1
Baqer, Abeer Ameen	1
Batty, Aaron Olaf	1
Beguin, Anton	1
Bejar, Isaac I.	1
Brent A. Culligan	1
Brown, Terran	1
Brunfaut, Tineke	1
Campfield, Dorota E.	1
Chapman, Mark	1
Chen, Jianshen	1
Choi, Ikkyu	1
Christiansen, Andrés	1
Combrinck, Celeste	1
Costanzo, Kate	1
Cox, Troy L.	1
More ▼