ERIC - Search Results

Publication Date

In 2025	1
Since 2024	1
Since 2021 (last 5 years)	6
Since 2016 (last 10 years)	6
Since 2006 (last 20 years)	15

Descriptor

Item Response Theory	17
Language Tests	17
Test Format	17
Test Items	12
English (Second Language)	9
Second Language Learning	9
Comparative Analysis	7
Difficulty Level	6
Foreign Countries	6
Multiple Choice Tests	6
Test Construction	5
Computer Assisted Testing	4
Correlation	4
Item Analysis	4
Language Proficiency	4
Psychometrics	4
Scores	4
Second Language Instruction	4
Test Validity	4
College Entrance Examinations	3
College Students	3
Construct Validity	3
Graduate Students	3
Listening Comprehension Tests	3
Reading Comprehension	3
More ▼

Source

Language Testing	7
ProQuest LLC	3
Language Assessment Quarterly	2
College Board	1
Educational and Psychological…	1
Journal of Educational and…	1

Publication Type

Journal Articles	11
Reports - Research	8
Dissertations/Theses -…	3
Reports - Evaluative	3
Collected Works - General	1
Collected Works - Serials	1
Information Analyses	1
Non-Print Media	1
Reference Materials - General	1
Tests/Questionnaires	1

Education Level

Higher Education	8
Postsecondary Education	7
Secondary Education	5
High Schools	2
Elementary Education	1
Elementary Secondary Education	1

Audience

Location

Japan	2
Australia	1
Indonesia	1
Iowa	1
South Korea	1
Turkey (Ankara)	1

Laws, Policies, & Programs

Assessments and Surveys

Test of English as a Foreign…	2
Advanced Placement…	1
Iowa Tests of Basic Skills	1
SAT (College Admission Test)	1

What Works Clearinghouse Rating

Showing 1 to 15 of 17 results Save | Export

A Systematic Review of Differential Item Functioning in Second Language Assessment

Peer reviewed

Direct link

Xueliang Chen; Vahid Aryadoust; Wenxin Zhang – Language Testing, 2025

The growing diversity among test takers in second or foreign language (L2) assessments makes the importance of fairness front and center. This systematic review aimed to examine how fairness in L2 assessments was evaluated through differential item functioning (DIF) analysis. A total of 83 articles from 27 journals were included in a systematic…

Descriptors: Second Language Learning, Language Tests, Test Items, Item Analysis

Technology-Enhanced Items in Grades 1-12 English Language Proficiency Assessments

Peer reviewed

Direct link

Kim, Ahyoung Alicia; Tywoniw, Rurik L.; Chapman, Mark – Language Assessment Quarterly, 2022

Technology-enhanced items (TEIs) are innovative, computer-delivered test items that allow test takers to better interact with the test environment compared to traditional multiple-choice items (MCIs). The interactive nature of TEIs offer improved construct coverage compared with MCIs but little research exists regarding students' performance on…

Descriptors: Language Tests, Test Items, Computer Assisted Testing, English (Second Language)

Examining the Effects of Different English Speech Varieties on an L2 Academic Listening Comprehension Test at the Item Level

Peer reviewed

Direct link

Shin, Sun-Young; Lee, Senyung; Lidster, Ryan – Language Testing, 2021

In this study we investigated the potential for a shared-first-language (shared-L1) effect on second language (L2) listening test scores using differential item functioning (DIF) analyses. We did this in order to understand how accented speech may influence performance at the item level, while controlling for key variables including listening…

Descriptors: Listening Comprehension Tests, Language Tests, Native Language, Scores

IRT-Based Classification Analysis of an English Language Reading Proficiency Subtest

Peer reviewed

Direct link

Kaya, Elif; O'Grady, Stefan; Kalender, Ilker – Language Testing, 2022

Language proficiency testing serves an important function of classifying examinees into different categories of ability. However, misclassification is to some extent inevitable and may have important consequences for stakeholders. Recent research suggests that classification efficacy may be enhanced substantially using computerized adaptive…

Descriptors: Item Response Theory, Test Items, Language Tests, Classification

Scenario-Based Language Assessment: Developing a Language Assessment Literacy Test for Indonesian Teachers of English as a Foreign Language

Direct link

Agustinus Hardi Prasetyo – ProQuest LLC, 2023

Studies have shown that language assessment literacy (LAL) is important for language teachers since they make important classroom decisions to improve student learning based on their assessment. However, some studies have shown that teachers need more knowledge and skills in assessment. Teachers also seem unconfident in assessing their students…

Descriptors: Language Tests, English (Second Language), Second Language Learning, Second Language Instruction

Towards Improved Assessment of L2 Collocation Knowledge

Peer reviewed

Direct link

Lee, Senyung; Shin, Sun-Young – Language Assessment Quarterly, 2021

Multiple test tasks are available for assessing L2 collocation knowledge. However, few studies have investigated the characteristics of a variety of recognition and recall tasks of collocation simultaneously, and most research on L2 collocations has focused on verb-noun and adjective-noun collocations. This study investigates (1) the relative…

Descriptors: Phrase Structure, Second Language Learning, Language Tests, Recall (Psychology)

A Comparison of Three Test Formats to Assess Word Difficulty

Peer reviewed

Direct link

Culligan, Brent – Language Testing, 2015

This study compared three common vocabulary test formats, the Yes/No test, the Vocabulary Knowledge Scale (VKS), and the Vocabulary Levels Test (VLT), as measures of vocabulary difficulty. Vocabulary difficulty was defined as the item difficulty estimated through Item Response Theory (IRT) analysis. Three tests were given to 165 Japanese students,…

Descriptors: Language Tests, Test Format, Comparative Analysis, Vocabulary

A Comparison of Video- and Audio-Mediated Listening Tests with Many-Facet Rasch Modeling and Differential Distractor Functioning

Peer reviewed

Direct link

Batty, Aaron Olaf – Language Testing, 2015

The rise in the affordability of quality video production equipment has resulted in increased interest in video-mediated tests of foreign language listening comprehension. Although research on such tests has continued fairly steadily since the early 1980s, studies have relied on analyses of raw scores, despite the growing prevalence of item…

Descriptors: Listening Comprehension Tests, Comparative Analysis, Video Technology, Audio Equipment

Applying Item Response Theory Methods to Examine the Impact of Different Response Formats

Peer reviewed

Direct link

Hohensinn, Christine; Kubinger, Klaus D. – Educational and Psychological Measurement, 2011

In aptitude and achievement tests, different response formats are usually used. A fundamental distinction must be made between the class of multiple-choice formats and the constructed response formats. Previous studies have examined the impact of different response formats applying traditional statistical approaches, but these influences can also…

Descriptors: Item Response Theory, Multiple Choice Tests, Responses, Test Format

Data Collection Design for Equivalent Groups Equating: Using a Matrix Stratification Framework for Mixed-Format Assessment

Direct link

Mbella, Kinge Keka – ProQuest LLC, 2012

Mixed-format assessments are increasingly being used in large scale standardized assessments to measure a continuum of skills ranging from basic recall to higher order thinking skills. These assessments are usually comprised of a combination of (a) multiple-choice items which can be efficiently scored, have stable psychometric properties, and…

Descriptors: Educational Assessment, Test Format, Evaluation Methods, Multiple Choice Tests

Causes of Gender DIF on an EFL Language Test: A Multiple-Data Analysis over Nine Years

Peer reviewed

Direct link

Pae, Tae-Il – Language Testing, 2012

This study tracked gender differential item functioning (DIF) on the English subtest of the Korean College Scholastic Aptitude Test (KCSAT) over a nine-year period across three data points, using both the Mantel-Haenszel (MH) and item response theory likelihood ratio (IRT-LR) procedures. Further, the study identified two factors (i.e. reading…

Descriptors: Aptitude Tests, Academic Aptitude, Language Tests, Test Items

Do Questions Written in the Target Language Make Foreign Language Listening Comprehension Tests More Difficult?

Peer reviewed

Direct link

Filipi, Anna – Language Testing, 2012

The Assessment of Language Competence (ALC) certificates is an annual, international testing program developed by the Australian Council for Educational Research to test the listening and reading comprehension skills of lower to middle year levels of secondary school. The tests are developed for three levels in French, German, Italian and…

Descriptors: Listening Comprehension Tests, Item Response Theory, Statistical Analysis, Foreign Countries

The Impact of Equating Method and Format Representation of Common Items on the Adequacy of Mixed-Format Test Equating Using Nonequivalent Groups

Direct link

Hagge, Sarah Lynn – ProQuest LLC, 2010

Mixed-format tests containing both multiple-choice and constructed-response items are widely used on educational tests. Such tests combine the broad content coverage and efficient scoring of multiple-choice items with the assessment of higher-order thinking skills thought to be provided by constructed-response items. However, the combination of…

Descriptors: Test Format, True Scores, Equated Scores, Psychometrics

Standard Error Estimation of 3PL IRT True Score Equating with an MCMC Method

Peer reviewed

Direct link

Liu, Yuming; Schulz, E. Matthew; Yu, Lei – Journal of Educational and Behavioral Statistics, 2008

A Markov chain Monte Carlo (MCMC) method and a bootstrap method were compared in the estimation of standard errors of item response theory (IRT) true score equating. Three test form relationships were examined: parallel, tau-equivalent, and congeneric. Data were simulated based on Reading Comprehension and Vocabulary tests of the Iowa Tests of…

Descriptors: Reading Comprehension, Test Format, Markov Processes, Educational Testing

An Exploratory Study of Characteristics Related to IRT Item Parameter Invariance with the Test of English as a Foreign Language. TOEFL Technical Report.

Download full text

Way, Walter D.; And Others – 1992

This study provided an exploratory investigation of item features that might contribute to a lack of invariance of item parameters for the Test of English as a Foreign Language (TOEFL). Data came from seven forms of the TOEFL administered in 1989. Subjective and quantitative measures developed for the study provided consistent information related…

Descriptors: Ability, English (Second Language), Goodness of Fit, Item Response Theory

Previous Page | Next Page »

Pages: 1 | 2

Lee, Senyung	2
Shin, Sun-Young	2
Agustinus Hardi Prasetyo	1
Anivan, Sarinee, Ed.	1
Batty, Aaron Olaf	1
Chapman, Mark	1
Culligan, Brent	1
Filipi, Anna	1
Hagge, Sarah Lynn	1
Hendrickson, Amy	1
Hohensinn, Christine	1
Kalender, Ilker	1
Kaya, Elif	1
Kim, Ahyoung Alicia	1
Kubinger, Klaus D.	1
Lidster, Ryan	1
Liu, Yuming	1
Mbella, Kinge Keka	1
Melican, Gerald	1
O'Grady, Stefan	1
Pae, Tae-Il	1
Patterson, Brian	1
Schulz, E. Matthew	1
Tywoniw, Rurik L.	1
More ▼