ERIC - Search Results

Publication Date

In 2025	0
Since 2024	1
Since 2021 (last 5 years)	2
Since 2016 (last 10 years)	9
Since 2006 (last 20 years)	14

Descriptor

Difficulty Level	17
Second Language Learning	17
Language Tests	13
English (Second Language)	12
Foreign Countries	12
Test Items	8
Item Analysis	6
Language Proficiency	6
Reading Comprehension	5
Statistical Analysis	5
Accuracy	4
Comparative Analysis	4
Listening Comprehension Tests	4
Native Language	4
Scores	4
Second Language Instruction	4
Task Analysis	4
Testing	4
Elementary School Students	3
Guidelines	3
Item Response Theory	3
Reading Tests	3
Regression (Statistics)	3
Chinese	2
Cognitive Processes	2
More ▼

Source

Language Testing

Publication Type

Journal Articles	17
Reports - Research	15
Tests/Questionnaires	4
Reports - Evaluative	2

Education Level

Higher Education	5
Elementary Education	3
Postsecondary Education	3
Secondary Education	2

Audience

Location

Japan	3
Europe	2
Australia	1
Hungary	1
Iran (Tehran)	1
Poland	1
Russia	1
Slovenia	1
Turkey	1
United Kingdom	1
Vietnam	1
More ▼

Laws, Policies, & Programs

Assessments and Surveys

Test of English as a Foreign…	4
International English…	1

What Works Clearinghouse Rating

Showing 1 to 15 of 17 results Save | Export

The Effect of Response Order on Candidate Viewing Behaviour and Item Difficulty in a Multiple-Choice Listening Test

Peer reviewed

Direct link

Holzknecht, Franz; McCray, Gareth; Eberharter, Kathrin; Kremmel, Benjamin; Zehentner, Matthias; Spiby, Richard; Dunlea, Jamie – Language Testing, 2021

Studies from various disciplines have reported that spatial location of options in relation to processing order impacts the ultimate choice of the option. A large number of studies have found a primacy effect, that is, the tendency to prefer the first option. In this paper we report on evidence that position of the key in four-option…

Descriptors: Language Tests, Test Items, Multiple Choice Tests, Listening Comprehension Tests

What Is the Best Predictor of Word Difficulty? A Case of Data Mining Using Random Forest

Peer reviewed

Direct link

Hung Tan Ha; Duyen Thi Bich Nguyen; Tim Stoeckel – Language Testing, 2024

Word frequency has a long history of being considered the most important predictor of word difficulty and has served as a guideline for several aspects of second language vocabulary teaching, learning, and assessment. However, recent empirical research has challenged the supremacy of frequency as a predictor of word difficulty. Accordingly,…

Descriptors: Word Frequency, Vocabulary Skills, Second Language Learning, Second Language Instruction

Facilitating the Interpretation of English Language Proficiency Scores: Combining Scale Anchoring and Test Score Mapping Methodologies

Peer reviewed

Direct link

Powers, Donald; Schedl, Mary; Papageorgiou, Spiros – Language Testing, 2017

The aim of this study was to develop, for the benefit of both test takers and test score users, enhanced "TOEFL ITP"® test score reports that go beyond the simple numerical scores that are currently reported. To do so, we applied traditional scale anchoring (proficiency scaling) to item difficulty data in order to develop performance…

Descriptors: English (Second Language), Second Language Learning, Language Proficiency, Scores

Is Anybody Listening? The Nature of Second Language Listening in Integrated Listening-to-Summarize Tasks

Peer reviewed

Direct link

Rukthong, Anchana; Brunfaut, Tineke – Language Testing, 2020

Integrated test tasks, such as listening-to-speak or reading-to-write, are increasingly used in second language assessment despite relatively limited empirical insights into what they assess. Most research on integrated tasks has primarily focused on the productive skills involved; studies exploring the receptive skills mostly investigated tasks…

Descriptors: Listening Comprehension Tests, Recall (Psychology), Oral Language, Linguistic Input

An Analysis of "TOEFL® Primary™" Repeaters: How Much Score Change Occurs?

Peer reviewed

Direct link

Cho, Yeonsuk; Blood, Ian A. – Language Testing, 2020

In this study, we examined how much change in "TOEFL® Primary™" listening and reading scores can be expected in relation to the time interval between test administrations. The test records of 5213 young learners of English (aged 8-13 years) in Japan and Turkey who repeated the tests were analyzed to examine test scores as a function of…

Descriptors: English (Second Language), Language Tests, Second Language Learning, Scores

The Effect of Read-Aloud Assistance on the Text Comprehension of Dyslexic and Non-Dyslexic English Language Learners

Peer reviewed

Direct link

Košak-Babuder, Milena; Kormos, Judit; Ratajczak, Michael; Pižorn, Karmen – Language Testing, 2019

One of the special arrangements in testing contexts is to allow dyslexic students to listen to the text while they read. In our study, we investigated the effect of read-aloud assistance on young English learners' language comprehension scores. We also examined whether students with dyslexia identification benefit from this assistance differently…

Descriptors: Dyslexia, Identification, Scores, English (Second Language)

Topic and Background Knowledge Effects on Performance in Speaking Assessment

Peer reviewed

Direct link

Khabbazbashi, Nahal – Language Testing, 2017

This study explores the extent to which topic and background knowledge of topic affect spoken performance in a high-stakes speaking test. It is argued that evidence of a substantial influence may introduce construct-irrelevant variance and undermine test fairness. Data were collected from 81 non-native speakers of English who performed on 10…

Descriptors: Speech Tests, High Stakes Tests, English (Second Language), Language Proficiency

Determining Cloze Item Difficulty from Item and Passage Characteristics across Different Learner Backgrounds

Peer reviewed

Direct link

Trace, Jonathan; Brown, James Dean; Janssen, Gerriet; Kozhevnikova, Liudmila – Language Testing, 2017

Cloze tests have been the subject of numerous studies regarding their function and use in both first language and second language contexts (e.g., Jonz & Oller, 1994; Watanabe & Koyama, 2008). From a validity standpoint, one area of investigation has been the extent to which cloze tests measure reading ability beyond the sentence level.…

Descriptors: Cloze Procedure, Language Tests, Test Items, Item Analysis

Lexical Difficulty--Using Elicited Imitation to Study Child L2

Peer reviewed

Direct link

Campfield, Dorota E. – Language Testing, 2017

This paper reports a post-hoc analysis of the influence of lexical difficulty of cue sentences on performance in an elicited imitation (EI) task to assess oral production skills for 645 child L2 English learners in instructional settings. This formed part of a large-scale investigation into effectiveness of foreign language teaching in Polish…

Descriptors: Difficulty Level, Second Language Learning, Second Language Instruction, Elementary School Students

Self-Assessment of Japanese as a Second Language: The Role of Experiences in the Naturalistic Acquisition

Peer reviewed

Direct link

Suzuki, Yuichi – Language Testing, 2015

Self-assessment has been used to assess second language proficiency; however, as sources of measurement errors vary, they may threaten the validity and reliability of the tools. The present paper investigated the role of experiences in using Japanese as a second language in the naturalistic acquisition context on the accuracy of the…

Descriptors: Self Evaluation (Individuals), Error of Measurement, Japanese, Second Language Learning

Rating Written Performance: What Do Raters Do and Why?

Peer reviewed

Direct link

Kuiken, Folkert; Vedder, Ineke – Language Testing, 2014

This study investigates the relationship in L2 writing between raters' judgments of communicative adequacy and linguistic complexity by means of six-point Likert scales, and general measures of linguistic performance. The participants were 39 learners of Italian and 32 of Dutch, who wrote two short argumentative essays. The same writing tasks…

Descriptors: Writing Evaluation, Second Language Learning, Evaluators, Native Language

Use of Tree-Based Regression in the Analyses of L2 Reading Test Items

Peer reviewed

Direct link

Gao, Lingyun; Rogers, W. Todd – Language Testing, 2011

The purpose of this study was to explore whether the results of Tree Based Regression (TBR) analyses, informed by a validated cognitive model, would enhance the interpretation of item difficulties in terms of the cognitive processes involved in answering the reading items included in two forms of the Michigan English Language Assessment Battery…

Descriptors: Test Items, Reading Tests, Item Analysis, Reading Processes

Do Questions Written in the Target Language Make Foreign Language Listening Comprehension Tests More Difficult?

Peer reviewed

Direct link

Filipi, Anna – Language Testing, 2012

The Assessment of Language Competence (ALC) certificates is an annual, international testing program developed by the Australian Council for Educational Research to test the listening and reading comprehension skills of lower to middle year levels of secondary school. The tests are developed for three levels in French, German, Italian and…

Descriptors: Listening Comprehension Tests, Item Response Theory, Statistical Analysis, Foreign Countries

Investigating the Performance of Alternative Types of Grammar Items

Peer reviewed

Direct link

David, Gergely – Language Testing, 2007

Some educational contexts almost mandate the application of multiple-choice (MC) testing techniques, even if they are deplored by many practitioners in the field. In such contexts especially, research into how well these types of item perform and how their performance may be characterised is both appropriate and desirable. The focus of this paper…

Descriptors: Student Evaluation, Grammar, Language Tests, Test Items

Predicting Item Difficulty in a Reading Comprehension Test with an Artificial Neural Network.

Peer reviewed

Perkins, Kyle; And Others – Language Testing, 1995

This article reports the results of using a three-layer back propagation artificial neural network to predict item difficulty in a reading comprehension test. Three classes of variables were examined: text structure, propositional analysis, and cognitive demand. Results demonstrate that the networks can consistently predict item difficulty. (JL)

Descriptors: Artificial Intelligence, Difficulty Level, English (Second Language), Language Tests

Previous Page | Next Page »

Pages: 1 | 2

Alderson, J. Charles	1
Blood, Ian A.	1
Brown, James Dean	1
Brunfaut, Tineke	1
Campfield, Dorota E.	1
Cho, Yeonsuk	1
David, Gergely	1
Dunlea, Jamie	1
Duyen Thi Bich Nguyen	1
Eberharter, Kathrin	1
Filipi, Anna	1
Gao, Lingyun	1
Holzknecht, Franz	1
Hung Tan Ha	1
Janssen, Gerriet	1
Khabbazbashi, Nahal	1
Kormos, Judit	1
Kozhevnikova, Liudmila	1
Košak-Babuder, Milena	1
Kremmel, Benjamin	1
Kuiken, Folkert	1
McCray, Gareth	1
Papageorgiou, Spiros	1
Perkins, Kyle	1
More ▼