ERIC - Search Results

Publication Date

In 2026	0
Since 2025	0
Since 2022 (last 5 years)	5
Since 2017 (last 10 years)	14
Since 2007 (last 20 years)	25

Descriptor

Difficulty Level	34
Language Tests	27
Foreign Countries	19
Second Language Learning	17
Test Items	17
English (Second Language)	16
Language Proficiency	9
Item Response Theory	8
Scores	8
College Students	7
Item Analysis	7
Reading Comprehension	7
Statistical Analysis	7
Comparative Analysis	6
Listening Comprehension Tests	6
Secondary School Students	5
Student Attitudes	5
Task Analysis	5
Test Reliability	5
Test Validity	5
Testing	5
Accuracy	4
Elementary School Students	4
Multiple Choice Tests	4
Native Language	4
More ▼

Source

Language Testing

Publication Type

Journal Articles	34
Reports - Research	27
Tests/Questionnaires	6
Reports - Evaluative	4
Reports - Descriptive	2
Opinion Papers	1

Education Level

Higher Education	10
Postsecondary Education	7
Secondary Education	5
Elementary Education	4

Audience

Location

Japan	5
Australia	2
Europe	2
Russia	2
Turkey	2
Bulgaria	1
China	1
Germany	1
Hong Kong	1
Hungary	1
Iran (Tehran)	1
Kuwait	1
New York (Rochester)	1
Poland	1
Slovenia	1
South Korea	1
Ukraine	1
United Kingdom	1
Vietnam	1
More ▼

Laws, Policies, & Programs

Assessments and Surveys

Test of English as a Foreign…	5
International English…	1

What Works Clearinghouse Rating

Showing 1 to 15 of 34 results Save | Export

Evaluating Methodological Enhancements to the Yes/No Angoff Standard-Setting Method in Language Proficiency Assessment

Peer reviewed

Direct link

Tia M. Fechter; Heeyeon Yoon – Language Testing, 2024

This study evaluated the efficacy of two proposed methods in an operational standard-setting study conducted for a high-stakes language proficiency test of the U.S. government. The goal was to seek low-cost modifications to the existing Yes/No Angoff method to increase the validity and reliability of the recommended cut scores using a convergent…

Descriptors: Standard Setting, Language Proficiency, Language Tests, Evaluation Methods

Operationalizing the Reading-into-Writing Construct in Analytic Rating Scales: Effects of Different Approaches on Rating

Peer reviewed

Direct link

Lestari, Santi B.; Brunfaut, Tineke – Language Testing, 2023

Assessing integrated reading-into-writing task performances is known to be challenging, and analytic rating scales have been found to better facilitate the scoring of these performances than other common types of rating scales. However, little is known about how specific operationalizations of the reading-into-writing construct in analytic rating…

Descriptors: Reading Writing Relationship, Writing Tests, Rating Scales, Writing Processes

Hong Kong Secondary Students' Perspectives on Selecting Test Difficulty Level and Learner Washback: Effects of a Graded Approach to Assessment

Peer reviewed

Direct link

Tsang, Chi Lai; Isaacs, Talia – Language Testing, 2022

This sequential mixed-methods study investigates washback on learning in a high-stakes school exit examination by examining learner perceptions and reported behaviours in relation to learners' beliefs and language learning experience, the role of other stakeholders in the washback mechanism, and socio-educational forces. The focus is the graded…

Descriptors: Foreign Countries, Secondary School Students, Student Attitudes, High Stakes Tests

Application of an Automated Essay Scoring Engine to English Writing Assessment Using Many-Facet Rasch Measurement

Peer reviewed

Direct link

Chan, Kinnie Kin Yee; Bond, Trevor; Yan, Zi – Language Testing, 2023

We investigated the relationship between the scores assigned by an Automated Essay Scoring (AES) system, the Intelligent Essay Assessor (IEA), and grades allocated by trained, professional human raters to English essay writing by instigating two procedures novel to written-language assessment: the logistic transformation of AES raw scores into…

Descriptors: Computer Assisted Testing, Essays, Scoring, Scores

The Effect of Response Order on Candidate Viewing Behaviour and Item Difficulty in a Multiple-Choice Listening Test

Peer reviewed

Direct link

Holzknecht, Franz; McCray, Gareth; Eberharter, Kathrin; Kremmel, Benjamin; Zehentner, Matthias; Spiby, Richard; Dunlea, Jamie – Language Testing, 2021

Studies from various disciplines have reported that spatial location of options in relation to processing order impacts the ultimate choice of the option. A large number of studies have found a primacy effect, that is, the tendency to prefer the first option. In this paper we report on evidence that position of the key in four-option…

Descriptors: Language Tests, Test Items, Multiple Choice Tests, Listening Comprehension Tests

What Is the Best Predictor of Word Difficulty? A Case of Data Mining Using Random Forest

Peer reviewed

Direct link

Hung Tan Ha; Duyen Thi Bich Nguyen; Tim Stoeckel – Language Testing, 2024

Word frequency has a long history of being considered the most important predictor of word difficulty and has served as a guideline for several aspects of second language vocabulary teaching, learning, and assessment. However, recent empirical research has challenged the supremacy of frequency as a predictor of word difficulty. Accordingly,…

Descriptors: Word Frequency, Vocabulary Skills, Second Language Learning, Second Language Instruction

Understanding Test-Takers' Perceptions of Difficulty in EAP Vocabulary Tests: The Role of Experiential Factors

Peer reviewed

Direct link

Oruç Ertürk, Nesrin; Mumford, Simon E. – Language Testing, 2017

This study, conducted by two researchers who were also multiple-choice question (MCQ) test item writers at a private English-medium university in an English as a foreign language (EFL) context, was designed to shed light on the factors that influence test-takers' perceptions of difficulty in English for academic purposes (EAP) vocabulary, with the…

Descriptors: English for Academic Purposes, Vocabulary, Language Tests, Difficulty Level

Is Anybody Listening? The Nature of Second Language Listening in Integrated Listening-to-Summarize Tasks

Peer reviewed

Direct link

Rukthong, Anchana; Brunfaut, Tineke – Language Testing, 2020

Integrated test tasks, such as listening-to-speak or reading-to-write, are increasingly used in second language assessment despite relatively limited empirical insights into what they assess. Most research on integrated tasks has primarily focused on the productive skills involved; studies exploring the receptive skills mostly investigated tasks…

Descriptors: Listening Comprehension Tests, Recall (Psychology), Oral Language, Linguistic Input

Facilitating the Interpretation of English Language Proficiency Scores: Combining Scale Anchoring and Test Score Mapping Methodologies

Peer reviewed

Direct link

Powers, Donald; Schedl, Mary; Papageorgiou, Spiros – Language Testing, 2017

The aim of this study was to develop, for the benefit of both test takers and test score users, enhanced "TOEFL ITP"® test score reports that go beyond the simple numerical scores that are currently reported. To do so, we applied traditional scale anchoring (proficiency scaling) to item difficulty data in order to develop performance…

Descriptors: English (Second Language), Second Language Learning, Language Proficiency, Scores

An Analysis of "TOEFL® Primary™" Repeaters: How Much Score Change Occurs?

Peer reviewed

Direct link

Cho, Yeonsuk; Blood, Ian A. – Language Testing, 2020

In this study, we examined how much change in "TOEFL® Primary™" listening and reading scores can be expected in relation to the time interval between test administrations. The test records of 5213 young learners of English (aged 8-13 years) in Japan and Turkey who repeated the tests were analyzed to examine test scores as a function of…

Descriptors: English (Second Language), Language Tests, Second Language Learning, Scores

Validity of the American Sign Language Discrimination Test

Peer reviewed

Direct link

Bochner, Joseph H.; Samar, Vincent J.; Hauser, Peter C.; Garrison, Wayne M.; Searls, J. Matt; Sanders, Cynthia A. – Language Testing, 2016

American Sign Language (ASL) is one of the most commonly taught languages in North America. Yet, few assessment instruments for ASL proficiency have been developed, none of which have adequately demonstrated validity. We propose that the American Sign Language Discrimination Test (ASL-DT), a recently developed measure of learners' ability to…

Descriptors: American Sign Language, Test Validity, Language Proficiency, Phonological Awareness

The Effect of Read-Aloud Assistance on the Text Comprehension of Dyslexic and Non-Dyslexic English Language Learners

Peer reviewed

Direct link

Košak-Babuder, Milena; Kormos, Judit; Ratajczak, Michael; Pižorn, Karmen – Language Testing, 2019

One of the special arrangements in testing contexts is to allow dyslexic students to listen to the text while they read. In our study, we investigated the effect of read-aloud assistance on young English learners' language comprehension scores. We also examined whether students with dyslexia identification benefit from this assistance differently…

Descriptors: Dyslexia, Identification, Scores, English (Second Language)

Topic and Background Knowledge Effects on Performance in Speaking Assessment

Peer reviewed

Direct link

Khabbazbashi, Nahal – Language Testing, 2017

This study explores the extent to which topic and background knowledge of topic affect spoken performance in a high-stakes speaking test. It is argued that evidence of a substantial influence may introduce construct-irrelevant variance and undermine test fairness. Data were collected from 81 non-native speakers of English who performed on 10…

Descriptors: Speech Tests, High Stakes Tests, English (Second Language), Language Proficiency

A Comparison of Three Test Formats to Assess Word Difficulty

Peer reviewed

Direct link

Culligan, Brent – Language Testing, 2015

This study compared three common vocabulary test formats, the Yes/No test, the Vocabulary Knowledge Scale (VKS), and the Vocabulary Levels Test (VLT), as measures of vocabulary difficulty. Vocabulary difficulty was defined as the item difficulty estimated through Item Response Theory (IRT) analysis. Three tests were given to 165 Japanese students,…

Descriptors: Language Tests, Test Format, Comparative Analysis, Vocabulary

Determining Cloze Item Difficulty from Item and Passage Characteristics across Different Learner Backgrounds

Peer reviewed

Direct link

Trace, Jonathan; Brown, James Dean; Janssen, Gerriet; Kozhevnikova, Liudmila – Language Testing, 2017

Cloze tests have been the subject of numerous studies regarding their function and use in both first language and second language contexts (e.g., Jonz & Oller, 1994; Watanabe & Koyama, 2008). From a validity standpoint, one area of investigation has been the extent to which cloze tests measure reading ability beyond the sentence level.…

Descriptors: Cloze Procedure, Language Tests, Test Items, Item Analysis

Previous Page | Next Page »

Pages: 1 | 2 | 3

Brunfaut, Tineke	2
Cho, Yeonsuk	2
Perkins, Kyle	2
Al-Hamly, Mashael	1
Alderson, J. Charles	1
Batty, Aaron Olaf	1
Blood, Ian A.	1
Bochner, Joseph H.	1
Bond, Trevor	1
Brown, James Dean	1
Campfield, Dorota E.	1
Chan, Kinnie Kin Yee	1
Coombe, Christine	1
Culligan, Brent	1
David, Gergely	1
Dunlea, Jamie	1
Duyen Thi Bich Nguyen	1
Eberharter, Kathrin	1
Eckes, Thomas	1
Filipi, Anna	1
Fulcher, Glenn	1
Gao, Lingyun	1
Garrison, Wayne M.	1
Hauser, Peter C.	1
More ▼