ERIC - Search Results

Publication Date

In 2025	0
Since 2024	0
Since 2021 (last 5 years)	3
Since 2016 (last 10 years)	7
Since 2006 (last 20 years)	11

Source

ETS Research Report Series	3
Language Testing	3
Language Assessment Quarterly	2
Educational Testing Service	1
InSight: A Journal of…	1
International Journal of…	1
Online Submission	1
TESL Canada Journal	1
Thought Currents in English…	1

Publication Type

Reports - Research	14
Journal Articles	12
Tests/Questionnaires	3
Reports - Descriptive	2
Reports - Evaluative	2
Dissertations/Theses -…	1
Information Analyses	1

Education Level

Higher Education	3
Elementary Education	2
Postsecondary Education	2
Junior High Schools	1
Middle Schools	1
Secondary Education	1

Audience

Location

Canada	1
Europe	1
France	1
Greece	1
Hungary (Budapest)	1
Japan	1
Japan (Tokyo)	1
Minnesota	1
South Korea	1
Vietnam	1

Laws, Policies, & Programs

Assessments and Surveys

Test of English as a Foreign…	19
Test of English for…	2

What Works Clearinghouse Rating

Showing 1 to 15 of 19 results Save | Export

The Impact of Using Synthetically Generated Listening Stimuli on Test-Taker Performance: A Case Study with Multiple-Choice, Single-Selection Items. TOEFL® Research Reports. RR-98. ETS?RR-22-05

Peer reviewed
PDF on ERIC

Download full text

Choi, Ikkyu; Zu, Jiyun – ETS Research Report Series, 2022

Synthetically generated speech (SGS) has become an integral part of our oral communication in a wide variety of contexts. It can be generated instantly at a low cost and allows precise control over multiple aspects of output, all of which can be highly appealing to second language (L2) assessment developers who have traditionally relied upon human…

Descriptors: Test Wiseness, Multiple Choice Tests, Test Items, Difficulty Level

Sustaining an Occupation-Specific Language Assessment for the Canadian Healthcare Field

Peer reviewed
PDF on ERIC

Download full text

Stewart, Gail; Strachan, Andrea – TESL Canada Journal, 2022

Since its implementation in 2004, the Canadian English Language Benchmark Assessment for Nurses (CELBAN) has been accepted as evidence of language ability for licensure of internationally educated nurses (IENs) in Canada. This article focuses on the complexities of sustaining an occupation-specific assessment over time. The authors reference the…

Descriptors: Language Tests, English for Special Purposes, Benchmarking, Nurses

Motivational Factors in Computer-Administered Integrated Skills Tasks: A Study of Young Learners

Peer reviewed

Direct link

Kormos, Judit; Brunfaut, Tineke; Michel, Marije – Language Assessment Quarterly, 2020

Previous studies examined the association between motivational characteristics and language learning achievement, but considerably less is known about young language learners' task-specific motivation in assessment contexts. Our study investigated the task motivation of young learners of English when completing computer-administered integrated…

Descriptors: Computer Assisted Testing, English (Second Language), Second Language Learning, Student Motivation

Question Preview in English for Academic Purposes Listening Assessment: The Effect of Stem Preview on Difficulty, Item Type, and Discrimination

Peer reviewed

Direct link

Yeager, Rebecca; Meyer, Zachary – International Journal of Listening, 2022

This study investigates the effects of adding stem preview to an English for Academic Purposes (EAP) multiple-choice listening assessment. In stem preview, listeners may view the item stems, but not response options, before listening. Previous research indicates that adding preview to an exam typically decreases difficulty, but raises concerns…

Descriptors: English for Academic Purposes, Second Language Learning, Second Language Instruction, Teaching Methods

Facilitating the Interpretation of English Language Proficiency Scores: Combining Scale Anchoring and Test Score Mapping Methodologies

Peer reviewed

Direct link

Powers, Donald; Schedl, Mary; Papageorgiou, Spiros – Language Testing, 2017

The aim of this study was to develop, for the benefit of both test takers and test score users, enhanced "TOEFL ITP"® test score reports that go beyond the simple numerical scores that are currently reported. To do so, we applied traditional scale anchoring (proficiency scaling) to item difficulty data in order to develop performance…

Descriptors: English (Second Language), Second Language Learning, Language Proficiency, Scores

Making Better Tests with the Rasch Measurement Model

Peer reviewed
PDF on ERIC

Download full text

Karlin, Omar; Karlin, Sayaka – InSight: A Journal of Scholarly Teaching, 2018

This study had two aims. The first was to explain the process of using the Rasch measurement model to validate tests in an easy-to-understand way for those unfamiliar with the Rasch measurement model. The second was to validate two final exams with several shared items. The exams were given to two groups of students with slightly differing English…

Descriptors: Item Response Theory, Test Validity, Test Items, Accuracy

Age, Task Characteristics, and Acoustic Indicators of Engagement: Investigations into the Validity of a Technology-Enhanced Speaking Test for Young Language Learners

Download full text

Edward Paul Getman – Online Submission, 2020

Despite calls for engaging assessments targeting young language learners (YLLs) between 8 and 13 years old, what makes assessment tasks engaging and how such task characteristics affect measurement quality have not been well studied empirically. Furthermore, there has been a dearth of validity research about technology-enhanced speaking tests for…

Descriptors: English (Second Language), Language Tests, Second Language Learning, Learner Engagement

Investigating the Effects of Prompt Characteristics on the Comparability of TOEFL iBT™ Integrated Writing Tasks

Peer reviewed

Direct link

Cho, Yeonsuk; Rijmen, Frank; Novák, Jakub – Language Testing, 2013

This study examined the influence of prompt characteristics on the averages of all scores given to test taker responses on the TOEFL iBT[TM] integrated Read-Listen-Write (RLW) writing tasks for multiple administrations from 2005 to 2009. In the context of TOEFL iBT RLW tasks, the prompt consists of a reading passage and a lecture. To understand…

Descriptors: English (Second Language), Language Tests, Writing Tests, Cues

The Role of Lexical Properties and Cohesive Devices in Text Integration and Their Effect on Human Ratings of Speaking Proficiency

Peer reviewed

Direct link

Crossley, Scott; Clevinger, Amanda; Kim, YouJin – Language Assessment Quarterly, 2014

There has been a growing interest in the use of integrated tasks in the field of second language testing to enhance the authenticity of language tests. However, the role of text integration in test takers' performance has not been widely investigated. The purpose of the current study is to examine the effects of text-based relational (i.e.,…

Descriptors: Language Proficiency, Connected Discourse, Language Tests, English (Second Language)

Assessing the Test Information Function and Differential Item Functioning for the "TOEFL Junior"® Standard Test. Research Report. ETS RR-13-17. "TOEFL Junior"® Research Report. TOEFL JR-01

Peer reviewed
PDF on ERIC

Download full text

Young, John W.; Morgan, Rick; Rybinski, Paul; Steinberg, Jonathan; Wang, Yuan – ETS Research Report Series, 2013

The "TOEFL Junior"® Standard Test is an assessment that measures the degree to which middle school-aged students learning English as a second language have attained proficiency in the academic and social English skills representative of English-medium instructional environments. The assessment measures skills in three areas: listening…

Descriptors: Item Response Theory, Test Items, Language Tests, Second Language Learning

Exploring Item Characteristics That Are Related to the Difficulty of TOEFL Dialogue Items. Research Reports. RR-79. RR-04-11

Download full text

Kostin, Irene – Educational Testing Service, 2004

The purpose of this study is to explore the relationship between a set of item characteristics and the difficulty of TOEFL[R] dialogue items. Identifying characteristics that are related to item difficulty has the potential to improve the efficiency of the item-writing process The study employed 365 TOEFL dialogue items, which were coded on 49…

Descriptors: Statistical Analysis, Difficulty Level, Language Tests, English (Second Language)

Analyzing the Option Effects of Difficult TOEFL Items with Low Biserials: Methods Developed for Use by Test Assemblers.

Download full text

Hicks, Marilyn M. – 1988

Several exploratory analyses of the fifths data generated by Test of English as a Foreign Language (TOEFL) item analyses were developed in order to evaluate the effects of options on the discriminability of difficult items and to identify difficult items with low, unreliable biserials that had been rejected by test developers, but for which…

Descriptors: Difficulty Level, Estimation (Mathematics), Identification, Item Analysis

The Prediction of TOEFL Reading Comprehension Item Difficulty for Expository Prose Passages for Three Item Types: Main Idea, Inference, and Supporting Idea Items.

Download full text

Freedle, Roy; Kostin, Irene – 1993

Prediction of the difficulty (equated delta) of a large sample (n=213) of reading comprehension items from the Test of English as a Foreign Language (TOEFL) was studied using main idea, inference, and supporting statement items. A related purpose was to examine whether text and text-related variables play a significant role in predicting item…

Descriptors: Construct Validity, Difficulty Level, Multiple Choice Tests, Prediction

Predicting Item Difficulty in a Reading Comprehension Test with an Artificial Neural Network.

Peer reviewed

Perkins, Kyle; And Others – Language Testing, 1995

This article reports the results of using a three-layer back propagation artificial neural network to predict item difficulty in a reading comprehension test. Three classes of variables were examined: text structure, propositional analysis, and cognitive demand. Results demonstrate that the networks can consistently predict item difficulty. (JL)

Descriptors: Artificial Intelligence, Difficulty Level, English (Second Language), Language Tests

An Analysis of Factors Affecting the Difficulty of Dialogue Items in TOEFL Listening Comprehension. TOEFL Research Reports, 51.

Download full text

Nissan, Susan; And Others – 1996

One of the item types in the Listening Comprehension section of the Test of English as a Foreign Language (TOEFL) test is the dialogue. Because the dialogue item pool needs to have an appropriate balance of items at a range of difficulty levels, test developers have examined items at various difficulty levels in an attempt to identify their…

Descriptors: Classification, Dialogs (Language), Difficulty Level, English (Second Language)

Previous Page | Next Page »

Pages: 1 | 2

Difficulty Level	19
Test Items	19
English (Second Language)	17
Language Tests	17
Second Language Learning	14
Scores	8
Foreign Countries	7
Statistical Analysis	7
Item Analysis	6
Language Proficiency	6
Multiple Choice Tests	6
Test Format	6
Item Response Theory	4
Reading Tests	4
Test Construction	4
Computer Assisted Testing	3
Interrater Reliability	3
Reading Comprehension	3
Second Language Instruction	3
Student Attitudes	3
Test Reliability	3
Test Validity	3
Testing	3
Academic Discourse	2
College Students	2
More ▼

Henning, Grant	2
Kostin, Irene	2
Brunfaut, Tineke	1
Cho, Yeonsuk	1
Choi, Ikkyu	1
Clevinger, Amanda	1
Cohen, Andrew D.	1
Crossley, Scott	1
Edward Paul Getman	1
Freedle, Roy	1
Hicks, Marilyn M.	1
Karlin, Omar	1
Karlin, Sayaka	1
Kim, YouJin	1
Kormos, Judit	1
Meyer, Zachary	1
Michel, Marije	1
Morgan, Rick	1
Nissan, Susan	1
Novák, Jakub	1
Papageorgiou, Spiros	1
Perkins, Kyle	1
Powers, Donald	1
Rijmen, Frank	1
More ▼