NotesFAQContact Us
Collection
Advanced
Search Tips
Audience
Laws, Policies, & Programs
What Works Clearinghouse Rating
Showing 1 to 15 of 42 results Save | Export
Peer reviewed Peer reviewed
Direct linkDirect link
Ghaemi, Hamed – Language Testing in Asia, 2022
Listening comprehension in English, as one of the most fundamental skills, has an essential role in the process of learning English. Mokken scale analysis (MSA) is a probabilistic-nonparametric approach to item response theory (IRT) which determines the one-dimensionality and scalability of test. Mokken scaling techniques are a useful tool for…
Descriptors: Second Language Learning, English (Second Language), Nonparametric Statistics, Item Response Theory
Peer reviewed Peer reviewed
PDF on ERIC Download full text
Lee, Shinhye – ETS Research Report Series, 2022
In response to the calls for making key stakeholders' perspectives relevant in the test validation process, the study discussed in this report sought test-taker feedback as part of collecting validity evidence and supporting the ongoing field testing efforts of the new "TOEFL ITP"® Speaking section. Specifically, I aimed to investigate…
Descriptors: English (Second Language), Second Language Learning, Language Tests, Test Validity
Peer reviewed Peer reviewed
PDF on ERIC Download full text
Madyarov, Irshat; Movsisyan, Vahe; Madoyan, Habet; Galikyan, Irena; Gasparyan, Rubina – ETS Research Report Series, 2021
The "TOEFL Junior"® Standard test is a tool for measuring the English language skills of students ages 11+ who learn English as an additional language. It is a paper-based multiple-choice test and measures proficiency in three sections: listening, form and meaning, and reading. To date, empirical evidence provides some support for the…
Descriptors: English (Second Language), Second Language Learning, Language Tests, Standardized Tests
Tavarez Da Costa, Pedro; Reyes Arias, Fransheska – Online Submission, 2021
The present work seeks to establish a comparison between two different and distant evaluation tools applied to the Dominican student population in order to measure the efficiency of our educational system in the recent years, one of them measured the quality of Dominican education in three areas (the PISA Test), whereas the other tested the…
Descriptors: Foreign Countries, Standardized Tests, Student Evaluation, International Assessment
Peer reviewed Peer reviewed
Direct linkDirect link
Khaled Barkaoui – Canadian Modern Language Review, 2024
Many English language proficiency (ELP) tests used for university admissions and placement now include integrated writing tasks that require examinees to use external sources when writing. Integrated writing tasks improve test authenticity and impact, but they raise several validity questions, such as what academic language skills they engage and…
Descriptors: Language Proficiency, Language Tests, English for Academic Purposes, Second Language Learning
Peer reviewed Peer reviewed
PDF on ERIC Download full text
Ariamanesh, Ali A.; Barati, Hossein; Youhanaee, Manijeh – International TESOL Journal, 2022
The present study investigated the speaking module of TOEFL iBT with an emphasis on the dichotomy of independent and integrated tasks. The potential differences between the two speaking conditions were intended to be explored based on the oral performance elicited from a group of Iranian test takers. To collect the required data, a simulated…
Descriptors: Second Language Learning, English (Second Language), Language Tests, Computer Assisted Testing
Peer reviewed Peer reviewed
Direct linkDirect link
Liu, Yanmei; Zheng, Binghan – Interpreter and Translator Trainer, 2022
This study investigates the comparability of three parallel translation tasks selected from a College English Test Band-6 (CET-6) and explores the major linguistic features contributing to translation difficulty. Data obtained from the participants' subjective rating, eye-tracking, and performance evaluation were triangulated to measure the…
Descriptors: Language Tests, Translation, Difficulty Level, Language Processing
Peer reviewed Peer reviewed
Direct linkDirect link
Lim, Hyojung – Language Testing in Asia, 2020
The current study aims to explore the cognitive validity of the iBT TOEFL reading test by investigating test takers' eye movements on individual items. It is assumed that successful test takers would adopt the intended reading processes, the same types and levels of cognitive processes that they would use for real-world reading tasks. Forty-seven…
Descriptors: Test Validity, High Stakes Tests, Second Language Learning, Language Tests
Peer reviewed Peer reviewed
PDF on ERIC Download full text
Davis, Larry; Norris, John – ETS Research Report Series, 2021
The elicited imitation task (EIT), in which language learners listen to a series of spoken sentences and repeat each one verbatim, is a commonly used measure of language proficiency in second language acquisition research. The "TOEFL® Essentials"™ test includes an EIT as a holistic measure of speaking proficiency, referred to as the…
Descriptors: Task Analysis, Language Proficiency, Speech Communication, Language Tests
Peer reviewed Peer reviewed
Direct linkDirect link
Huang, Becky H.; Bailey, Alison L.; Sass, Daniel A.; Shawn Chang, Yung-hsiang – Language Testing, 2021
Given the increasing emphasis of communicative competence in English as a foreign language (EFL) contexts and the lack of validation research on speaking assessments for adolescent EFL learners, in the current study we examined the validity of the TOEFL Junior® speaking test, a relatively new speaking assessment developed by Educational Testing…
Descriptors: Test Validity, Language Tests, English (Second Language), Second Language Learning
Peer reviewed Peer reviewed
Direct linkDirect link
Schmidgall, Jonathan E.; Getman, Edward P.; Zu, Jiyun – Language Testing, 2018
In this study, we define the term "screener test," elaborate key considerations in test design, and describe how to incorporate the concepts of practicality and argument-based validation to drive an evaluation of screener tests for language assessment. A screener test is defined as a brief assessment designed to identify an examinee as a…
Descriptors: Test Validity, Test Use, Test Construction, Language Tests
Peer reviewed Peer reviewed
Direct linkDirect link
Lim, Hyojung – Language Testing in Asia, 2019
Background: This study aims to empirically answer the question of whether the role of sub-reading skills changes depending on the test format (e.g., multiple-choice vs. open-ended reading questions). The test format effect also addresses the issue of test validity--whether the reading test properly elicits construct-relevant reading skills or…
Descriptors: Foreign Countries, Test Format, Language Tests, English (Second Language)
Peer reviewed Peer reviewed
Direct linkDirect link
Ling, Guangming – Language Assessment Quarterly, 2017
To investigate whether the type of keyboard used in exams introduces any construct-irrelevant variance to the TOEFL iBT Writing scores, we surveyed 17,040 TOEFL iBT examinees from 24 countries on their keyboard-related perceptions and preferences and analyzed the survey responses together with their test scores. Results suggest that controlling…
Descriptors: English (Second Language), Language Tests, Second Language Learning, Writing Tests
Peer reviewed Peer reviewed
PDF on ERIC Download full text
Papageorgiou, Spiros; Wu, Sha; Hsieh, Ching-Ni; Tannenbaum, Richard J.; Cheng, Mengmeng – ETS Research Report Series, 2019
The past decade has seen an emerging interest in mapping (aligning or linking) test scores to language proficiency levels of external performance scales or frameworks, such as the Common European Framework of Reference (CEFR), as well as locally developed frameworks, such as China's Standards of English Language Ability (CSE). Such alignment is…
Descriptors: English (Second Language), Language Tests, Second Language Learning, Computer Assisted Testing
Peer reviewed Peer reviewed
PDF on ERIC Download full text
Karlin, Omar; Karlin, Sayaka – InSight: A Journal of Scholarly Teaching, 2018
This study had two aims. The first was to explain the process of using the Rasch measurement model to validate tests in an easy-to-understand way for those unfamiliar with the Rasch measurement model. The second was to validate two final exams with several shared items. The exams were given to two groups of students with slightly differing English…
Descriptors: Item Response Theory, Test Validity, Test Items, Accuracy
Previous Page | Next Page »
Pages: 1  |  2  |  3