Publication Date
In 2025 | 1 |
Since 2024 | 2 |
Since 2021 (last 5 years) | 4 |
Since 2016 (last 10 years) | 10 |
Since 2006 (last 20 years) | 18 |
Descriptor
Source
Author
Publication Type
Journal Articles | 22 |
Reports - Research | 22 |
Reports - Evaluative | 5 |
Tests/Questionnaires | 4 |
Numerical/Quantitative Data | 1 |
Speeches/Meeting Papers | 1 |
Education Level
Higher Education | 7 |
Postsecondary Education | 5 |
Secondary Education | 3 |
Junior High Schools | 2 |
Middle Schools | 2 |
Elementary Education | 1 |
Elementary Secondary Education | 1 |
Grade 7 | 1 |
Grade 8 | 1 |
Grade 9 | 1 |
High Schools | 1 |
More ▼ |
Audience
Laws, Policies, & Programs
Assessments and Surveys
Test of English as a Foreign… | 27 |
Graduate Record Examinations | 2 |
International English… | 2 |
Test of Written English | 1 |
What Works Clearinghouse Rating
Ching-Ni Hsieh – ETS Research Report Series, 2024
The TOEFL Junior® tests are designed to evaluate young language students' English reading, listening, speaking, and writing skills in an English-medium secondary instructional context. This paper articulates a validity argument constructed to support the use and interpretation of the TOEFL Junior test scores for the purpose of placement, progress…
Descriptors: English (Second Language), Language Tests, Second Language Learning, Scores
Peiyu Wang; Liying Cheng – Critical Inquiry in Language Studies, 2025
This study employed a multi-methods design to investigate the impact of preparation on Chinese test-takers' perceptions of the integrated TOEFL iBT speaking and writing design. Combining results from over 1700 surveys and 10 interviews, it was found that these Chinese test-takers, who are the most vulnerable group in the multimillion testing…
Descriptors: Foreign Countries, Second Language Learning, English (Second Language), Language Tests
Kermad, Alyssa; Bogorevich, Valeria – Language Teaching Research Quarterly, 2022
The practice of second language (L2) speech perception has traditionally relied on equal-interval perceptual scales and novice listeners' (NLs) impressionistic judgments of constructs such as accentedness and comprehensibility (Munro & Derwing, 2011). However, issues have surfaced with respect to how well NLs can use these scales, whether they…
Descriptors: Speech Communication, Second Language Learning, Intelligibility, Rating Scales
Huang, Becky H.; Bailey, Alison L.; Sass, Daniel A.; Shawn Chang, Yung-hsiang – Language Testing, 2021
Given the increasing emphasis of communicative competence in English as a foreign language (EFL) contexts and the lack of validation research on speaking assessments for adolescent EFL learners, in the current study we examined the validity of the TOEFL Junior® speaking test, a relatively new speaking assessment developed by Educational Testing…
Descriptors: Test Validity, Language Tests, English (Second Language), Second Language Learning
Ahmadi Shirazi, Masoumeh – SAGE Open, 2019
Threats to construct validity should be reduced to a minimum. If true, sources of bias, namely raters, items, tests as well as gender, age, race, language background, culture, and socio-economic status need to be spotted and removed. This study investigates raters' experience, language background, and the choice of essay prompt as potential…
Descriptors: Foreign Countries, Language Tests, Test Bias, Essay Tests
Toker, Deniz – TESL-EJ, 2019
The central purpose of this paper is to examine validity problems arising from the multiple-choice items and technical passages in the Test of English as a Foreign Language Internet-based Test (TOEFL iBT) reading section, primarily concentrating on construct-irrelevant variance (Messick, 1989). My personal TOEFL iBT experience, along with my…
Descriptors: English (Second Language), Language Tests, Second Language Learning, Computer Assisted Testing
Galikyan, Irena; Madyarov, Irshat; Gasparyan, Rubina – ETS Research Report Series, 2019
The broad range of English language teaching and learning contexts present in the world today necessitates high quality assessment instruments that can provide reliable and meaningful information about learners' English proficiency levels to relevant stakeholders. The "TOEFL Junior"® tests were recently introduced by Educational Testing…
Descriptors: English (Second Language), Language Tests, Second Language Learning, Student Attitudes
Hanifehzadeh, Sepeedeh; Farahzad, Farzaneh – International Journal of Language Testing, 2016
The present study was designed basically to develop a psycho-motor mechanism scale based on the theory of translation competence proposed by PACTE (2003), and then to assess the validity and reliability of the constructed scale. In this quantitative research, after designing the scale, two translation tasks were given to 90 M.A. students majoring…
Descriptors: Translation, Language Tests, Test Construction, Test Reliability
In'nami, Yo; Koizumi, Rie; Nakamura, Keita – Language Testing in Asia, 2016
Background: This study examined the factor structure of the Test of English for Academic Purposes (TEAP®) test--a recently developed academic English test measuring four skills among Japanese university applicants--and compared the structure to that of the Test of English as a Foreign Language Internet-based test (TOEFL iBT®), to investigate the…
Descriptors: English (Second Language), Language Tests, Second Language Learning, English for Academic Purposes
Kyle, Kristopher; Crossley, Scott A.; McNamara, Danielle S. – Language Testing, 2016
This study explores the construct validity of speaking tasks included in the TOEFL iBT (e.g., integrated and independent speaking tasks). Specifically, advanced natural language processing (NLP) tools, MANOVA difference statistics, and discriminant function analyses (DFA) are used to assess the degree to which and in what ways responses to these…
Descriptors: Construct Validity, Natural Language Processing, Speech Skills, Speech Acts
Zahedi, Keivan; Shamsaee, Saeedeh – Educational Assessment, Evaluation and Accountability, 2012
The aim of the present research is to examine the viability of the construct validity of the speaking modules of two internationally recognized language proficiency examinations, namely IELTS and TOEFL iBT. High-stake standardized tests play a crucial and decisive role in determining the future academic life of many people. Overall obtained scores…
Descriptors: Foreign Countries, Construct Validity, Language Tests, English (Second Language)
Attali, Yigal; Sinharay, Sandip – ETS Research Report Series, 2015
The "e-rater"® automated essay scoring system is used operationally in the scoring of "TOEFL iBT"® independent and integrated tasks. In this study we explored the psychometric added value of reporting four trait scores for each of these two tasks, beyond the total e-rater score.The four trait scores are word choice, grammatical…
Descriptors: Writing Tests, Scores, Language Tests, English (Second Language)
Farnsworth, Timothy L. – Language Assessment Quarterly, 2013
This study examined the construct validity of the TOEFL iBT Speaking subsection for the purposes of international teaching assistant (ITA) certification, a purpose for which it was not specifically designed. The factor structure of the new TOEFL was compared with that of another language performance test in use at a major American research…
Descriptors: Test Validity, Language Tests, English (Second Language), Second Language Learning
Jang, Eunice Eunhee; Roussos, Louis – International Journal of Testing, 2009
In this article we present results of a Differential Item Functioning (DIF) study using Shealy and Stout's (1993) multidimensionality-based DIF analysis framework. In this framework, differences in test score distributions across different groups of examinees may be a result of multidimensionality if secondary dimensions (not the primary dimension…
Descriptors: Test Bias, Vocabulary, English (Second Language), Scores
Quinlan, Thomas; Higgins, Derrick; Wolff, Susanne – Educational Testing Service, 2009
This report evaluates the construct coverage of the e-rater[R[ scoring engine. The matter of construct coverage depends on whether one defines writing skill, in terms of process or product. Originally, the e-rater engine consisted of a large set of components with a proven ability to predict human holistic scores. By organizing these capabilities…
Descriptors: Guides, Writing Skills, Factor Analysis, Writing Tests
Previous Page | Next Page »
Pages: 1 | 2