Publication Date
In 2025 | 1 |
Since 2024 | 4 |
Since 2021 (last 5 years) | 23 |
Since 2016 (last 10 years) | 58 |
Since 2006 (last 20 years) | 109 |
Descriptor
Source
Author
Powers, Donald E. | 5 |
Xi, Xiaoming | 4 |
Attali, Yigal | 3 |
Bridgeman, Brent | 3 |
Cho, Yeonsuk | 3 |
Kyle, Kristopher | 3 |
Ling, Guangming | 3 |
Papageorgiou, Spiros | 3 |
Stricker, Lawrence J. | 3 |
Alderman, Donald L. | 2 |
Ayers, Jerry B. | 2 |
More ▼ |
Publication Type
Education Level
Higher Education | 59 |
Postsecondary Education | 44 |
Secondary Education | 11 |
Elementary Education | 4 |
High Schools | 4 |
Junior High Schools | 4 |
Middle Schools | 4 |
Elementary Secondary Education | 2 |
Grade 12 | 2 |
Grade 7 | 2 |
Grade 8 | 2 |
More ▼ |
Audience
Researchers | 2 |
Location
China | 11 |
Iran | 10 |
Canada | 9 |
Japan | 7 |
Taiwan | 4 |
United States | 4 |
South Korea | 3 |
Turkey | 3 |
Armenia | 2 |
Brazil | 2 |
California | 2 |
More ▼ |
Laws, Policies, & Programs
No Child Left Behind Act 2001 | 1 |
Assessments and Surveys
What Works Clearinghouse Rating
Karlin, Omar; Karlin, Sayaka – InSight: A Journal of Scholarly Teaching, 2018
This study had two aims. The first was to explain the process of using the Rasch measurement model to validate tests in an easy-to-understand way for those unfamiliar with the Rasch measurement model. The second was to validate two final exams with several shared items. The exams were given to two groups of students with slightly differing English…
Descriptors: Item Response Theory, Test Validity, Test Items, Accuracy
Lee, Shinhye; Winke, Paula – Language Testing, 2018
We investigated how young language learners process their responses on and perceive a computer-mediated, timed speaking test. Twenty 8-, 9-, and 10-year-old non-native English-speaking children (NNSs) and eight same-aged, native English-speaking children (NSs) completed seven computerized sample TOEFL® Primary™ speaking test tasks. We investigated…
Descriptors: Elementary School Students, Second Language Learning, Responses, Computer Assisted Testing
Edward Paul Getman – Online Submission, 2020
Despite calls for engaging assessments targeting young language learners (YLLs) between 8 and 13 years old, what makes assessment tasks engaging and how such task characteristics affect measurement quality have not been well studied empirically. Furthermore, there has been a dearth of validity research about technology-enhanced speaking tests for…
Descriptors: English (Second Language), Language Tests, Second Language Learning, Learner Engagement
Brown, Annie; Ducasse, Ana Maria – Language Assessment Quarterly, 2019
This study investigates the relationship between performances on the speaking component of the TOEFL iBTTM with performances on academic oral assessment tasks. For the academic tasks, we recorded and transcribed the performances of five local and five high-achieving international undergraduate students on oral assessment tasks in core first-year…
Descriptors: Comparative Analysis, Oral Language, Language Proficiency, High Achievement
Beigman Klebanov, Beata; Ramineni, Chaitanya; Kaufer, David; Yeoh, Paul; Ishizaki, Suguru – Language Testing, 2019
Essay writing is a common type of constructed-response task used frequently in standardized writing assessments. However, the impromptu timed nature of the essay writing tests has drawn increasing criticism for the lack of authenticity for real-world writing in classroom and workplace settings. The goal of this paper is to contribute evidence to a…
Descriptors: Test Validity, Writing Tests, Writing Skills, Persuasive Discourse
Baker, Beverly A.; Tsushima, Rika; Wang, Shujiao – Language Learning in Higher Education, 2014
There are increasing numbers of non-native English speaking applicants to Canadian universities (AUCC 2008a, 2010), which are committed to promoting linguistic and cultural diversity (AUCC 2008b). One result of this trend is that university admissions officers, as gatekeepers, are faced with a growing and potentially confusing array of language…
Descriptors: Admissions Officers, College Admission, Foreign Countries, English (Second Language)
O'Dwyer, John; Kantarcioglu, Elif; Thomas, Carole – ETS Research Report Series, 2018
This study reports on an investigation of the predictive validity of the TOEFL iBT®test in an English-medium institution (EMI) in a non-target-language context, namely, Turkey. The relationship between TOEFL iBT scores and academic performance was explored in a cohort of 286 undergraduate students, as was the TOEFL iBT's relationship with an…
Descriptors: Predictive Validity, Computer Assisted Testing, Grade Point Average, Language of Instruction
Huang, Heng-Tsung Danny; Hung, Shao-Ting Alan; Hong, He-Ting Vivian – Language Assessment Quarterly, 2016
This study explored the relationships among language proficiency, two selected test-taker characteristics (i.e., topical knowledge and anxiety), and integrated speaking test performance. Data collection capitalized on three sets of instruments: three integrated tasks derived from TOEFL-iBT preparation materials, the state anxiety inventory created…
Descriptors: Oral Language, Language Tests, Path Analysis, Test Anxiety
In'nami, Yo; Koizumi, Rie; Nakamura, Keita – Language Testing in Asia, 2016
Background: This study examined the factor structure of the Test of English for Academic Purposes (TEAP®) test--a recently developed academic English test measuring four skills among Japanese university applicants--and compared the structure to that of the Test of English as a Foreign Language Internet-based test (TOEFL iBT®), to investigate the…
Descriptors: English (Second Language), Language Tests, Second Language Learning, English for Academic Purposes
Farnsworth, Timothy L. – Language Assessment Quarterly, 2013
This study examined the construct validity of the TOEFL iBT Speaking subsection for the purposes of international teaching assistant (ITA) certification, a purpose for which it was not specifically designed. The factor structure of the new TOEFL was compared with that of another language performance test in use at a major American research…
Descriptors: Test Validity, Language Tests, English (Second Language), Second Language Learning
Iberri-Shea, Gina – Cogent Education, 2017
Prominent spoken language assessments such as the Oral Proficiency Interview and the Test of Spoken English have been primarily concerned with speaking ability as it relates to conversation. This paper looks at an additional aspect of spoken language ability, namely public speaking. This study used an adapted form of a public speaking rating scale…
Descriptors: Public Speaking, Rating Scales, Adoption (Ideas), English Instruction
Staples, Shelley; Biber, Douglas; Reppen, Randi – Modern Language Journal, 2018
One of the central considerations in the validity argument for the TOEFL iBT is the relationship between the language on the exam and the language required for university courses. Corpus linguistics has recently been shown to be an effective way to explore this relationship, which can also be considered as an aspect of authenticity. Applying…
Descriptors: Computational Linguistics, Computer Assisted Testing, English (Second Language), Language Tests
Kyle, Kristopher; Crossley, Scott A.; McNamara, Danielle S. – Language Testing, 2016
This study explores the construct validity of speaking tasks included in the TOEFL iBT (e.g., integrated and independent speaking tasks). Specifically, advanced natural language processing (NLP) tools, MANOVA difference statistics, and discriminant function analyses (DFA) are used to assess the degree to which and in what ways responses to these…
Descriptors: Construct Validity, Natural Language Processing, Speech Skills, Speech Acts
Ling, Guangming; Mollaun, Pamela; Xi, Xiaoming – Language Testing, 2014
The scoring of constructed responses may introduce construct-irrelevant factors to a test score and affect its validity and fairness. Fatigue is one of the factors that could negatively affect human performance in general, yet little is known about its effects on a human rater's scoring quality on constructed responses. In this study, we compared…
Descriptors: Evaluators, Fatigue (Biology), Scoring, Performance
Tanabe, Masayuki – Reading in a Foreign Language, 2016
The present study addressed the role of speed as a factor in tests of second language (L2) vocabulary knowledge, presupposing that speed of performance is important in actual language use. Research questions were: (a) Do learners with a larger vocabulary size answer faster on an L2 vocabulary breadth test than smaller vocabulary sized learners?;…
Descriptors: Second Language Learning, Vocabulary Development, Vocabulary Skills, Alternative Assessment