Publication Date
| In 2026 | 0 |
| Since 2025 | 3 |
| Since 2022 (last 5 years) | 19 |
| Since 2017 (last 10 years) | 36 |
| Since 2007 (last 20 years) | 65 |
Descriptor
Source
| Language Testing | 93 |
Author
| Batty, Aaron Olaf | 3 |
| Brown, James Dean | 3 |
| Shin, Sun-Young | 3 |
| Boldt, Robert F. | 2 |
| Chapelle, Carol A. | 2 |
| Chung, Yoo-Ree | 2 |
| Eberharter, Kathrin | 2 |
| Emma Marsden | 2 |
| Janssen, Gerriet | 2 |
| Kremmel, Benjamin | 2 |
| Lee, Senyung | 2 |
| More ▼ | |
Publication Type
| Journal Articles | 93 |
| Reports - Research | 74 |
| Reports - Evaluative | 12 |
| Tests/Questionnaires | 5 |
| Reports - Descriptive | 4 |
| Information Analyses | 2 |
| Opinion Papers | 2 |
| Numerical/Quantitative Data | 1 |
Education Level
Audience
| Researchers | 1 |
| Teachers | 1 |
Location
| Japan | 11 |
| China | 4 |
| Russia | 3 |
| Australia | 2 |
| Canada | 2 |
| Europe | 2 |
| Germany | 2 |
| South Korea | 2 |
| Austria | 1 |
| Bulgaria | 1 |
| China (Guangzhou) | 1 |
| More ▼ | |
Laws, Policies, & Programs
| Elementary and Secondary… | 1 |
| Lau v Nichols | 1 |
| No Child Left Behind Act 2001 | 1 |
| Race to the Top | 1 |
Assessments and Surveys
| Test of English as a Foreign… | 17 |
| Test of English for… | 3 |
| Florida Comprehensive… | 1 |
| International English… | 1 |
| Test of Written English | 1 |
What Works Clearinghouse Rating
Peer reviewedOltman, Phillip K.; Stricker, Lawrence J. – Language Testing, 1990
A recent multidimensional scaling analysis of the Test of English-as-a-Foreign-Language (TOEFL) item response data identified clusters of items in the test sections that, being more homogeneous than their parent sections, might be better for diagnostic use. The analysis was repeated using different scoring techniques. Results diverged only for…
Descriptors: English (Second Language), Item Analysis, Language Tests, Scaling
Peer reviewedZumbo, Bruno D. – Language Testing, 2003
Based on the observation that scale-level methods are sometimes exclusively used to investigate measurement invariance for test translation, describes results of a simulation study investigating whether item-level differential item functioning (DIF) manifests itself in scale-level analyses such as single and multigroup factor analyses and per…
Descriptors: Factor Analysis, Item Analysis, Language Tests, Second Language Learning
Peer reviewedKim, Mikyung – Language Testing, 2001
Investigates differential item functioning (DIF) across two different broad language groupings, Asian and European, in a speaking test in which the test takers' responses were rated polytomously. Data were collected from 1038 nonnative speakers of English from France, Hong Kong, Japan, Spain, Switzerland, and Thailand who took the SPEAK test in…
Descriptors: English (Second Language), Foreign Countries, Item Analysis, Language Tests
Peer reviewedAlderson, J. Charles; Percsich, Richard; Szabo, Gabor – Language Testing, 2000
Reports on the potential problems in scoring responses to sequencing tests, the development of a computer program to overcome these difficulties, and an exploration of the value of scoring procedures. (Author/VWL)
Descriptors: Computer Software, Foreign Countries, Item Analysis, Language Tests
Song, Min-Young – Language Testing, 2008
This paper concerns the divisibility of comprehension subskills measured in L2 listening and reading tests. Motivated by the administration of the new Web-based English as a Second Language Placement Exam (WB-ESLPE) at UCLA, this study addresses the following research questions: first, to what extent do the WB-ESLPE listening and reading items…
Descriptors: Structural Equation Models, Second Language Learning, Reading Tests, Inferences
Peer reviewedBrown, James Dean – Language Testing, 1999
Explored the relative contributions to Test of English as a Foreign Language (TOEFL) score dependability of various numbers of persons, items, subtests, languages, and their various interactions. Sampled 15,000 test takers, 1000 each from 15 different language backgrounds. (Author/VWL)
Descriptors: English (Second Language), Language Tests, Second Language Learning, Student Characteristics
David, Gergely – Language Testing, 2007
Some educational contexts almost mandate the application of multiple-choice (MC) testing techniques, even if they are deplored by many practitioners in the field. In such contexts especially, research into how well these types of item perform and how their performance may be characterised is both appropriate and desirable. The focus of this paper…
Descriptors: Student Evaluation, Grammar, Language Tests, Test Items
Cohen, Andrew D.; Upton, Thomas A. – Language Testing, 2007
This study describes the reading and test-taking strategies that test takers used on the "Reading" section of the "LanguEdge Courseware" (2002) materials developed to familiarize prospective respondents with the new TOEFL. The investigation focused on strategies used to respond to more traditional "single selection"…
Descriptors: Courseware, Language Tests, Test Wiseness, Language Teachers
Peer reviewedSireci, Stephen G.; Allalouf, Avi – Language Testing, 2003
Describes a statistical method for evaluating the translation equivalence of language test items that are scored dichotomously. Provides an illustration of the method to a portion of the verbal subtest of the Psychometric Entrance Test, which is a large-scale postsecondary admissions test used in Israel. (VWL)
Descriptors: College Entrance Examinations, Foreign Countries, Language Tests, Second Language Learning
Peer reviewedTakala, Sauli; Kaftandjieva, Felianka – Language Testing, 2000
Analyzes gender-uniform differential item functioning (DIF) in a second language vocabulary test with the tools of item response theory to study potential gender impact on the test performance measured by different item composites. Results show that while there are test items with indications of DIF in favor of either females or males, the test as…
Descriptors: English (Second Language), Foreign Countries, Item Analysis, Language Tests
Peer reviewedBoldt, Robert F. – Language Testing, 1989
Attempts to identify latent variables affecting the item responses of the diverse language groups taking the Test of English As a Foreign Language indicated that latent group effects were small. Results support equating with item response theory and suggest the use of a restrictive assumption of proportionality of item response curves. (Author/CB)
Descriptors: English (Second Language), Item Response Theory, Language Proficiency, Language Tests
Peer reviewedGinther, April – Language Testing, 2002
A nested cross-over design was used to examine the effects of visual condition, type of stimuli, and language proficiency on listening comprehension items of the Test of English as a Foreign Language. Three two-way interactions were significant: proficiency by type of stimuli, type of stimuli by visual condition, and type of stimuli by time.…
Descriptors: English (Second Language), Language Proficiency, Language Tests, Listening Comprehension
Peer reviewedHenning, Grant – Language Testing, 1988
Violations of item unidimensionality on language tests produced distorted estimates of person ability, and violations of person unidimensionality produced distorted estimates of item difficulty. The Bejar Method was sensitive to such distortions. (Author)
Descriptors: Construct Validity, Content Validity, Difficulty Level, Item Analysis
Abbott, Marilyn L. – Language Testing, 2007
In this article, I describe a practical application of the Roussos and Stout (1996) multidimensional analysis framework for interpreting group performance differences on an ESL reading proficiency test. Although a variety of statistical methods have been developed for flagging test items that function differentially for equal ability examinees…
Descriptors: Test Bias, Test Items, English (Second Language), Second Language Learning
Peer reviewedGriffin, Patrick E.; And Others – Language Testing, 1988
Discusses the development of an interview test of English proficiency in the 0 to 1+ range on the Australian Second Language Proficiency Rating Scale. Items were written toward 29 specified objectives using specially developed algorithms. A sample set of algorithms used in one of the tests and 23 references are appended. (Author/LMO)
Descriptors: English (Second Language), Interviews, Language Proficiency, Language Tests

Direct link
