Publication Date
In 2025 | 0 |
Since 2024 | 1 |
Since 2021 (last 5 years) | 3 |
Since 2016 (last 10 years) | 8 |
Since 2006 (last 20 years) | 11 |
Descriptor
Source
Language Testing | 21 |
Author
Choi, Inn-Chull | 2 |
Klein-Braley, Christine | 2 |
Aryadoust, Vahid | 1 |
August, Diane | 1 |
Bachman, Lyle F. | 1 |
Boldt, Robert F. | 1 |
Boo, Jaeyool | 1 |
Brown, James Dean | 1 |
Carlo, Maria | 1 |
Clark, John L. D. | 1 |
Culligan, Brent | 1 |
More ▼ |
Publication Type
Journal Articles | 21 |
Reports - Research | 18 |
Reports - Evaluative | 2 |
Information Analyses | 1 |
Opinion Papers | 1 |
Tests/Questionnaires | 1 |
Audience
Laws, Policies, & Programs
Assessments and Surveys
Test of English as a Foreign… | 2 |
Edinburgh Handedness Inventory | 1 |
English Proficiency Test | 1 |
International English… | 1 |
Michigan Test of English… | 1 |
What Works Clearinghouse Rating
Ute Knoch; Jason Fan – Language Testing, 2024
While several test concordance tables have been published, the research underpinning such tables has rarely been examined in detail. This study aimed to survey the publically available studies or documentation underpinning the test concordance tables of the providers of four major international language tests, all accepted by the Australian…
Descriptors: Language Tests, English, Test Validity, Item Analysis
Khabbazbashi, Nahal; Galaczi, Evelina D. – Language Testing, 2020
This mixed methods study examined holistic, analytic, and part marking models (MMs) in terms of their measurement properties and impact on candidate CEFR classifications in a semi-direct online speaking test. Speaking performances of 240 candidates were first marked holistically and by part (phase 1). On the basis of phase 1 findings--which…
Descriptors: Holistic Approach, Classification, Grading, Language Tests
Schnoor, Birger; Hartig, Johannes; Klinger, Thorsten; Naumann, Alexander; Usanova, Irina – Language Testing, 2023
Research on assessing English as a foreign language (EFL) development has been growing recently. However, empirical evidence from longitudinal analyses based on substantial samples is still needed. In such settings, tests for measuring language development must meet high standards of test quality such as validity, reliability, and objectivity, as…
Descriptors: English (Second Language), Second Language Learning, Second Language Instruction, Longitudinal Studies
Aryadoust, Vahid; Foo, Stacy; Ng, Li Ying – Language Testing, 2022
The aim of this study was to investigate how test methods affect listening test takers' performance and cognitive load. Test methods were defined and operationalized as while-listening performance (WLP) and post-listening performance (PLP) formats. To achieve the goal of the study, we examined test takers' (N = 80) brain activity patterns…
Descriptors: Listening Comprehension Tests, Language Tests, Eye Movements, Brain Hemisphere Functions
Longabach, Tanya; Peyton, Vicki – Language Testing, 2018
K-12 English language proficiency tests that assess multiple content domains (e.g., listening, speaking, reading, writing) often have subsections based on these content domains; scores assigned to these subsections are commonly known as subscores. Testing programs face increasing customer demands for the reporting of subscores in addition to the…
Descriptors: Comparative Analysis, Test Reliability, Second Language Learning, Language Proficiency
Elicited Imitation as a Measure of Second Language Proficiency: A Narrative Review and Meta-Analysis
Yan, Xun; Maeda, Yukiko; Lv, Jing; Ginther, April – Language Testing, 2016
Elicited imitation (EI) has been widely used to examine second language (L2) proficiency and development and was an especially popular method in the 1970s and early 1980s. However, as the field embraced more communicative approaches to both instruction and assessment, the use of EI diminished, and the construct-related validity of EI scores as a…
Descriptors: Second Language Learning, Language Proficiency, Meta Analysis, Effect Size
Culligan, Brent – Language Testing, 2015
This study compared three common vocabulary test formats, the Yes/No test, the Vocabulary Knowledge Scale (VKS), and the Vocabulary Levels Test (VLT), as measures of vocabulary difficulty. Vocabulary difficulty was defined as the item difficulty estimated through Item Response Theory (IRT) analysis. Three tests were given to 165 Japanese students,…
Descriptors: Language Tests, Test Format, Comparative Analysis, Vocabulary
LaFlair, Geoffrey T.; Staples, Shelley – Language Testing, 2017
Investigations of the validity of a number of high-stakes language assessments are conducted using an argument-based approach, which requires evidence for inferences that are critical to score interpretation (Chapelle, Enright, & Jamieson, 2008b; Kane, 2013). The current study investigates the extrapolation inference for a high-stakes test of…
Descriptors: Computational Linguistics, Language Tests, Test Validity, Inferences
Trace, Jonathan; Brown, James Dean; Janssen, Gerriet; Kozhevnikova, Liudmila – Language Testing, 2017
Cloze tests have been the subject of numerous studies regarding their function and use in both first language and second language contexts (e.g., Jonz & Oller, 1994; Watanabe & Koyama, 2008). From a validity standpoint, one area of investigation has been the extent to which cloze tests measure reading ability beyond the sentence level.…
Descriptors: Cloze Procedure, Language Tests, Test Items, Item Analysis
Goodwin, Amanda P.; Huggins, A. Corinne; Carlo, Maria; Malabonga, Valerie; Kenyon, Dorry; Louguit, Mohammed; August, Diane – Language Testing, 2012
This study describes the development and validation of the Extract the Base test (ETB), which assesses derivational morphological awareness. Scores on this test were validated for 580 monolingual students and 373 Spanish-speaking English language learners (ELLs) in third through fifth grade. As part of the validation of the internal structure,…
Descriptors: Reading Comprehension, Speech Communication, Second Language Learning, Scoring

Shohamy, Elana; Reves, Thea – Language Testing, 1985
Surveys the development of language tests toward authenticity and discusses the advantages and disadvantages of indirect and direct (authentic) language tests. Discusses the difficulty of applying appropriate psychometric measures to tests using real-life language, and the large number of tests variables which interfere with the authenticity of…
Descriptors: Comparative Analysis, Interviews, Language Tests, Language Usage

Choi, Inn-Chull; Bachman, Lyle F. – Language Testing, 1992
This study is part of a larger one examining the comparability of the First Certificate in English and the Test of English as a Foreign Language. The general assumption of unidimensionality and goodness-of-fit were tested. Findings raise questions about the consequences of rejecting or retaining misfitting items. (60 references) (LB)
Descriptors: Comparative Analysis, English (Second Language), Goodness of Fit, Item Response Theory

Boldt, Robert F. – Language Testing, 1992
The assumption called PIRC (proportional item response curve) was tested in which PIRC was used to predict item scores of selected examinees on selected items. Findings show approximate accuracies of prediction for PIRC, the three-parameter logist model, and a modified Rasch model. (12 references) (Author/LB)
Descriptors: Comparative Analysis, English (Second Language), Factor Analysis, Item Response Theory

Clark, John L. D. – Language Testing, 1988
A validation study of the "semi-direct" Chinese Speaking Test (CST) directly compared college students' performance on the test with their performance on the "live" language proficiency interview. CST provided scoring results largely equivalent to those of the live interview, although examinees perceived CST to be more…
Descriptors: Chinese, College Students, Comparative Analysis, Higher Education

Kunnan, Antony John – Language Testing, 1992
Three analysis procedures were used to study the dependability and validity of ESLPE, a criterion-referenced English-as-a-Second-Language placement test developed at the University of California at Los Angeles in 1989. Findings led to the suggestion that some students might have been differently placed if subtest scores were used for placement.(38…
Descriptors: Cluster Analysis, Comparative Analysis, Criterion Referenced Tests, English (Second Language)
Previous Page | Next Page ยป
Pages: 1 | 2