Publication Date
In 2025 | 0 |
Since 2024 | 1 |
Since 2021 (last 5 years) | 3 |
Since 2016 (last 10 years) | 6 |
Since 2006 (last 20 years) | 9 |
Descriptor
Source
Language Testing | 15 |
Author
Adams, Raymond J. | 1 |
Bernstein, Jared | 1 |
Butler, Frances A. | 1 |
Cheng, Jian | 1 |
Chukharev-Hudilainen, Evgeny | 1 |
Clark, John L. D. | 1 |
Eckes, Thomas | 1 |
Fulcher, Glenn | 1 |
Ginther, April | 1 |
Gokturk, Nazlinur | 1 |
Grotjahn, Rudiger | 1 |
More ▼ |
Publication Type
Journal Articles | 15 |
Reports - Research | 13 |
Information Analyses | 1 |
Reports - Descriptive | 1 |
Reports - Evaluative | 1 |
Education Level
Higher Education | 2 |
Elementary Education | 1 |
Postsecondary Education | 1 |
Audience
Location
Brazil | 1 |
Cyprus | 1 |
Indiana | 1 |
South Africa | 1 |
Laws, Policies, & Programs
Assessments and Surveys
Test of English as a Foreign… | 2 |
Michigan Test of English… | 1 |
What Works Clearinghouse Rating
Haerim Hwang; Hyunwoo Kim – Language Testing, 2024
Given the lack of computational tools available for assessing second language (L2) production in Korean, this study introduces a novel automated tool called the Korean Syntactic Complexity Analyzer (KOSCA) for measuring syntactic complexity in L2 Korean production. As an open-source graphic user interface (GUI) developed in Python, KOSCA provides…
Descriptors: Korean, Natural Language Processing, Syntax, Computer Graphics
Gokturk, Nazlinur; Chukharev-Hudilainen, Evgeny – Language Testing, 2023
With recent technological advances, researchers have begun to explore the potential use of spoken dialog systems (SDSs) for L2 oral communication assessment. While several studies support the feasibility of building these systems for various types of oral tasks, research on the construct validity of SDS-delivered tasks is still limited. Thus, this…
Descriptors: Oral Language, Dialogs (Language), Second Language Learning, Second Language Instruction
Roever, Carsten; Kasper, Gabriele – Language Testing, 2018
In the assessment of speaking, a psycholinguistically based speaking construct has predominated. In this paper, we argue for the integration of the construct of interactional competence (IC) in speaking assessments to broaden the range of defensible inferences from speaking tests. IC emphasizes the co-constructed nature of interaction and enables…
Descriptors: Language Tests, Testing, Second Language Learning, Language Proficiency
Wilsenach, Carien; Schaefer, Maxine – Language Testing, 2022
Multilingualism in education is encouraged in South Africa, and children are expected to become bilingual and biliterate during the early primary grades. Much focus has been placed on measuring literacy in children's first language, often the medium of instruction (MOI), and English, the language typically used as MOI from fourth grade. However,…
Descriptors: Test Construction, Test Validity, Reading Tests, Reading Comprehension
LaFlair, Geoffrey T.; Staples, Shelley – Language Testing, 2017
Investigations of the validity of a number of high-stakes language assessments are conducted using an argument-based approach, which requires evidence for inferences that are critical to score interpretation (Chapelle, Enright, & Jamieson, 2008b; Kane, 2013). The current study investigates the extrapolation inference for a high-stakes test of…
Descriptors: Computational Linguistics, Language Tests, Test Validity, Inferences
Ginther, April; Yan, Xun – Language Testing, 2018
This study examines the predictive validity of the TOEFL iBT with respect to academic achievement as measured by the first-year grade point average (GPA) of Chinese students at Purdue University, a large, public, Research I institution in Indiana, USA. Correlations between GPA, TOEFL iBT total and subsection scores were examined on 1990 mainland…
Descriptors: Correlation, Computer Assisted Testing, Profiles, English (Second Language)
Ling, Guangming; Mollaun, Pamela; Xi, Xiaoming – Language Testing, 2014
The scoring of constructed responses may introduce construct-irrelevant factors to a test score and affect its validity and fairness. Fatigue is one of the factors that could negatively affect human performance in general, yet little is known about its effects on a human rater's scoring quality on constructed responses. In this study, we compared…
Descriptors: Evaluators, Fatigue (Biology), Scoring, Performance
Bernstein, Jared; Van Moere, Alistair; Cheng, Jian – Language Testing, 2010
This paper presents evidence that supports the valid use of scores from fully automatic tests of spoken language ability to indicate a person's effectiveness in spoken communication. The paper reviews the constructs, scoring, and the concurrent validity evidence of "facility-in-L2" tests, a family of automated spoken language tests in Spanish,…
Descriptors: Speech, Oral Language, Language Tests, Test Validity

Powers, Donald E.; Schedl, Mary A.; Leung, Susan Wilson; Butler, Frances A. – Language Testing, 1999
A communicative-competence orientation was undertaken to study the validity of test-score inferences derived from the revised Test of Spoken English (TSE). To implement the approach, a sample of undergraduate students, primarily native-English speakers, provided reactions to the test responses of a sample of TSE examinees. (Author/VWL)
Descriptors: College Students, Communicative Competence (Languages), English (Second Language), Inferences

Swain, Merrill – Language Testing, 2001
Examines one aspect of the many interfaces between second language (L2) learning and L2 testing. The aspect is the oral interaction--the dialogue--that occurs within small groups. Discusses from within a sociocultural theory of mind, that in a group, performance is jointly constructed and distributed across the participants. (Author/VWL)
Descriptors: Dialogs (Language), Inferences, Interaction, Language Tests
Eckes, Thomas; Grotjahn, Rudiger – Language Testing, 2006
What C-tests actually measure has been an issue of debate for many years. In the present research, the authors examined the hypothesis that C-tests measure general language proficiency. A total of 843 participants from four independent samples took a German C-test along with the TestDaF (Test of German as a Foreign Language). Rasch measurement…
Descriptors: Test Validity, Language Proficiency, German, Factor Analysis

Adams, Raymond J.; And Others – Language Testing, 1987
Classical test theory and correlational techniques such as factor analysis have been unable to deal with many language tests' measurement problems. The Partial Credit Model, a latent trait model for the analysis of data scored in ordered categories, is used to construct and analyze an oral interview test of English as a Second Language. (28…
Descriptors: English (Second Language), Interviews, Language Proficiency, Language Tests

Clark, John L. D. – Language Testing, 1988
A validation study of the "semi-direct" Chinese Speaking Test (CST) directly compared college students' performance on the test with their performance on the "live" language proficiency interview. CST provided scoring results largely equivalent to those of the live interview, although examinees perceived CST to be more…
Descriptors: Chinese, College Students, Comparative Analysis, Higher Education

Scott, Mary Lee – Language Testing, 1986
Assessed native Brazilian students' affective reactions to different oral English-as-a-Foreign-Language achievement test formats. A multivariate analysis of variance based on the results of a factor analysis showed no significant difference among student reactions to the different test formats. (Author/CB)
Descriptors: English (Second Language), Factor Analysis, Foreign Countries, Higher Education

Fulcher, Glenn – Language Testing, 1996
Investigates issues surrounding the use of tasks in oral tests with particular reference to group discussion. The article focuses on language testing, second-language acquisition, and discourse analysis to shed light on the selection of tasks for use in oral tests. (65 references) (Author/CK)
Descriptors: English (Second Language), Foreign Countries, Group Discussion, Interviews