Publication Date
| In 2026 | 0 |
| Since 2025 | 0 |
| Since 2022 (last 5 years) | 1 |
| Since 2017 (last 10 years) | 4 |
| Since 2007 (last 20 years) | 4 |
Descriptor
Source
| Language Testing | 6 |
Author
| Knoch, Ute | 2 |
| Aryadoust, Vahid | 1 |
| Brown, James Dean | 1 |
| Chapelle, Carol A. | 1 |
| Deygers, Bart | 1 |
| Khamboonruang, Apichat | 1 |
| Luo, Lan | 1 |
| Peterson, Meghan E. | 1 |
| Ross, Jacqueline | 1 |
| Stansfield, Charles W. | 1 |
| Wind, Stefanie A. | 1 |
| More ▼ | |
Publication Type
| Information Analyses | 6 |
| Journal Articles | 6 |
| Reports - Evaluative | 2 |
| Opinion Papers | 1 |
| Reports - Descriptive | 1 |
| Reports - Research | 1 |
Education Level
Audience
Location
Laws, Policies, & Programs
Assessments and Surveys
| Test of English as a Foreign… | 1 |
| Test of Written English | 1 |
What Works Clearinghouse Rating
Knoch, Ute; Deygers, Bart; Khamboonruang, Apichat – Language Testing, 2021
Rating scale development in the field of language assessment is often considered in dichotomous ways: It is assumed to be guided either by expert intuition or by drawing on performance data. Even though quite a few authors have argued that rating scale development is rarely so easily classifiable, this dyadic view has dominated language testing…
Descriptors: Rating Scales, Test Construction, Language Tests, Test Use
Aryadoust, Vahid; Luo, Lan – Language Testing, 2023
This study reviewed conceptualizations and operationalizations of second language (L2) listening constructs. A total of 157 peer-reviewed papers published in 19 journals in applied linguistics were coded for (1) publication year, author, source title, location, language, and reliability and (2) listening subskills, cognitive processes, attributes,…
Descriptors: Test Format, Listening Comprehension Tests, Second Language Learning, Second Language Instruction
Knoch, Ute; Chapelle, Carol A. – Language Testing, 2018
Argument-based validation requires test developers and researchers to specify what is entailed in test interpretation and use. Doing so has been shown to yield advantages (Chapelle, Enright, & Jamieson, 2010), but it also requires an analysis of how the concerns of language testers can be conceptualized in the terms used to construct a…
Descriptors: Test Validity, Language Tests, Evaluation Research, Rating Scales
Wind, Stefanie A.; Peterson, Meghan E. – Language Testing, 2018
The use of assessments that require rater judgment (i.e., rater-mediated assessments) has become increasingly popular in high-stakes language assessments worldwide. Using a systematic literature review, the purpose of this study is to identify and explore the dominant methods for evaluating rating quality within the context of research on…
Descriptors: Language Tests, Evaluators, Evaluation Methods, Interrater Reliability
Peer reviewedBrown, James Dean – Language Testing, 1990
Presents simplified methods for deriving estimates of the consistency of criterion-referenced, English-as-a-Second-Language tests, including (1) the threshold loss agreement approach using agreement or kappa coefficients, (2) the squared-error loss agreement approach using the phi(lambda) dependability approach, and (3) the domain score…
Descriptors: Criterion Referenced Tests, English (Second Language), Language Tests, Second Language Learning
Peer reviewedStansfield, Charles W.; Ross, Jacqueline – Language Testing, 1988
Outlines research necessary for determining the validity and reliability of Test of Written English, an essay test that directly measures writing ability and complements Test of English-as-a-Foreign-Language's (TOEFL) indirect assessment of writing skills. Research should cover such aspects as construct, criterion-related, concurrent, content, and…
Descriptors: English (Second Language), Essay Tests, Language Research, Language Tests

Direct link
