ERIC - Search Results

Publication Date

In 2025	1
Since 2024	3
Since 2021 (last 5 years)	5
Since 2016 (last 10 years)	5
Since 2006 (last 20 years)	5

Descriptor

Rating Scales	7
Test Construction	7
Language Tests	5
English (Second Language)	4
Language Proficiency	4
High Stakes Tests	3
Test Items	3
Test Reliability	3
Test Validity	3
Interviews	2
Scores	2
Second Language Learning	2
Accuracy	1
Applied Linguistics	1
Aviation Education	1
Check Lists	1
College Students	1
Comparative Analysis	1
Cutting Scores	1
Diagnostic Tests	1
English for Special Purposes	1
Essays	1
Evaluation Research	1
Flight Training	1
Foreign Countries	1
More ▼

Source

Language Testing

Author

Deygers, Bart	1
Esmat Babaii	1
Farshad Effatpanah	1
Fulcher, Glenn	1
Griffin, Patrick E.	1
John Read	1
Khamboonruang, Apichat	1
Knoch, Ute	1
Lukácsi, Zoltán	1
Maria Treadaway	1
Mona Tabatabaee-Yazdi	1
Purya Baghaei	1
Sun-Young Shin	1
Yunwen Su	1
More ▼

Publication Type

Journal Articles	7
Reports - Research	6
Information Analyses	1
Reports - Evaluative	1

Education Level

Higher Education	1
Postsecondary Education	1

Audience

Location

Iran

Laws, Policies, & Programs

Assessments and Surveys

What Works Clearinghouse Rating

Showing all 7 results Save | Export

Comparing Two Formats of Data-Driven Rating Scales for Classroom Assessment of Pragmatic Performance with Roleplays

Peer reviewed

Direct link

Yunwen Su; Sun-Young Shin – Language Testing, 2024

Rating scales that language testers design should be tailored to the specific test purpose and score use as well as reflect the target construct. Researchers have long argued for the value of data-driven scales for classroom performance assessment, because they are specific to pedagogical tasks and objectives, have rich descriptors to offer useful…

Descriptors: Rating Scales, Language Tests, Test Construction, Performance Based Assessment

Revisiting Rating Scale Development for Rater-Mediated Language Performance Assessments: Modelling Construct and Contextual Choices Made by Scale Developers

Peer reviewed

Direct link

Knoch, Ute; Deygers, Bart; Khamboonruang, Apichat – Language Testing, 2021

Rating scale development in the field of language assessment is often considered in dichotomous ways: It is assumed to be guided either by expert intuition or by drawing on performance data. Even though quite a few authors have argued that rating scale development is rarely so easily classifiable, this dyadic view has dominated language testing…

Descriptors: Rating Scales, Test Construction, Language Tests, Test Use

A New Scoring Method for Item Response Theory Analysis of C-Tests

Peer reviewed

Direct link

Farshad Effatpanah; Purya Baghaei; Mona Tabatabaee-Yazdi; Esmat Babaii – Language Testing, 2025

This study aimed to propose a new method for scoring C-Tests as measures of general language proficiency. In this approach, the unit of analysis is sentences rather than gaps or passages. That is, the gaps correctly reformulated in each sentence were aggregated as sentence score, and then each sentence was entered into the analysis as a polytomous…

Descriptors: Item Response Theory, Language Tests, Test Items, Test Construction

Setting Standards for a Diagnostic Test of Aviation English for Student Pilots

Peer reviewed

Direct link

Maria Treadaway; John Read – Language Testing, 2024

Standard-setting is an essential component of test development, supporting the meaningfulness and appropriate interpretation of test scores. However, in the high-stakes testing environment of aviation, standard-setting studies are underexplored. To address this gap, we document two stages in the standard-setting procedures for the Overseas Flight…

Descriptors: Standard Setting, Diagnostic Tests, High Stakes Tests, English for Special Purposes

Developing a Level-Specific Checklist for Assessing EFL Writing

Peer reviewed

Direct link

Lukácsi, Zoltán – Language Testing, 2021

In second language writing assessment, rating scales and scores from human-mediated assessment have been criticized for a number of shortcomings including problems with adequacy, relevance, and reliability (Hamp-Lyons, 1990; McNamara, 1996; Weigle, 2002). In its testing practice, Euroexam International also detected that the rating scales for…

Descriptors: Test Construction, Test Validity, Test Items, Check Lists

An Algorithmic Approach to Prescriptive Assessment in English as a Second Language.

Peer reviewed

Griffin, Patrick E.; And Others – Language Testing, 1988

Discusses the development of an interview test of English proficiency in the 0 to 1+ range on the Australian Second Language Proficiency Rating Scale. Items were written toward 29 specified objectives using specially developed algorithms. A sample set of algorithms used in one of the tests and 23 references are appended. (Author/LMO)

Descriptors: English (Second Language), Interviews, Language Proficiency, Language Tests

Does Thick Description Lead to Smart Tests? A Data-Based Approach to Rating Scale Construction.

Peer reviewed

Fulcher, Glenn – Language Testing, 1996

Examines the definition of fluency in the literature, and proposes a qualitative and quantitative approach that may be used to produce a "thick" description of language use for use in rating scale construction. The article suggests that validity considerations must be addressed in the construction phase of developing scales. (69 references)…

Descriptors: College Students, English (Second Language), Individual Differences, Interviews