NotesFAQContact Us
Collection
Advanced
Search Tips
Audience
Laws, Policies, & Programs
What Works Clearinghouse Rating
Showing all 10 results Save | Export
Peer reviewed Peer reviewed
Direct linkDirect link
Yuichiro Yokouchi – Language Testing in Asia, 2025
The performance decision tree (PDT; Fulcher et al., 2011) is a rubric style that is applicable to performance assessment, with origins in Upshur and Turner's (1995) empirically derived binary-choice, boundary-definition (EBB) scale. It is easier for raters to assess performance by evaluating multiple binary-choice descriptors. Additionally,…
Descriptors: Scoring Rubrics, Second Language Learning, Second Language Instruction, Language Teachers
Peer reviewed Peer reviewed
Direct linkDirect link
Berger, Cynthia M.; Crossley, Scott A.; Kyle, Kristopher – Applied Linguistics, 2019
A large data set of L1 psycholinguistic norms (Balota "et al." 2007) was used to assess spoken L2 English lexical proficiency in cross-sectional and longitudinal learner corpora. Behavioral norms included lexical decision and word naming latencies (i.e. reaction times) and accuracies for 40,481 English words. A frequency measure was…
Descriptors: Psycholinguistics, Native Language, Second Language Learning, Case Studies
Peer reviewed Peer reviewed
Direct linkDirect link
Tan, Chin Pei; Howes, Dora; Tan, Rendell K. W.; Dancza, Karina M. – Assessment & Evaluation in Higher Education, 2022
Interactive oral assessments demonstrate potential to develop graduate attributes such as critical thinking, professional communication and collaborative skills in students through authentic simulation of workplace scenarios. This study captured the design, delivery and evaluation of interactive oral assessments across three programmes --…
Descriptors: Oral Language, Interaction, Critical Thinking, Communication Skills
Peer reviewed Peer reviewed
Direct linkDirect link
Saito, Kazuya; Liu, Yuwei – Second Language Research, 2022
There is emerging evidence that collocation use plays a primary role in determining various dimensions of L2 oral proficiency assessment and development. The current study presents the results of three experiments which examined the relationship between the degree of association in collocation use (operationalized as t scores and mutual…
Descriptors: Phrase Structure, Case Studies, Second Language Learning, Second Language Instruction
Peer reviewed Peer reviewed
Direct linkDirect link
Burton, John Dylan – Language Assessment Quarterly, 2020
An assumption underlying speaking tests is that scores reflect the ability to produce online, non-rehearsed speech. Speech produced in testing situations may, however, be less spontaneous if extensive test preparation takes place, resulting in memorized or rehearsed responses. If raters detect these patterns, they may conceptualize speech as…
Descriptors: Language Tests, Oral Language, Scores, Speech Communication
Peer reviewed Peer reviewed
PDF on ERIC Download full text
Yalçin-Çolakoglu, Özlem; Selçuk, Merve – Advances in Language and Literary Studies, 2019
Criterion referenced tests of second language speaking performance are administered in different institutions using different procedures. The present study reports raters' practices of second language speaking tests, in particular the correspondence between test-takers' grades when assessed individually and in groups. Data derived from…
Descriptors: Oral Language, Language Tests, Test Validity, Inferences
Peer reviewed Peer reviewed
PDF on ERIC Download full text
Li, Hui – English Language Teaching, 2016
The aim of the study was to investigate how raters come to their decisions when judging spoken vocabulary. Segmental rating was introduced to quantify raters' decision-making process. It is hoped that this simulated study brings fresh insight to future methodological considerations with spoken data. Twenty trainee raters assessed five Chinese…
Descriptors: Foreign Countries, Evaluators, Interrater Reliability, Decision Making
Peer reviewed Peer reviewed
Direct linkDirect link
O'Hagan, Sally; Pill, John; Zhang, Ying – Language Testing, 2016
Criticism of specific-purpose language (LSP) tests is often directed at their limited ability to represent fully the demands of the target language use situation. Such criticisms extend to the criteria used to assess test performance, which may fail to capture what matters to participants in the domain of interest. This paper reports on the…
Descriptors: Health Personnel, Language Tests, English for Special Purposes, Criticism
Davis, Lawrence Edward – ProQuest LLC, 2012
Speaking performance tests typically employ raters to produce scores; accordingly, variability in raters' scoring decisions has important consequences for test reliability and validity. One such source of variability is the rater's level of expertise in scoring. Therefore, it is important to understand how raters' performance is influenced by…
Descriptors: Evaluators, Expertise, Scores, Second Language Learning
Hsieh, Ching-Ni – ProQuest LLC, 2011
Second language (L2) oral performance assessment always involves raters' subjective judgments and is thus subject to rater variability. The variability due to rater characteristics has important consequential impacts on decision-making processes, particularly in high-stakes testing situations (Bachman, Lynch, & Mason, 1995; A. Brown, 1995;…
Descriptors: Undergraduate Students, Phonology, Teaching Assistants, Foreign Students