Publication Date
In 2025 | 0 |
Since 2024 | 0 |
Since 2021 (last 5 years) | 0 |
Since 2016 (last 10 years) | 12 |
Since 2006 (last 20 years) | 26 |
Descriptor
Evaluators | 26 |
Scores | 26 |
Statistical Analysis | 26 |
Second Language Learning | 12 |
Foreign Countries | 11 |
Language Tests | 10 |
Comparative Analysis | 9 |
English (Second Language) | 9 |
Correlation | 8 |
College Students | 7 |
Oral Language | 7 |
More ▼ |
Source
Author
Publication Type
Journal Articles | 23 |
Reports - Research | 22 |
Tests/Questionnaires | 5 |
Dissertations/Theses -… | 3 |
Reports - Evaluative | 1 |
Education Level
Higher Education | 16 |
Postsecondary Education | 14 |
Grade 7 | 1 |
Junior High Schools | 1 |
Audience
Location
Iran | 3 |
Canada | 2 |
United Kingdom | 2 |
Australia | 1 |
Hong Kong | 1 |
Israel | 1 |
Mississippi | 1 |
New York (New York) | 1 |
Ohio | 1 |
Turkey | 1 |
Laws, Policies, & Programs
Assessments and Surveys
Test of English as a Foreign… | 4 |
Center for Epidemiologic… | 1 |
Flesch Kincaid Grade Level… | 1 |
International English… | 1 |
Interpersonal Reactivity Index | 1 |
What Works Clearinghouse Rating
White, Lisa – ProQuest LLC, 2017
Although used in the corporate world for decades, using a multi-rater tool to evaluate school leaders began relatively recently. With states seeking flexibility from the "Elementary and Secondary Education Act of 1965" (reauthorized as the "No Child Left Behind Act of 2001"), the requirement to develop and implement principal…
Descriptors: Principals, Administrator Evaluation, Surveys, Self Evaluation (Individuals)
Azer, Haniyeh Sadeghi; Aghayi, Mohammad Bagher – Advances in Language and Literary Studies, 2015
This study aims to evaluate the translation quality of two machine translation systems in translating six different text-types, from English to Persian. The evaluation was based on criteria proposed by Van Slype (1979). The proposed model for evaluation is a black-box type, comparative and adequacy-oriented evaluation. To conduct the evaluation, a…
Descriptors: Computational Linguistics, Computer Software, Translation, Users (Information)
Hart, John T., Jr. – Contributions to Music Education, 2016
The purpose of this study was to examine the effects of Laban Effort Action (slash) instruction in an undergraduate conducting class on college wind ensemble member's ratings of conductors' gestural clarity. Participants--undergraduate and graduate wind ensemble members (N = 28)--rated 32 videos of eight undergraduate conducting students who had…
Descriptors: Undergraduate Students, Music Education, Music Activities, Administrators
Davis, Larry – Language Testing, 2016
Two factors were investigated that are thought to contribute to consistency in rater scoring judgments: rater training and experience in scoring. Also considered were the relative effects of scoring rubrics and exemplars on rater performance. Experienced teachers of English (N = 20) scored recorded responses from the TOEFL iBT speaking test prior…
Descriptors: Evaluators, Oral Language, Scores, Language Tests
Teker, Gulsen Tasdelen; Guler, Nese; Uyanik, Gulden Kaya – Educational Sciences: Theory and Practice, 2015
Generalizability theory (G theory) provides a broad conceptual framework for social sciences such as psychology and education, and a comprehensive construct for numerous measurement events by using analysis of variance, a strong statistical method. G theory, as an extension of both classical test theory and analysis of variance, is a model which…
Descriptors: Guidelines, Generalizability Theory, Computer Software, Statistical Analysis
Rex, Camille C.; Metzler, Jonathan N. – Measurement in Physical Education and Exercise Science, 2016
The purpose of this research was to develop a measure of sport injury anxiety (SIA), defined as the tendency to make threat appraisals in sport situations where injury is seen as possible and/or likely. The Sport Injury Anxiety Scale (SIAS) was developed in three stages. In Stage 1, expert raters evaluated items to determine their adequacy. In…
Descriptors: Anxiety, Injuries, Measures (Individuals), Self Concept
Moshinsky, Avital; Ziegler, David; Gafni, Naomi – International Journal of Testing, 2017
Many medical schools have adopted multiple mini-interviews (MMI) as an advanced selection tool. MMIs are expensive and used to test only a few dozen candidates per day, making it infeasible to develop a different test version for each test administration. Therefore, some items are reused both within and across years. This study investigated the…
Descriptors: Interviews, Medical Schools, Test Validity, Test Reliability
Li, Hui – English Language Teaching, 2016
The aim of the study was to investigate how raters come to their decisions when judging spoken vocabulary. Segmental rating was introduced to quantify raters' decision-making process. It is hoped that this simulated study brings fresh insight to future methodological considerations with spoken data. Twenty trainee raters assessed five Chinese…
Descriptors: Foreign Countries, Evaluators, Interrater Reliability, Decision Making
Skalicky, Stephen; Berger, Cynthia M.; Crossley, Scott A.; McNamara, Danielle S. – Advances in Language and Literary Studies, 2016
A corpus of 313 freshman college essays was analyzed in order to better understand the forms and functions of humor in academic writing. Human ratings of humor and wordplay were statistically aggregated using Factor Analysis to provide an overall "Humor" component score for each essay in the corpus. In addition, the essays were also…
Descriptors: Discourse Analysis, Academic Discourse, Humor, Writing (Composition)
Ahmadi, Alireza; Sadeghi, Elham – Language Assessment Quarterly, 2016
In the present study we investigated the effect of test format on oral performance in terms of test scores and discourse features (accuracy, fluency, and complexity). Moreover, we explored how the scores obtained on different test formats relate to such features. To this end, 23 Iranian EFL learners participated in three test formats of monologue,…
Descriptors: Oral Language, Comparative Analysis, Language Fluency, Accuracy
Préfontaine, Yvonne; Kormos, Judit; Johnson, Daniel Ezra – Language Testing, 2016
While the research literature on second language (L2) fluency is replete with descriptions of fluency and its influence with regard to English as an additional language, little is known about what fluency features influence judgments of fluency in L2 French. This study reports the results of an investigation that analyzed the relationship between…
Descriptors: Prediction, French, Second Language Learning, Evaluators
Aryadoust, Vahid – Educational Psychology, 2016
This study sought to examine the development of paragraph writing skills of 116 English as a second language university students over the course of 12 weeks and the relationship between the linguistic features of students' written texts as measured by Coh-Metrix--a computational system for estimating textual features such as cohesion and…
Descriptors: English (Second Language), Second Language Learning, Writing Skills, College Students
Alemi, Minoo; Khanlarzadeh, Neda – Iranian Journal of Language Teaching Research, 2016
The analysis of raters' comments on pragmatic assessment of L2 learners is among new and understudied concepts in second language studies. To shed light on this issue, the present investigation targeted important variables such as raters' criteria and rating patterns by analyzing the interlanguage pragmatic assessment process of the Iranian…
Descriptors: Pragmatics, English (Second Language), Second Language Learning, Video Technology
Kang, Okim; Vo, Son Ca Thanh; Moran, Meghan Kerry – TESL-EJ, 2016
Research in second language speech has often focused on listeners' accent judgment and factors that affect their perception. However, the topic of listeners' application of specific sound categories in their own perceptual judgments has not been widely investigated. The current study explored how listeners from diverse language backgrounds weighed…
Descriptors: Pronunciation, Phonology, English (Second Language), Second Language Learning
Schmid, Monika S.; Hopp, Holger – Language Testing, 2014
This study examines the methodology of global foreign accent ratings in studies on L2 speech production. In three experiments, we test how variation in raters, range within speech samples, as well as instructions and procedures affects ratings of accent in predominantly monolingual speakers of German, non-native speakers of German, as well as…
Descriptors: Comparative Analysis, Second Language Learning, Pronunciation, Native Speakers
Previous Page | Next Page »
Pages: 1 | 2