Publication Date
In 2025 | 0 |
Since 2024 | 0 |
Since 2021 (last 5 years) | 0 |
Since 2016 (last 10 years) | 17 |
Since 2006 (last 20 years) | 33 |
Descriptor
Correlation | 33 |
Evaluators | 33 |
Statistical Analysis | 33 |
Second Language Learning | 18 |
Foreign Countries | 16 |
English (Second Language) | 13 |
Writing Evaluation | 13 |
Interrater Reliability | 11 |
Essays | 10 |
Language Tests | 10 |
College Students | 8 |
More ▼ |
Source
Author
Coniam, David | 3 |
Al-Hattami, Abdulghani A. | 1 |
Aryadoust, Vahid | 1 |
Berger, Cynthia M. | 1 |
Bridgeman, Brent | 1 |
Brown, Michelle Stallone | 1 |
Carlsson, Gunilla | 1 |
Crossley, Scott A. | 1 |
Davey, Tim | 1 |
Davis, Larry | 1 |
Downer, Jason | 1 |
More ▼ |
Publication Type
Journal Articles | 30 |
Reports - Research | 27 |
Tests/Questionnaires | 4 |
Reports - Evaluative | 3 |
Dissertations/Theses -… | 2 |
Information Analyses | 1 |
Reports - Descriptive | 1 |
Speeches/Meeting Papers | 1 |
Education Level
Higher Education | 13 |
Postsecondary Education | 11 |
Secondary Education | 3 |
Early Childhood Education | 1 |
Elementary Education | 1 |
Grade 1 | 1 |
Grade 11 | 1 |
Grade 6 | 1 |
Junior High Schools | 1 |
Middle Schools | 1 |
Preschool Education | 1 |
More ▼ |
Audience
Location
Hong Kong | 3 |
Iran | 2 |
Argentina | 1 |
Australia | 1 |
Belgium | 1 |
California | 1 |
Canada | 1 |
Europe | 1 |
Finland | 1 |
Mexico | 1 |
Netherlands | 1 |
More ▼ |
Laws, Policies, & Programs
Assessments and Surveys
Test of English as a Foreign… | 3 |
Flesch Kincaid Grade Level… | 1 |
Rosenberg Self Esteem Scale | 1 |
What Works Clearinghouse Rating
Yun, Jiyeo – ProQuest LLC, 2017
Since researchers investigated automatic scoring systems in writing assessments, they have dealt with relationships between human and machine scoring, and then have suggested evaluation criteria for inter-rater agreement. The main purpose of my study is to investigate the magnitudes of and relationships among indices for inter-rater agreement used…
Descriptors: Interrater Reliability, Essays, Scoring, Evaluators
Morris, Darrell; Pennell, Ashley M.; Perney, Jan; Trathen, Woodrow – Reading Psychology, 2018
This study compared reading rate to reading fluency (as measured by a rating scale). After listening to first graders read short passages, we assigned an overall fluency rating (low, average, or high) to each reading. We then used predictive discriminant analyses to determine which of five measures--accuracy, rate (objective); accuracy, phrasing,…
Descriptors: Reading Fluency, Prediction, Grade 1, Elementary School Students
Ebuoh, Casmir N. – World Journal of Education, 2018
Literature revealed that the patterns/methods of scoring essay tests had been criticized for not being reliable and this unreliability is more likely to be more in internal examinations than in the external examinations. The purpose of this study is to find out the effects of analytical and holistic scoring patterns on scorer reliability in…
Descriptors: Holistic Approach, Scoring, Essay Tests, Biology
Steedle, Jeffrey T.; Ferrara, Steve – Applied Measurement in Education, 2016
As an alternative to rubric scoring, comparative judgment generates essay scores by aggregating decisions about the relative quality of the essays. Comparative judgment eliminates certain scorer biases and potentially reduces training requirements, thereby allowing a large number of judges, including teachers, to participate in essay evaluation.…
Descriptors: Essays, Scoring, Comparative Analysis, Evaluators
Pines, Harvey A.; Larkin, Judith E.; Murray, Molly P. – Teaching of Psychology, 2016
Two studies explored properties of psychology assignments from an atypical perspective: students' own perceptions of what they learned and their emotional reactions to the assignments, specifically feelings of pride in their work. Study 1 showed that assignments vary in their likelihood of generating prideful accomplishment and identified three…
Descriptors: Assignments, Psychology, Student Attitudes, Correlation
Davis, Larry – Language Testing, 2016
Two factors were investigated that are thought to contribute to consistency in rater scoring judgments: rater training and experience in scoring. Also considered were the relative effects of scoring rubrics and exemplars on rater performance. Experienced teachers of English (N = 20) scored recorded responses from the TOEFL iBT speaking test prior…
Descriptors: Evaluators, Oral Language, Scores, Language Tests
Kuiken, Folkert; Vedder, Ineke – Language Testing, 2017
The importance of functional adequacy as an essential component of L2 proficiency has been observed by several authors (Pallotti, 2009; De Jong, Steinel, Florijn, Schoonen, & Hulstijn, 2012a, b). The rationale underlying the present study is that the assessment of writing proficiency in L2 is not fully possible without taking into account the…
Descriptors: Second Language Learning, Rating Scales, Computational Linguistics, Persuasive Discourse
In'nami, Yo; Koizumi, Rie – Language Testing, 2016
We addressed Deville and Chalhoub-Deville's (2006), Schoonen's (2012), and Xi and Mollaun's (2006) call for research into the contextual features that are considered related to person-by-task interactions in the framework of generalizability theory in two ways. First, we quantitatively synthesized the generalizability studies to determine the…
Descriptors: Evaluators, Second Language Learning, Writing Skills, Oral Language
Li, Hui – English Language Teaching, 2016
The aim of the study was to investigate how raters come to their decisions when judging spoken vocabulary. Segmental rating was introduced to quantify raters' decision-making process. It is hoped that this simulated study brings fresh insight to future methodological considerations with spoken data. Twenty trainee raters assessed five Chinese…
Descriptors: Foreign Countries, Evaluators, Interrater Reliability, Decision Making
Skalicky, Stephen; Berger, Cynthia M.; Crossley, Scott A.; McNamara, Danielle S. – Advances in Language and Literary Studies, 2016
A corpus of 313 freshman college essays was analyzed in order to better understand the forms and functions of humor in academic writing. Human ratings of humor and wordplay were statistically aggregated using Factor Analysis to provide an overall "Humor" component score for each essay in the corpus. In addition, the essays were also…
Descriptors: Discourse Analysis, Academic Discourse, Humor, Writing (Composition)
Ghalib, Thikra K.; Al-Hattami, Abdulghani A. – English Language Teaching, 2015
This paper investigates the performance of holistic and analytic scoring rubrics in the context of EFL writing. Specifically, the paper compares EFL students' scores on a writing task using holistic and analytic scoring rubrics. The data for the study was collected from 30 participants attending an English undergraduate program in a Yemeni…
Descriptors: Writing Evaluation, Student Evaluation, English (Second Language), Second Language Learning
Nye, Benjamin D.; Morrison, Donald M.; Samei, Borhan – International Educational Data Mining Society, 2015
Archived transcripts from tens of millions of online human tutoring sessions potentially contain important knowledge about how online tutors help, or fail to help, students learn. However, without ways of automatically analyzing these large corpora, any knowledge in this data will remain buried. One way to approach this issue is to train an…
Descriptors: Tutoring, Instructional Effectiveness, Tutors, Models
Préfontaine, Yvonne; Kormos, Judit; Johnson, Daniel Ezra – Language Testing, 2016
While the research literature on second language (L2) fluency is replete with descriptions of fluency and its influence with regard to English as an additional language, little is known about what fluency features influence judgments of fluency in L2 French. This study reports the results of an investigation that analyzed the relationship between…
Descriptors: Prediction, French, Second Language Learning, Evaluators
Jarvis, Scott – Language Testing, 2017
The present study discusses the relevance of measures of lexical diversity (LD) to the assessment of learner corpora. It also argues that existing measures of LD, many of which have become specialized for use with language corpora, are fundamentally measures of lexical repetition, are based on an etic perspective of language, and lack construct…
Descriptors: Computational Linguistics, English (Second Language), Second Language Learning, Native Speakers
Aryadoust, Vahid – Educational Psychology, 2016
This study sought to examine the development of paragraph writing skills of 116 English as a second language university students over the course of 12 weeks and the relationship between the linguistic features of students' written texts as measured by Coh-Metrix--a computational system for estimating textual features such as cohesion and…
Descriptors: English (Second Language), Second Language Learning, Writing Skills, College Students