ERIC - Search Results

Publication Date

In 2025	0
Since 2024	0
Since 2021 (last 5 years)	0
Since 2016 (last 10 years)	17
Since 2006 (last 20 years)	33

Descriptor

Correlation	33
Evaluators	33
Statistical Analysis	33
Second Language Learning	18
Foreign Countries	16
English (Second Language)	13
Writing Evaluation	13
Interrater Reliability	11
Essays	10
Language Tests	10
College Students	8
Comparative Analysis	8
Scores	8
Computational Linguistics	7
Computer Assisted Testing	7
Language Proficiency	7
Scoring	7
Second Language Instruction	7
Oral Language	6
Rating Scales	5
Scoring Rubrics	5
Undergraduate Students	5
Writing Tests	5
Evaluation Criteria	4
Evaluation Methods	4
More ▼

Publication Type

Journal Articles	30
Reports - Research	27
Tests/Questionnaires	4
Reports - Evaluative	3
Dissertations/Theses -…	2
Information Analyses	1
Reports - Descriptive	1
Speeches/Meeting Papers	1

Education Level

Higher Education	13
Postsecondary Education	11
Secondary Education	3
Early Childhood Education	1
Elementary Education	1
Grade 1	1
Grade 11	1
Grade 6	1
Junior High Schools	1
Middle Schools	1
Preschool Education	1
More ▼

Audience

Location

Hong Kong	3
Iran	2
Argentina	1
Australia	1
Belgium	1
California	1
Canada	1
Europe	1
Finland	1
Mexico	1
Netherlands	1
Nigeria	1
Ohio	1
Philippines	1
Sweden	1
Taiwan	1
Texas	1
United Kingdom	1
Vietnam	1
Yemen	1
More ▼

Laws, Policies, & Programs

Assessments and Surveys

Test of English as a Foreign…	3
Flesch Kincaid Grade Level…	1
Rosenberg Self Esteem Scale	1

What Works Clearinghouse Rating

Showing 1 to 15 of 33 results Save | Export

The Impact of Rater Variability on Relationships among Different Effect-Size Indices for Inter-Rater Agreement between Human and Automated Essay Scoring

Direct link

Yun, Jiyeo – ProQuest LLC, 2017

Since researchers investigated automatic scoring systems in writing assessments, they have dealt with relationships between human and machine scoring, and then have suggested evaluation criteria for inter-rater agreement. The main purpose of my study is to investigate the magnitudes of and relationships among indices for inter-rater agreement used…

Descriptors: Interrater Reliability, Essays, Scoring, Evaluators

Using Subjective and Objective Measures to Predict Level of Reading Fluency at the End of First Grade

Peer reviewed

Direct link

Morris, Darrell; Pennell, Ashley M.; Perney, Jan; Trathen, Woodrow – Reading Psychology, 2018

This study compared reading rate to reading fluency (as measured by a rating scale). After listening to first graders read short passages, we assigned an overall fluency rating (low, average, or high) to each reading. We then used predictive discriminant analyses to determine which of five measures--accuracy, rate (objective); accuracy, phrasing,…

Descriptors: Reading Fluency, Prediction, Grade 1, Elementary School Students

Effects of Analytical and Holistic Scoring Patterns on Scorer Reliability in Biology Essay Tests

Peer reviewed
PDF on ERIC

Download full text

Ebuoh, Casmir N. – World Journal of Education, 2018

Literature revealed that the patterns/methods of scoring essay tests had been criticized for not being reliable and this unreliability is more likely to be more in internal examinations than in the external examinations. The purpose of this study is to find out the effects of analytical and holistic scoring patterns on scorer reliability in…

Descriptors: Holistic Approach, Scoring, Essay Tests, Biology

Evaluating Comparative Judgment as an Approach to Essay Scoring

Peer reviewed

Direct link

Steedle, Jeffrey T.; Ferrara, Steve – Applied Measurement in Education, 2016

As an alternative to rubric scoring, comparative judgment generates essay scores by aggregating decisions about the relative quality of the essays. Comparative judgment eliminates certain scorer biases and potentially reduces training requirements, thereby allowing a large number of judges, including teachers, to participate in essay evaluation.…

Descriptors: Essays, Scoring, Comparative Analysis, Evaluators

Dual Outcomes of Psychology Assignments: Perceived Learning and Feelings of Prideful Accomplishment

Peer reviewed

Direct link

Pines, Harvey A.; Larkin, Judith E.; Murray, Molly P. – Teaching of Psychology, 2016

Two studies explored properties of psychology assignments from an atypical perspective: students' own perceptions of what they learned and their emotional reactions to the assignments, specifically feelings of pride in their work. Study 1 showed that assignments vary in their likelihood of generating prideful accomplishment and identified three…

Descriptors: Assignments, Psychology, Student Attitudes, Correlation

The Influence of Training and Experience on Rater Performance in Scoring Spoken Language

Peer reviewed

Direct link

Davis, Larry – Language Testing, 2016

Two factors were investigated that are thought to contribute to consistency in rater scoring judgments: rater training and experience in scoring. Also considered were the relative effects of scoring rubrics and exemplars on rater performance. Experienced teachers of English (N = 20) scored recorded responses from the TOEFL iBT speaking test prior…

Descriptors: Evaluators, Oral Language, Scores, Language Tests

Functional Adequacy in L2 Writing: Towards a New Rating Scale

Peer reviewed

Direct link

Kuiken, Folkert; Vedder, Ineke – Language Testing, 2017

The importance of functional adequacy as an essential component of L2 proficiency has been observed by several authors (Pallotti, 2009; De Jong, Steinel, Florijn, Schoonen, & Hulstijn, 2012a, b). The rationale underlying the present study is that the assessment of writing proficiency in L2 is not fully possible without taking into account the…

Descriptors: Second Language Learning, Rating Scales, Computational Linguistics, Persuasive Discourse

Task and Rater Effects in L2 Speaking and Writing: A Synthesis of Generalizability Studies

Peer reviewed

Direct link

In'nami, Yo; Koizumi, Rie – Language Testing, 2016

We addressed Deville and Chalhoub-Deville's (2006), Schoonen's (2012), and Xi and Mollaun's (2006) call for research into the contextual features that are considered related to person-by-task interactions in the framework of generalizability theory in two ways. First, we quantitatively synthesized the generalizability studies to determine the…

Descriptors: Evaluators, Second Language Learning, Writing Skills, Oral Language

How Do Raters Judge Spoken Vocabulary?

Peer reviewed
PDF on ERIC

Download full text

Li, Hui – English Language Teaching, 2016

The aim of the study was to investigate how raters come to their decisions when judging spoken vocabulary. Segmental rating was introduced to quantify raters' decision-making process. It is hoped that this simulated study brings fresh insight to future methodological considerations with spoken data. Twenty trainee raters assessed five Chinese…

Descriptors: Foreign Countries, Evaluators, Interrater Reliability, Decision Making

Linguistic Features of Humor in Academic Writing

Peer reviewed
PDF on ERIC

Download full text

Skalicky, Stephen; Berger, Cynthia M.; Crossley, Scott A.; McNamara, Danielle S. – Advances in Language and Literary Studies, 2016

A corpus of 313 freshman college essays was analyzed in order to better understand the forms and functions of humor in academic writing. Human ratings of humor and wordplay were statistically aggregated using Factor Analysis to provide an overall "Humor" component score for each essay in the corpus. In addition, the essays were also…

Descriptors: Discourse Analysis, Academic Discourse, Humor, Writing (Composition)

Holistic versus Analytic Evaluation of EFL Writing: A Case Study

Peer reviewed
PDF on ERIC

Download full text

Ghalib, Thikra K.; Al-Hattami, Abdulghani A. – English Language Teaching, 2015

This paper investigates the performance of holistic and analytic scoring rubrics in the context of EFL writing. Specifically, the paper compares EFL students' scores on a writing task using holistic and analytic scoring rubrics. The data for the study was collected from 30 participants attending an English undergraduate program in a Yemeni…

Descriptors: Writing Evaluation, Student Evaluation, English (Second Language), Second Language Learning

Automated Session-Quality Assessment for Human Tutoring Based on Expert Ratings of Tutoring Success

Download full text

Nye, Benjamin D.; Morrison, Donald M.; Samei, Borhan – International Educational Data Mining Society, 2015

Archived transcripts from tens of millions of online human tutoring sessions potentially contain important knowledge about how online tutors help, or fail to help, students learn. However, without ways of automatically analyzing these large corpora, any knowledge in this data will remain buried. One way to approach this issue is to train an…

Descriptors: Tutoring, Instructional Effectiveness, Tutors, Models

How Do Utterance Measures Predict Raters' Perceptions of Fluency in French as a Second Language?

Peer reviewed

Direct link

Préfontaine, Yvonne; Kormos, Judit; Johnson, Daniel Ezra – Language Testing, 2016

While the research literature on second language (L2) fluency is replete with descriptions of fluency and its influence with regard to English as an additional language, little is known about what fluency features influence judgments of fluency in L2 French. This study reports the results of an investigation that analyzed the relationship between…

Descriptors: Prediction, French, Second Language Learning, Evaluators

Grounding Lexical Diversity in Human Judgments

Peer reviewed

Direct link

Jarvis, Scott – Language Testing, 2017

The present study discusses the relevance of measures of lexical diversity (LD) to the assessment of learner corpora. It also argues that existing measures of LD, many of which have become specialized for use with language corpora, are fundamentally measures of lexical repetition, are based on an etic perspective of language, and lack construct…

Descriptors: Computational Linguistics, English (Second Language), Second Language Learning, Native Speakers

Understanding the Growth of ESL Paragraph Writing Skills and Its Relationships with Linguistic Features

Peer reviewed

Direct link

Aryadoust, Vahid – Educational Psychology, 2016

This study sought to examine the development of paragraph writing skills of 116 English as a second language university students over the course of 12 weeks and the relationship between the linguistic features of students' written texts as measured by Coh-Metrix--a computational system for estimating textual features such as cohesion and…

Descriptors: English (Second Language), Second Language Learning, Writing Skills, College Students

Previous Page | Next Page »

Pages: 1 | 2 | 3

Language Testing	5
Advances in Language and…	2
ETS Research Report Series	2
English Language Teaching	2
Language Assessment Quarterly	2
ProQuest LLC	2
Applied Measurement in…	1
Contemporary Issues in…	1
Early Education and…	1
Educational Psychology	1
Educational Research and…	1
Hispania	1
International Educational…	1
International Journal of…	1
Journal of Clinical Child and…	1
Journal of Early Adolescence	1
Journal of Effective Teaching	1
Journal on English Language…	1
Modern Language Journal	1
New Horizons in Education	1
Reading Psychology	1
System: An International…	1
Teaching of Psychology	1
World Journal of Education	1
More ▼

Coniam, David	3
Al-Hattami, Abdulghani A.	1
Aryadoust, Vahid	1
Berger, Cynthia M.	1
Bridgeman, Brent	1
Brown, Michelle Stallone	1
Carlsson, Gunilla	1
Crossley, Scott A.	1
Davey, Tim	1
Davis, Larry	1
Downer, Jason	1
Ebuoh, Casmir N.	1
Engels, Rutger C. M. E.	1
Ferrara, Steve	1
Fortune, Tara W.	1
Gentile, Claudia	1
Ghalib, Thikra K.	1
Glew, David	1
Haak, Maria	1
Hoang, Giang Thi Linh	1
In'nami, Yo	1
Iwarsson, Susanne	1
Jarvis, Scott	1
Jia, Yujie	1
Johnson, Daniel Ezra	1
More ▼