ERIC - Search Results

Publication Date

In 2025	0
Since 2024	0
Since 2021 (last 5 years)	1
Since 2016 (last 10 years)	3
Since 2006 (last 20 years)	4

Source

English Teaching	1
Language Testing	1
National Center for Education…	1
Online Submission	1
Review of Educational Research	1
Working Papers in TESOL &…	1

Author

Chang, Yuh-Fang	1
Dillow, Sally	1
Ginther, April	1
Han, Qie	1
Jiyeo Yun	1
Lv, Jing	1
Maeda, Yukiko	1
White, Sheida	1
Yan, Xun	1
van der Linden, Wim J.	1

Publication Type

Information Analyses	6
Journal Articles	4
Reports - Evaluative	3
Reports - Research	1

Education Level

Adult Basic Education	1
Higher Education	1
Postsecondary Education	1

Audience

Location

Netherlands	1
Taiwan	1

Laws, Policies, & Programs

Assessments and Surveys

National Assessment of Adult…

What Works Clearinghouse Rating

Showing all 6 results Save | Export

Meta-Analysis of Inter-Rater Agreement and Discrepancy Between Human and Automated English Essay Scoring

Peer reviewed
PDF on ERIC

Download full text

Direct link

Jiyeo Yun – English Teaching, 2023

Studies on automatic scoring systems in writing assessments have also evaluated the relationship between human and machine scores for the reliability of automated essay scoring systems. This study investigated the magnitudes of indices for inter-rater agreement and discrepancy, especially regarding human and machine scoring, in writing assessment.…

Descriptors: Meta Analysis, Interrater Reliability, Essays, Scoring

Rater Cognition in L2 Speaking Assessment: A Review of the Literature

Peer reviewed
PDF on ERIC

Download full text

Han, Qie – Working Papers in TESOL & Applied Linguistics, 2016

This literature review attempts to survey representative studies within the context of L2 speaking assessment that have contributed to the conceptualization of rater cognition. Two types of studies are looked at: 1) studies that examine "how" raters differ (and sometimes agree) in their cognitive processes and rating behaviors, in terms…

Descriptors: Second Language Learning, Student Evaluation, Evaluators, Speech Tests

Elicited Imitation as a Measure of Second Language Proficiency: A Narrative Review and Meta-Analysis

Peer reviewed

Direct link

Yan, Xun; Maeda, Yukiko; Lv, Jing; Ginther, April – Language Testing, 2016

Elicited imitation (EI) has been widely used to examine second language (L2) proficiency and development and was an especially popular method in the 1970s and early 1980s. However, as the field embraced more communicative approaches to both instruction and assessment, the use of EI diminished, and the construct-related validity of EI scores as a…

Descriptors: Second Language Learning, Language Proficiency, Meta Analysis, Effect Size

Variations in the Analysis of Written Recall Protocols

Download full text

Chang, Yuh-Fang – Online Submission, 2009

While researchers generally quantify the amount of information that learners recall correctly in order to measure reading comprehension, the unit of analysis adopted to score the recall protocol differs. Whether and how different scoring systems bring about a different picture of L2 reading comprehension, however, remains unexplored. This study…

Descriptors: Reading Comprehension, Measures (Individuals), Scoring, Recall (Psychology)

Key Concepts and Features of the 2003 National Assessment of Adult Literacy. NCES 2006-471

Peer reviewed
PDF on ERIC

Download full text

White, Sheida; Dillow, Sally – National Center for Education Statistics, 2005

The 2003 NAAL is a complex assessment with several components and various types of data. The primary purpose of this publication is to describe the assessment's key features and data types. Thus, the publication covers the critical concepts and features carried over from the 1992 assessment, as well as those new to the 2003 assessment--for…

Descriptors: Adult Literacy, National Surveys, Evaluation, Evaluation Methods

A Latent Trait Look at Pretest-Posttest Validation of Criterion-referenced Test Items.

Peer reviewed

van der Linden, Wim J. – Review of Educational Research, 1981

Using criterion-referenced test item data collected in an empirical study, differences in item selection between Cox and Vargas' pretest-posttest validity index and a latent trait approach (evaluation of the item information function for the mastery score) are analyzed. (Author/GK)

Descriptors: Comparative Analysis, Criterion Referenced Tests, Foreign Countries, Latent Trait Theory

Comparative Analysis	6
Scoring	6
Second Language Learning	4
Correlation	2
Effect Size	2
English (Second Language)	2
Evaluators	2
Foreign Countries	2
Interrater Reliability	2
Meta Analysis	2
Test Validity	2
Adult Literacy	1
College Students	1
Computational Linguistics	1
Computer Assisted Testing	1
Construct Validity	1
Criterion Referenced Tests	1
Decision Making	1
Definitions	1
Essays	1
Evaluation	1
Evaluation Criteria	1
Evaluation Methods	1
Evaluation Research	1
Imitation	1
More ▼