Publication Date
In 2025 | 0 |
Since 2024 | 0 |
Since 2021 (last 5 years) | 1 |
Since 2016 (last 10 years) | 3 |
Since 2006 (last 20 years) | 4 |
Descriptor
Comparative Analysis | 6 |
Scoring | 6 |
Second Language Learning | 4 |
Correlation | 2 |
Effect Size | 2 |
English (Second Language) | 2 |
Evaluators | 2 |
Foreign Countries | 2 |
Interrater Reliability | 2 |
Meta Analysis | 2 |
Test Validity | 2 |
More ▼ |
Source
English Teaching | 1 |
Language Testing | 1 |
National Center for Education… | 1 |
Online Submission | 1 |
Review of Educational Research | 1 |
Working Papers in TESOL &… | 1 |
Author
Chang, Yuh-Fang | 1 |
Dillow, Sally | 1 |
Ginther, April | 1 |
Han, Qie | 1 |
Jiyeo Yun | 1 |
Lv, Jing | 1 |
Maeda, Yukiko | 1 |
White, Sheida | 1 |
Yan, Xun | 1 |
van der Linden, Wim J. | 1 |
Publication Type
Information Analyses | 6 |
Journal Articles | 4 |
Reports - Evaluative | 3 |
Reports - Research | 1 |
Education Level
Adult Basic Education | 1 |
Higher Education | 1 |
Postsecondary Education | 1 |
Audience
Location
Netherlands | 1 |
Taiwan | 1 |
Laws, Policies, & Programs
Assessments and Surveys
National Assessment of Adult… | 1 |
What Works Clearinghouse Rating
Jiyeo Yun – English Teaching, 2023
Studies on automatic scoring systems in writing assessments have also evaluated the relationship between human and machine scores for the reliability of automated essay scoring systems. This study investigated the magnitudes of indices for inter-rater agreement and discrepancy, especially regarding human and machine scoring, in writing assessment.…
Descriptors: Meta Analysis, Interrater Reliability, Essays, Scoring
Han, Qie – Working Papers in TESOL & Applied Linguistics, 2016
This literature review attempts to survey representative studies within the context of L2 speaking assessment that have contributed to the conceptualization of rater cognition. Two types of studies are looked at: 1) studies that examine "how" raters differ (and sometimes agree) in their cognitive processes and rating behaviors, in terms…
Descriptors: Second Language Learning, Student Evaluation, Evaluators, Speech Tests
Elicited Imitation as a Measure of Second Language Proficiency: A Narrative Review and Meta-Analysis
Yan, Xun; Maeda, Yukiko; Lv, Jing; Ginther, April – Language Testing, 2016
Elicited imitation (EI) has been widely used to examine second language (L2) proficiency and development and was an especially popular method in the 1970s and early 1980s. However, as the field embraced more communicative approaches to both instruction and assessment, the use of EI diminished, and the construct-related validity of EI scores as a…
Descriptors: Second Language Learning, Language Proficiency, Meta Analysis, Effect Size
Chang, Yuh-Fang – Online Submission, 2009
While researchers generally quantify the amount of information that learners recall correctly in order to measure reading comprehension, the unit of analysis adopted to score the recall protocol differs. Whether and how different scoring systems bring about a different picture of L2 reading comprehension, however, remains unexplored. This study…
Descriptors: Reading Comprehension, Measures (Individuals), Scoring, Recall (Psychology)
White, Sheida; Dillow, Sally – National Center for Education Statistics, 2005
The 2003 NAAL is a complex assessment with several components and various types of data. The primary purpose of this publication is to describe the assessment's key features and data types. Thus, the publication covers the critical concepts and features carried over from the 1992 assessment, as well as those new to the 2003 assessment--for…
Descriptors: Adult Literacy, National Surveys, Evaluation, Evaluation Methods

van der Linden, Wim J. – Review of Educational Research, 1981
Using criterion-referenced test item data collected in an empirical study, differences in item selection between Cox and Vargas' pretest-posttest validity index and a latent trait approach (evaluation of the item information function for the mastery score) are analyzed. (Author/GK)
Descriptors: Comparative Analysis, Criterion Referenced Tests, Foreign Countries, Latent Trait Theory