NotesFAQContact Us
Collection
Advanced
Search Tips
Showing all 6 results Save | Export
Jiyeo Yun – English Teaching, 2023
Studies on automatic scoring systems in writing assessments have also evaluated the relationship between human and machine scores for the reliability of automated essay scoring systems. This study investigated the magnitudes of indices for inter-rater agreement and discrepancy, especially regarding human and machine scoring, in writing assessment.…
Descriptors: Meta Analysis, Interrater Reliability, Essays, Scoring
Peer reviewed Peer reviewed
PDF on ERIC Download full text
Han, Qie – Working Papers in TESOL & Applied Linguistics, 2016
This literature review attempts to survey representative studies within the context of L2 speaking assessment that have contributed to the conceptualization of rater cognition. Two types of studies are looked at: 1) studies that examine "how" raters differ (and sometimes agree) in their cognitive processes and rating behaviors, in terms…
Descriptors: Second Language Learning, Student Evaluation, Evaluators, Speech Tests
Peer reviewed Peer reviewed
Direct linkDirect link
Yan, Xun; Maeda, Yukiko; Lv, Jing; Ginther, April – Language Testing, 2016
Elicited imitation (EI) has been widely used to examine second language (L2) proficiency and development and was an especially popular method in the 1970s and early 1980s. However, as the field embraced more communicative approaches to both instruction and assessment, the use of EI diminished, and the construct-related validity of EI scores as a…
Descriptors: Second Language Learning, Language Proficiency, Meta Analysis, Effect Size
Chang, Yuh-Fang – Online Submission, 2009
While researchers generally quantify the amount of information that learners recall correctly in order to measure reading comprehension, the unit of analysis adopted to score the recall protocol differs. Whether and how different scoring systems bring about a different picture of L2 reading comprehension, however, remains unexplored. This study…
Descriptors: Reading Comprehension, Measures (Individuals), Scoring, Recall (Psychology)
Peer reviewed Peer reviewed
PDF on ERIC Download full text
White, Sheida; Dillow, Sally – National Center for Education Statistics, 2005
The 2003 NAAL is a complex assessment with several components and various types of data. The primary purpose of this publication is to describe the assessment's key features and data types. Thus, the publication covers the critical concepts and features carried over from the 1992 assessment, as well as those new to the 2003 assessment--for…
Descriptors: Adult Literacy, National Surveys, Evaluation, Evaluation Methods
Peer reviewed Peer reviewed
van der Linden, Wim J. – Review of Educational Research, 1981
Using criterion-referenced test item data collected in an empirical study, differences in item selection between Cox and Vargas' pretest-posttest validity index and a latent trait approach (evaluation of the item information function for the mastery score) are analyzed. (Author/GK)
Descriptors: Comparative Analysis, Criterion Referenced Tests, Foreign Countries, Latent Trait Theory