NotesFAQContact Us
Collection
Advanced
Search Tips
Showing 1 to 15 of 16 results Save | Export
Peer reviewed Peer reviewed
Direct linkDirect link
Yan, Xun; Chuang, Ping-Lin – Language Testing, 2023
This study employed a mixed-methods approach to examine how rater performance develops during a semester-long rater certification program for an English as a Second Language (ESL) writing placement test at a large US university. From 2016 to 2018, we tracked three groups of novice raters (n = 30) across four rounds in the certification program.…
Descriptors: Evaluators, Interrater Reliability, Item Response Theory, Certification
Peer reviewed Peer reviewed
Direct linkDirect link
Jin, Tan; Mak, Barley – Language Testing, 2013
For Chinese as a second language (L2 Chinese), there has been little research into "distinguishing features" (Fulcher, 1996; Iwashita et al., 2008) used in scoring L2 Chinese speaking performance. The study reported here investigates the relationship between the distinguishing features of L2 Chinese spoken performances and the scores…
Descriptors: Second Languages, Second Language Learning, Chinese, Holistic Evaluation
Peer reviewed Peer reviewed
Direct linkDirect link
Llosa, Lorena – Language Testing, 2011
With the United States' adoption of a standards-based approach to education, most attention has focused on the large-scale, high-stakes assessments intended to measure students' mastery of standards for accountability purposes. Less attention has been paid to the role of standards-based assessments in the classroom. The purpose of this paper is to…
Descriptors: Urban Schools, Student Evaluation, Language Tests, Second Language Learning
Peer reviewed Peer reviewed
Direct linkDirect link
Butler, Yuko Goto; Lee, Jiyoon – Language Testing, 2010
This study examined the effectiveness of self-assessment among 254 young learners of English as a foreign language. This study looked at 6th grade students in South Korea, who were asked to perform self-assessments on a regular basis for a semester during their English classes. The students improved their ability to self-assess their performance…
Descriptors: Second Language Learning, Program Effectiveness, Effect Size, Foreign Countries
Peer reviewed Peer reviewed
Direct linkDirect link
In'nami, Yo; Koizumi, Rie – Language Testing, 2009
A meta-analysis was conducted on the effects of multiple-choice and open-ended formats on L1 reading, L2 reading, and L2 listening test performance. Fifty-six data sources located in an extensive search of the literature were the basis for the estimates of the mean effect sizes of test format effects. The results using the mixed effects model of…
Descriptors: Test Format, Listening Comprehension Tests, Multiple Choice Tests, Program Effectiveness
Peer reviewed Peer reviewed
Direct linkDirect link
Wagner, Elvis – Language Testing, 2010
Video is widely used in the teaching of L2 listening, and SLA researchers have argued that the visual components of spoken texts are useful for the listener in comprehending aural information. Yet video texts are rarely used on tests of L2 listening ability, perhaps in part due to the belief that including the visual channel involves assessing…
Descriptors: Experimental Groups, Control Groups, Listening Comprehension, Quasiexperimental Design
Peer reviewed Peer reviewed
Direct linkDirect link
Van Moere, Alistair – Language Testing, 2009
The purpose of BEST Plus is to assess the ability to understand and use unprepared, conversational, everyday language within topic areas generally covered in adult education courses. It is one of several standardized assessments approved by the National Reporting System (NRS, 2008), which is the accountability system for federally funded ESL and…
Descriptors: Standardized Tests, Oral Language, Language Tests, Adult Education
Peer reviewed Peer reviewed
Direct linkDirect link
Bunch, Michael B. – Language Testing, 2011
Title III of Public Law 107-110 (No Child Left Behind; NCLB) provided for creation of assessments of English language learners (ELLs) and established, through the Enhanced Assessment Grant program, a platform from which four consortia of states developed ELL tests aligned to rigorous statewide content standards. Those four tests (ACCESS for ELLs,…
Descriptors: Test Items, Student Evaluation, Federal Legislation, Formative Evaluation
Peer reviewed Peer reviewed
Direct linkDirect link
Munoz, Ana P.; Alvarez, Marta E. – Language Testing, 2010
This article reports the results of a research study to determine the washback effect of an oral assessment system on some areas of the teaching and learning of English as a Foreign Language (EFL). The research combined quantitative and qualitative research methods within a comparative study between an experimental group and a comparison group.…
Descriptors: Experimental Groups, Qualitative Research, Student Surveys, Program Effectiveness
Peer reviewed Peer reviewed
Direct linkDirect link
di Gennaro, Kristen – Language Testing, 2009
Practitioners working closely with second language (L2) writers in the US recognize at least two types of L2 students: international (IL2) and Generation 1.5 (G1.5) students. Some argue that specific differences in each group's writing performance are evident (cf. Harklau, 2003; Reid, 2006); however, investigations into observable and measurable…
Descriptors: English (Second Language), Second Language Learning, Student Placement, Writing (Composition)
Peer reviewed Peer reviewed
Direct linkDirect link
Schaefer, Edward – Language Testing, 2008
The present study employed multi-faceted Rasch measurement (MFRM) to explore the rater bias patterns of native English-speaker (NES) raters when they rate EFL essays. Forty NES raters rated 40 essays written by female Japanese university students on a single topic adapted from the TOEFL Test of Written English (TWE). The essays were assessed using…
Descriptors: Writing Evaluation, Writing Tests, Program Effectiveness, Essays
Peer reviewed Peer reviewed
Direct linkDirect link
Brooks, Lindsay – Language Testing, 2009
This study, framed within sociocultural theory, examines the interaction of adult ESL test-takers in two tests of oral proficiency: one in which they interacted with an examiner (the individual format) and one in which they interacted with another student (the paired format). The data for the eight pairs in this study were drawn from a larger…
Descriptors: Testing, Rating Scales, Program Effectiveness, Interaction
Peer reviewed Peer reviewed
Direct linkDirect link
East, Martin – Language Testing, 2007
Whether test takers should be allowed access to dictionaries when taking L2 tests has been the subject of debate for a good number of years. Opinions differ according to how the test construct is understood and whether the underlying value system favours process-orientated assessment for learning, with its concern to elicit the test takers' best…
Descriptors: Writing Tests, Reading Tests, Program Effectiveness, Dictionaries
Peer reviewed Peer reviewed
Direct linkDirect link
Llosa, Lorena – Language Testing, 2007
The use of standards-based classroom assessments to test English learners' language proficiency is increasingly prevalent in the United States and many other countries. In a large urban school district in California, for example, a classroom assessment is used to make high-stakes decisions about English learners' progress from one level to the…
Descriptors: Urban Schools, Multitrait Multimethod Techniques, Standardized Tests, Construct Validity
Peer reviewed Peer reviewed
Direct linkDirect link
Sawaki, Yasuyo – Language Testing, 2007
This is a construct validation study of a second language speaking assessment that reported a language profile based on analytic rating scales and a composite score. The study addressed three key issues: score dependability, convergent/discriminant validity of analytic rating scales and the weighting of analytic ratings in the composite score.…
Descriptors: Generalizability Theory, Speech Communication, Student Placement, Construct Validity
Previous Page | Next Page ยป
Pages: 1  |  2