NotesFAQContact Us
Collection
Advanced
Search Tips
Showing all 15 results Save | Export
Peer reviewed Peer reviewed
Direct linkDirect link
Lim, Gad S. – Language Testing, 2011
Raters are central to writing performance assessment, and rater development--training, experience, and expertise--involves a temporal dimension. However, few studies have examined new and experienced raters' rating performance longitudinally over multiple time points. This study uses operational data from the writing section of the MELAB (n =…
Descriptors: Expertise, Writing Evaluation, Performance Based Assessment, Writing Tests
Peer reviewed Peer reviewed
Direct linkDirect link
Huang, Shu-Chen – Language Testing, 2011
This study examined two types of classroom assessment events, the more closed convergent assessments (CA) versus the more open-ended divergent assessments (DA), to see if they influence learners differently in terms of motivation and learning strategies. Participants were 105 college freshmen in Taiwan with the same instructor placed under one…
Descriptors: College Freshmen, Speech Communication, Self Efficacy, Performance Based Assessment
Peer reviewed Peer reviewed
Direct linkDirect link
Kim, Youn-Hee – Language Testing, 2011
Despite the increasing interest in and need for test information for use in instructional practice and student learning, there have been few attempts to systematically link a diagnostic approach to English for academic purposes (EAP) writing instruction and assessment. In response to this need for research, this study examined the extent to which…
Descriptors: Performance Based Assessment, Performance Tests, Diagnostic Tests, Discriminant Analysis
Peer reviewed Peer reviewed
Direct linkDirect link
Ginther, April; Dimova, Slobodanka; Yang, Rui – Language Testing, 2010
Information provided by examination of the skills that underlie holistic scores can be used not only as supporting evidence for the validity of inferences associated with performance tests but also as a way to improve the scoring rubrics, descriptors, and benchmarks associated with scoring scales. As fluency is considered a critical, perhaps…
Descriptors: Performance Tests, Scoring Rubrics, Measures (Individuals), Scoring
Peer reviewed Peer reviewed
Direct linkDirect link
Plough, India C.; Briggs, Sarah L.; Van Bonn, Sarah – Language Testing, 2010
The study reported here examined the evaluation criteria used to assess the proficiency and effectiveness of the language produced in an oral performance test of English conducted in an American university context. Empirical methods were used to analyze qualitatively and quantitatively transcriptions of the Oral English Tests (OET) of 44…
Descriptors: Graduate Students, Listening Comprehension, Evaluators, Performance Tests
Peer reviewed Peer reviewed
Direct linkDirect link
Johnson, Jeff S.; Lim, Gad S. – Language Testing, 2009
Language performance assessments typically require human raters, introducing possible error. In international examinations of English proficiency, rater language background is an especially salient factor that needs to be considered. The existence of rater language background-related bias in writing performance assessment is the object of this…
Descriptors: Performance Based Assessment, Performance Tests, Native Speakers, English (Second Language)
Peer reviewed Peer reviewed
Direct linkDirect link
Knoch, Ute – Language Testing, 2009
Alderson (2005) suggests that diagnostic tests should identify strengths and weaknesses in learners' use of language and focus on specific elements rather than global abilities. However, rating scales used in performance assessment have been repeatedly criticized for being imprecise and therefore often resulting in holistic marking by raters…
Descriptors: Feedback (Response), Language Usage, Performance Based Assessment, Performance Tests
Peer reviewed Peer reviewed
Direct linkDirect link
Lee, Hee-Kyung; Anderson, Carolyn – Language Testing, 2007
The goal of the current study was to examine the validity and topic generality of a writing performance test designed to place international students into appropriate ESL courses at a large mid-western university. Because for each test administration the test randomly rotates three academic topics integrated with listening and reading sources, it…
Descriptors: Majors (Students), Student Placement, Performance Tests, Test Validity
Peer reviewed Peer reviewed
Pollitt, Alastair; Hutchinson, Carolyn – Language Testing, 1987
Describes the use of the partial credit form of the Rasch model in the analysis and calibration of a set of writing tasks in which assessment scales and criteria were adapted to suit each task's specific demands. Potential applications of the partial credit model in language testing are discussed. (Author/CB)
Descriptors: Evaluation Criteria, Language Tests, Performance Tests, Second Language Learning
Peer reviewed Peer reviewed
Upshur, John A.; Turner, Carolyn E. – Language Testing, 1999
Research on two approaches to assessment of second-language performance--second-language acquisition and language testing--is examined and compared with regard to systematic effects on language tests. Findings incidental to a test development project are then presented. It is concluded that a full account of performance testing requires a paradigm…
Descriptors: Discourse Analysis, Language Tests, Performance Tests, Second Language Learning
Peer reviewed Peer reviewed
Seaton, Ian – Language Testing, 1987
Suggests that language testing should be more performance-based and realigned with a current and richer understanding of the language teaching and learning processes. This kind of testing has questionable validity until a wide range of variables are successfully defined and delimited. (CB)
Descriptors: English for Academic Purposes, Language Proficiency, Language Tests, Performance Tests
Peer reviewed Peer reviewed
Direct linkDirect link
Kozaki ,Y. – Language Testing, 2004
This article presents a standard-setting procedure for performance assessment in a foreign language, through which some of the major problems facing performance assessment in criterion-referenced testing can be addressed. The procedure, which was geared to revealing and accommodating inter-judge variability, employed the synergy of multiple…
Descriptors: Data Analysis, Testing, Performance Tests, Generalizability Theory
Peer reviewed Peer reviewed
Direct linkDirect link
Stansfield, Charles W.; Hewitt, William E. – Language Testing, 2005
The United States Court Interpreters Act (US Congress, 1978) requires that interpreters in US federal courts be certified through a criterion-referenced performance test. The Federal Court Interpreter Certification Examination (FCICE) is a two-phase certification battery for federal court interpreters. Phase I is a multiple-choice Written…
Descriptors: Program Effectiveness, Certification, Screening Tests, Predictive Validity
Peer reviewed Peer reviewed
Shameem, Nikhat – Language Testing, 1998
Examined the validity of aural and oral self-report scales for determining the Fiji Hindi proficiency of new adolescent immigrants in New Zealand. Participants completed self-reports and performance tests (oral interviews, listening-comprehension tests, and vocabulary tests). Performance tests correlated strongly with self-reports. Respondents…
Descriptors: Adolescents, Foreign Countries, Immigrants, Language Proficiency
Peer reviewed Peer reviewed
Wesche, Marjorie Bingham – Language Testing, 1987
Discusses a recently developed post-admissions testing battery, for Ontario colleges and universities, that tests students' listening, speaking, reading, and writing skills through integrated texts and tasks that simulate academic language use. The battery yields placement and diagnostic information about students' readiness to undertake academic…
Descriptors: Educational Diagnosis, English for Academic Purposes, Foreign Countries, Higher Education