NotesFAQContact Us
Collection
Advanced
Search Tips
Audience
Location
Iran1
Laws, Policies, & Programs
Assessments and Surveys
Test of English as a Foreign…8
Graduate Management Admission…1
International English…1
Strategy Inventory for…1
What Works Clearinghouse Rating
Showing all 8 results Save | Export
Peer reviewed Peer reviewed
PDF on ERIC Download full text
Kermad, Alyssa; Bogorevich, Valeria – Language Teaching Research Quarterly, 2022
The practice of second language (L2) speech perception has traditionally relied on equal-interval perceptual scales and novice listeners' (NLs) impressionistic judgments of constructs such as accentedness and comprehensibility (Munro & Derwing, 2011). However, issues have surfaced with respect to how well NLs can use these scales, whether they…
Descriptors: Speech Communication, Second Language Learning, Intelligibility, Rating Scales
Peer reviewed Peer reviewed
Direct linkDirect link
Papageorgiou, Spiros; Xi, Xiaoming; Morgan, Rick; So, Youngsoon – Language Assessment Quarterly, 2015
This study presents the development and empirical validation of score levels and descriptors specifically designed for reporting purposes to provide test takers with more than just a number on a score scale. In the context of a test primarily intended for 11- to 15-year-old students learning English as a second/foreign language, the study examined…
Descriptors: Scores, Validity, Scaling, Classification
Fazeli, Seyed Hossein – Online Submission, 2012
The current study aims to analyze the psychometric qualities of the Persian adapted version of Strategy Inventory for Language Learning (SILL) developed by Rebecca L. Oxford (1990). Three instruments were used: Persian adapted version of SILL, a Background Questionnaire, and Test of English as a Foreign Language. Two hundred and thirteen Iranian…
Descriptors: Psychometrics, Measures (Individuals), Indo European Languages, Females
Davis, Lawrence Edward – ProQuest LLC, 2012
Speaking performance tests typically employ raters to produce scores; accordingly, variability in raters' scoring decisions has important consequences for test reliability and validity. One such source of variability is the rater's level of expertise in scoring. Therefore, it is important to understand how raters' performance is influenced by…
Descriptors: Evaluators, Expertise, Scores, Second Language Learning
Lee, Yong-Won; Kantor, Robert; Mollaun, Pam – 2002
This paper reports the results of generalizability theory (G) analyses done for new writing and speaking tasks for the Test of English as a Foreign Language (TOEFL). For writing, a special focus was placed on evaluating the impact on the reliability of the number of raters (or ratings) per essay (one or two) and the number of tasks (one, two, or…
Descriptors: English (Second Language), Generalizability Theory, Reliability, Scores
Peer reviewed Peer reviewed
DeVincenzi, Felicia – TESOL Quarterly, 1995
Argues that teachers need to become "informed consumers" of standardized tests in order to influence decisions about test use and about ways to help students perform at their best. Six strategies for considering the content of a test form are presented. (LR)
Descriptors: Content Analysis, English (Second Language), Evaluation, Guidelines
Peer reviewed Peer reviewed
PDF on ERIC Download full text
Stankov, Lazar; Lee, Jihyun – ETS Research Report Series, 2007
This paper examines the nature of confidence in relation to cognitive abilities, personality traits, and metacognition. Confidence was measured as it was expressed in answers to each test item during the administration of reading and listening sections of the TOEFL® iBT. The confidence scores were correlated with the accuracy scores from the TOEFL…
Descriptors: English (Second Language), Grade Point Average, High Schools, Personality Traits
Peer reviewed Peer reviewed
PDF on ERIC Download full text
Attali, Yigal; Burstein, Jill – ETS Research Report Series, 2005
The e-rater® system has been used by ETS for automated essay scoring since 1999. This paper describes a new version of e-rater (v.2.0) that differs from the previous one (v.1.3) with regard to the feature set and model building approach. The paper describes the new version, compares the new and previous versions in terms of performance, and…
Descriptors: Essay Tests, Automation, Scoring, Comparative Analysis