Publication Date
In 2025 | 0 |
Since 2024 | 0 |
Since 2021 (last 5 years) | 1 |
Since 2016 (last 10 years) | 1 |
Since 2006 (last 20 years) | 5 |
Descriptor
English (Second Language) | 8 |
Reliability | 8 |
Language Tests | 6 |
Second Language Learning | 6 |
Validity | 6 |
Scores | 4 |
Accuracy | 2 |
College Students | 2 |
Correlation | 2 |
Evaluators | 2 |
Rating Scales | 2 |
More ▼ |
Source
ETS Research Report Series | 2 |
Language Assessment Quarterly | 1 |
Language Teaching Research… | 1 |
Online Submission | 1 |
ProQuest LLC | 1 |
TESOL Quarterly | 1 |
Author
Attali, Yigal | 1 |
Bogorevich, Valeria | 1 |
Burstein, Jill | 1 |
Davis, Lawrence Edward | 1 |
DeVincenzi, Felicia | 1 |
Fazeli, Seyed Hossein | 1 |
Kantor, Robert | 1 |
Kermad, Alyssa | 1 |
Lee, Jihyun | 1 |
Lee, Yong-Won | 1 |
Mollaun, Pam | 1 |
More ▼ |
Publication Type
Journal Articles | 6 |
Reports - Research | 6 |
Dissertations/Theses -… | 1 |
Guides - Classroom - Teacher | 1 |
Numerical/Quantitative Data | 1 |
Speeches/Meeting Papers | 1 |
Education Level
Higher Education | 4 |
Postsecondary Education | 4 |
High Schools | 2 |
Secondary Education | 2 |
Elementary Education | 1 |
Grade 10 | 1 |
Grade 11 | 1 |
Grade 12 | 1 |
Grade 6 | 1 |
Grade 7 | 1 |
Grade 8 | 1 |
More ▼ |
Audience
Location
Iran | 1 |
Laws, Policies, & Programs
Assessments and Surveys
Test of English as a Foreign… | 8 |
Graduate Management Admission… | 1 |
International English… | 1 |
Strategy Inventory for… | 1 |
What Works Clearinghouse Rating
Kermad, Alyssa; Bogorevich, Valeria – Language Teaching Research Quarterly, 2022
The practice of second language (L2) speech perception has traditionally relied on equal-interval perceptual scales and novice listeners' (NLs) impressionistic judgments of constructs such as accentedness and comprehensibility (Munro & Derwing, 2011). However, issues have surfaced with respect to how well NLs can use these scales, whether they…
Descriptors: Speech Communication, Second Language Learning, Intelligibility, Rating Scales
Papageorgiou, Spiros; Xi, Xiaoming; Morgan, Rick; So, Youngsoon – Language Assessment Quarterly, 2015
This study presents the development and empirical validation of score levels and descriptors specifically designed for reporting purposes to provide test takers with more than just a number on a score scale. In the context of a test primarily intended for 11- to 15-year-old students learning English as a second/foreign language, the study examined…
Descriptors: Scores, Validity, Scaling, Classification
Fazeli, Seyed Hossein – Online Submission, 2012
The current study aims to analyze the psychometric qualities of the Persian adapted version of Strategy Inventory for Language Learning (SILL) developed by Rebecca L. Oxford (1990). Three instruments were used: Persian adapted version of SILL, a Background Questionnaire, and Test of English as a Foreign Language. Two hundred and thirteen Iranian…
Descriptors: Psychometrics, Measures (Individuals), Indo European Languages, Females
Davis, Lawrence Edward – ProQuest LLC, 2012
Speaking performance tests typically employ raters to produce scores; accordingly, variability in raters' scoring decisions has important consequences for test reliability and validity. One such source of variability is the rater's level of expertise in scoring. Therefore, it is important to understand how raters' performance is influenced by…
Descriptors: Evaluators, Expertise, Scores, Second Language Learning
Lee, Yong-Won; Kantor, Robert; Mollaun, Pam – 2002
This paper reports the results of generalizability theory (G) analyses done for new writing and speaking tasks for the Test of English as a Foreign Language (TOEFL). For writing, a special focus was placed on evaluating the impact on the reliability of the number of raters (or ratings) per essay (one or two) and the number of tasks (one, two, or…
Descriptors: English (Second Language), Generalizability Theory, Reliability, Scores

DeVincenzi, Felicia – TESOL Quarterly, 1995
Argues that teachers need to become "informed consumers" of standardized tests in order to influence decisions about test use and about ways to help students perform at their best. Six strategies for considering the content of a test form are presented. (LR)
Descriptors: Content Analysis, English (Second Language), Evaluation, Guidelines
Stankov, Lazar; Lee, Jihyun – ETS Research Report Series, 2007
This paper examines the nature of confidence in relation to cognitive abilities, personality traits, and metacognition. Confidence was measured as it was expressed in answers to each test item during the administration of reading and listening sections of the TOEFL® iBT. The confidence scores were correlated with the accuracy scores from the TOEFL…
Descriptors: English (Second Language), Grade Point Average, High Schools, Personality Traits
Attali, Yigal; Burstein, Jill – ETS Research Report Series, 2005
The e-rater® system has been used by ETS for automated essay scoring since 1999. This paper describes a new version of e-rater (v.2.0) that differs from the previous one (v.1.3) with regard to the feature set and model building approach. The paper describes the new version, compares the new and previous versions in terms of performance, and…
Descriptors: Essay Tests, Automation, Scoring, Comparative Analysis