Publication Date
In 2025 | 0 |
Since 2024 | 0 |
Since 2021 (last 5 years) | 0 |
Since 2016 (last 10 years) | 2 |
Since 2006 (last 20 years) | 11 |
Descriptor
Source
ETS Research Report Series | 4 |
Language Assessment Quarterly | 3 |
Language Testing | 2 |
Acta Educationis Generalis | 1 |
Educational Testing Service | 1 |
International Journal of… | 1 |
New Directions for… | 1 |
Author
Attali, Yigal | 1 |
Boonsuk, Yusop | 1 |
Bridgeman, Brent | 1 |
Brown, Annie | 1 |
Carlson, Sybil B. | 1 |
Davey, Tim | 1 |
Erdosy, M. Usman | 1 |
Gu, Lin | 1 |
Haberman, Shelby J. | 1 |
Hsieh, Ching-Ni | 1 |
Iwashita, Noriko | 1 |
More ▼ |
Publication Type
Journal Articles | 12 |
Reports - Research | 11 |
Reports - Descriptive | 2 |
Tests/Questionnaires | 2 |
Reports - Evaluative | 1 |
Education Level
Higher Education | 3 |
Postsecondary Education | 3 |
Elementary Education | 1 |
Junior High Schools | 1 |
Middle Schools | 1 |
Secondary Education | 1 |
Audience
Researchers | 1 |
Location
Australia | 1 |
Laws, Policies, & Programs
Assessments and Surveys
What Works Clearinghouse Rating
Boonsuk, Yusop; Karakas, Ali – Acta Educationis Generalis, 2020
Introduction: In recent years, the number of test-takers of international tests of English has grown at an exponential rate. Those whose first language is not English, i.e. non-native English speakers (NNES), constitute the predominant majority of these test-takers, largely based in non-Anglophone contexts. Thus, the state of whether the…
Descriptors: Language Variation, English (Second Language), Second Language Learning, Second Language Instruction
Gu, Lin; Hsieh, Ching-Ni – Language Assessment Quarterly, 2019
Examining spoken features across proficiency levels allows researchers to explore the nature of speaking proficiency as it develops. This line of research has thus far primarily focused on adult second language (L2) learners. Using cross-sectional data based on a large-scale language assessment intended for young L2 learners, in this study, we…
Descriptors: Oral Language, Speech Communication, English (Second Language), Second Language Learning
Yu, Guoxing – Language Assessment Quarterly, 2013
This article reports the lexical diversity of summaries written by experts and test takers in an empirical study and then interrogates the (in)congruity between the conceptualisations of "summary" and "summarize" in the literature of educational research and the operationalization of summarization tasks in three international…
Descriptors: Documentation, Writing Tests, Language Usage, Language Tests
Attali, Yigal; Sinharay, Sandip – ETS Research Report Series, 2015
The "e-rater"® automated essay scoring system is used operationally in the scoring of "TOEFL iBT"® independent and integrated tasks. In this study we explored the psychometric added value of reporting four trait scores for each of these two tasks, beyond the total e-rater score.The four trait scores are word choice, grammatical…
Descriptors: Writing Tests, Scores, Language Tests, English (Second Language)
Youn, Soo Jung – Language Testing, 2015
This study investigates the validity of assessing L2 pragmatics in interaction using mixed methods, focusing on the evaluation inference. Open role-plays that are meaningful and relevant to the stakeholders in an English for Academic Purposes context were developed for classroom assessment. For meaningful score interpretations and accurate…
Descriptors: Second Language Learning, Pragmatics, Validity, Mixed Methods Research
Wei, Jing; Llosa, Lorena – Language Assessment Quarterly, 2015
This article reports on an investigation of the role raters' language background plays in raters' assessment of test takers' speaking ability. Specifically, this article examines differences between American and Indian raters in their scores and scoring processes when rating Indian test takers' responses to the Test of English as a Foreign…
Descriptors: North Americans, Indians, Evaluators, English (Second Language)
Ramineni, Chaitanya; Trapani, Catherine S.; Williamson, David M.; Davey, Tim; Bridgeman, Brent – ETS Research Report Series, 2012
Scoring models for the "e-rater"® system were built and evaluated for the "TOEFL"® exam's independent and integrated writing prompts. Prompt-specific and generic scoring models were built, and evaluation statistics, such as weighted kappas, Pearson correlations, standardized differences in mean scores, and correlations with…
Descriptors: Scoring, Prompting, Evaluators, Computer Software
Haberman, Shelby J. – Educational Testing Service, 2011
Alternative approaches are discussed for use of e-rater[R] to score the TOEFL iBT[R] Writing test. These approaches involve alternate criteria. In the 1st approach, the predicted variable is the expected rater score of the examinee's 2 essays. In the 2nd approach, the predicted variable is the expected rater score of 2 essay responses by the…
Descriptors: Writing Tests, Scoring, Essays, Language Tests
Jamieson, Joan; Poonpon, Kornwipa – ETS Research Report Series, 2013
Research and development of a new type of scoring rubric for the integrated speaking tasks of "TOEFL iBT"® are described. These "analytic rating guides" could be helpful if tasks modeled after those in TOEFL iBT were used for formative assessment, a purpose which is different from TOEFL iBT's primary use for admission…
Descriptors: Oral Language, Language Proficiency, Scaling, Scores
Yin, Alexander C.; Volkwein, J. Fredericks – New Directions for Institutional Research, 2010
After surveying 1,827 students in their final year at eighty randomly selected two-year and four-year public and private institutions, American Institutes for Research (2006) reported that approximately 30 percent of students in two-year institutions and nearly 20 percent of students in four-year institutions have only basic quantitative…
Descriptors: Standardized Tests, Basic Skills, College Admission, Educational Testing
Xi, Xiaoming – Language Testing, 2007
This study explores the utility of analytic scoring for TAST in providing useful and reliable diagnostic information for operational use in three aspects of candidates' performance: delivery, language use and topic development. One hundred and forty examinees' responses to six TAST tasks were scored analytically on these three aspects of speech. G…
Descriptors: Scoring, Profiles, Performance Based Assessment, Academic Discourse

Erdosy, M. Usman – International Journal of English Studies, 2001
Describes how some background factors influenced the way in which one experienced rater dealt with a number of operations involved in setting up and applying scoring criteria in the assessment of 60 Test of English as a Foreign Language essays. Implications are drawn for both future research into interrater variability and for rater training.…
Descriptors: English (Second Language), Evaluation Criteria, Interrater Reliability, Language Research
Carlson, Sybil B.; And Others – 1985
Four writing samples were obtained from 638 foreign college applicants who represented three major foreign language groups (Arabic, Chinese, and Spanish), and from 60 native English speakers. All four were scored holistically, two were also scored for sentence-level and discourse-level skills, and some were scored by the Writer's Workbench…
Descriptors: Arabic, Chinese, College Entrance Examinations, Computer Software
Brown, Annie; Iwashita, Noriko; McNamara, Tim – ETS Research Report Series, 2005
This report documents two coordinated exploratory studies into the nature of oral English-for-academic-purposes (EAP) proficiency. Study I used verbal-report methodology to examine field experts? rating orientations, and Study II investigated the quality of test-taker discourse on two different Test of English as a Foreign Language? (TOEFL®) task…
Descriptors: Evaluators, English (Second Language), Language Tests, Second Language Learning