ERIC - Search Results

Publication Date

In 2025	0
Since 2024	0
Since 2021 (last 5 years)	1
Since 2016 (last 10 years)	8
Since 2006 (last 20 years)	16

Descriptor

Correlation	18
Language Tests	18
English (Second Language)	17
Second Language Learning	17
Test Reliability	11
Scores	10
Computer Assisted Testing	9
Scoring	8
Interrater Reliability	6
Oral Language	6
Foreign Countries	5
Statistical Analysis	5
Accuracy	4
Evaluators	4
Factor Analysis	4
Foreign Students	4
Language Proficiency	4
Native Speakers	4
Profiles	4
Reliability	4
Test Validity	4
Computer Software	3
Decision Making	3
Essays	3
Grade Point Average	3
More ▼

Source

ETS Research Report Series	7
Language Testing	3
Applied Linguistics	1
Cogent Education	1
Educational Testing Service	1
English Language Teaching	1
International Journal of…	1
ProQuest LLC	1
Psicologica: International…	1

Publication Type

Journal Articles	15
Reports - Research	14
Tests/Questionnaires	4
Reports - Evaluative	2
Dissertations/Theses -…	1
Reports - Descriptive	1

Education Level

Higher Education	7
Postsecondary Education	6
Elementary Education	1
High Schools	1
Secondary Education	1

Audience

Researchers

Location

Iran	2
China	1
Colombia	1
Germany	1
India	1
Japan	1
Jordan	1
Kenya	1
Mexico	1
Pennsylvania (Philadelphia)	1
South Korea	1
Turkey	1
United States	1
More ▼

Laws, Policies, & Programs

Assessments and Surveys

Test of English as a Foreign…	18
ACTFL Oral Proficiency…	1
Graduate Record Examinations	1

What Works Clearinghouse Rating

Showing 1 to 15 of 18 results Save | Export

Adaptation and Assessment of a Public Speaking Rating Scale

Peer reviewed

Direct link

Iberri-Shea, Gina – Cogent Education, 2017

Prominent spoken language assessments such as the Oral Proficiency Interview and the Test of Spoken English have been primarily concerned with speaking ability as it relates to conversation. This paper looks at an additional aspect of spoken language ability, namely public speaking. This study used an adapted form of a public speaking rating scale…

Descriptors: Public Speaking, Rating Scales, Adoption (Ideas), English Instruction

Developing an Innovative Elicited Imitation Task for Efficient English Proficiency Assessment. TOEFL® Research Report. RR-96. ETS RR-21-24

Peer reviewed
PDF on ERIC

Download full text

Davis, Larry; Norris, John – ETS Research Report Series, 2021

The elicited imitation task (EIT), in which language learners listen to a series of spoken sentences and repeat each one verbatim, is a commonly used measure of language proficiency in second language acquisition research. The "TOEFL® Essentials"™ test includes an EIT as a holistic measure of speaking proficiency, referred to as the…

Descriptors: Task Analysis, Language Proficiency, Speech Communication, Language Tests

Evaluating Subscore Uses across Multiple Levels: A Case of Reading and Listening Subscores for Young EFL Learners

Peer reviewed

Direct link

Choi, Ikkyu; Papageorgiou, Spiros – Language Testing, 2020

Stakeholders of language tests are often interested in subscores. However, reporting a subscore is not always justified; a subscore should provide reliable and distinct information to be worth reporting. When a subscore is used for decisions across multiple levels (e.g., individual test takers and schools), it needs to be justified for its…

Descriptors: English (Second Language), Language Tests, Second Language Learning, Scores

The Influence of Training and Experience on Rater Performance in Scoring Spoken Language

Peer reviewed

Direct link

Davis, Larry – Language Testing, 2016

Two factors were investigated that are thought to contribute to consistency in rater scoring judgments: rater training and experience in scoring. Also considered were the relative effects of scoring rubrics and exemplars on rater performance. Experienced teachers of English (N = 20) scored recorded responses from the TOEFL iBT speaking test prior…

Descriptors: Evaluators, Oral Language, Scores, Language Tests

Modeling Local Item Dependence in Cloze and Reading Comprehension Test Items Using Testlet Response Theory

Peer reviewed
PDF on ERIC

Download full text

Baghaei, Purya; Ravand, Hamdollah – Psicologica: International Journal of Methodology and Experimental Psychology, 2016

In this study the magnitudes of local dependence generated by cloze test items and reading comprehension items were compared and their impact on parameter estimates and test precision was investigated. An advanced English as a foreign language reading comprehension test containing three reading passages and a cloze test was analyzed with a…

Descriptors: Cloze Procedure, Reading, Reading Comprehension, Reading Skills

Placement of International English Language Learners: How Different Is It?

Peer reviewed
PDF on ERIC

Download full text

Bostian, Brad – International Journal of Multidisciplinary Perspectives in Higher Education, 2017

Amid profound changes to student placement systems at universities and colleges, the placement of English language learners has remained largely the same. Generally speaking, international students, and in some places other English language learners, face single measure testing and required remediation. Single measure high stakes testing goes…

Descriptors: Student Placement, English (Second Language), Second Language Learning, College Students

Modern English Drama and the Students' Fluency and Accuracy of Speaking

Peer reviewed
PDF on ERIC

Download full text

Pishkar, Kian; Moinzadeh, Ahmad; Dabaghi, Azizallah – English Language Teaching, 2017

Speaking a language involves more than simply knowing the linguistic components of the message, and developing language skills requires more than grammatical comprehension and vocabulary memorization. In teaching-learning processes, drama method may have some positive effects on ELL students' speaking fluency and accuracy. This study attempts to…

Descriptors: English (Second Language), Second Language Learning, Second Language Instruction, Undergraduate Students

Predicting Grades from an English Language Assessment: The Importance of Peeling the Onion

Peer reviewed

Direct link

Bridgeman, Brent; Cho, Yeonsuk; DiPietro, Stephen – Language Testing, 2016

Data from 787 international undergraduate students at an urban university in the United States were used to demonstrate the importance of separating a sample into meaningful subgroups in order to demonstrate the ability of an English language assessment to predict the first-year grade point average (GPA). For example, when all students were pooled…

Descriptors: Grade Prediction, English Curriculum, Language Tests, Undergraduate Students

Use of e-rater[R] in Scoring of the TOEFL iBT[R] Writing Test. Research Report. ETS RR-11-25

Download full text

Haberman, Shelby J. – Educational Testing Service, 2011

Alternative approaches are discussed for use of e-rater[R] to score the TOEFL iBT[R] Writing test. These approaches involve alternate criteria. In the 1st approach, the predicted variable is the expected rater score of the examinee's 2 essays. In the 2nd approach, the predicted variable is the expected rater score of 2 essay responses by the…

Descriptors: Writing Tests, Scoring, Essays, Language Tests

Investigating the Value of Section Scores for the "TOEFL iBT"® Test. "TOEFL iBT"® Research Report. TOEFL iBT-21. ETS Research Report RR-13-35

Peer reviewed
PDF on ERIC

Download full text

Sawaki, Yasuyo; Sinharay, Sandip – ETS Research Report Series, 2013

This study investigates the value of reporting the reading, listening, speaking, and writing section scores for the "TOEFL iBT"® test, focusing on 4 related aspects of the psychometric quality of the TOEFL iBT section scores: reliability of the section scores, dimensionality of the test, presence of distinct score profiles, and the…

Descriptors: Scores, Computer Assisted Testing, Factor Analysis, Correlation

Rater Expertise in a Second Language Speaking Assessment: The Influence of Training and Experience

Direct link

Davis, Lawrence Edward – ProQuest LLC, 2012

Speaking performance tests typically employ raters to produce scores; accordingly, variability in raters' scoring decisions has important consequences for test reliability and validity. One such source of variability is the rater's level of expertise in scoring. Therefore, it is important to understand how raters' performance is influenced by…

Descriptors: Evaluators, Expertise, Scores, Second Language Learning

Toward Automated Multi-Trait Scoring of Essays: Investigating Links among Holistic, Analytic, and Text Feature Scores

Peer reviewed

Direct link

Lee, Yong-Won; Gentile, Claudia; Kantor, Robert – Applied Linguistics, 2010

The main purpose of the study was to investigate the distinctness and reliability of analytic (or multi-trait) rating dimensions and their relationships to holistic scores and "e-rater"[R] essay feature variables in the context of the TOEFL[R] computer-based test (TOEFL CBT) writing assessment. Data analyzed in the study were holistic…

Descriptors: Writing Evaluation, Writing Tests, Scoring, Essays

Analytic Scoring of TOEFL® CBT Essays: Scores from Humans and "E-rater"®. TOEFL® Research Reports. RR-81. ETS RR-08-01

Peer reviewed
PDF on ERIC

Download full text

Lee, Yong-Won; Gentile, Claudia; Kantor, Robert – ETS Research Report Series, 2008

The main purpose of the study was to investigate the distinctness and reliability of analytic (or multitrait) rating dimensions and their relationships to holistic scores and "e-rater"® essay feature variables in the context of the TOEFL® computer-based test (CBT) writing assessment. Data analyzed in the study were analytic and holistic…

Descriptors: English (Second Language), Language Tests, Second Language Learning, Scoring

Relationship of Admission Test Scores to Writing Performance of Native and Nonnative Speakers of English.

Download full text

Carlson, Sybil B.; And Others – 1985

Four writing samples were obtained from 638 foreign college applicants who represented three major foreign language groups (Arabic, Chinese, and Spanish), and from 60 native English speakers. All four were scored holistically, two were also scored for sentence-level and discourse-level skills, and some were scored by the Writer's Workbench…

Descriptors: Arabic, Chinese, College Entrance Examinations, Computer Software

Construct Validity of "e-rater"® in Scoring TOEFL® Essays. Research Report. ETS RR-07-21

Peer reviewed
PDF on ERIC

Download full text

Attali, Yigal – ETS Research Report Series, 2007

This study examined the construct validity of the "e-rater"® automated essay scoring engine as an alternative to human scoring in the context of TOEFL® essay writing. Analyses were based on a sample of students who repeated the TOEFL within a short time period. Two "e-rater" scores were investigated in this study, the first…

Descriptors: Construct Validity, Computer Assisted Testing, Scoring, English (Second Language)

Previous Page | Next Page »

Pages: 1 | 2

Davis, Larry	2
Gentile, Claudia	2
Kantor, Robert	2
Lee, Yong-Won	2
Attali, Yigal	1
Baghaei, Purya	1
Bostian, Brad	1
Bridgeman, Brent	1
Carlson, Sybil B.	1
Cho, Yeonsuk	1
Choi, Ikkyu	1
Dabaghi, Azizallah	1
Davis, Lawrence Edward	1
DiPietro, Stephen	1
Haberman, Shelby J.	1
Iberri-Shea, Gina	1
Lee, Jihyun	1
Manalo, Jonathan R.	1
Moinzadeh, Ahmad	1
Mollaun, Pam	1
Norris, John	1
Papageorgiou, Spiros	1
Pishkar, Kian	1
Ravand, Hamdollah	1
More ▼