ERIC - Search Results

Publication Date

In 2026	0
Since 2025	0
Since 2022 (last 5 years)	0
Since 2017 (last 10 years)	0
Since 2007 (last 20 years)	4

Descriptor

Computer Assisted Testing	5
Second Language Learning	5
Language Tests	4
Oral Language	4
Scoring	4
Correlation	3
English (Second Language)	3
Test Validity	3
Classification	2
Evaluators	2
Foreign Students	2
Scores	2
Scoring Rubrics	2
Speech	2
Academic Discourse	1
Accuracy	1
Audio Equipment	1
College Admission	1
Comparative Analysis	1
Computer System Design	1
Criterion Referenced Tests	1
Cutting Scores	1
Database Design	1
Database Management Systems	1
Diagnostic Tests	1
More ▼

Source

ETS Research Report Series	3
Language Testing	2

Author

Xi, Xiaoming	5
Higgins, Derrick	2
Zechner, Klaus	2
Ling, Guangming	1
Mollaun, Pam	1
Mollaun, Pamela	1
Williamson, David	1
Williamson, David M.	1

Publication Type

Journal Articles	5
Reports - Research	4
Tests/Questionnaires	3
Reports - Evaluative	1

Education Level

Higher Education	1
Postsecondary Education	1

Audience

Location

California (Los Angeles)	1
Florida	1
North Carolina (Charlotte)	1
Pennsylvania (Philadelphia)	1

Laws, Policies, & Programs

Assessments and Surveys

Test of English as a Foreign…

What Works Clearinghouse Rating

Showing all 5 results Save | Export

A Study on the Impact of Fatigue on Human Raters When Scoring Speaking Responses

Peer reviewed

Direct link

Ling, Guangming; Mollaun, Pamela; Xi, Xiaoming – Language Testing, 2014

The scoring of constructed responses may introduce construct-irrelevant factors to a test score and affect its validity and fairness. Fatigue is one of the factors that could negatively affect human performance in general, yet little is known about its effects on a human rater's scoring quality on constructed responses. In this study, we compared…

Descriptors: Evaluators, Fatigue (Biology), Scoring, Performance

A Comparison of Two Scoring Methods for an Automated Speech Scoring System

Peer reviewed

Direct link

Xi, Xiaoming; Higgins, Derrick; Zechner, Klaus; Williamson, David – Language Testing, 2012

This paper compares two alternative scoring methods--multiple regression and classification trees--for an automated speech scoring system used in a practice environment. The two methods were evaluated on two criteria: construct representation and empirical performance in predicting human scores. The empirical performance of the two scoring models…

Descriptors: Scoring, Classification, Weighted Scores, Comparative Analysis

Automated Scoring of Spontaneous Speech Using SpeechRater? v1.0. Research Report. ETS RR-08-62

Peer reviewed
PDF on ERIC

Download full text

Xi, Xiaoming; Higgins, Derrick; Zechner, Klaus; Williamson, David M. – ETS Research Report Series, 2008

This report presents the results of a research and development effort for SpeechRater? Version 1.0 (v1.0), an automated scoring system for the spontaneous speech of English language learners used operationally in the Test of English as a Foreign Language™ (TOEFL®) Practice Online assessment (TPO). The report includes a summary of the validity…

Descriptors: Speech, Scoring, Scoring Rubrics, Scoring Formulas

Investigating the Criterion-Related Validity of the TOEFL® Speaking Scores for ITA Screening and Setting Standards for ITAS. TOEFL iBT Research Report. TOEFL iBT-03. ETS RR-08-02

Peer reviewed
PDF on ERIC

Download full text

Xi, Xiaoming – ETS Research Report Series, 2008

Although the primary use of the speaking section of the Test of English as a Foreign Language™ Internet-based test (TOEFL® iBT Speaking test) is to inform admissions decisions at English medium universities, it may also be useful as an initial screening measure for international teaching assistants (ITAs). This study provides criterion-related…

Descriptors: Test Validity, Criterion Referenced Tests, English (Second Language), Language Tests

Investigating the Utility of Analytic Scoring for the TOEFL Academic Speaking Test (TAST). TOEFL iBT Research Report. TOEFL iBT-01. ETS RR-06-07

Peer reviewed
PDF on ERIC

Download full text

Xi, Xiaoming; Mollaun, Pam – ETS Research Report Series, 2006

This study explores the utility of analytic scoring for the TOEFL® Academic Speaking Test (TAST) in providing useful and reliable diagnostic information in three aspects of candidates' performance: delivery, language use, and topic development. G studies were used to investigate the dependability of the analytic scores, the distinctness of the…

Descriptors: English (Second Language), Language Tests, Second Language Learning, Oral Language