ERIC - Search Results

Publication Date

In 2025	0
Since 2024	0
Since 2021 (last 5 years)	4
Since 2016 (last 10 years)	7
Since 2006 (last 20 years)	15

Descriptor

Scoring	22
English (Second Language)	21
Language Tests	18
Second Language Learning	14
Test Validity	14
Computer Assisted Testing	11
Essay Tests	7
Test Construction	7
Correlation	6
Evaluators	6
Interrater Reliability	6
Language Proficiency	6
Scores	6
Comparative Analysis	5
Test Reliability	5
Writing Evaluation	5
Writing Tests	5
Accuracy	4
Construct Validity	4
Essays	4
Factor Analysis	4
Higher Education	4
Oral Language	4
Rating Scales	4
Validity	4
More ▼

Source

ETS Research Report Series	5
Educational Testing Service	3
Language Testing	3
Assessing Writing	1
College Entrance Examination…	1
Education and Information…	1
Language Education &…	1
ProQuest LLC	1
SAGE Open	1
TESL Canada Journal	1

Publication Type

Reports - Research	15
Journal Articles	13
Tests/Questionnaires	5
Reports - Evaluative	4
Reports - Descriptive	2
Dissertations/Theses -…	1
Numerical/Quantitative Data	1
Speeches/Meeting Papers	1

Education Level

Higher Education	4
Postsecondary Education	4
Elementary Education	1
Elementary Secondary Education	1
Grade 10	1
Grade 11	1
Grade 12	1
Grade 6	1
Grade 7	1
Grade 8	1
Grade 9	1
High Schools	1
Intermediate Grades	1
Junior High Schools	1
Middle Schools	1
Secondary Education	1
More ▼

Audience

Researchers

Location

China	2
Japan	2
Brazil	1
Colombia	1
Germany	1
India	1
Iran	1
Jordan	1
Kenya	1
Mexico	1
South Korea	1
Turkey	1
United States	1
More ▼

Laws, Policies, & Programs

Assessments and Surveys

Test of English as a Foreign…	22
Graduate Record Examinations	3
Graduate Management Admission…	2
International English…	2
Computer Attitude Scale	1
Law School Admission Test	1
Medical College Admission Test	1
SAT (College Admission Test)	1

What Works Clearinghouse Rating

Showing 1 to 15 of 22 results Save | Export

Towards More Valid Scoring Criteria for Integrated Reading-Writing and Listening-Writing Summary Tasks

Peer reviewed

Direct link

Chan, Sathena; May, Lyn – Language Testing, 2023

Despite the increased use of integrated tasks in high-stakes academic writing assessment, research on rating criteria which reflect the unique construct of integrated summary writing skills is comparatively rare. Using a mixed-method approach of expert judgement, text analysis, and statistical analysis, this study examines writing features that…

Descriptors: Scoring, Writing Evaluation, Reading Tests, Listening Skills

Design Framework for the "TOEFL® Essentials"™ Test 2021. Research Memorandum. ETS RM-21-03

Download full text

Papageorgiou, Spiros; Davis, Larry; Norris, John M.; Garcia Gomez, Pablo; Manna, Venessa F.; Monfils, Lora – Educational Testing Service, 2021

The "TOEFL® Essentials"™ test is a new English language proficiency test in the "TOEFL"® family of assessments. It measures foundational language skills and communication abilities in academic and general (daily life) contexts. The test covers the four language skills of reading, listening, writing, and speaking and is intended…

Descriptors: Language Tests, English (Second Language), Second Language Learning, Language Proficiency

Developing an Innovative Elicited Imitation Task for Efficient English Proficiency Assessment. TOEFL® Research Report. RR-96. ETS RR-21-24

Peer reviewed
PDF on ERIC

Download full text

Davis, Larry; Norris, John – ETS Research Report Series, 2021

The elicited imitation task (EIT), in which language learners listen to a series of spoken sentences and repeat each one verbatim, is a commonly used measure of language proficiency in second language acquisition research. The "TOEFL® Essentials"™ test includes an EIT as a holistic measure of speaking proficiency, referred to as the…

Descriptors: Task Analysis, Language Proficiency, Speech Communication, Language Tests

Computerized Testing in Reading Comprehension Skill: Investigating Score Interchangeability, Item Review, Age and Gender Stereotypes, ICT Literacy and Computer Attitudes

Peer reviewed

Direct link

Toroujeni, Seyyed Morteza Hashemi – Education and Information Technologies, 2022

Score interchangeability of Computerized Fixed-Length Linear Testing (henceforth CFLT) and Paper-and-Pencil-Based Testing (henceforth PPBT) has become a controversial issue over the last decade when technology has meaningfully restructured methods of the educational assessment. Given this controversy, various testing guidelines published on…

Descriptors: Computer Assisted Testing, Reading Tests, Reading Comprehension, Scoring

Screener Tests Need Validation Too: Weighing an Argument for Test Use against Practical Concerns

Peer reviewed

Direct link

Schmidgall, Jonathan E.; Getman, Edward P.; Zu, Jiyun – Language Testing, 2018

In this study, we define the term "screener test," elaborate key considerations in test design, and describe how to incorporate the concepts of practicality and argument-based validation to drive an evaluation of screener tests for language assessment. A screener test is defined as a brief assessment designed to identify an examinee as a…

Descriptors: Test Validity, Test Use, Test Construction, Language Tests

For a Greater Good: Bias Analysis in Writing Assessment

Peer reviewed

Direct link

Ahmadi Shirazi, Masoumeh – SAGE Open, 2019

Threats to construct validity should be reduced to a minimum. If true, sources of bias, namely raters, items, tests as well as gender, age, race, language background, culture, and socio-economic status need to be spotted and removed. This study investigates raters' experience, language background, and the choice of essay prompt as potential…

Descriptors: Foreign Countries, Language Tests, Test Bias, Essay Tests

Building a Validity Argument for the Use of Academic Language Tests for Immigration Purposes: Evidence from Immigration-Seeking Test-Takers

Peer reviewed
PDF on ERIC

Download full text

Hoang, Ngoc Thi Huyen – Language Education & Assessment, 2019

As validity pertains to test use rather than the test itself, using a test for unintended purposes requires a new validation program using additional evidence from relevant sources. This small-scale study contributes to the validation of the use of originally academic language tests--the International English Language Testing System and the Test…

Descriptors: Language Tests, Immigrants, Immigration, Testing Problems

A Study on the Impact of Fatigue on Human Raters When Scoring Speaking Responses

Peer reviewed

Direct link

Ling, Guangming; Mollaun, Pamela; Xi, Xiaoming – Language Testing, 2014

The scoring of constructed responses may introduce construct-irrelevant factors to a test score and affect its validity and fairness. Fatigue is one of the factors that could negatively affect human performance in general, yet little is known about its effects on a human rater's scoring quality on constructed responses. In this study, we compared…

Descriptors: Evaluators, Fatigue (Biology), Scoring, Performance

Automated Trait Scores for "TOEFL"® Writing Tasks. Research Report. ETS RR-15-14

Peer reviewed
PDF on ERIC

Download full text

Attali, Yigal; Sinharay, Sandip – ETS Research Report Series, 2015

The "e-rater"® automated essay scoring system is used operationally in the scoring of "TOEFL iBT"® independent and integrated tasks. In this study we explored the psychometric added value of reporting four trait scores for each of these two tasks, beyond the total e-rater score.The four trait scores are word choice, grammatical…

Descriptors: Writing Tests, Scores, Language Tests, English (Second Language)

English Language Learners and Automated Scoring of Essays: Critical Considerations

Peer reviewed

Direct link

Weigle, Sara Cushing – Assessing Writing, 2013

This article presents considerations for using automated scoring systems to evaluate second language writing. A distinction is made between English language learners in English-medium educational systems and those studying English in their own countries for a variety of purposes, and between learning-to-write and writing-to-learn in a second…

Descriptors: Scoring, Second Language Learning, Second Languages, English Language Learners

Use of e-rater[R] in Scoring of the TOEFL iBT[R] Writing Test. Research Report. ETS RR-11-25

Download full text

Haberman, Shelby J. – Educational Testing Service, 2011

Alternative approaches are discussed for use of e-rater[R] to score the TOEFL iBT[R] Writing test. These approaches involve alternate criteria. In the 1st approach, the predicted variable is the expected rater score of the examinee's 2 essays. In the 2nd approach, the predicted variable is the expected rater score of 2 essay responses by the…

Descriptors: Writing Tests, Scoring, Essays, Language Tests

Rater Expertise in a Second Language Speaking Assessment: The Influence of Training and Experience

Direct link

Davis, Lawrence Edward – ProQuest LLC, 2012

Speaking performance tests typically employ raters to produce scores; accordingly, variability in raters' scoring decisions has important consequences for test reliability and validity. One such source of variability is the rater's level of expertise in scoring. Therefore, it is important to understand how raters' performance is influenced by…

Descriptors: Evaluators, Expertise, Scores, Second Language Learning

Evaluating the Construct-Coverage of the e-rater[R] Scoring Engine. Research Report. ETS RR-09-01

Download full text

Quinlan, Thomas; Higgins, Derrick; Wolff, Susanne – Educational Testing Service, 2009

This report evaluates the construct coverage of the e-rater[R[ scoring engine. The matter of construct coverage depends on whether one defines writing skill, in terms of process or product. Originally, the e-rater engine consisted of a large set of components with a proven ability to predict human holistic scores. By organizing these capabilities…

Descriptors: Guides, Writing Skills, Factor Analysis, Writing Tests

Problems in Developing an Alternative to the TOEFL.

Peer reviewed
PDF on ERIC

Download full text

Des Brisay, Margaret – TESL Canada Journal, 1994

Data from the Canadian Test of English for Scholars and Trainees (CanTEST) are compared to data from the Test of English as a Foreign Language (TOEFL) to establish CanTEST as a valid admissions tool for English-as-a-Second Language college applicants. Data are taken from four groups of examinees who took both tests. (eight references) (LR)

Descriptors: Admission Criteria, Comparative Analysis, Comparative Testing, Correlation

Construct Validity of "e-rater"® in Scoring TOEFL® Essays. Research Report. ETS RR-07-21

Peer reviewed
PDF on ERIC

Download full text

Attali, Yigal – ETS Research Report Series, 2007

This study examined the construct validity of the "e-rater"® automated essay scoring engine as an alternative to human scoring in the context of TOEFL® essay writing. Analyses were based on a sample of students who repeated the TOEFL within a short time period. Two "e-rater" scores were investigated in this study, the first…

Descriptors: Construct Validity, Computer Assisted Testing, Scoring, English (Second Language)

Previous Page | Next Page »

Pages: 1 | 2

Attali, Yigal	3
Davis, Larry	2
Ahmadi Shirazi, Masoumeh	1
Bejar, Isaac I.	1
Breland, Hunter M.	1
Bridgeman, Brent	1
Burstein, Jill	1
Camp, Roberta	1
Carlson, Sybil B.	1
Chan, Sathena	1
Davis, Lawrence Edward	1
Des Brisay, Margaret	1
Fowles, Mary E.	1
Garcia Gomez, Pablo	1
Getman, Edward P.	1
Haberman, Shelby J.	1
Hemat, Ramin	1
Henning, Grant	1
Hicks, Marilyn M.	1
Higgins, Derrick	1
Hoang, Ngoc Thi Huyen	1
Ling, Guangming	1
Manna, Venessa F.	1
May, Lyn	1
More ▼