ERIC - Search Results

Publication Date

In 2025	0
Since 2024	0
Since 2021 (last 5 years)	3
Since 2016 (last 10 years)	11
Since 2006 (last 20 years)	16

Source

ETS Research Report Series	4
Language Testing	3
Assessment in Education:…	1
Cogent Education	1
College Entrance Examination…	1
ESL Magazine	1
Education and Information…	1
English Language Teaching	1
International Journal of…	1
International Journal of…	1
Language Learning	1
Online Submission	1
Psicologica: International…	1
Reading Matrix: An…	1
TESOL Quarterly	1
More ▼

Publication Type

Reports - Research	28
Journal Articles	18
Tests/Questionnaires	4
Speeches/Meeting Papers	3
Numerical/Quantitative Data	2

Education Level

Higher Education	7
Postsecondary Education	6
Elementary Education	1
Secondary Education	1

Audience

Researchers

Location

Iran	4
Dominican Republic	1
Japan	1
Kenya	1
Pennsylvania (Philadelphia)	1
United States	1

Laws, Policies, & Programs

Assessments and Surveys

Test of English as a Foreign…	28
Graduate Record Examinations	3
Test of English for…	2
ACTFL Oral Proficiency…	1
Computer Attitude Scale	1
English Proficiency Test	1
Graduate Management Admission…	1
Law School Admission Test	1
Medical College Admission Test	1
Program for International…	1
SAT (College Admission Test)	1
More ▼

What Works Clearinghouse Rating

Showing 1 to 15 of 28 results Save | Export

Complementary Strengths? Evaluation of a Hybrid Human-Machine Scoring Approach for a Test of Oral Academic English

Peer reviewed

Direct link

Davis, Larry; Papageorgiou, Spiros – Assessment in Education: Principles, Policy & Practice, 2021

Human raters and machine scoring systems potentially have complementary strengths in evaluating language ability; specifically, it has been suggested that automated systems might be used to make consistent measurements of specific linguistic phenomena, whilst humans evaluate more global aspects of performance. We report on an empirical study that…

Descriptors: Scoring, English for Academic Purposes, Oral English, Speech Tests

Of Standardized Student Measurements and Tests in the Dominican Republic

Download full text

Tavarez Da Costa, Pedro; Reyes Arias, Fransheska – Online Submission, 2021

The present work seeks to establish a comparison between two different and distant evaluation tools applied to the Dominican student population in order to measure the efficiency of our educational system in the recent years, one of them measured the quality of Dominican education in three areas (the PISA Test), whereas the other tested the…

Descriptors: Foreign Countries, Standardized Tests, Student Evaluation, International Assessment

Computerized Testing in Reading Comprehension Skill: Investigating Score Interchangeability, Item Review, Age and Gender Stereotypes, ICT Literacy and Computer Attitudes

Peer reviewed

Direct link

Toroujeni, Seyyed Morteza Hashemi – Education and Information Technologies, 2022

Score interchangeability of Computerized Fixed-Length Linear Testing (henceforth CFLT) and Paper-and-Pencil-Based Testing (henceforth PPBT) has become a controversial issue over the last decade when technology has meaningfully restructured methods of the educational assessment. Given this controversy, various testing guidelines published on…

Descriptors: Computer Assisted Testing, Reading Tests, Reading Comprehension, Scoring

Adding Value to Second-Language Listening and Reading Subscores: Using a Score Augmentation Approach

Peer reviewed

Direct link

Papageorgiou, Spiros; Choi, Ikkyu – International Journal of Testing, 2018

This study examined whether reporting subscores for groups of items within a test section assessing a second-language modality (specifically reading or listening comprehension) added value from a measurement perspective to the information already provided by the section scores. We analyzed the responses of 116,489 test takers to reading and…

Descriptors: Second Language Learning, Second Language Instruction, English (Second Language), Language Tests

Evaluating Subscore Uses across Multiple Levels: A Case of Reading and Listening Subscores for Young EFL Learners

Peer reviewed

Direct link

Choi, Ikkyu; Papageorgiou, Spiros – Language Testing, 2020

Stakeholders of language tests are often interested in subscores. However, reporting a subscore is not always justified; a subscore should provide reliable and distinct information to be worth reporting. When a subscore is used for decisions across multiple levels (e.g., individual test takers and schools), it needs to be justified for its…

Descriptors: English (Second Language), Language Tests, Second Language Learning, Scores

Do the TOEFL iBT® Section Scores Provide Value-Added Information to Stakeholders

Peer reviewed

Direct link

Sawaki, Yasuyo; Sinharay, Sandip – Language Testing, 2018

The present study examined the reliability of the reading, listening, speaking, and writing section scores for the TOEFL iBT® test and their interrelationship in order to collect empirical evidence to support, respectively, the "generalization" inference and the "explanation" inference in the TOEFL iBT validity argument…

Descriptors: English (Second Language), Language Tests, Second Language Learning, Computer Assisted Testing

Development and Validation of a Researcher Constructed Psycho-Motor Mechanism Scale for Evaluating the Quality of Translation Works

Peer reviewed
PDF on ERIC

Download full text

Hanifehzadeh, Sepeedeh; Farahzad, Farzaneh – International Journal of Language Testing, 2016

The present study was designed basically to develop a psycho-motor mechanism scale based on the theory of translation competence proposed by PACTE (2003), and then to assess the validity and reliability of the constructed scale. In this quantitative research, after designing the scale, two translation tasks were given to 90 M.A. students majoring…

Descriptors: Translation, Language Tests, Test Construction, Test Reliability

Modeling Local Item Dependence in Cloze and Reading Comprehension Test Items Using Testlet Response Theory

Peer reviewed
PDF on ERIC

Download full text

Baghaei, Purya; Ravand, Hamdollah – Psicologica: International Journal of Methodology and Experimental Psychology, 2016

In this study the magnitudes of local dependence generated by cloze test items and reading comprehension items were compared and their impact on parameter estimates and test precision was investigated. An advanced English as a foreign language reading comprehension test containing three reading passages and a cloze test was analyzed with a…

Descriptors: Cloze Procedure, Reading, Reading Comprehension, Reading Skills

Modern English Drama and the Students' Fluency and Accuracy of Speaking

Peer reviewed
PDF on ERIC

Download full text

Pishkar, Kian; Moinzadeh, Ahmad; Dabaghi, Azizallah – English Language Teaching, 2017

Speaking a language involves more than simply knowing the linguistic components of the message, and developing language skills requires more than grammatical comprehension and vocabulary memorization. In teaching-learning processes, drama method may have some positive effects on ELL students' speaking fluency and accuracy. This study attempts to…

Descriptors: English (Second Language), Second Language Learning, Second Language Instruction, Undergraduate Students

Adaptation and Assessment of a Public Speaking Rating Scale

Peer reviewed

Direct link

Iberri-Shea, Gina – Cogent Education, 2017

Prominent spoken language assessments such as the Oral Proficiency Interview and the Test of Spoken English have been primarily concerned with speaking ability as it relates to conversation. This paper looks at an additional aspect of spoken language ability, namely public speaking. This study used an adapted form of a public speaking rating scale…

Descriptors: Public Speaking, Rating Scales, Adoption (Ideas), English Instruction

Predicting Grades from an English Language Assessment: The Importance of Peeling the Onion

Peer reviewed

Direct link

Bridgeman, Brent; Cho, Yeonsuk; DiPietro, Stephen – Language Testing, 2016

Data from 787 international undergraduate students at an urban university in the United States were used to demonstrate the importance of separating a sample into meaningful subgroups in order to demonstrate the ability of an English language assessment to predict the first-year grade point average (GPA). For example, when all students were pooled…

Descriptors: Grade Prediction, English Curriculum, Language Tests, Undergraduate Students

Investigating the Value of Section Scores for the "TOEFL iBT"® Test. "TOEFL iBT"® Research Report. TOEFL iBT-21. ETS Research Report RR-13-35

Peer reviewed
PDF on ERIC

Download full text

Sawaki, Yasuyo; Sinharay, Sandip – ETS Research Report Series, 2013

This study investigates the value of reporting the reading, listening, speaking, and writing section scores for the "TOEFL iBT"® test, focusing on 4 related aspects of the psychometric quality of the TOEFL iBT section scores: reliability of the section scores, dimensionality of the test, presence of distinct score profiles, and the…

Descriptors: Scores, Computer Assisted Testing, Factor Analysis, Correlation

Analytic Scoring of TOEFL® CBT Essays: Scores from Humans and "E-rater"®. TOEFL® Research Reports. RR-81. ETS RR-08-01

Peer reviewed
PDF on ERIC

Download full text

Lee, Yong-Won; Gentile, Claudia; Kantor, Robert – ETS Research Report Series, 2008

The main purpose of the study was to investigate the distinctness and reliability of analytic (or multitrait) rating dimensions and their relationships to holistic scores and "e-rater"® essay feature variables in the context of the TOEFL® computer-based test (CBT) writing assessment. Data analyzed in the study were analytic and holistic…

Descriptors: English (Second Language), Language Tests, Second Language Learning, Scoring

Factor Structure of the TOEFL Internet-Based Test across Subgroups. TOEFL iBT Research Report. TOEFL iBT-07. ETS Research Report. RR-08-66

Peer reviewed
PDF on ERIC

Download full text

Stricker, Lawrence J.; Rock, Donald A. – ETS Research Report Series, 2008

This study assessed the invariance in the factor structure of the "Test of English as a Foreign Language"™ Internet-based test (TOEFL® iBT) across subgroups of test takers who differed in native language and exposure to the English language. The subgroups were defined by (a) Indo-European and Non-Indo-European language family, (b)…

Descriptors: Factor Structure, English (Second Language), Language Tests, Computer Assisted Testing

Decision Dependability of Subtests, Tests, and the Overall TOEFL Test Battery.

Download full text

Brown, James Dean; Ross, Jacqueline A. – 1993

This study investigates the Test of English as a Foreign Language (TOEFL), in particular the relative contributions to score dependability (analogous to classical theory reliability) of various numbers of items and subtests as well as the decision dependability at different cut points. Research questions that apply to the overall TOEFL battery and…

Descriptors: English (Second Language), Language Tests, Statistical Analysis, Test Reliability

Previous Page | Next Page »

Pages: 1 | 2

Test Reliability	28
English (Second Language)	26
Language Tests	25
Second Language Learning	21
Test Validity	15
Scores	10
Correlation	9
Foreign Countries	8
Language Proficiency	8
Computer Assisted Testing	7
Reading Tests	7
Scoring	7
Test Construction	7
Test Items	7
Factor Analysis	6
Reading Comprehension	6
College Entrance Examinations	5
Interrater Reliability	4
Psychometrics	4
Statistical Analysis	4
Undergraduate Students	4
Foreign Students	3
Higher Education	3
Listening Comprehension Tests	3
Oral Language	3
More ▼

Papageorgiou, Spiros	3
Bridgeman, Brent	2
Carlson, Sybil B.	2
Choi, Ikkyu	2
Henning, Grant	2
Sawaki, Yasuyo	2
Sinharay, Sandip	2
Angoff, William H.	1
Attali, Yigal	1
Baghaei, Purya	1
Breland, Hunter M.	1
Brown, James Dean	1
Camp, Roberta	1
Cho, Yeonsuk	1
Dabaghi, Azizallah	1
Davis, Larry	1
DiPietro, Stephen	1
Farahzad, Farzaneh	1
Fowles, Mary E.	1
Gentile, Claudia	1
Golkar, Maryam	1
Hale, Gordon, A.	1
Hanifehzadeh, Sepeedeh	1
Hosley, Deborah	1
More ▼