ERIC - Search Results

Publication Date

In 2025	0
Since 2024	0
Since 2021 (last 5 years)	2
Since 2016 (last 10 years)	6
Since 2006 (last 20 years)	7

Source

Language Testing	2
Education and Information…	1
Educational Testing Service	1
InSight: A Journal of…	1
International Journal of…	1
Language Testing in Asia	1
Online Submission	1
TESL-EJ	1
Thought Currents in English…	1

Publication Type

Reports - Research	9
Journal Articles	8
Dissertations/Theses -…	1
Numerical/Quantitative Data	1
Reports - Descriptive	1
Reports - Evaluative	1

Education Level

Higher Education	2
Elementary Education	1
Postsecondary Education	1

Audience

Location

California	1
Canada	1
China	1
Connecticut	1
Georgia	1
Indiana	1
Iowa	1
Japan	1
Japan (Tokyo)	1
Michigan	1
United States	1
Wisconsin	1
More ▼

Laws, Policies, & Programs

Assessments and Surveys

Test of English as a Foreign…	12
Computer Attitude Scale	1
International English…	1
Test of English for…	1

What Works Clearinghouse Rating

Showing all 12 results Save | Export

A Cognitive Diagnostic Assessment Study of the Reading Comprehension Section of the Preliminary English Test (PET)

Peer reviewed
PDF on ERIC

Download full text

Mohammed, Aisha; Dawood, Abdul Kareem Shareef; Alghazali, Tawfeeq; Kadhim, Qasim Khlaif; Sabti, Ahmed Abdulateef; Sabit, Shaker Holh – International Journal of Language Testing, 2023

Cognitive diagnostic models (CDMs) have received much interest within the field of language testing over the last decade due to their great potential to provide diagnostic feedback to all stakeholders and ultimately improve language teaching and learning. A large number of studies have demonstrated the application of CDMs on advanced large-scale…

Descriptors: Reading Comprehension, Reading Tests, Language Tests, English (Second Language)

Exploring the Validity Evidence of a High-Stake, Second Language Reading Test: An Eye-Tracking Study

Peer reviewed

Direct link

Lim, Hyojung – Language Testing in Asia, 2020

The current study aims to explore the cognitive validity of the iBT TOEFL reading test by investigating test takers' eye movements on individual items. It is assumed that successful test takers would adopt the intended reading processes, the same types and levels of cognitive processes that they would use for real-world reading tasks. Forty-seven…

Descriptors: Test Validity, High Stakes Tests, Second Language Learning, Language Tests

Computerized Testing in Reading Comprehension Skill: Investigating Score Interchangeability, Item Review, Age and Gender Stereotypes, ICT Literacy and Computer Attitudes

Peer reviewed

Direct link

Toroujeni, Seyyed Morteza Hashemi – Education and Information Technologies, 2022

Score interchangeability of Computerized Fixed-Length Linear Testing (henceforth CFLT) and Paper-and-Pencil-Based Testing (henceforth PPBT) has become a controversial issue over the last decade when technology has meaningfully restructured methods of the educational assessment. Given this controversy, various testing guidelines published on…

Descriptors: Computer Assisted Testing, Reading Tests, Reading Comprehension, Scoring

Topic Familiarity Matters: A Critical Analysis of TOEFL iBT Reading Section

Peer reviewed
PDF on ERIC

Download full text

Toker, Deniz – TESL-EJ, 2019

The central purpose of this paper is to examine validity problems arising from the multiple-choice items and technical passages in the Test of English as a Foreign Language Internet-based Test (TOEFL iBT) reading section, primarily concentrating on construct-irrelevant variance (Messick, 1989). My personal TOEFL iBT experience, along with my…

Descriptors: English (Second Language), Language Tests, Second Language Learning, Computer Assisted Testing

Making Better Tests with the Rasch Measurement Model

Peer reviewed
PDF on ERIC

Download full text

Karlin, Omar; Karlin, Sayaka – InSight: A Journal of Scholarly Teaching, 2018

This study had two aims. The first was to explain the process of using the Rasch measurement model to validate tests in an easy-to-understand way for those unfamiliar with the Rasch measurement model. The second was to validate two final exams with several shared items. The exams were given to two groups of students with slightly differing English…

Descriptors: Item Response Theory, Test Validity, Test Items, Accuracy

Age, Task Characteristics, and Acoustic Indicators of Engagement: Investigations into the Validity of a Technology-Enhanced Speaking Test for Young Language Learners

Download full text

Edward Paul Getman – Online Submission, 2020

Despite calls for engaging assessments targeting young language learners (YLLs) between 8 and 13 years old, what makes assessment tasks engaging and how such task characteristics affect measurement quality have not been well studied empirically. Furthermore, there has been a dearth of validity research about technology-enhanced speaking tests for…

Descriptors: English (Second Language), Language Tests, Second Language Learning, Learner Engagement

Download full text

Sawaki, Yasuyo; Nissan, Susan – Educational Testing Service, 2009

The study investigated the criterion-related validity of the "Test of English as a Foreign Language"[TM] Internet-based test (TOEFL[R] iBT) Listening section by examining its relationship to a criterion measure designed to reflect language-use tasks that university students encounter in everyday academic life: listening to academic…

Descriptors: Test Validity, Language Tests, English (Second Language), Computer Assisted Testing

Crossvalidation of Item Response Curve Models Using TOEFL Data.

Peer reviewed

Boldt, Robert F. – Language Testing, 1992

The assumption called PIRC (proportional item response curve) was tested in which PIRC was used to predict item scores of selected examinees on selected items. Findings show approximate accuracies of prediction for PIRC, the three-parameter logist model, and a modified Rasch model. (12 references) (Author/LB)

Descriptors: Comparative Analysis, English (Second Language), Factor Analysis, Item Response Theory

The Relationship Between TOEFL Vocabulary Items and Meaning, Association, Collocation, and Word-Class Knowledge.

Peer reviewed

Schmitt, Norbert – Language Testing, 1999

One way of determining construct validity of vocabulary items in language tests is to interview subjects directly after taking the items to ascertain what is known about the target words in question. This approach was combined within the framework of lexical competency in a study of the behavior of lexical items on the Test of English as a Foreign…

Descriptors: Associative Learning, Construct Validity, English (Second Language), Foreign Countries

An Investigation of Proposed Revisions to Section 3 of the TOEFL Test. TOEFL Research Report 47.

Download full text

Schedl, Mary; And Others – 1995

The Test of English as a Foreign Language (TOEFL) program is exploring a change in Section 3 of the TOEFL test that would replace the vocabulary subpart with additional reading comprehension questions. This study investigated the proposed revision in terms of the length and timing that would be necessary to address concerns of test speededness of…

Descriptors: Adult Students, English (Second Language), Language Tests, Psychometrics

Context Bias in the Test of English as a Foreign Language.

Download full text

Angoff, William H. – 1989

This study was undertaken to test the hypothesis that items of the Test of English as a Foreign Language (TOEFL) containing reference to American people, places, customs, etc., tend to favor examinees who have spent some time living in the United States. Two samples of examinees were drawn from the March 1987 TOEFL administration, one tested in…

Descriptors: Context Effect, English (Second Language), Evaluators, Foreign Nationals

A Survey of Issues and Item Writing in Language Testing.

Download full text

Strong, Gregory – Thought Currents in English Literature, 1995

This paper traces developments in educational psychology and measurement that led to the Test of English as a Foreign Language (TOEFL) and the test of English for International Communication (TOEIC) and the application of educational measurement terms such as validity and reliability to testing. Use of a table of specifications for planning…

Descriptors: Cloze Procedure, Difficulty Level, English (Second Language), Foreign Countries

English (Second Language)	12
Test Items	12
Test Validity	12
Language Tests	11
Second Language Learning	7
Foreign Countries	6
Reading Tests	6
Computer Assisted Testing	5
Test Construction	5
Reading Comprehension	4
Scores	4
Test Reliability	4
Difficulty Level	3
High Stakes Tests	3
Interrater Reliability	3
College Students	2
Computer Software	2
Construct Validity	2
Item Response Theory	2
Models	2
Multiple Choice Tests	2
Reading Skills	2
Second Language Instruction	2
Statistical Analysis	2
Test Format	2
More ▼

Alghazali, Tawfeeq	1
Angoff, William H.	1
Boldt, Robert F.	1
Dawood, Abdul Kareem Shareef	1
Edward Paul Getman	1
Kadhim, Qasim Khlaif	1
Karlin, Omar	1
Karlin, Sayaka	1
Lim, Hyojung	1
Mohammed, Aisha	1
Nissan, Susan	1
Sabit, Shaker Holh	1
Sabti, Ahmed Abdulateef	1
Sawaki, Yasuyo	1
Schedl, Mary	1
Schmitt, Norbert	1
Strong, Gregory	1
Toker, Deniz	1
Toroujeni, Seyyed Morteza…	1
More ▼