ERIC - Search Results

Publication Date

In 2025	0
Since 2024	0
Since 2021 (last 5 years)	1
Since 2016 (last 10 years)	3
Since 2006 (last 20 years)	6

Descriptor

Item Response Theory	14
Test Items	14
English (Second Language)	13
Language Tests	11
Second Language Learning	9
Reading Comprehension	5
Difficulty Level	4
Scores	4
Test Construction	4
Comparative Analysis	3
Foreign Countries	3
Language Proficiency	3
Models	3
Statistical Analysis	3
Cloze Procedure	2
Computer Assisted Testing	2
Estimation (Mathematics)	2
Factor Analysis	2
Grammar	2
Identification	2
Item Analysis	2
Language Skills	2
Listening Comprehension	2
Multiple Choice Tests	2
Psychometrics	2
More ▼

Source

ETS Research Report Series	3
Language Testing	2
Educational and Psychological…	1
InSight: A Journal of…	1
Language Assessment Quarterly	1
Psicologica: International…	1

Publication Type

Reports - Research	10
Journal Articles	9
Reports - Evaluative	4

Education Level

Higher Education	1
Junior High Schools	1
Middle Schools	1
Secondary Education	1

Audience

Location

France	1
Greece	1
Iran	1
Japan (Tokyo)	1
South Korea	1
Vietnam	1

Laws, Policies, & Programs

Assessments and Surveys

Test of English as a Foreign…

What Works Clearinghouse Rating

Showing all 14 results Save | Export

The Impact of Using Synthetically Generated Listening Stimuli on Test-Taker Performance: A Case Study with Multiple-Choice, Single-Selection Items. TOEFL® Research Reports. RR-98. ETS?RR-22-05

Peer reviewed
PDF on ERIC

Download full text

Choi, Ikkyu; Zu, Jiyun – ETS Research Report Series, 2022

Synthetically generated speech (SGS) has become an integral part of our oral communication in a wide variety of contexts. It can be generated instantly at a low cost and allows precise control over multiple aspects of output, all of which can be highly appealing to second language (L2) assessment developers who have traditionally relied upon human…

Descriptors: Test Wiseness, Multiple Choice Tests, Test Items, Difficulty Level

Making Better Tests with the Rasch Measurement Model

Peer reviewed
PDF on ERIC

Download full text

Karlin, Omar; Karlin, Sayaka – InSight: A Journal of Scholarly Teaching, 2018

This study had two aims. The first was to explain the process of using the Rasch measurement model to validate tests in an easy-to-understand way for those unfamiliar with the Rasch measurement model. The second was to validate two final exams with several shared items. The exams were given to two groups of students with slightly differing English…

Descriptors: Item Response Theory, Test Validity, Test Items, Accuracy

Modeling Local Item Dependence in Cloze and Reading Comprehension Test Items Using Testlet Response Theory

Peer reviewed
PDF on ERIC

Download full text

Baghaei, Purya; Ravand, Hamdollah – Psicologica: International Journal of Methodology and Experimental Psychology, 2016

In this study the magnitudes of local dependence generated by cloze test items and reading comprehension items were compared and their impact on parameter estimates and test precision was investigated. An advanced English as a foreign language reading comprehension test containing three reading passages and a cloze test was analyzed with a…

Descriptors: Cloze Procedure, Reading, Reading Comprehension, Reading Skills

Assessing the Test Information Function and Differential Item Functioning for the "TOEFL Junior"® Standard Test. Research Report. ETS RR-13-17. "TOEFL Junior"® Research Report. TOEFL JR-01

Peer reviewed
PDF on ERIC

Download full text

Young, John W.; Morgan, Rick; Rybinski, Paul; Steinberg, Jonathan; Wang, Yuan – ETS Research Report Series, 2013

The "TOEFL Junior"® Standard Test is an assessment that measures the degree to which middle school-aged students learning English as a second language have attained proficiency in the academic and social English skills representative of English-medium instructional environments. The assessment measures skills in three areas: listening…

Descriptors: Item Response Theory, Test Items, Language Tests, Second Language Learning

Q-Matrix Construction: Defining the Link between Constructs and Test Items in Large-Scale Reading and Listening Comprehension Assessments

Peer reviewed

Direct link

Sawaki, Yasuyo; Kim, Hae-Jin; Gentile, Claudia – Language Assessment Quarterly, 2009

In cognitive diagnosis a Q-matrix (Tatsuoka, 1983, 1990), which is an incidence matrix that defines the relationships between test items and constructs of interest, has great impact on the nature of performance feedback that can be provided to score users. The purpose of the present study was to identify meaningful skill coding categories that…

Descriptors: Feedback (Response), Test Items, Test Content, Identification

How Reliable Are TOEFL Scores?

Peer reviewed

Wainer, Howard; Lukhele, Robert – Educational and Psychological Measurement, 1997

The reliability of scores from four forms of the Test of English as a Foreign Language (TOEFL) was estimated using a hybrid item response theory model. It was found that there was very little difference between overall reliability when the testlet items were assumed to be independent and when their dependence was modeled. (Author/SLD)

Descriptors: English (Second Language), Item Response Theory, Scores, Second Language Learning

An Investigation of IRT-Based Assembly of the TOEFL Test. TOEFL Technical Report.

Download full text

Chyn, Susan; And Others – 1995

The current study, carried out jointly by Test Development and Statistical Analysis staff at Educational Testing Service investigated the feasibility of the Automated Item Selection (AIS) procedure for the Test of English as a Foreign Language (TOEFL). Item-response theory (IRT)-based statistical specifications were developed. Two TOEFL test forms…

Descriptors: English (Second Language), Item Banks, Item Response Theory, Language Tests

Analyzing the Option Effects of Difficult TOEFL Items with Low Biserials: Methods Developed for Use by Test Assemblers.

Download full text

Hicks, Marilyn M. – 1988

Several exploratory analyses of the fifths data generated by Test of English as a Foreign Language (TOEFL) item analyses were developed in order to evaluate the effects of options on the discriminability of difficult items and to identify difficult items with low, unreliable biserials that had been rejected by test developers, but for which…

Descriptors: Difficulty Level, Estimation (Mathematics), Identification, Item Analysis

Crossvalidation of Item Response Curve Models Using TOEFL Data.

Peer reviewed

Boldt, Robert F. – Language Testing, 1992

The assumption called PIRC (proportional item response curve) was tested in which PIRC was used to predict item scores of selected examinees on selected items. Findings show approximate accuracies of prediction for PIRC, the three-parameter logist model, and a modified Rasch model. (12 references) (Author/LB)

Descriptors: Comparative Analysis, English (Second Language), Factor Analysis, Item Response Theory

Simulated Equating Using Several Item Response Curves.

Download full text

Boldt, R. F. – 1994

The comparison of item response theory models for the Test of English as a Foreign Language (TOEFL) was extended to an equating context as simulation trials were used to "equate the test to itself." Equating sample data were generated from administration of identical item sets. Equatings that used procedures based on each model (simple…

Descriptors: Comparative Analysis, Cutting Scores, English (Second Language), Equated Scores

An Exploratory Study of Characteristics Related to IRT Item Parameter Invariance with the Test of English as a Foreign Language. TOEFL Technical Report.

Download full text

Way, Walter D.; And Others – 1992

This study provided an exploratory investigation of item features that might contribute to a lack of invariance of item parameters for the Test of English as a Foreign Language (TOEFL). Data came from seven forms of the TOEFL administered in 1989. Subjective and quantitative measures developed for the study provided consistent information related…

Descriptors: Ability, English (Second Language), Goodness of Fit, Item Response Theory

Multiple-Choice Cloze Items and the Test of English as a Foreign Language. TOEFL Research Reports 26.

Download full text

Hale, Gordon A.; And Others – 1988

This study examined the relation of performance on the Test of English as a Foreign Language (TOEFL) to a widely used variant of the cloze procedure, the multiple choice (MC) cloze method. Examinees taking an operational TOEFL (n=11,290) were given three basic sections of the test along with a section containing prepared MC cloze items, and…

Descriptors: Adults, Cloze Procedure, English (Second Language), Estimation (Mathematics)

The Factor Structure of Test Task Characteristics and Examinee Performance

Peer reviewed

Direct link

Carr, Nathan T. – Language Testing, 2006

The present study focuses on the task characteristics of reading passages and key sentences in a test of second language reading. Using a new methodological approach to describe variation in test task characteristics and explore how differences in these characteristics might relate to examinee performance, it posed the two following research…

Descriptors: English for Academic Purposes, Sentences, Reading Comprehension, Factor Analysis

A General Diagnostic Model Applied to Language Testing Data. Research Report. ETS RR-05-16

Peer reviewed
PDF on ERIC

Download full text

von Davier, Matthias – ETS Research Report Series, 2005

Probabilistic models with more than one latent variable are designed to report profiles of skills or cognitive attributes. Testing programs want to offer additional information beyond what a single test score can provide using these skill profiles. Many recent approaches to skill profile models are limited to dichotomous data and have made use of…

Descriptors: Models, Diagnostic Tests, Language Tests, Language Proficiency

Baghaei, Purya	1
Boldt, R. F.	1
Boldt, Robert F.	1
Carr, Nathan T.	1
Choi, Ikkyu	1
Chyn, Susan	1
Gentile, Claudia	1
Hale, Gordon A.	1
Hicks, Marilyn M.	1
Karlin, Omar	1
Karlin, Sayaka	1
Kim, Hae-Jin	1
Lukhele, Robert	1
Morgan, Rick	1
Ravand, Hamdollah	1
Rybinski, Paul	1
Sawaki, Yasuyo	1
Steinberg, Jonathan	1
Wainer, Howard	1
Wang, Yuan	1
Way, Walter D.	1
Young, John W.	1
Zu, Jiyun	1
von Davier, Matthias	1
More ▼