ERIC - Search Results

Publication Date

In 2025	0
Since 2024	0
Since 2021 (last 5 years)	1
Since 2016 (last 10 years)	1
Since 2006 (last 20 years)	3

Source

ETS Research Report Series	1
Language Assessment Quarterly	1
Language Testing	1
Language Testing in Asia	1

Author

Hicks, Marilyn M.	2
Ghaemi, Hamed	1
Jamieson, Joan	1
Jiang, Hai	1
Morgan, Rick	1
Oltman, Phillip K.	1
Papageorgiou, Spiros	1
Perkins, Kyle	1
Poonpon, Kornwipa	1
Reese, Clyde M.	1
So, Youngsoon	1
Stricker, Lawrence J.	1
Tang, K. Linda	1
Way, Walter D.	1
Xi, Xiaoming	1
More ▼

Publication Type

Reports - Research	6
Journal Articles	4
Reports - Evaluative	4
Speeches/Meeting Papers	1

Education Level

Higher Education	2
Postsecondary Education	2

Audience

Location

Iran

Laws, Policies, & Programs

Assessments and Surveys

Test of English as a Foreign…

What Works Clearinghouse Rating

Showing all 10 results Save | Export

Application of Nonparametric Item Response Theory in Determining the One-Dimensionality and Scalability of TOEFL iBT Listening Test

Peer reviewed

Direct link

Ghaemi, Hamed – Language Testing in Asia, 2022

Listening comprehension in English, as one of the most fundamental skills, has an essential role in the process of learning English. Mokken scale analysis (MSA) is a probabilistic-nonparametric approach to item response theory (IRT) which determines the one-dimensionality and scalability of test. Mokken scaling techniques are a useful tool for…

Descriptors: Second Language Learning, English (Second Language), Nonparametric Statistics, Item Response Theory

Developing and Validating Band Levels and Descriptors for Reporting Overall Examinee Performance

Peer reviewed

Direct link

Papageorgiou, Spiros; Xi, Xiaoming; Morgan, Rick; So, Youngsoon – Language Assessment Quarterly, 2015

This study presents the development and empirical validation of score levels and descriptors specifically designed for reporting purposes to provide test takers with more than just a number on a score scale. In the context of a test primarily intended for 11- to 15-year-old students learning English as a second/foreign language, the study examined…

Descriptors: Scores, Validity, Scaling, Classification

Developing Analytic Rating Guides for "TOEFL iBT"® Integrated Speaking Tasks. "TOEFL iBT"® Research Report, TOEFL iBT-20. ETS Research Report. RR-13-13

Peer reviewed
PDF on ERIC

Download full text

Jamieson, Joan; Poonpon, Kornwipa – ETS Research Report Series, 2013

Research and development of a new type of scoring rubric for the integrated speaking tasks of "TOEFL iBT"® are described. These "analytic rating guides" could be helpful if tasks modeled after those in TOEFL iBT were used for formative assessment, a purpose which is different from TOEFL iBT's primary use for admission…

Descriptors: Oral Language, Language Proficiency, Scaling, Scores

Developing Homogeneous TOEFL Scales by Multidimensional Scaling.

Peer reviewed

Oltman, Phillip K.; Stricker, Lawrence J. – Language Testing, 1990

A recent multidimensional scaling analysis of the Test of English-as-a-Foreign-Language (TOEFL) item response data identified clusters of items in the test sections that, being more homogeneous than their parent sections, might be better for diagnostic use. The analysis was repeated using different scoring techniques. Results diverged only for…

Descriptors: English (Second Language), Item Analysis, Language Tests, Scaling

A Scalable Set of ESL Reading Comprehension Items.

Download full text

Perkins, Kyle – 2002

Guttman implicational scaling techniques were used to identify a unidimensional set of English as a Second Language reading comprehension items. Data were analyzed from 202 students who sat for an institutional administration of the Test of English as a Foreign Language (TOEFL). The examinees who contributed to the scalable set had significantly…

Descriptors: Adults, Classification, English (Second Language), Limited English Speaking

Estimation of Score Distributions for TOEFL Concordance Tables.

Download full text

Jiang, Hai – 1999

The purpose of this paper is to describe the techniques used in establishing the concordance tables between the Test of English as a Foreign Language (TOEFL), paper and pencil (P&P), and computer-based testing (CBT) sections and total reported score scales. Listening, reading, and composite structure and essay scores plus a total score are…

Descriptors: Computer Assisted Testing, English (Second Language), Estimation (Mathematics), Scaling

An Investigation of the Use of Simplified IRT Models for Scaling and Equating the TOEFL Test. TOEFL Technical Report TR-2.

Download full text

Way, Walter D.; Reese, Clyde M. – 1991

The use of two alternative item response theory (IRT) estimation models in the scaling and equating of the Test of English as a Foreign Language (TOEFL) was explored; and item scaling and test equating results based on these models were compared with results based on the three-parameter (3PL) model currently being used with the TOEFL. Models were…

Descriptors: Correlation, Equated Scores, Estimation (Mathematics), Goodness of Fit

The Effect of Small Calibration Sample Sizes on TOEFL IRT-Based Equating.

Download full text

Tang, K. Linda; And Others – 1993

This study compared the performance of the LOGIST and BILOG computer programs on item response theory (IRT) based scaling and equating for the Test of English as a Foreign Language (TOEFL) using real and simulated data and two calibration structures. Applications of IRT for the TOEFL program are based on the three-parameter logistic (3PL) model.…

Descriptors: Comparative Analysis, Computer Simulation, Equated Scores, Estimation (Mathematics)

A Comparative Study of Methods of Equating TOEFL Test Scores.

Download full text

Hicks, Marilyn M. – 1984

Six methods of equating Test of English as a Foreign Language (TOEFL) test scores for samples consisting of the usual groups of examinees and groups controlled for native language representation were evaluated in terms of scale stability. The equating methods included three item response theory (IRT) variants (fixed b's scaling, a one-parameter…

Descriptors: College Entrance Examinations, Comparative Analysis, English (Second Language), Equated Scores

The TOEFL Computerized Placement Test: Adaptive Conventional Measurement. TOEFL Research Reports, Report 31.

Download full text

Hicks, Marilyn M. – 1989

Methods of computerized adaptive testing using conventional scoring methods in order to develop a computerized placement test for the Test of English as a Foreign Language (TOEFL) were studied. As a consequence of simulation studies during the first phase of the study, the multilevel testing paradigm was adopted to produce three test levels…

Descriptors: Adaptive Testing, Adults, Algorithms, Computer Assisted Testing

Scaling	10
English (Second Language)	8
Language Tests	5
Equated Scores	4
Item Response Theory	4
Scores	4
Computer Assisted Testing	3
Estimation (Mathematics)	3
Simulation	3
Test Items	3
Adults	2
Classification	2
Comparative Analysis	2
Sample Size	2
Second Language Learning	2
Statistical Analysis	2
Test Construction	2
Test Validity	2
Adaptive Testing	1
Algorithms	1
Chinese	1
College Admission	1
College Entrance Examinations	1
College Students	1
Computer Literacy	1
More ▼