Publication Date
In 2025 | 0 |
Since 2024 | 0 |
Since 2021 (last 5 years) | 1 |
Since 2016 (last 10 years) | 1 |
Since 2006 (last 20 years) | 3 |
Descriptor
Scaling | 10 |
English (Second Language) | 8 |
Language Tests | 5 |
Equated Scores | 4 |
Item Response Theory | 4 |
Scores | 4 |
Computer Assisted Testing | 3 |
Estimation (Mathematics) | 3 |
Simulation | 3 |
Test Items | 3 |
Adults | 2 |
More ▼ |
Source
ETS Research Report Series | 1 |
Language Assessment Quarterly | 1 |
Language Testing | 1 |
Language Testing in Asia | 1 |
Author
Hicks, Marilyn M. | 2 |
Ghaemi, Hamed | 1 |
Jamieson, Joan | 1 |
Jiang, Hai | 1 |
Morgan, Rick | 1 |
Oltman, Phillip K. | 1 |
Papageorgiou, Spiros | 1 |
Perkins, Kyle | 1 |
Poonpon, Kornwipa | 1 |
Reese, Clyde M. | 1 |
So, Youngsoon | 1 |
More ▼ |
Publication Type
Reports - Research | 6 |
Journal Articles | 4 |
Reports - Evaluative | 4 |
Speeches/Meeting Papers | 1 |
Education Level
Higher Education | 2 |
Postsecondary Education | 2 |
Audience
Location
Iran | 1 |
Laws, Policies, & Programs
Assessments and Surveys
Test of English as a Foreign… | 10 |
What Works Clearinghouse Rating
Ghaemi, Hamed – Language Testing in Asia, 2022
Listening comprehension in English, as one of the most fundamental skills, has an essential role in the process of learning English. Mokken scale analysis (MSA) is a probabilistic-nonparametric approach to item response theory (IRT) which determines the one-dimensionality and scalability of test. Mokken scaling techniques are a useful tool for…
Descriptors: Second Language Learning, English (Second Language), Nonparametric Statistics, Item Response Theory
Papageorgiou, Spiros; Xi, Xiaoming; Morgan, Rick; So, Youngsoon – Language Assessment Quarterly, 2015
This study presents the development and empirical validation of score levels and descriptors specifically designed for reporting purposes to provide test takers with more than just a number on a score scale. In the context of a test primarily intended for 11- to 15-year-old students learning English as a second/foreign language, the study examined…
Descriptors: Scores, Validity, Scaling, Classification
Jamieson, Joan; Poonpon, Kornwipa – ETS Research Report Series, 2013
Research and development of a new type of scoring rubric for the integrated speaking tasks of "TOEFL iBT"® are described. These "analytic rating guides" could be helpful if tasks modeled after those in TOEFL iBT were used for formative assessment, a purpose which is different from TOEFL iBT's primary use for admission…
Descriptors: Oral Language, Language Proficiency, Scaling, Scores

Oltman, Phillip K.; Stricker, Lawrence J. – Language Testing, 1990
A recent multidimensional scaling analysis of the Test of English-as-a-Foreign-Language (TOEFL) item response data identified clusters of items in the test sections that, being more homogeneous than their parent sections, might be better for diagnostic use. The analysis was repeated using different scoring techniques. Results diverged only for…
Descriptors: English (Second Language), Item Analysis, Language Tests, Scaling
Perkins, Kyle – 2002
Guttman implicational scaling techniques were used to identify a unidimensional set of English as a Second Language reading comprehension items. Data were analyzed from 202 students who sat for an institutional administration of the Test of English as a Foreign Language (TOEFL). The examinees who contributed to the scalable set had significantly…
Descriptors: Adults, Classification, English (Second Language), Limited English Speaking
Jiang, Hai – 1999
The purpose of this paper is to describe the techniques used in establishing the concordance tables between the Test of English as a Foreign Language (TOEFL), paper and pencil (P&P), and computer-based testing (CBT) sections and total reported score scales. Listening, reading, and composite structure and essay scores plus a total score are…
Descriptors: Computer Assisted Testing, English (Second Language), Estimation (Mathematics), Scaling
Way, Walter D.; Reese, Clyde M. – 1991
The use of two alternative item response theory (IRT) estimation models in the scaling and equating of the Test of English as a Foreign Language (TOEFL) was explored; and item scaling and test equating results based on these models were compared with results based on the three-parameter (3PL) model currently being used with the TOEFL. Models were…
Descriptors: Correlation, Equated Scores, Estimation (Mathematics), Goodness of Fit
Tang, K. Linda; And Others – 1993
This study compared the performance of the LOGIST and BILOG computer programs on item response theory (IRT) based scaling and equating for the Test of English as a Foreign Language (TOEFL) using real and simulated data and two calibration structures. Applications of IRT for the TOEFL program are based on the three-parameter logistic (3PL) model.…
Descriptors: Comparative Analysis, Computer Simulation, Equated Scores, Estimation (Mathematics)
Hicks, Marilyn M. – 1984
Six methods of equating Test of English as a Foreign Language (TOEFL) test scores for samples consisting of the usual groups of examinees and groups controlled for native language representation were evaluated in terms of scale stability. The equating methods included three item response theory (IRT) variants (fixed b's scaling, a one-parameter…
Descriptors: College Entrance Examinations, Comparative Analysis, English (Second Language), Equated Scores
Hicks, Marilyn M. – 1989
Methods of computerized adaptive testing using conventional scoring methods in order to develop a computerized placement test for the Test of English as a Foreign Language (TOEFL) were studied. As a consequence of simulation studies during the first phase of the study, the multilevel testing paradigm was adopted to produce three test levels…
Descriptors: Adaptive Testing, Adults, Algorithms, Computer Assisted Testing