Publication Date
In 2025 | 0 |
Since 2024 | 0 |
Since 2021 (last 5 years) | 0 |
Since 2016 (last 10 years) | 2 |
Since 2006 (last 20 years) | 4 |
Descriptor
Source
Educational Measurement:… | 1 |
Educational and Psychological… | 1 |
Journal of Educational… | 1 |
Language Assessment Quarterly | 1 |
Language Testing | 1 |
TESL-EJ | 1 |
Author
Wainer, Howard | 2 |
Bailey, Alison L. | 1 |
Berke, Sally | 1 |
Boldt, R. F. | 1 |
Bridgeman, Brent | 1 |
Cohen, Andrew D. | 1 |
Frantz, Roger S. | 1 |
Hicks, Marilyn M. | 1 |
Lukhele, Robert | 1 |
Nissan, Susan | 1 |
Perea, Luis | 1 |
More ▼ |
Publication Type
Reports - Evaluative | 14 |
Journal Articles | 6 |
Historical Materials | 1 |
Information Analyses | 1 |
Speeches/Meeting Papers | 1 |
Education Level
Higher Education | 2 |
Postsecondary Education | 1 |
Secondary Education | 1 |
Audience
Location
Laws, Policies, & Programs
No Child Left Behind Act 2001 | 1 |
Assessments and Surveys
Test of English as a Foreign… | 14 |
Graduate Record Examinations | 1 |
Test of Written English | 1 |
What Works Clearinghouse Rating
Toker, Deniz – TESL-EJ, 2019
The central purpose of this paper is to examine validity problems arising from the multiple-choice items and technical passages in the Test of English as a Foreign Language Internet-based Test (TOEFL iBT) reading section, primarily concentrating on construct-irrelevant variance (Messick, 1989). My personal TOEFL iBT experience, along with my…
Descriptors: English (Second Language), Language Tests, Second Language Learning, Computer Assisted Testing
Bridgeman, Brent – Educational Measurement: Issues and Practice, 2016
Scores on essay-based assessments that are part of standardized admissions tests are typically given relatively little weight in admissions decisions compared to the weight given to scores from multiple-choice assessments. Evidence is presented to suggest that more weight should be given to these assessments. The reliability of the writing scores…
Descriptors: Multiple Choice Tests, Scores, Standardized Tests, Comparative Analysis
Frantz, Roger S.; Bailey, Alison L.; Starr, Laura; Perea, Luis – Language Assessment Quarterly, 2014
The current focus across the U.S. on student college and career readiness standards makes clear that both instruction and assessment of academic English will continue to be important for school-age English learner (EL) students. This article presents an overview and summary of key literature on academic language (usually academic English);…
Descriptors: Academic Discourse, English Language Learners, State Standards, Language Proficiency

Wainer, Howard; Wang, Xiaohui – Journal of Educational Measurement, 2000
Modified the three-parameter model to include an additional random effect for items nested within the same testlet. Fitted the new model to 86 testlets from the Test of English as a Foreign Language (TOEFL) and compared standard parameters (discrimination, difficulty, and guessing) with those obtained through traditional modeling. Discusses the…
Descriptors: English (Second Language), Language Tests, Scoring, Statistical Analysis
Cohen, Andrew D.; Upton, Thomas A. – Language Testing, 2007
This study describes the reading and test-taking strategies that test takers used on the "Reading" section of the "LanguEdge Courseware" (2002) materials developed to familiarize prospective respondents with the new TOEFL. The investigation focused on strategies used to respond to more traditional "single selection"…
Descriptors: Courseware, Language Tests, Test Wiseness, Language Teachers
Secolsky, Charles – 1989
The usual assessment of speededness for rights-only scored tests does not account for the possibility that examinees respond in a random or patterned fashion to the items at the end of the test as the time limit approaches. This study represented an attempt to determine if Sections 2 and 3 of the Test of English as a Foreign Language (TOEFL) are…
Descriptors: Adults, English (Second Language), Language Tests, Pretests Posttests

Wainer, Howard; Lukhele, Robert – Educational and Psychological Measurement, 1997
The reliability of scores from four forms of the Test of English as a Foreign Language (TOEFL) was estimated using a hybrid item response theory model. It was found that there was very little difference between overall reliability when the testlet items were assumed to be independent and when their dependence was modeled. (Author/SLD)
Descriptors: English (Second Language), Item Response Theory, Scores, Second Language Learning
Berke, Sally – 1979
Item content of the Test of English as a Foreign Language (TOEFL) is catagorized using 100 items from a half-length sample TOEFL published by Educational Testing Service (ETS), 120 items from Section II and 150 from the Section IV of Test of English as a Foreign Language (Gruber and Gruber), and 200 items from How to Prepare for the TOEFL by…
Descriptors: English (Second Language), Item Analysis, Language Tests, Research Needs
Hicks, Marilyn M. – 1988
Several exploratory analyses of the fifths data generated by Test of English as a Foreign Language (TOEFL) item analyses were developed in order to evaluate the effects of options on the discriminability of difficult items and to identify difficult items with low, unreliable biserials that had been rejected by test developers, but for which…
Descriptors: Difficulty Level, Estimation (Mathematics), Identification, Item Analysis
Boldt, R. F. – 1994
The comparison of item response theory models for the Test of English as a Foreign Language (TOEFL) was extended to an equating context as simulation trials were used to "equate the test to itself." Equating sample data were generated from administration of identical item sets. Equatings that used procedures based on each model (simple…
Descriptors: Comparative Analysis, Cutting Scores, English (Second Language), Equated Scores
Nissan, Susan; And Others – 1996
One of the item types in the Listening Comprehension section of the Test of English as a Foreign Language (TOEFL) test is the dialogue. Because the dialogue item pool needs to have an appropriate balance of items at a range of difficulty levels, test developers have examined items at various difficulty levels in an attempt to identify their…
Descriptors: Classification, Dialogs (Language), Difficulty Level, English (Second Language)
Way, Walter D.; And Others – 1992
This study provided an exploratory investigation of item features that might contribute to a lack of invariance of item parameters for the Test of English as a Foreign Language (TOEFL). Data came from seven forms of the TOEFL administered in 1989. Subjective and quantitative measures developed for the study provided consistent information related…
Descriptors: Ability, English (Second Language), Goodness of Fit, Item Response Theory
Schedl, Mary; And Others – 1996
The issue of what exactly is measured by different types of reading items has been a matter of interest in the field of reading research for many years. Language teaching and testing specialists have raised the question of whether a reading test for foreign students wishing to enter a university in the United States should include questions…
Descriptors: Adults, English (Second Language), Factor Analysis, Factor Structure
Stansfield, Charles W. – 1986
A history of the Test of Written English (TWE), a section of the Test of English as a Foreign Language (TOEFL), describes its inception and development process. The new test is a thirty-minute essay test providing a measure of a non-native English-speaker's ability to perform academic writing tasks similar to those required of international…
Descriptors: Educational History, English (Second Language), Essay Tests, Foreign Students