ERIC - Search Results

Publication Date

In 2025	0
Since 2024	0
Since 2021 (last 5 years)	0
Since 2016 (last 10 years)	2
Since 2006 (last 20 years)	4

Source

Educational Measurement:…	1
Educational and Psychological…	1
Journal of Educational…	1
Language Assessment Quarterly	1
Language Testing	1
TESL-EJ	1

Publication Type

Reports - Evaluative	14
Journal Articles	6
Historical Materials	1
Information Analyses	1
Speeches/Meeting Papers	1

Education Level

Higher Education	2
Postsecondary Education	1
Secondary Education	1

Audience

Location

Laws, Policies, & Programs

No Child Left Behind Act 2001

Assessments and Surveys

Test of English as a Foreign…	14
Graduate Record Examinations	1
Test of Written English	1

What Works Clearinghouse Rating

Showing all 14 results Save | Export

Topic Familiarity Matters: A Critical Analysis of TOEFL iBT Reading Section

Peer reviewed
PDF on ERIC

Download full text

Toker, Deniz – TESL-EJ, 2019

The central purpose of this paper is to examine validity problems arising from the multiple-choice items and technical passages in the Test of English as a Foreign Language Internet-based Test (TOEFL iBT) reading section, primarily concentrating on construct-irrelevant variance (Messick, 1989). My personal TOEFL iBT experience, along with my…

Descriptors: English (Second Language), Language Tests, Second Language Learning, Computer Assisted Testing

Can a Two-Question Test Be Reliable and Valid for Predicting Academic Outcomes?

Peer reviewed

Direct link

Bridgeman, Brent – Educational Measurement: Issues and Practice, 2016

Scores on essay-based assessments that are part of standardized admissions tests are typically given relatively little weight in admissions decisions compared to the weight given to scores from multiple-choice assessments. Evidence is presented to suggest that more weight should be given to these assessments. The reliability of the writing scores…

Descriptors: Multiple Choice Tests, Scores, Standardized Tests, Comparative Analysis

Measuring Academic Language Proficiency in School-Age English Language Proficiency Assessments under New College and Career Readiness Standards in the United States

Peer reviewed

Direct link

Frantz, Roger S.; Bailey, Alison L.; Starr, Laura; Perea, Luis – Language Assessment Quarterly, 2014

The current focus across the U.S. on student college and career readiness standards makes clear that both instruction and assessment of academic English will continue to be important for school-age English learner (EL) students. This article presents an overview and summary of key literature on academic language (usually academic English);…

Descriptors: Academic Discourse, English Language Learners, State Standards, Language Proficiency

Using a New Statistical Model for Testlets To Score TOEFL.

Peer reviewed

Wainer, Howard; Wang, Xiaohui – Journal of Educational Measurement, 2000

Modified the three-parameter model to include an additional random effect for items nested within the same testlet. Fitted the new model to 86 testlets from the Test of English as a Foreign Language (TOEFL) and compared standard parameters (discrimination, difficulty, and guessing) with those obtained through traditional modeling. Discusses the…

Descriptors: English (Second Language), Language Tests, Scoring, Statistical Analysis

"I Want to Go Back to the Text": Response Strategies on the Reading Subtest of the New TOEFL[R]

Peer reviewed

Direct link

Cohen, Andrew D.; Upton, Thomas A. – Language Testing, 2007

This study describes the reading and test-taking strategies that test takers used on the "Reading" section of the "LanguEdge Courseware" (2002) materials developed to familiarize prospective respondents with the new TOEFL. The investigation focused on strategies used to respond to more traditional "single selection"…

Descriptors: Courseware, Language Tests, Test Wiseness, Language Teachers

Accounting for Random Responding at the End of the Test in Assessing Speededness on the Test of English as a Foreign Language. TOEFL Research Reports, Report 30.

Download full text

Secolsky, Charles – 1989

The usual assessment of speededness for rights-only scored tests does not account for the possibility that examinees respond in a random or patterned fashion to the items at the end of the test as the time limit approaches. This study represented an attempt to determine if Sections 2 and 3 of the Test of English as a Foreign Language (TOEFL) are…

Descriptors: Adults, English (Second Language), Language Tests, Pretests Posttests

How Reliable Are TOEFL Scores?

Peer reviewed

Wainer, Howard; Lukhele, Robert – Educational and Psychological Measurement, 1997

The reliability of scores from four forms of the Test of English as a Foreign Language (TOEFL) was estimated using a hybrid item response theory model. It was found that there was very little difference between overall reliability when the testlet items were assumed to be independent and when their dependence was modeled. (Author/SLD)

Descriptors: English (Second Language), Item Response Theory, Scores, Second Language Learning

My Encounter with the TOEFL: Studying the Structure and Written Expression Section.

Berke, Sally – 1979

Item content of the Test of English as a Foreign Language (TOEFL) is catagorized using 100 items from a half-length sample TOEFL published by Educational Testing Service (ETS), 120 items from Section II and 150 from the Section IV of Test of English as a Foreign Language (Gruber and Gruber), and 200 items from How to Prepare for the TOEFL by…

Descriptors: English (Second Language), Item Analysis, Language Tests, Research Needs

Analyzing the Option Effects of Difficult TOEFL Items with Low Biserials: Methods Developed for Use by Test Assemblers.

Download full text

Hicks, Marilyn M. – 1988

Several exploratory analyses of the fifths data generated by Test of English as a Foreign Language (TOEFL) item analyses were developed in order to evaluate the effects of options on the discriminability of difficult items and to identify difficult items with low, unreliable biserials that had been rejected by test developers, but for which…

Descriptors: Difficulty Level, Estimation (Mathematics), Identification, Item Analysis

Simulated Equating Using Several Item Response Curves.

Download full text

Boldt, R. F. – 1994

The comparison of item response theory models for the Test of English as a Foreign Language (TOEFL) was extended to an equating context as simulation trials were used to "equate the test to itself." Equating sample data were generated from administration of identical item sets. Equatings that used procedures based on each model (simple…

Descriptors: Comparative Analysis, Cutting Scores, English (Second Language), Equated Scores

An Analysis of Factors Affecting the Difficulty of Dialogue Items in TOEFL Listening Comprehension. TOEFL Research Reports, 51.

Download full text

Nissan, Susan; And Others – 1996

One of the item types in the Listening Comprehension section of the Test of English as a Foreign Language (TOEFL) test is the dialogue. Because the dialogue item pool needs to have an appropriate balance of items at a range of difficulty levels, test developers have examined items at various difficulty levels in an attempt to identify their…

Descriptors: Classification, Dialogs (Language), Difficulty Level, English (Second Language)

An Exploratory Study of Characteristics Related to IRT Item Parameter Invariance with the Test of English as a Foreign Language. TOEFL Technical Report.

Download full text

Way, Walter D.; And Others – 1992

This study provided an exploratory investigation of item features that might contribute to a lack of invariance of item parameters for the Test of English as a Foreign Language (TOEFL). Data came from seven forms of the TOEFL administered in 1989. Subjective and quantitative measures developed for the study provided consistent information related…

Descriptors: Ability, English (Second Language), Goodness of Fit, Item Response Theory

An Analysis of the Dimensionality of TOEFL Reading Comprehension Items. TOEFL Research Reports, 53.

Download full text

Schedl, Mary; And Others – 1996

The issue of what exactly is measured by different types of reading items has been a matter of interest in the field of reading research for many years. Language teaching and testing specialists have raised the question of whether a reading test for foreign students wishing to enter a university in the United States should include questions…

Descriptors: Adults, English (Second Language), Factor Analysis, Factor Structure

A History of the Test of Written English: The Developmental Year.

Download full text

Stansfield, Charles W. – 1986

A history of the Test of Written English (TWE), a section of the Test of English as a Foreign Language (TOEFL), describes its inception and development process. The new test is a thirty-minute essay test providing a measure of a non-native English-speaker's ability to perform academic writing tasks similar to those required of international…

Descriptors: Educational History, English (Second Language), Essay Tests, Foreign Students

Test Items	14
English (Second Language)	12
Language Tests	12
Second Language Learning	5
Test Construction	5
Item Response Theory	4
Test Format	4
Item Analysis	3
Language Proficiency	3
Multiple Choice Tests	3
Reading Comprehension	3
Reading Tests	3
Standardized Tests	3
Adults	2
Comparative Analysis	2
Difficulty Level	2
Higher Education	2
Pretests Posttests	2
Regression (Statistics)	2
Scores	2
Scoring	2
Statistical Analysis	2
Test Wiseness	2
Ability	1
Academic Discourse	1
More ▼

Wainer, Howard	2
Bailey, Alison L.	1
Berke, Sally	1
Boldt, R. F.	1
Bridgeman, Brent	1
Cohen, Andrew D.	1
Frantz, Roger S.	1
Hicks, Marilyn M.	1
Lukhele, Robert	1
Nissan, Susan	1
Perea, Luis	1
Schedl, Mary	1
Secolsky, Charles	1
Stansfield, Charles W.	1
Starr, Laura	1
Toker, Deniz	1
Upton, Thomas A.	1
Wang, Xiaohui	1
Way, Walter D.	1
More ▼