Publication Date
| In 2026 | 0 |
| Since 2025 | 220 |
| Since 2022 (last 5 years) | 1089 |
| Since 2017 (last 10 years) | 2599 |
| Since 2007 (last 20 years) | 4960 |
Descriptor
Source
Author
Publication Type
Education Level
Audience
| Practitioners | 653 |
| Teachers | 563 |
| Researchers | 250 |
| Students | 201 |
| Administrators | 81 |
| Policymakers | 22 |
| Parents | 17 |
| Counselors | 8 |
| Community | 7 |
| Support Staff | 3 |
| Media Staff | 1 |
| More ▼ | |
Location
| Turkey | 226 |
| Canada | 223 |
| Australia | 155 |
| Germany | 116 |
| United States | 99 |
| China | 90 |
| Florida | 86 |
| Indonesia | 82 |
| Taiwan | 78 |
| United Kingdom | 73 |
| California | 66 |
| More ▼ | |
Laws, Policies, & Programs
Assessments and Surveys
What Works Clearinghouse Rating
| Meets WWC Standards without Reservations | 4 |
| Meets WWC Standards with or without Reservations | 4 |
| Does not meet standards | 1 |
Henning, Grant – 1991
Criticisms of the Test of English as a Foreign Language (TOEFL) have included speculation that the listening test places too much burden on short-term memory as compared with comprehension, that a knowledge of reading is required to respond successfully, and that many items appear to require mere recall and matching rather than higher-order…
Descriptors: Adults, Auditory Stimuli, Cognitive Processes, Educational Assessment
Webb, Melvin W., II; Miller, Eva R. – 1995
As constructed-response items become an integral part of educational assessments, setting student performance standards on constructed-response items has become an important issue. Two standard-setting methods, one used for setting standards on the National Assessment of Educational Progress (NAEP) in reading in grade 8 and the other used to set…
Descriptors: Comparative Analysis, Constructed Response, Criteria, Educational Assessment
Allen, Nancy L.; Donoghue, John R. – 1995
This Monte Carlo study examined the effect of complex sampling of items on the measurement of differential item functioning (DIF) using the Mantel-Haenszel procedure. Data were generated using a three-parameter logistic item response theory model according to the balanced incomplete block (BIB) design used in the National Assessment of Educational…
Descriptors: Computer Assisted Testing, Difficulty Level, Elementary Secondary Education, Identification
Way, Walter D.; And Others – 1992
This study provided an exploratory investigation of item features that might contribute to a lack of invariance of item parameters for the Test of English as a Foreign Language (TOEFL). Data came from seven forms of the TOEFL administered in 1989. Subjective and quantitative measures developed for the study provided consistent information related…
Descriptors: Ability, English (Second Language), Goodness of Fit, Item Response Theory
Frary, Robert B. – 1995
This digest presents a list of recommendations for writing multiple-choice test items, based on psychometrics and logical deduction. Questions should ask more than mere knowledge of facts and should not contain superfluous information as an introduction to the question. Each question should focus on some specific aspect of the course, and the item…
Descriptors: Culture Fair Tests, Distractors (Tests), Educational Assessment, Item Bias
Angoff, William H. – 1989
This study was undertaken to test the hypothesis that items of the Test of English as a Foreign Language (TOEFL) containing reference to American people, places, customs, etc., tend to favor examinees who have spent some time living in the United States. Two samples of examinees were drawn from the March 1987 TOEFL administration, one tested in…
Descriptors: Context Effect, English (Second Language), Evaluators, Foreign Nationals
Buser, Karen – 1996
Most seasoned test developers recognize the importance of thoughtful decision making when constructing a test. Unfortunately, many classroom achievement tests are created by novice test developed who have not received sufficient instruction in item writing (G. Gulliksen, 1986; R. J. Stiggins, 1991). The result is often a test that is poorly…
Descriptors: Achievement Tests, Decision Making, Educational Planning, Evaluation Methods
Carlson, Sybil B. – 1988
The reasoning skills tapped by the analytical measure of the Graduate Record Examinations were studied by examining how performance on its constituent type items relate to alternative criteria. Another objective was to ascertain the extent to which additional information on examinees' analytical skills might be obtained from further analyses of…
Descriptors: Arabic, Chinese, College Entrance Examinations, College Students
Hale, Gordon A.; And Others – 1988
This study examined the relation of performance on the Test of English as a Foreign Language (TOEFL) to a widely used variant of the cloze procedure, the multiple choice (MC) cloze method. Examinees taking an operational TOEFL (n=11,290) were given three basic sections of the test along with a section containing prepared MC cloze items, and…
Descriptors: Adults, Cloze Procedure, English (Second Language), Estimation (Mathematics)
Kaiser, Paul D.; Brull, Harry – 1994
The design, administration, scoring, and results of the 1993 New York State Correctional Captain Examination are described. The examination was administered to 405 candidates. As in previous Sergeant and Lieutenant examinations, candidates also completed latent image written simulation problems and open/closed book multiple choice test components.…
Descriptors: Competitive Selection, Correctional Rehabilitation, Decision Making, Educational Innovation
Roos, Linda L.; And Others – 1992
Computerized adaptive (CA) testing uses an algorithm to match examinee ability to item difficulty, while self-adapted (SA) testing allows the examinee to choose the difficulty of his or her items. Research comparing SA and CA testing has shown that examinees experience lower anxiety and improved performance with SA testing. All previous research…
Descriptors: Ability Identification, Adaptive Testing, Algebra, Algorithms
Fan, Xitao; And Others – 1994
The hypothesis that faulty classical psychometric and sampling procedures in test construction could generate systematic bias against ethnic groups with smaller representation in the test construction sample was studied empirically. Two test construction models were developed: one with differential representation of ethnic groups (White, African…
Descriptors: Ethnic Groups, Genetics, High School Students, High Schools
Cardoso, Rosana M. F. – Texas Papers in Foreign Language Education, 1998
This study analyzed English language tests administered in Brazil as part of a university entrance examination, focusing on the authenticity of its tests of second language reading comprehension, the concept of reading as an interactive process between reader and text, a proficiency-based view of language instruction, and the psychometric…
Descriptors: College Entrance Examinations, English (Second Language), Foreign Countries, Higher Education
Angelo, Thomas A.; Cross, K. Patricia – 1993
This handbook has been written for college teachers regardless of their prior training in pedagogy, assessment, or education. It is a practical handbook, designed for easy reference. Part 1 can provide either an introduction to Classroom Assessment or a comprehensive review, depending on the reader's prior experience. The first chapter explains…
Descriptors: Case Studies, Classroom Techniques, College Faculty, Educational Assessment
Parshall, Cynthia G.; Stewart, Rob; Ritter, Judy – 1996
While computer-based tests might be as simple as computerized versions of paper-and-pencil examinations, more innovative applications also exist. Examples of innovations in computer-based assessment include the use of graphics or sound, some measure of interactivity, a change in the means in which examinees responded to items, and the application…
Descriptors: College Students, Computer Assisted Testing, Educational Innovation, Graphic Arts


