Publication Date
| In 2026 | 0 |
| Since 2025 | 220 |
| Since 2022 (last 5 years) | 1089 |
| Since 2017 (last 10 years) | 2599 |
| Since 2007 (last 20 years) | 4960 |
Descriptor
Source
Author
Publication Type
Education Level
Audience
| Practitioners | 653 |
| Teachers | 563 |
| Researchers | 250 |
| Students | 201 |
| Administrators | 81 |
| Policymakers | 22 |
| Parents | 17 |
| Counselors | 8 |
| Community | 7 |
| Support Staff | 3 |
| Media Staff | 1 |
| More ▼ | |
Location
| Turkey | 226 |
| Canada | 223 |
| Australia | 155 |
| Germany | 116 |
| United States | 99 |
| China | 90 |
| Florida | 86 |
| Indonesia | 82 |
| Taiwan | 78 |
| United Kingdom | 73 |
| California | 66 |
| More ▼ | |
Laws, Policies, & Programs
Assessments and Surveys
What Works Clearinghouse Rating
| Meets WWC Standards without Reservations | 4 |
| Meets WWC Standards with or without Reservations | 4 |
| Does not meet standards | 1 |
Yovanoff, Paul; Tindal, Gerald – Exceptional Children, 2007
Alternatives to the standard statewide assessment often are necessary for valid measurement of students with significant disabilities. These alternate assessments must be carefully developed and evaluated with respect to generally accepted psychometric standards. Ideally, these measures should be sensitive to growth and scaled to state tests that…
Descriptors: Scaling, Early Reading, Grade 3, Test Results
OECD Publishing (NJ1), 2009
The Organisation for Economic Cooperation and Development's (OECD's) Programme for International Student Assessment (PISA) surveys, which take place every three years, have been designed to collect information about 15-year-old students in participating countries. PISA examines how well students are prepared to meet the challenges of the future,…
Descriptors: Policy Formation, Scaling, Academic Achievement, Interrater Reliability
Fuller, Robert G., Ed.; Campbell, Thomas C., Ed.; Dykstra, Dewey I., Jr., Ed.; Stevens, Scott M., Ed. – IAP - Information Age Publishing, Inc., 2009
This book is intended to offer college faculty members the insights of the development of reasoning movement that enlighten physics educators in the late 1970s and led to a variety of college programs directed at improving the reasoning patterns used by college students. While the original materials were directed at physics concepts, they quickly…
Descriptors: Constructivism (Learning), College Instruction, College Students, Textbooks
Ferrao, Maria – Assessment & Evaluation in Higher Education, 2010
The Bologna Declaration brought reforms into higher education that imply changes in teaching methods, didactic materials and textbooks, infrastructures and laboratories, etc. Statistics and mathematics are disciplines that traditionally have the worst success rates, particularly in non-mathematics core curricula courses. This research project,…
Descriptors: Foreign Countries, Computer Assisted Testing, Educational Technology, Educational Assessment
Qian, David D. – Language Assessment Quarterly, 2008
In the last 15 years or so, language testing practitioners have increasingly favored assessing vocabulary in context. The discrete-point vocabulary measure used in the old version of the Test of English as a Foreign Language (TOEFL) has long been criticized for encouraging test candidates to memorize wordlists out of context although test items…
Descriptors: Predictive Validity, Context Effect, Vocabulary, English (Second Language)
Allen, Nancy L.; And Others – 1993
A special case of examinee choice, the Optional Essay Problem, is examined from the point of view of test equating. The Optional Essay Problem involves equating essay scores when the examinees are required to select an optional essay topic from a list of topics in addition to taking a mandatory test required of all examinees. The conditions that…
Descriptors: Difficulty Level, Equated Scores, Essay Tests, Essays
Pashley, Peter J. – 1992
The detection of differential item functioning (DIF) has become an important psychometric research topic in recent years. A number of item response theory (IRT) methods for solving this problem have been suggested. A common approach is to calculate some function of the area between item response curves estimated from the subpopulations of…
Descriptors: Ability, Estimation (Mathematics), Identification, Item Bias
Dorans, Neil J.; Holland, Paul W. – 1992
At the Educational Testing Service, the Mantel-Haenszel procedure is used for differential item functioning (DIF) detection, and the standardization procedure is used to describe DIF. This report describes these procedures. First, an important distinction is made between DIF and impact, pointing to the need to compare the comparable. Then, these…
Descriptors: Comparative Analysis, Distractors (Tests), Identification, Item Bias
Gershon, Richard C. – 1991
The Johnson O'Connor Research Foundation, which produces vocabulary instructional materials for test takers, is in the process of determining the difficulty values of nontechnical words in the English language. To this end, the Foundation writes test items for vocabulary words and tests them in schools. The items are then calibrated using the…
Descriptors: Ability, Difficulty Level, Goodness of Fit, Item Response Theory
Lawrence, Ida M. – 1995
This study examined to what extent, if any, estimates of reliability for a multiple choice test are affected by the presence of large item sets where each set shares common reading material. The purpose of this research was to assess the effect of local item dependence on estimates of reliability for verbal portions of seven forms of the old and…
Descriptors: Estimation (Mathematics), High Schools, Multiple Choice Tests, Reading Tests
Enright, Mary K.; Bejar, Isaac I. – 1989
In this study, the ability of test development staff to predict the difficulty of analogy items was explored. The nature of the item attributes that contributed to test writers' predictions of difficulty as well as actual item difficulty was also investigated. The two expert test writers studied were quite good at predicting item difficulty. Item…
Descriptors: Analogy, Construct Validity, Difficulty Level, Models
Wightman, Lawrence E.; De Champlain, Andre F. – 1994
Two different methods of obtaining three parameter logistic item response theory (IRT) pretest item parameter estimated for the Graduate Management Admissions Testing Program. The first method consisted of calibrating pretest and operational items simultaneously in a LOGIST run, that is a concurrent calibration design. The second approach entailed…
Descriptors: Ability, Comparative Analysis, Estimation (Mathematics), Item Banks
Herman, William E. – 1996
Marks made by students on test item booklets were analyzed as a clue to better understanding of the metacognitive strategies employed during the completion of a 100-question multiple-choice final examination. Test item booklets of 56 undergraduates were scrutinized for the frequency of the following item markings; (1) no markings at all; (2)…
Descriptors: Higher Education, Metacognition, Multiple Choice Tests, Responses
Holweger, Nancy; Weston, Timothy – 1998
This study compares logistic discriminant function analysis for differential item functioning (DIF) with a technique for the detection of DIF that is based on item response theory rather than the Mantel-Haenszel procedure. In this study, the areas between the two item characteristic curves, also called the item characteristic curve method is…
Descriptors: Item Bias, Item Response Theory, Performance Based Assessment, State Programs
Kromrey, Jeffrey D.; Parshall, Cynthia G.; Yi, Qing – 1998
The effects of anchor test characteristics in the accuracy and precision of test equating in the "common items, nonequivalent groups design" were studied. The study also considered the effects of nonparallel based and new forms on the equating solution, and it investigated the effects of differential weighting on the success of equating…
Descriptors: Equated Scores, High Schools, Item Response Theory, Monte Carlo Methods

Peer reviewed
Direct link
