Publication Date
| In 2026 | 0 |
| Since 2025 | 220 |
| Since 2022 (last 5 years) | 1089 |
| Since 2017 (last 10 years) | 2599 |
| Since 2007 (last 20 years) | 4960 |
Descriptor
Source
Author
Publication Type
Education Level
Audience
| Practitioners | 653 |
| Teachers | 563 |
| Researchers | 250 |
| Students | 201 |
| Administrators | 81 |
| Policymakers | 22 |
| Parents | 17 |
| Counselors | 8 |
| Community | 7 |
| Support Staff | 3 |
| Media Staff | 1 |
| More ▼ | |
Location
| Turkey | 226 |
| Canada | 223 |
| Australia | 155 |
| Germany | 116 |
| United States | 99 |
| China | 90 |
| Florida | 86 |
| Indonesia | 82 |
| Taiwan | 78 |
| United Kingdom | 73 |
| California | 66 |
| More ▼ | |
Laws, Policies, & Programs
Assessments and Surveys
What Works Clearinghouse Rating
| Meets WWC Standards without Reservations | 4 |
| Meets WWC Standards with or without Reservations | 4 |
| Does not meet standards | 1 |
Peer reviewedScheuneman, Janice – Journal of Educational Measurement, 1979
This paper presents a chi square method for assessing bias in test items. In this procedure an unbiased item is defined as an item for which the probability of a correct response is the same for any person of a given ability level regardless of that person's ethnic group. (Author/CTM)
Descriptors: Cultural Differences, Culture Fair Tests, Item Analysis, Primary Education
Peer reviewedPlake, Barbara S.; And Others – Journal of Experimental Education, 1981
Number right and elimination scores were analyzed on a college level mathematics exam assembled from pretest data. Anxiety measures were administered along with the experimental forms to undergraduates. Results suggest that neither test scores nor attitudes are influenced by item order knowledge thereof, or anxiety level. (Author/GK)
Descriptors: College Mathematics, Difficulty Level, Higher Education, Multiple Choice Tests
Imrie, B. W. – Assessment in Higher Education, 1979
Three case studies are presented that describe variations used to obtain student perceptions of examinations and examination questions. Student perceptions can provide feedback about the quality of the examination. Results are presented and discussed and some examples of test evaluation questions are given. (Author/MLW)
Descriptors: Case Studies, Educational Testing, Feedback, Higher Education
Peer reviewedRoid, Gale; Haladyna, Tom – Review of Educational Research, 1980
A continuum of item-writing methods is proposed ranging from informal-subjective methods to algorithmic-objective methods. Examples of techniques include objective-based item writing, amplified objectives, item forms, facet design, domain-referenced concept testing, and computerized techniques. (Author/CP)
Descriptors: Achievement Tests, Algorithms, Computer Assisted Testing, Criterion Referenced Tests
Peer reviewedForsyth, Robert A.; Spratt, Kevin F. – Journal of Educational Measurement, 1980
The effects of two item formats on item difficulty and item discrimination indices for mathematics problem solving multiple-choice tests were investigated. One format required identifying the proper "set-up" for the item; the other format required complete solving of the item. (Author/JKS)
Descriptors: Difficulty Level, Junior High Schools, Multiple Choice Tests, Problem Solving
Peer reviewedAlderman, Donald L.; Powers, Donald E. – American Educational Research Journal, 1980
Control groups' scores on the Scholastic Aptitude Test-Verbal were compared to scores of secondary school students who were exposed to their school's special test preparation programs. Special preparation resulted in an overall difference of eight points on the total score, corresponding to one additional correctly answered item. (Author/GDC)
Descriptors: College Bound Students, College Entrance Examinations, High Schools, Program Effectiveness
Peer reviewedPlake, Barbara S. – Educational and Psychological Measurement, 1980
Analysis of variance and subjective rating by curriculum specialists were used to identify biased items on the Iowa Tests of Basic Skills. Results show little agreement between statistical and subjective methods. Test developers should statistically support a reviewer's selection of biased items. (Author/CP)
Descriptors: Achievement Tests, Analysis of Variance, Elementary Education, Evaluation Methods
Peer reviewedMarr, Mary Beth; Lyon, Kathleen R. – Reading Psychology, 1981
Reveals that the "Classroom Reading Inventory" is the most passage independent of three informal reading inventories administered to fourth grade students. Shows that the general information, vocabulary meaning, and affective question categories tended to be particularly passage independent. (FL)
Descriptors: Content Analysis, Grade 4, Informal Reading Inventories, Intermediate Grades
Peer reviewedWhite, Edward M. – Journal of Basic Writing, 1978
Presents remarks on essay testing made by Edward M. White, an account of procedures for developing essay questions for the California State University and Colleges (CSUC) English Equivalency Examination, the 1974 essay scale with accompanying writing samples, and the questions and scale used in the 1975 test. (RL)
Descriptors: Equivalency Tests, Essay Tests, Higher Education, Models
Peer reviewedAlexander, John J. – Journal of Chemical Education, 1979
Describes an exam question which challenges college freshmen, enrolled in chemistry, to derive temperature dependence of an equilibrium constant. The question requires cognitive response at the level of synthesis. (Author/SA)
Descriptors: Chemical Equilibrium, Chemical Reactions, Chemistry, College Freshmen
Peer reviewedBennett, Randy Elliot; Steffen, Manfred; Singley, Mark Kevin; Morley, Mary; Jacquemin, Daniel – Journal of Educational Measurement, 1997
Scoring accuracy and item functioning were studied for an open-ended response type test in which correct answers can take many different surface forms. Results with 1,864 graduate school applicants showed automated scoring to approximate the accuracy of multiple-choice scoring. Items functioned similarly to other item types being considered. (SLD)
Descriptors: Adaptive Testing, Automation, College Applicants, Computer Assisted Testing
Peer reviewedSolano-Flores, Guillermo; Shavelson, Richard J. – Educational Measurement: Issues and Practice, 1997
Conceptual, practical, and logistical issues in the development of science performance assessments (SPAs) are discussed. The conceptual framework identifies task, response format, and scoring system as components, and conceives of SPAs as tasks that attempt to recreate conditions in which scientists work. Developing SPAs is a sophisticated effort…
Descriptors: Elementary Secondary Education, Performance Based Assessment, Science Education, Science Tests
Peer reviewedScheuneman, Janice Dowd; Grima, Angela – Applied Measurement in Education, 1997
Differential item functioning for female-male and black-white groups and two samples of white males who differed in mathematical training were studied using data from 104 items from the Graduate Record Examination. Verbal properties of items were associated with differential performance of men and women but not of blacks and whites. (SLD)
Descriptors: Black Students, Females, Item Bias, Mathematics Achievement
Learning about Students' Knowledge and Thinking in Science through Large-Scale Quantitative Studies.
Peer reviewedOlsen, Rolf V.; Turmo, Are; Lie, Svein – European Journal of Psychology of Education, 2001
Discusses how responses to multiple-choice items could be interpreted, demonstrates how responses on constructed-response items can be analyzed, and examines interactions between item characteristics and student responses. Uses information, specifically items and student responses, from the Third International Mathematics and Science Study…
Descriptors: Educational Research, Higher Education, Mathematics Education, Science Education
Peer reviewedO'Leary, Michael – Educational Measurement: Issues and Practice, 2002
Examined the performance of Irish students on multiple-choice, short-answer, and extended-response item sets from the Third International Mathematics and Science Study to determine whether Ireland's relative rank among the more than 40 countries involved remained stable. Findings provide additional evidence that comparing student achievement…
Descriptors: Comparative Analysis, Foreign Countries, International Education, Mathematics Achievement


