Publication Date
| In 2026 | 0 |
| Since 2025 | 200 |
| Since 2022 (last 5 years) | 1070 |
| Since 2017 (last 10 years) | 2580 |
| Since 2007 (last 20 years) | 4941 |
Descriptor
Source
Author
Publication Type
Education Level
Audience
| Practitioners | 653 |
| Teachers | 563 |
| Researchers | 250 |
| Students | 201 |
| Administrators | 81 |
| Policymakers | 22 |
| Parents | 17 |
| Counselors | 8 |
| Community | 7 |
| Support Staff | 3 |
| Media Staff | 1 |
| More ▼ | |
Location
| Turkey | 225 |
| Canada | 223 |
| Australia | 155 |
| Germany | 116 |
| United States | 99 |
| China | 90 |
| Florida | 86 |
| Indonesia | 82 |
| Taiwan | 78 |
| United Kingdom | 73 |
| California | 65 |
| More ▼ | |
Laws, Policies, & Programs
Assessments and Surveys
What Works Clearinghouse Rating
| Meets WWC Standards without Reservations | 4 |
| Meets WWC Standards with or without Reservations | 4 |
| Does not meet standards | 1 |
Peer reviewedWainer, Howard; Kiely, Gerard L. – Journal of Educational Measurement, 1987
The testlet, a bundle of test items, alleviates some problems associated with computerized adaptive testing: context effects, lack of robustness, and item difficulty ordering. While testlets may be linear or hierarchical, the most useful ones are four-level hierarchical units, containing 15 items and partitioning examinees into 16 classes. (GDC)
Descriptors: Adaptive Testing, Computer Assisted Testing, Context Effect, Item Banks
Peer reviewedBond, Lloyd – Educational Measurement: Issues and Practice, 1987
This article suggests that mechanical application of Golden Rule-like procedures is inappropriate. The fundamental idea embodied in them, namely, that of taking issues of equity into account in test construction, may reasonably be done without doing violence to test validity. (JAZ)
Descriptors: Court Litigation, Item Analysis, Minority Groups, Standards
Peer reviewedJaeger, Richard M. – Educational Measurement: Issues and Practice, 1987
This is a reprint of a 1986 letter by the former president of the National Council on Measurement in Education (NCME) to New York and California legislators. The author outlines why NCME is opposed to legislative initiatives to extend Golden Rule procedures to tests in those states. (JAZ)
Descriptors: Item Analysis, Letters (Correspondence), Minority Groups, Standards
Peer reviewedMcDermott, Dana P. – Journal of Chemical Education, 1988
Examines three errors and learning difficulties found to be typical in general chemistry, gives the source of each problem, and presents a method of correcting each problem. Suggests using analysis of examination errors for the purpose of altering instruction for the improvement of student learning. (CW)
Descriptors: Chemistry, Cognitive Processes, College Science, Higher Education
Peer reviewedPhysics Teacher, 1988
Presented is an electricity problem on simple series circuit. Provided are the problem diagram and answers with explanation. (YP)
Descriptors: Demonstrations (Educational), Electricity, Physics, Science Activities
Peer reviewedCliff, Norman; And Others – Applied Psychological Measurement, 1988
A method for ordering persons and items when all responses are ordinal was developed and applied to several sets of questionnaire data (from undergraduates) and one set of archeological data. The method provides a possible nonparametric treatment of data usually treated by more traditional psychometric methods. (SLD)
Descriptors: Archaeology, Data Analysis, Data Collection, Higher Education
Peer reviewedRocklin, Thomas; O'Donnell, Angela M. – Journal of Educational Psychology, 1987
An experiment was conducted that contrasted a variant of computerized adaptive testing, self-adapted testing, with two traditional tests. Participants completed a self-report of text anxiety and were randomly assigned to take one of the three tests of verbal ability. Subjects generally chose more difficult items as the test progressed. (Author/LMO)
Descriptors: Adaptive Testing, Comparative Testing, Computer Assisted Testing, Difficulty Level
Peer reviewedMehrens, William A.; Phillips, S. E. – Journal of Educational Measurement, 1987
A taxonomic matrix classification was used to assess the curricular validity of the Stanford Achievement Tests for the mathematics textbooks used in a school district's fifth and sixth grades. Rasch item difficulty was also examined. Results indicated only small differences between textbooks. (GDC)
Descriptors: Difficulty Level, Elementary School Mathematics, Intermediate Grades, Item Analysis
Peer reviewedDiamond, James J.; McCormick, Janet – Evaluation and the Health Professions, 1986
Using item responses from an in-training examination in diagnostic radiology, the application of a strength of association statistic to the general problem of item analysis is illustrated. Criteria for item selection, general issues of reliability, and error of measurement are discussed. (Author/LMO)
Descriptors: Achievement Tests, Difficulty Level, Error of Measurement, Graduate Medical Education
Peer reviewedDivgi, D. R. – Journal of Educational Measurement, 1986
This paper discusses various issues involved in using the Rasch Model with multiple-choice tests and questions the suitability of this model for multiple-choice items. Results of some past studies supporting the model are shown to be irrelevant. The effects of the model's misfit on test equating are demonstrated. (Author JAZ)
Descriptors: Equated Scores, Goodness of Fit, Latent Trait Theory, Mathematical Models
Peer reviewedMovshovitz-Hadar, Nitsa; And Others – Journal for Research in Mathematics Education, 1987
A content-oriented analysis of written solutions to test items in Israeli high school graduation examinations in mathematics yielded six error categories: misused data; misinterpreted language; logically invalid inference; distorted theorem or definition; unverified solution; and technical error. (Authors/MNS)
Descriptors: Educational Research, Error Patterns, Mathematics Instruction, Research Reports
Peer reviewedHuynh, Huynh – Journal of Educational Statistics, 1986
Under the assumptions of classical measurement theory and the condition of normality, a formula is derived for the reliability of composite scores. The formula represents an extension of the Spearman-Brown formula to the case of truncated data. (Author/JAZ)
Descriptors: Computer Simulation, Error of Measurement, Expectancy Tables, Scoring Formulas
Butler, Helen Lewis – Use of English, 1987
Argues that though college examiners seem to desire candidates' personal responses to literature, the examination questions require students to reply to the opinions of others rather than explain their own opinions. (SRT)
Descriptors: College Entrance Examinations, Higher Education, Literary Criticism, Literature Appreciation
Peer reviewedKlein, Lawrence W.; Jarjoura, David – Journal of Educational Measurement, 1985
The test equating accuracy of content-representative anchors (subsets of items in common) versus nonrepresentative, but substantially longer, anchors was compared for a professional certification examination. Through a chain of equatings, it was found that content representation in anchors was critical. (Author/GDC)
Descriptors: Adults, Equated Scores, Higher Education, Item Banks
Peer reviewedHuntley, Diane E. – Journal of Allied Health, 1985
Response changes of two classes of 30 dental hygiene students each were tabulated on multiple-choice questions on quizzes, midterms, and final examinations. Response changes were classified as wrong to right, right to wrong, or wrong to wrong. Significantly more responses were changed from wrong to right than from right to wrong. (Author/CT)
Descriptors: Class Rank, Dental Hygienists, Grades (Scholastic), Higher Education


