Publication Date
In 2025 | 0 |
Since 2024 | 0 |
Since 2021 (last 5 years) | 1 |
Since 2016 (last 10 years) | 2 |
Since 2006 (last 20 years) | 2 |
Descriptor
Licensing Examinations… | 4 |
Physicians | 4 |
Test Items | 3 |
Difficulty Level | 2 |
Multiple Choice Tests | 2 |
Test Construction | 2 |
Ability | 1 |
Accuracy | 1 |
Bayesian Statistics | 1 |
Behavior Patterns | 1 |
Certification | 1 |
More ▼ |
Source
Educational and Psychological… | 4 |
Author
Cizek, Gregory J. | 2 |
Baldwin, Peter | 1 |
Bezirhan, Ummugul | 1 |
Clauser, Jerome C. | 1 |
Grabovsky, Irina | 1 |
Hambleton, Ronald K. | 1 |
O'Day, Dennis M. | 1 |
von Davier, Matthias | 1 |
Publication Type
Journal Articles | 4 |
Reports - Research | 3 |
Reports - Evaluative | 1 |
Education Level
Audience
Location
Laws, Policies, & Programs
Assessments and Surveys
United States Medical… | 2 |
What Works Clearinghouse Rating
Bezirhan, Ummugul; von Davier, Matthias; Grabovsky, Irina – Educational and Psychological Measurement, 2021
This article presents a new approach to the analysis of how students answer tests and how they allocate resources in terms of time on task and revisiting previously answered questions. Previous research has shown that in high-stakes assessments, most test takers do not end the testing session early, but rather spend all of the time they were…
Descriptors: Response Style (Tests), Accuracy, Reaction Time, Ability
Clauser, Jerome C.; Hambleton, Ronald K.; Baldwin, Peter – Educational and Psychological Measurement, 2017
The Angoff standard setting method relies on content experts to review exam items and make judgments about the performance of the minimally proficient examinee. Unfortunately, at times content experts may have gaps in their understanding of specific exam content. These gaps are particularly likely to occur when the content domain is broad and/or…
Descriptors: Scores, Item Analysis, Classification, Decision Making

Cizek, Gregory J.; O'Day, Dennis M. – Educational and Psychological Measurement, 1994
Two investigations involving 700 candidates for medical specialty certification suggest that test items with only 4 options perform as well as the same items with 5 options. Results also suggest that five-option multiple-choice items can be reduced to four-option items by removing a nonfunctioning item. (SLD)
Descriptors: Certification, Difficulty Level, Distractors (Tests), Licensing Examinations (Professions)

Cizek, Gregory J. – Educational and Psychological Measurement, 1994
Performance of a common set of test items on an examination in which the order of options for one test form was experimentally manipulated. Results for 759 medical specialty board examinees find that reordering item options results in significant but unpredictable effects on item difficulty. (SLD)
Descriptors: Change, Difficulty Level, Equated Scores, Licensing Examinations (Professions)