Publication Date
In 2025 | 0 |
Since 2024 | 0 |
Since 2021 (last 5 years) | 1 |
Since 2016 (last 10 years) | 1 |
Since 2006 (last 20 years) | 3 |
Descriptor
Difficulty Level | 18 |
Test Construction | 18 |
Test Items | 16 |
Item Analysis | 7 |
Multiple Choice Tests | 6 |
Higher Education | 4 |
Test Format | 4 |
Licensing Examinations… | 3 |
Ability | 2 |
Adaptive Testing | 2 |
Distractors (Tests) | 2 |
More ▼ |
Source
Educational and Psychological… | 18 |
Author
Cizek, Gregory J. | 3 |
Ace, Merle C. | 1 |
Aiken, Lewis R. | 1 |
Blumberg, Phyllis | 1 |
Crehan, Kevin D. | 1 |
Dawis, Rene V. | 1 |
DiPerna, James C. | 1 |
Haladyna, Thomas M. | 1 |
He, Wei | 1 |
Ivens, Stephen H. | 1 |
Kam, Chester Chun Seng | 1 |
More ▼ |
Publication Type
Journal Articles | 14 |
Reports - Research | 8 |
Reports - Evaluative | 6 |
Guides - Non-Classroom | 1 |
Information Analyses | 1 |
Education Level
Early Childhood Education | 1 |
Preschool Education | 1 |
Audience
Location
Mexico | 1 |
Netherlands | 1 |
Laws, Policies, & Programs
Assessments and Surveys
What Works Clearinghouse Rating
Kam, Chester Chun Seng – Educational and Psychological Measurement, 2023
When constructing measurement scales, regular and reversed items are often used (e.g., "I am satisfied with my job"/"I am not satisfied with my job"). Some methodologists recommend excluding reversed items because they are more difficult to understand and therefore engender a second, artificial factor distinct from the…
Descriptors: Test Items, Difficulty Level, Test Construction, Construct Validity
He, Wei; Reckase, Mark D. – Educational and Psychological Measurement, 2014
For computerized adaptive tests (CATs) to work well, they must have an item pool with sufficient numbers of good quality items. Many researchers have pointed out that, in developing item pools for CATs, not only is the item pool size important but also the distribution of item parameters and practical considerations such as content distribution…
Descriptors: Item Banks, Test Length, Computer Assisted Testing, Adaptive Testing
Lei, Pui-Wa; Wu, Qiong; DiPerna, James C.; Morgan, Paul L. – Educational and Psychological Measurement, 2009
Currently, few measures are available to monitor young children's progress in acquiring key early academic skills. In response to this need, the authors have begun developing measures (i.e., the Early Arithmetic, Reading and Learning Indicators, or EARLI) of preschoolers' numeracy skills. To accurately and efficiently monitor acquisition of early…
Descriptors: Preschool Children, Measures (Individuals), Numeracy, Emergent Literacy

Ivens, Stephen H. – Educational and Psychological Measurement, 1971
Descriptors: Difficulty Level, Item Analysis, Nonparametric Statistics, Statistical Analysis

Rogers, Paul W. – Educational and Psychological Measurement, 1978
Two procedures for the display of item analysis statistics are described. One procedure allows for investigation of difficulty; the second plots item difficulty against item discrimination. (Author/JKS)
Descriptors: Difficulty Level, Graphs, Guidelines, Item Analysis

Cizek, Gregory J.; Robinson, K. Lynne; O'Day, Denis M. – Educational and Psychological Measurement, 1998
The effect of removing nonfunctioning items from multiple-choice tests was studied by examining change in difficulty, discrimination, and dimensionality. Results provide additional support for the benefits of eliminating nonfunctioning options, such as enhanced score reliability, reduced testing time, potential for broader domain sampling, and…
Descriptors: Difficulty Level, Multiple Choice Tests, Sampling, Scores

Roid, G. H.; Haladyna, Thomas M. – Educational and Psychological Measurement, 1978
Two techniques for writing achievement test items to accompany instructional materials are contrasted: writing items from statements of instructional objectives, and writing items from semi-automated rules for transforming instructional statements. Both systems resulted in about the same number of faulty items. (Author/JKS)
Descriptors: Achievement Tests, Comparative Analysis, Criterion Referenced Tests, Difficulty Level

Blumberg, Phyllis; And Others – Educational and Psychological Measurement, 1982
First year medical students answered parallel multiple-choice questions at different taxonomic levels as part of their diagnostic examinations. The results show that when content is held constant, students perform as well on interpretation and problem-solving questions as on recall questions. (Author/BW)
Descriptors: Classification, Cognitive Processes, Difficulty Level, Higher Education

Cizek, Gregory J.; O'Day, Dennis M. – Educational and Psychological Measurement, 1994
Two investigations involving 700 candidates for medical specialty certification suggest that test items with only 4 options perform as well as the same items with 5 options. Results also suggest that five-option multiple-choice items can be reduced to four-option items by removing a nonfunctioning item. (SLD)
Descriptors: Certification, Difficulty Level, Distractors (Tests), Licensing Examinations (Professions)

Knowles, Susan L.; Welch, Cynthia A. – Educational and Psychological Measurement, 1992
A meta-analysis of the difficulty and discrimination of the "none-of-the-above" (NOTA) test option was conducted with 12 articles (20 effect sizes) for difficulty and 7 studies (11 effect sizes) for discrimination. Findings indicate that using the NOTA option does not result in items of lesser quality. (SLD)
Descriptors: Difficulty Level, Effect Size, Meta Analysis, Multiple Choice Tests

Van der Ven, Ad H. G. S. – Educational and Psychological Measurement, 1992
The dichotomous Rasch model was applied to verbal subtest scores on the Intelligence Structure Test Battery for 905 12- to 15-year-old secondary school students in the Netherlands. Results suggest that, if any factor is used to increase difficulty of items, that factor should be used on all items. (SLD)
Descriptors: Difficulty Level, Foreign Countries, Intelligence Tests, Secondary Education

Ace, Merle C.; Dawis, Rene V. – Educational and Psychological Measurement, 1973
Because no previous study was found in which both blank position in the item stem and positional placement of the correct response were studied simultaneously, it was decided to investigate the influence of these two factors, alone and in combination, on the difficulty level of verbal analogy items. (Authors)
Descriptors: Analysis of Variance, Data Analysis, Difficulty Level, Disadvantaged

Cizek, Gregory J. – Educational and Psychological Measurement, 1994
Performance of a common set of test items on an examination in which the order of options for one test form was experimentally manipulated. Results for 759 medical specialty board examinees find that reordering item options results in significant but unpredictable effects on item difficulty. (SLD)
Descriptors: Change, Difficulty Level, Equated Scores, Licensing Examinations (Professions)

Crehan, Kevin D.; And Others – Educational and Psychological Measurement, 1993
Studies with 220 college students found that multiple-choice test items with 3 items are more difficult than those with 4 items, and items with the none-of-these option are more difficult than those without this option. Neither format manipulation affected item discrimination. Implications for test construction are discussed. (SLD)
Descriptors: College Students, Comparative Testing, Difficulty Level, Distractors (Tests)

Lord, Frederic M. – Educational and Psychological Measurement, 1971
Descriptors: Ability, Adaptive Testing, Computer Oriented Programs, Difficulty Level
Previous Page | Next Page ยป
Pages: 1 | 2