Publication Date
In 2025 | 0 |
Since 2024 | 0 |
Since 2021 (last 5 years) | 1 |
Since 2016 (last 10 years) | 1 |
Since 2006 (last 20 years) | 3 |
Descriptor
Item Analysis | 13 |
Multiple Choice Tests | 13 |
Test Theory | 13 |
Test Items | 8 |
Higher Education | 6 |
Test Construction | 4 |
Test Format | 4 |
Comparative Testing | 3 |
Difficulty Level | 3 |
Psychometrics | 3 |
Test Reliability | 3 |
More ▼ |
Source
Journal of Educational… | 2 |
Advances in Health Sciences… | 1 |
European Journal of… | 1 |
Journal of Economic Education | 1 |
ProQuest LLC | 1 |
Author
Brice, Julie | 1 |
Budescu, David V. | 1 |
Coombes, Lee | 1 |
DeCarlo, Lawrence T. | 1 |
Ellis, David P. | 1 |
Fenna, Doug S. | 1 |
Hamm, Debra W. | 1 |
Lancaster, Diana M. | 1 |
Melancon, Janet G. | 1 |
Miao, Chang Yu | 1 |
Myers, Charles T. | 1 |
More ▼ |
Publication Type
Reports - Research | 8 |
Journal Articles | 5 |
Speeches/Meeting Papers | 4 |
Reports - Evaluative | 2 |
Dissertations/Theses -… | 1 |
Opinion Papers | 1 |
Reports - Descriptive | 1 |
Education Level
Higher Education | 3 |
Postsecondary Education | 1 |
Audience
Researchers | 1 |
Laws, Policies, & Programs
Assessments and Surveys
Comprehensive Tests of Basic… | 1 |
Embedded Figures Test | 1 |
What Works Clearinghouse Rating
DeCarlo, Lawrence T. – Journal of Educational Measurement, 2023
A conceptualization of multiple-choice exams in terms of signal detection theory (SDT) leads to simple measures of item difficulty and item discrimination that are closely related to, but also distinct from, those used in classical item analysis (CIA). The theory defines a "true split," depending on whether or not examinees know an item,…
Descriptors: Multiple Choice Tests, Test Items, Item Analysis, Test Wiseness
Ellis, David P. – ProQuest LLC, 2011
The current version of the International Language Testing Association (ILTA) Guidelines for Practice requires language testers to pretest items before including them on an exam, or when pretesting is not possible, to conduct post-hoc item analysis to ensure any malfunctioning items are excluded from scoring. However, the guidelines are devoid of…
Descriptors: Item Response Theory, High Stakes Tests, College Entrance Examinations, Item Analysis
Ricketts, Chris; Brice, Julie; Coombes, Lee – Advances in Health Sciences Education, 2010
The purpose of multiple choice tests of medical knowledge is to estimate as accurately as possible a candidate's level of knowledge. However, concern is sometimes expressed that multiple choice tests may also discriminate in undesirable and irrelevant ways, such as between minority ethnic groups or by sex of candidates. There is little literature…
Descriptors: Medical Students, Testing Accommodations, Ethnic Groups, Learning Disabilities
Fenna, Doug S. – European Journal of Engineering Education, 2004
Multiple-choice testing (MCT) has several advantages which are becoming more relevant in the current financial climate. In particular, they can be machine marked. As an objective testing method it is particularly relevant to engineering and other factual courses, but MCTs are not widely used in engineering because students can benefit from…
Descriptors: Guessing (Tests), Testing, Multiple Choice Tests, Engineering Education
Myers, Charles T. – 1978
The viewpoint is expressed that adding to test reliability by either selecting a more homogeneous set of items, restricting the range of item difficulty as closely as possible to the most efficient level, or increasing the number of items will not add to test validity and that there is considerable danger that efforts to increase reliability may…
Descriptors: Achievement Tests, Item Analysis, Multiple Choice Tests, Test Construction
Ryan, Joseph P.; Hamm, Debra W. – 1976
A procedure is described for increasing the reliability of tests after they have been given and for developing shorter but more reliable tests. Eight tests administered to 200 graduate students studying educational research are analyzed. The analysis considers the original tests, the items loading on the first factor of the test, and the items…
Descriptors: Career Development, Factor Analysis, Factor Structure, Item Analysis
Seong, Tae-Je; Subkoviak, Michael J. – 1987
The purpose of this research was to reinvestigate the accuracy of three item bias detection procedures: (1) Linn and Harnisch's pseudo-IRT(Z) method; (2) Camilli's chi-square technique; and (3) Angoff's revised transformed item difficulty method. These methods are applied when the minority group sample size is too small to obtain stable estimates…
Descriptors: Blacks, Difficulty Level, Higher Education, Item Analysis
Lancaster, Diana M.; And Others – 1987
Difficulty and discrimination ability were compared between multiple choice and short answer items in midterm and final examinations for the internal medicine course at Louisiana State University School of Dentistry. The examinations were administered to 67 sophomore dental students in that course. Additionally, the impact of the source of the…
Descriptors: Dental Schools, Dentistry, Difficulty Level, Discriminant Analysis

Budescu, David V.; Nevo, Baruch – Journal of Educational Measurement, 1985
The proportionality model assumes that total testing time is proportional to the number of test items and the number of options per multiple choice test item. This assumption was examined, using test items having from two to five options. The model was not supported. (Author/GDC)
Descriptors: College Entrance Examinations, Foreign Countries, Higher Education, Item Analysis
Yen, Wendy M. – 1979
Three test-analysis models were used to analyze three types of simulated test score data plus the results of eight achievement tests. Chi-square goodness-of-fit statistics were used to evaluate the appropriateness of the models to the four kinds of data. Data were generated to simulate the responses of 1,000 students to 36 pseudo-items by…
Descriptors: Achievement Tests, Correlation, Goodness of Fit, Item Analysis
Miao, Chang Yu – 1987
Nedelsky (1954) has suggested a procedure for determining the minimum passing score on a multiple-choice test. In this procedure expert judges estimate the probable score of a minimally competent examinee. The technique does not refer to the students' performance data. The purposes of this paper are: (1) to introduce a modification to the Nedelsky…
Descriptors: Academic Standards, Analysis of Variance, Bayesian Statistics, Cutting Scores
Melancon, Janet G.; Thompson, Bruce – 1990
Classical measurement theory was used to investigate measurement characteristics of both parts of the Finding Embedded Figures Test (FEFT) when the test was: administered in either a "no guessing" supply format or a multiple-choice selection format; administered to either undergraduate college students or middle school students; and…
Descriptors: Comparative Testing, Construct Validity, Guessing (Tests), Higher Education

Walstad, William B.; Robson, Denise – Journal of Economic Education, 1997
Applies Item Response Theory methods to data from the national norming of the Test of Economic Literacy to identify test questions with large male-female differences. Regression analysis showed a significant decrease in the magnitude of gender difference, although a difference was still present. (MJP)
Descriptors: Academic Aptitude, Comparative Testing, Economics, Economics Education