Descriptor
Scoring Formulas | 9 |
Test Items | 9 |
Multiple Choice Tests | 5 |
Guessing (Tests) | 4 |
Scoring | 3 |
Objective Tests | 2 |
Test Interpretation | 2 |
Test Theory | 2 |
Testing Problems | 2 |
Weighted Scores | 2 |
Achievement Tests | 1 |
More ▼ |
Source
Applied Measurement in… | 1 |
Assessment & Evaluation in… | 1 |
Educational and Psychological… | 1 |
Evaluation and the Health… | 1 |
Author
Frary, Robert B. | 2 |
Aghbar, Ali A. | 1 |
Angoff, William H. | 1 |
Budescu, David V. | 1 |
Burton, Richard F. | 1 |
Gross, Leon J. | 1 |
Hutchinson, T.P. | 1 |
Pomplun, Mark | 1 |
Pray, Bruce S., Sr. | 1 |
Tang, Huixing | 1 |
Publication Type
Reports - Evaluative | 9 |
Journal Articles | 4 |
Reports - Research | 1 |
Speeches/Meeting Papers | 1 |
Education Level
Audience
Researchers | 2 |
Location
Laws, Policies, & Programs
Education for All Handicapped… | 1 |
Assessments and Surveys
SAT (College Admission Test) | 1 |
Wechsler Adult Intelligence… | 1 |
What Works Clearinghouse Rating

Frary, Robert B. – Applied Measurement in Education, 1989
Multiple-choice response and scoring methods that attempt to determine an examinee's degree of knowledge about each item in order to produce a total test score are reviewed. There is apparently little advantage to such schemes; however, they may have secondary benefits such as providing feedback to enhance learning. (SLD)
Descriptors: Knowledge Level, Multiple Choice Tests, Scoring, Scoring Formulas
Budescu, David V. – 1979
This paper outlines a technique for differentially weighting options of a multiple choice test in a fashion that maximizes the item predictive validity. The rule can be applied with different number of categories and the "optimal" number of categories can be determined by significance tests and/or through the R2 criterion. Our theoretical analysis…
Descriptors: Multiple Choice Tests, Predictive Validity, Scoring Formulas, Test Items
Angoff, William H. – 1985
This paper points out that there are certain generalizations about directions for guessing and methods of scoring that require that data be derived from random groups design. It supports the viewpoint that it is neither sufficient nor appropriate to make such generalizations on the basis of an analysis of scores obtained from the answer sheets of…
Descriptors: Correlation, Guessing (Tests), Research Design, Scoring Formulas

Gross, Leon J. – Evaluation and the Health Professions, 1982
Despite the 50 percent probability of a correctly guessed response, a multiple true-false examination should provide sufficient score variability for adequate discrimination without formula scoring. This scoring system directs examinees to respond to each item, with their scores based simply on the number of correct responses. (Author/CM)
Descriptors: Achievement Tests, Guessing (Tests), Health Education, Higher Education
Pomplun, Mark; And Others – 1992
This study evaluated the use of bivariate matching as a solution to the problem of studying differential item functioning (DIF) with formula scored tests. Using Scholastic Aptitude Test verbal data with large samples, both male/female and black/white group comparisons were investigated. Mantel-Haenszel (MH) delta-(D) DIF values and DIF category…
Descriptors: Blacks, Criteria, Females, Item Bias
Willingness to Answer Multiple-Choice Questions as Manifested Both in Genuine and in Nonsense Items.

Frary, Robert B.; Hutchinson, T.P. – Educational and Psychological Measurement, 1982
Alternate versions of Hutchinson's theory were compared, and one which implies the existence of partial knowledge was found to be better than one which implies that an appropriate measure of ability is obtained by applying the conventional correction for guessing. (Author/PN)
Descriptors: Guessing (Tests), Latent Trait Theory, Multiple Choice Tests, Scoring Formulas
Multiple Choice and True/False Tests: Reliability Measures and Some Implications of Negative Marking
Burton, Richard F. – Assessment & Evaluation in Higher Education, 2004
The standard error of measurement usefully provides confidence limits for scores in a given test, but is it possible to quantify the reliability of a test with just a single number that allows comparison of tests of different format? Reliability coefficients do not do this, being dependent on the spread of examinee attainment. Better in this…
Descriptors: Multiple Choice Tests, Error of Measurement, Test Reliability, Test Items
Aghbar, Ali A.; Tang, Huixing – 1991
A study was undertaken to develop a partial credit scheme for scoring cloze-type questions on an English collocation test, obtain construct validity evidence for the test and the scoring scheme using the Rasch Partial Credit Model, and compare partial credit scoring with the more commonly used dichotomous scoring with the same test instrument.…
Descriptors: Cloze Procedure, College Students, English (Second Language), Language Tests
Pray, Bruce S., Sr. – 1979
Professional assessment of mental retardation (MR) and learning disabilities (LD) is crucial to the implementation of P.L. 94-142 because it necessarily precedes the employment of trained teachers and because a plan for remediation depends on accurate assessment. Intelligence is a key element in the identification of MR-LD, but competent, fair…
Descriptors: American Indian Education, American Indians, Cultural Influences, Culture Fair Tests