Publication Date
In 2025 | 0 |
Since 2024 | 0 |
Since 2021 (last 5 years) | 1 |
Since 2016 (last 10 years) | 3 |
Since 2006 (last 20 years) | 7 |
Descriptor
Multiple Choice Tests | 39 |
Weighted Scores | 39 |
Test Reliability | 25 |
Scoring Formulas | 17 |
Test Validity | 16 |
Guessing (Tests) | 15 |
Scoring | 11 |
Test Construction | 9 |
Test Items | 9 |
Measurement Techniques | 8 |
Confidence Testing | 7 |
More ▼ |
Source
Author
Publication Type
Reports - Research | 12 |
Journal Articles | 9 |
Speeches/Meeting Papers | 4 |
Reports - Evaluative | 3 |
Information Analyses | 1 |
Non-Print Media | 1 |
Reference Materials - General | 1 |
Education Level
Higher Education | 2 |
Postsecondary Education | 2 |
Grade 10 | 1 |
Grade 4 | 1 |
Grade 8 | 1 |
High Schools | 1 |
Secondary Education | 1 |
Audience
Location
Florida | 1 |
Michigan | 1 |
United Kingdom | 1 |
Laws, Policies, & Programs
No Child Left Behind Act 2001 | 1 |
Assessments and Surveys
Florida Comprehensive… | 1 |
New Jersey College Basic… | 1 |
SAT (College Admission Test) | 1 |
Stanford Achievement Tests | 1 |
Test of Standard Written… | 1 |
What Works Clearinghouse Rating
Kim, Sooyeon; Walker, Michael E. – ETS Research Report Series, 2021
Equating the scores from different forms of a test requires collecting data that link the forms. Problems arise when the test forms to be linked are given to groups that are not equivalent and the forms share no common items by which to measure or adjust for this group nonequivalence. We compared three approaches to adjusting for group…
Descriptors: Equated Scores, Weighted Scores, Sampling, Multiple Choice Tests
Yun, Eunjeong – Research in Science & Technological Education, 2020
Background: We adopted a theoretical framework that the acquisition of a scientific concept comprises the development of connections among conceptual elements associated with a scientific term within a mental semantic network. Given this framework, the hypothesis that the surrounding words connected with a scientific term are relevant to the…
Descriptors: Correlation, Semantics, Scientific Concepts, Networks
Ganzfried, Sam; Yusuf, Farzana – Education Sciences, 2018
A problem faced by many instructors is that of designing exams that accurately assess the abilities of the students. Typically, these exams are prepared several days in advance, and generic question scores are used based on rough approximation of the question difficulty and length. For example, for a recent class taught by the author, there were…
Descriptors: Weighted Scores, Test Construction, Student Evaluation, Multiple Choice Tests
Scafe, Marla G. – American Journal of Business Education, 2011
The purpose of this study was to evaluate the effectiveness of group testing as a pedagogical technique to enhance learning in a difficult subject such as statistics. Individual test scores were compared to their group test scores for the same, identical test. A t test was used to compare the scores for 157 randomly selected MBA students enrolled…
Descriptors: Group Testing, Individual Testing, Statistical Analysis, Comparative Analysis
Hendrickson, Amy; Patterson, Brian; Ewing, Maureen – College Board, 2010
The psychometric considerations and challenges associated with including constructed response items on tests are discussed along with how these issues affect the form assembly specifications for mixed-format exams. Reliability and validity, security and fairness, pretesting, content and skills coverage, test length and timing, weights, statistical…
Descriptors: Multiple Choice Tests, Test Format, Test Construction, Test Validity
Shermis, Mark D.; Long, Susanne K. – Journal of Psychoeducational Assessment, 2009
This study investigated the convergent and discriminant validity of the high-stakes Florida Comprehensive Assessment Test (FCAT) in both reading and writing at grade levels 4, 8, and 10. The data from the 2006 FCAT administration were analyzed via traditional multitrait-multimethod (MTMM) analysis to identify the factor structure and structural…
Descriptors: Structural Equation Models, Multitrait Multimethod Techniques, Writing Tests, Validity

Raffeld, Paul – Journal of Educational Measurement, 1975
Results support the contention that a Guttman-weighted objective test can have psychometric properties that are superior to those of its unweighted counterpart, as long as omissions do not exist or are assigned a value equal to the mean of the k item alternative weights. (Author/BJG)
Descriptors: Multiple Choice Tests, Predictive Validity, Test Reliability, Test Validity
Ahlgren, Andrew – 1970
A hand-scoring system for a three-level confidence-marking scheme for short answer and multiple-choice tests is described. The scoring system is for a test where the student is asked to indicate for each answer whether the probability of his being correct is more than 1/2 (sure), 1/2 (neutral), or less than 1/2 (guess). The effect of the system is…
Descriptors: Answer Keys, Guessing (Tests), Multiple Choice Tests, Scores

Hendrickson, Gerry F. – Journal of Educational Measurement, 1971
Descriptors: Correlation, Guessing (Tests), Multiple Choice Tests, Sex Differences
Budescu, David V. – 1979
This paper outlines a technique for differentially weighting options of a multiple choice test in a fashion that maximizes the item predictive validity. The rule can be applied with different number of categories and the "optimal" number of categories can be determined by significance tests and/or through the R2 criterion. Our theoretical analysis…
Descriptors: Multiple Choice Tests, Predictive Validity, Scoring Formulas, Test Items

Waters, Brian K. – Journal of Educational Research, 1976
This pilot study compared two empirically-derived, option-weighting methods and the resultant effect on the reliability and validity of multiple choice test scores as compared with conventional rights-only scoring. (MM)
Descriptors: Guessing (Tests), Measurement, Multiple Choice Tests, Scoring

Abu-Sayf, F. K. – Educational Review, 1979
The purpose of this article is to discuss some recent developments in the scoring of multiple-choice items from two angles. The first consists of the recent developments in the test instructions of the conventional scoring procedures, and the second consists of a discussion of new scoring methods and formulas. (Author)
Descriptors: Confidence Testing, Guessing (Tests), Measurement Objectives, Multiple Choice Tests
Smith, Richard M. – 1981
One of the recurrent themes of the psychometric literature has been the idea that the incorrect responses a person makes to test items contain information that might be useful in determining the person's position on the variable the items are intended to define. The "Partial Credit" model, a member of the family of latent trait models…
Descriptors: Algebra, High Schools, Latent Trait Theory, Multiple Choice Tests

Krauft, Conrad C.; Beggs, Donald L. – Journal of Experimental Education, 1973
The purpose of the study was to determine whether a subject weighted (SW) multiple-choice test taking procedure would result in higher and more reliable scores than the conventional (C) multiple-choice test taking procedure in general at different levels of risk taking. (Author)
Descriptors: Attitudes, Educational Research, Multiple Choice Tests, Questionnaires

Reilly, Richard R.; Dynarski, Barbara J. – Educational and Psychological Measurement, 1972
Descriptors: Answer Keys, Branching, Computer Programs, Multiple Choice Tests