NotesFAQContact Us
Collection
Advanced
Search Tips
Showing all 9 results Save | Export
Peer reviewed Peer reviewed
Direct linkDirect link
Kentaro Fukushima; Nao Uchida; Kensuke Okada – Journal of Educational and Behavioral Statistics, 2025
Diagnostic tests are typically administered in a multiple-choice (MC) format due to their advantages of objectivity and time efficiency. The MC-deterministic input, noisy "and" gate (DINA) family of models, a representative class of cognitive diagnostic models for MC items, efficiently and parsimoniously estimates the mastery profiles of…
Descriptors: Diagnostic Tests, Cognitive Measurement, Multiple Choice Tests, Educational Assessment
Peer reviewed Peer reviewed
Burton, Richard F. – Assessment & Evaluation in Higher Education, 2001
Describes four measures of test unreliability that quantify effects of question selection and guessing, both separately and together--three chosen for immediacy and one for greater mathematical elegance. Quantifies their dependence on test length and number of answer options per question. Concludes that many multiple choice tests are unreliable…
Descriptors: Guessing (Tests), Mathematical Models, Multiple Choice Tests, Objective Tests
Peer reviewed Peer reviewed
Murphy, R. J. L. – British Journal of Educational Psychology, 1982
To study sex differences in test performance, the performance of males and females on 16 General Certificate of Education exams was analyzed in England. Results show that males perform better on objective tests than females. (Author/JJD)
Descriptors: Achievement, Foreign Countries, Objective Tests, Prediction
Peer reviewed Peer reviewed
Haladyna, Thomas M.; Downing, Steven M. – Applied Measurement in Education, 1989
A taxonomy of 43 rules for writing multiple-choice test items is presented, based on a consensus of 46 textbooks. These guidelines are presented as complete and authoritative, with solid consensus apparent for 33 of the rules. Four rules lack consensus, and 5 rules were cited fewer than 10 times. (SLD)
Descriptors: Classification, Interrater Reliability, Multiple Choice Tests, Objective Tests
Peer reviewed Peer reviewed
Direct linkDirect link
Handel, Richard W.; Arnau, Randolph C.; Archer, Robert P.; Dandy, Kristina L. – Assessment, 2006
The Minnesota Multiphasic Personality Inventory--Adolescent (MMPI-A) and Minnesota Multiphasic Personality Inventory--2 (MMPI-2) True Response Inconsistency (TRIN) scales are measures of acquiescence and nonacquiescence included among the standard validity scales on these instruments. The goals of this study were to evaluate the effectiveness of…
Descriptors: Adolescents, Protocol Analysis, Effect Size, Personality Measures
Todd, Amelia B. – 1983
An achievement test for the secondary school vocational electricity programs in South Carolina was constructed by a research coordinating unit (RCU) project. A trade and industrial supervisor and consultant, three electricity instructors, and four industry representatives comprised an advisory committee that participated in its development. Three…
Descriptors: Achievement Tests, Electricity, Multiple Choice Tests, Objective Tests
Peer reviewed Peer reviewed
Peckham, Irvin – English Journal, 1987
Criticizes the California Assessment Program (CAP) prior to l987 for testing writing skills objectively. Describes the specific improvements in the new CAP Directed Writing Assessment which focuses on the most important characteristics necessary to a particular type of writing rather than those that are common to all types.(NH)
Descriptors: Educational Assessment, Measurement Techniques, Objective Tests, Performance Based Assessment
Torrence, David R. – 1986
This was a replicative study that was initiated with a journeyman level certification instrument for an international union, when industry monitors were observed suggesting to examinees to "go with your first response." The question arose whether this was a researched-based practice. If not, wouldn't this practice inject constant error…
Descriptors: Adults, Correlation, Error of Measurement, Guessing (Tests)
Peer reviewed Peer reviewed
Mitchell, G.; And Others – Medical Teacher, 1986
Describes a study designed to determine if the amount of time allocated for answering multiple true/false type questions affects the grades of the medical students taking the tests. Students who had 2-1/4 minutes to answer each question scored significantly better than those who had 1-1/2 minutes or 3 minutes. (TW)
Descriptors: Biochemistry, College Science, Higher Education, Medical Education