NotesFAQContact Us
Collection
Advanced
Search Tips
Showing all 12 results Save | Export
Wang, Jianjun – 1995
Effects of blind guessing on the success of passing true-false and multiple-choice tests are investigated under a stochastic binomial model. Critical values of guessing are thresholds which signify when the effect of guessing is negligible. By checking a table of critical values assembled in this paper, one can make a decision with 95% confidence…
Descriptors: Bayesian Statistics, Grading, Guessing (Tests), Models
Peer reviewed Peer reviewed
Ebel, Robert L. – Journal of Educational Measurement, 1982
Reasonable and practical solutions to two major problems confronting the developer of any test of educational achievement (what to measure and how to measure it) are proposed, defended, and defined. (Author/PN)
Descriptors: Measurement Techniques, Objective Tests, Test Construction, Test Items
Peer reviewed Peer reviewed
Kolstad, Rosemarie K.; And Others – Journal of Research and Development in Education, 1983
A study compared college students' performance on complex multiple-choice tests with scores on multiple true-false clusters. Researchers concluded that the multiple-choice tests did not accurately measure students' knowledge and that cueing and guessing led to grade inflation. (PP)
Descriptors: Achievement Tests, Difficulty Level, Guessing (Tests), Higher Education
Terwilliger, James S. – 1991
This paper clarifies important distinctions in item writing and item scoring and considers the implications of these distinctions for developing guidelines related to test construction for training teachers. The terminology used to describe and classify paper and pencil test questions frequently confuses two distinct features of questions:…
Descriptors: Classroom Techniques, Educational Testing, Higher Education, Measurement Techniques
Peer reviewed Peer reviewed
Haladyna, Thomas A. – Applied Measurement in Education, 1992
Several multiple-choice item formats are examined in the current climate of test reform. The reform movement is discussed as it affects use of the following formats: (1) complex multiple-choice; (2) alternate choice; (3) true-false; (4) multiple true-false; and (5) the context dependent item set. (SLD)
Descriptors: Cognitive Psychology, Comparative Testing, Context Effect, Educational Change
Peer reviewed Peer reviewed
Hambleton, Ronald K.; Murphy, Edward – Applied Measurement in Education, 1992
The validity of several criticisms of objective tests is addressed, and the viability of some alternatives to objective testing is discussed. Evidence against multiple-choice tests is not as strong as has been claimed. Authentic assessments may not always be better, and research about new forms of assessment is necessary. (SLD)
Descriptors: Achievement Tests, Educational Assessment, Literature Reviews, Measurement Techniques
Anderson, Paul S.; Saliba, Alcyone – 1987
The use of optical scanners and computers in educational testing is common where objective testing methods (such as true-false, matching, and multiple-choice items) are well-established means of evaluating educational achievement. Where non-objective testing methods (such as fill-in-the-blank, short-answer, and essay items) have been more common,…
Descriptors: Computer Assisted Testing, Developed Nations, Developing Nations, Educational Technology
Samejima, Fumiko – 1990
The shortcomings of the conventional way of using and interpreting multiple-choice tests are summarized. Some theories and methodologies that can be applied for better use multiple-choice test items are described. Empirical facts are introduced to support the theoretical observations. New strategies are proposed that will reduce "noise"…
Descriptors: Ability Identification, Distractors (Tests), Equations (Mathematics), Estimation (Mathematics)
van Roosmalen, Willem M. M. – 1983
The construction of objective tests for native language reading comprehension is described. The tests were designed for the early secondary school years in several kinds of schools, vocational and non-vocational. The description focuses on the use of the Rasch model in test development, to develop a large pool of homogenous items and establish…
Descriptors: Ability Grouping, Difficulty Level, Foreign Countries, Item Banks
Neill, Monty – 1997
In this study, FairTest evaluated how well state assessment practices live up to the promise of high standards without standardization. The practices of states were measured against standards derived from the "Principles and Indicators for Student Assessment Systems," a 1995 publication of education and civil rights groups working…
Descriptors: Achievement Tests, Decision Making, Educational Assessment, Educational Change
Neill, Monty – 1997
FairTest evaluated how well state assessment practices live up to the promise of high standards without standardization. The practices of states were measured against standards derived from the "Principles and Indicators for Student Assessment Systems," a 1995 publication of education and civil rights groups working through the National…
Descriptors: Achievement Tests, Decision Making, Educational Assessment, Educational Change
de Jong, John H. A. L. – 1983
The Rasch model for test analysis is a latent-trait model, which specifies the relationship between observable test performance and the unobservable traits or abilities assumed under test performance. In most cases, the test constructor has no clue as to whether the latent traits postulated by the model are indeed the abilities he wants to…
Descriptors: Ability Grouping, Cloze Procedure, Correlation, English (Second Language)