NotesFAQContact Us
Collection
Advanced
Search Tips
Showing all 12 results Save | Export
Angoff, William H. – 1985
This paper points out that there are certain generalizations about directions for guessing and methods of scoring that require that data be derived from random groups design. It supports the viewpoint that it is neither sufficient nor appropriate to make such generalizations on the basis of an analysis of scores obtained from the answer sheets of…
Descriptors: Correlation, Guessing (Tests), Research Design, Scoring Formulas
Hutchinson, T. P. – 1984
One means of learning about the processes operating in a multiple choice test is to include some test items, called nonsense items, which have no correct answer. This paper compares two versions of a mathematical model of test performance to interpret test data that includes both genuine and nonsense items. One formula is based on the usual…
Descriptors: Foreign Countries, Guessing (Tests), Mathematical Models, Multiple Choice Tests
Peer reviewed Peer reviewed
Lord, Frederic M. – Journal of Educational Measurement, 1984
Four methods are outlined for estimating or approximating from a single test administration the standard error of measurement of number-right test score at specified ability levels or cutting scores. The methods are illustrated and compared on one set of real test data. (Author)
Descriptors: Academic Ability, Cutting Scores, Error of Measurement, Scoring Formulas
Aghbar, Ali A.; Tang, Huixing – 1991
A study was undertaken to develop a partial credit scheme for scoring cloze-type questions on an English collocation test, obtain construct validity evidence for the test and the scoring scheme using the Rasch Partial Credit Model, and compare partial credit scoring with the more commonly used dichotomous scoring with the same test instrument.…
Descriptors: Cloze Procedure, College Students, English (Second Language), Language Tests
Bruno, James E. – Journal of Computer-Based Instruction, 1987
Reports preliminary findings of a study which used a modified Admissible Probability Measurement (APM) test scoring system in the design of computer based instructional management systems. The use of APM for curriculum analysis is discussed, as well as its value in enhancing individualized learning. (Author/LRW)
Descriptors: Computer Assisted Testing, Computer Managed Instruction, Curriculum Evaluation, Design
Livingston, Samuel A. – 1986
This paper deals with test fairness regarding a test consisting of two parts: (1) a "common" section, taken by all students; and (2) a "variable" section, in which some students may answer a different set of questions from other students. For example, a test taken by several thousand students each year contains a common multiple-choice portion and…
Descriptors: Difficulty Level, Error of Measurement, Essay Tests, Mathematical Models
Illinois State Board of Education, Springfield. Evaluation and Assessment Section. – 1984
This report identifies the methods and issues in the evaluation of student writing in classrooms and large scale assessments using an analytical scoring system from the Illinois Inventory of Educational Progress (IIEP). This guide is written for staff in state offices of education conducting large-scale writing assessments, and also for personnel…
Descriptors: Educational Assessment, Educational Methods, Elementary Secondary Education, Essay Tests
Melican, Gerald; Plake, Barbara S. – 1984
The validity of combining a correction for guessing with the Nedelsky-based cutscore was investigated. A five option multiple choice Mathematics Achievement Test was used in the study. Items were selected to meet several criteria. These included: the capability of measuring mathematics concepts related to performance in introductory statistics;…
Descriptors: Cutting Scores, Guessing (Tests), Higher Education, Multiple Choice Tests
Lenel, Julia C.; Gilmer, Jerry S. – 1986
In some testing programs an early item analysis is performed before final scoring in order to validate the intended keys. As a result, some items which are flawed and do not discriminate well may be keyed so as to give credit to examinees no matter which answer was chosen. This is referred to as allkeying. This research examined how varying the…
Descriptors: Equated Scores, Item Analysis, Latent Trait Theory, Licensing Examinations (Professions)
Education Commission of the States, Denver, CO. National Assessment of Educational Progress. – 1983
Exercises from the National Assessment of Educational Progress (NAEP) third mathematics assessment are provided in this released exercise set. Exercises were administered to 9-year-olds, 13-year-olds, and 17-year-olds. Some exercises were administered to only one age group, others to two or more age groups. The set is divided into two parts: text…
Descriptors: Elementary School Mathematics, Elementary Secondary Education, Item Banks, Knowledge Level
Kingston, Neal M. – 1985
Birnbaum's three-parameter logistic item response model was used to study guessing behavior of low ability examinees on the Graduate Record Examinations (GRE) General Test, Verbal Measure. GRE scoring procedures had recently changed, from a scoring formula which corrected for guessing, to number-right scoring. The three-parameter theory was used…
Descriptors: Academic Aptitude, Analysis of Variance, College Entrance Examinations, Difficulty Level
Hilton, Thomas L.; And Others – 1985
Since the mean score for a sample composed of several subgroups can be viewed as the sum of the mean of each subgroup weighted by the proportional size of the subgroup, then the mean change in a time period--in this case, from 1972 to 1980--is the sum of the differences between the means for each subgroup, with each mean weighted by its…
Descriptors: Analysis of Covariance, Cohort Analysis, Cross Sectional Studies, Educational Trends