NotesFAQContact Us
Collection
Advanced
Search Tips
Showing 1 to 15 of 16 results Save | Export
Linn, Robert – 1978
A series of studies on conceptual and design problems in competency-based measurements are explained. The concept of validity within the context of criterion-referenced measurement is reviewed. The authors believe validation should be viewed as a process rather than an end product. It is the process of marshalling evidence to support…
Descriptors: Criterion Referenced Tests, Item Analysis, Item Sampling, Test Bias
Smith, Douglas U. – 1978
This study examined the effects of certain item selection methods on the classification accuracy and classification consistency of criterion-referenced instruments. Three item response data sets, representing varying situations of instructional effectiveness, were simulated. Five methods of item selection were then applied to each data set for the…
Descriptors: Criterion Referenced Tests, Item Analysis, Item Sampling, Latent Trait Theory
Epstein, Kenneth I.; Knerr, Claramae S. – 1976
The literature on criterion referenced testing is full of discussions concerning whether classical measurement techniques are appropriate, whether variance is necessary, whether new indices of reliability are needed, and the like. What appears to be lacking, however, is a clear and simple discussion of why the problems occur. This paper suggests…
Descriptors: Career Development, Criterion Referenced Tests, Item Analysis, Item Sampling
Kolakowski, Donald – 1972
Empirical results are presented as regards the implementation of a latent-trait psychometric model by means of conditional maximum likelihood estimation. Items are scored polychotomously into varying numbers of nominal categories and the test and item characteristic curves and information functions are examined. It is concluded that scoring items…
Descriptors: Error of Measurement, Item Analysis, Item Sampling, Measurement Techniques
Peer reviewed Peer reviewed
Fiske, Donald W.; Barack, Leonard I. – Educational and Psychological Measurement, 1976
The diversity among interpretations of single items in personality questionnaires has been noted previously. Using adjectives from the Adjective Check List (ACL), the study sought evidence bearing on these questions: Does such diversity make the responses to an item not comparable across subjects? If so, what are the implications for scores based…
Descriptors: Adjectives, Check Lists, Individual Differences, Item Analysis
Faggen, Jane – 1978
Formulas are presented for decision reliability and for classification validity for mastery/nonmastery decisions based on criterion referenced tests. Two item parameters are used: the probability of a master answering an item correctly, and the probability of a nonmaster answering an item incorrectly. The theory explores the relationships of…
Descriptors: Bayesian Statistics, Criterion Referenced Tests, Item Analysis, Item Banks
Cliff, Norman – 1975
Measures of consistency and completeness of order relations derived from test-type data are proposed. The measures are generalized to apply to incomplete data such as tailored testing. The measures are based on consideration of the items-plus-persons by items-plus-persons matrix as an adjacency matrix in which a 1 means that the row element…
Descriptors: Adaptive Testing, Career Development, Computer Oriented Programs, Individual Differences
Kriewall, Thomas E. – Illinois School Research, 1972
Author discusses and defines criterion tests in the context of classroom needs that have created much of the interest in the theory at this time. The primary source of interest is related to the growing implementation of individualized curricula. (Author/CB)
Descriptors: Criterion Referenced Tests, Difficulty Level, Individualized Instruction, Item Analysis
Peer reviewed Peer reviewed
Lord, Frederic M. – Applied Psychological Measurement, 1977
Under given conditions, conventional testing and computer-generated repeatable testing (CGRT) are equally effective for estimating examinee ability; CGRT is more effective for estimating the mean ability level of a group and less effective for estimating ability differences among individuals. These conclusion are drawn from domain-referenced test…
Descriptors: Career Development, Computer Assisted Testing, Difficulty Level, Group Norms
Mislevy, Robert J.; And Others – 1982
An approach was developed based on item-response models defined at the level of salient subject groups rather than at the level of individuals, designed for use with multiple-matrix sampling designs. In each of three National Assessment of Educational Progress (NAEP) mathematics subtopics, Reiser's group-effects latent trait model was fitted to…
Descriptors: Educational Assessment, Item Analysis, Item Sampling, Latent Trait Theory
Harris, Chester W.; And Others – 1977
The implications of a mathematical model of test scores are explored where the data are limited to a random sample of items without replacement from an indefinitely large population or item domain in which items are scored either zero or one. The purpose is to obtain an unbiased estimate of a student's proportion of items correct in the item…
Descriptors: Academic Achievement, Achievement Tests, Annotated Bibliographies, Bibliographies
Forster, Fred – 1987
Studies carried out over a 12-year period addressed fundamental questions on the use of Rasch-based item banks. Large field tests administered in grades 3-8 of reading, mathematics, and science items, as well as standardized test results were used to explore the possible effects of many factors on item calibrations. In general, the results…
Descriptors: Achievement Tests, Difficulty Level, Elementary Education, Item Analysis
Cook, Linda L.; And Others – 1987
This study tests several explanations for discrepant results in an earlier study (Cook et al., 1985) which presented a partial pre-calibration method for equating new editions of the Scholastic Aptitude Test (SAT) to the same scale as older editions. In contrast to full pre-calibration, which seeks to equate all items from two or more editions,…
Descriptors: College Entrance Examinations, Concurrent Validity, Equated Scores, Estimation (Mathematics)
Wilcox, Rand R. – 1979
Mastery tests are analyzed in terms of the number of skills to be mastered and the number of items per skill, in order that correct decisions of mastery or nonmastery will be made to a desired degree of probability. It is assumed that a random sample of skills will be selected for measurement, that each skill will be measured by the same number of…
Descriptors: Achievement Tests, Cutting Scores, Decision Making, Equivalency Tests
Swezey, Robert W.; Pearlstein, Richard B. – 1975
This manual outlines the rationale for using the Criterion Referenced Test (CRT) approach and suggests specific guidelines for test developers to use in constructing test items. Methods for assessing the adequacy of a CRT are also provided. (Author/RC)
Descriptors: Behavioral Objectives, Check Lists, Comparative Analysis, Criterion Referenced Tests
Previous Page | Next Page ยป
Pages: 1  |  2