Descriptor
Item Sampling | 11 |
Test Interpretation | 11 |
Test Reliability | 11 |
Test Construction | 7 |
Criterion Referenced Tests | 6 |
Item Analysis | 6 |
Test Validity | 5 |
Achievement Tests | 4 |
Career Development | 4 |
Norm Referenced Tests | 4 |
Test Theory | 4 |
More ▼ |
Author
Publication Type
Reports - Research | 5 |
Speeches/Meeting Papers | 2 |
Collected Works - Proceedings | 1 |
Education Level
Audience
Location
Laws, Policies, & Programs
Assessments and Surveys
Adjective Check List | 1 |
What Works Clearinghouse Rating
Harris, Chester W. – 1975
Achievement tests which are specifically linked to an instructional program and have been developed in relation to an objectives base and/or to an item generation rule are considered, as well as student response data. Three types of studies are outlined and the kind of procedures thought useful illustrated. As various methods for examining…
Descriptors: Achievement Tests, Instructional Programs, Item Banks, Item Sampling
Epstein, Kenneth I.; Knerr, Claramae S. – 1976
The literature on criterion referenced testing is full of discussions concerning whether classical measurement techniques are appropriate, whether variance is necessary, whether new indices of reliability are needed, and the like. What appears to be lacking, however, is a clear and simple discussion of why the problems occur. This paper suggests…
Descriptors: Career Development, Criterion Referenced Tests, Item Analysis, Item Sampling
Kolakowski, Donald – 1972
Empirical results are presented as regards the implementation of a latent-trait psychometric model by means of conditional maximum likelihood estimation. Items are scored polychotomously into varying numbers of nominal categories and the test and item characteristic curves and information functions are examined. It is concluded that scoring items…
Descriptors: Error of Measurement, Item Analysis, Item Sampling, Measurement Techniques

Fiske, Donald W.; Barack, Leonard I. – Educational and Psychological Measurement, 1976
The diversity among interpretations of single items in personality questionnaires has been noted previously. Using adjectives from the Adjective Check List (ACL), the study sought evidence bearing on these questions: Does such diversity make the responses to an item not comparable across subjects? If so, what are the implications for scores based…
Descriptors: Adjectives, Check Lists, Individual Differences, Item Analysis
Kriewall, Thomas E.; Hirsch, Edward – 1969
As an alternative to a classical test theory basis for criterion-referenced test construction, it is proposed that a strict item-sampling model be used. The computer's role in such a model is outlined. The assumptions of the model are carefully defined and its properties reviewed. The relationship between mastery criteria and such sampling plans…
Descriptors: Arithmetic, Behavioral Objectives, Computer Assisted Instruction, Criterion Referenced Tests

Lord, Frederic M. – Applied Psychological Measurement, 1977
Under given conditions, conventional testing and computer-generated repeatable testing (CGRT) are equally effective for estimating examinee ability; CGRT is more effective for estimating the mean ability level of a group and less effective for estimating ability differences among individuals. These conclusion are drawn from domain-referenced test…
Descriptors: Career Development, Computer Assisted Testing, Difficulty Level, Group Norms
Gifford, Janice A.; Hambleton, Ronald K. – 1980
Technical considerations associated with item selection and reliability assessment are considered in relation to criterion-referenced tests constructed to provide group information. The purpose is to emphasize test building and the evaluation of test scores in program evaluation studies. It is stressed that an evaluator employ a performance or…
Descriptors: Criterion Referenced Tests, Group Testing, Item Sampling, Models
Wilcox, Rand R. – 1979
Mastery tests are analyzed in terms of the number of skills to be mastered and the number of items per skill, in order that correct decisions of mastery or nonmastery will be made to a desired degree of probability. It is assumed that a random sample of skills will be selected for measurement, that each skill will be measured by the same number of…
Descriptors: Achievement Tests, Cutting Scores, Decision Making, Equivalency Tests
Haladyna, Tom – 1976
The existence of criterion-referenced (CR) measurement is questioned in this paper. Despite beliefs that differences exist between two alternative forms of measurement, CR and Norm Referenced (NR), an analysis of philosophical and psychological descriptions of measurement, as well as a growing number of empirical studies, reveal that the common…
Descriptors: Academic Standards, Achievement Tests, Career Development, Comparative Analysis
Haladyna, Thomas – 1975
A central problem for the user of domain-referenced tests in instruction is deciding who has passed and who has failed. Two procedures were presented and discussed. The first, employing classical test theory, was found to be more useful for larger domains and where the passing standard is 70 percent or less. The sampling procedure suggested by…
Descriptors: Academic Achievement, Academic Standards, Criterion Referenced Tests, Decision Making Skills
Educational Testing Service, Princeton, NJ. – 1977
The 1976 Educational Testing Service (ETS) Invitational Conference served as a platform for individuals who have been prominent in educational measurement and research to present their views on issues surrounding the testing controversy. The 1976 ETS "The Testing Scene: Chaos and Controversy," presents a historical review of events surrounding the…
Descriptors: Achievement Tests, Adaptive Testing, Awards, Career Development