Descriptor
Item Sampling | 8 |
Measurement Techniques | 8 |
Test Reliability | 8 |
Criterion Referenced Tests | 4 |
Test Construction | 3 |
Test Interpretation | 3 |
Test Validity | 3 |
Testing | 3 |
Academic Achievement | 2 |
Error of Measurement | 2 |
Evaluation Methods | 2 |
More ▼ |
Author
Kriewall, Thomas E. | 2 |
Adams, Raymond J. | 1 |
Haladyna, Thomas | 1 |
Hirsch, Edward | 1 |
Kohr, Richard L. | 1 |
Kolakowski, Donald | 1 |
Liang, Xin | 1 |
Linehan, Marsha M. | 1 |
Publication Type
Journal Articles | 2 |
Reports - Research | 2 |
Reports - Descriptive | 1 |
Reports - Evaluative | 1 |
Speeches/Meeting Papers | 1 |
Education Level
Secondary Education | 1 |
Audience
Location
Laws, Policies, & Programs
Assessments and Surveys
Pennsylvania Educational… | 1 |
What Works Clearinghouse Rating
Adams, Raymond J. – Studies in Educational Evaluation, 2005
Test reliability is a concept central to classical test theory and it is commonly stated as a requirement that a test attain a certain level of reliability before it be considered of sufficient quality for practical use. This article discusses the role of reliability in item response theory, and in particular the role of reliability in contexts…
Descriptors: Test Reliability, Error of Measurement, Item Sampling, Item Response Theory
Kohr, Richard L. – 1976
Pennsylvania's Educational Quality Assessment Program provides each participating school with a building level report in which state percentiles are a prominent part. Multiple matrix sampling was being considered as a technique to reduce testing time. However, there was great concern that the error associated with estimating the school mean might…
Descriptors: Educational Assessment, Elementary Secondary Education, Item Sampling, Measurement Techniques
Linehan, Marsha M. – 1976
Both criterion-referenced testing and behavioral assessment share the basic assumption that test behavior is a sample rather than a sign. In addition, both types of assessment focus on response capabilities and performance in specified content domains. Although content validity has been traditionally recognized as essential to criterion-referenced…
Descriptors: Behavior Patterns, Content Analysis, Criterion Referenced Tests, Informal Assessment
Kolakowski, Donald – 1972
Empirical results are presented as regards the implementation of a latent-trait psychometric model by means of conditional maximum likelihood estimation. Items are scored polychotomously into varying numbers of nominal categories and the test and item characteristic curves and information functions are examined. It is concluded that scoring items…
Descriptors: Error of Measurement, Item Analysis, Item Sampling, Measurement Techniques
Liang, Xin – Evaluation and Research in Education, 2003
Multiple matrix sampling is a data collection technique that ensures accuracy and efficiency in group performance. It has been widely used in large-scale curriculum evaluation since the 1980s. However, the design does not always fully embrace the dynamics of local evaluation demands. The purpose of this study is to introduce a modified matrix…
Descriptors: Curriculum Evaluation, Item Sampling, Matrices, Statistical Studies
Kriewall, Thomas E.; Hirsch, Edward – 1969
As an alternative to a classical test theory basis for criterion-referenced test construction, it is proposed that a strict item-sampling model be used. The computer's role in such a model is outlined. The assumptions of the model are carefully defined and its properties reviewed. The relationship between mastery criteria and such sampling plans…
Descriptors: Arithmetic, Behavioral Objectives, Computer Assisted Instruction, Criterion Referenced Tests

Kriewall, Thomas E. – 1972
The measurement information generated by CRT's is designed for use in instructional management systems where classifications of pupils for treatment are to be decided on the basis of minimal data consistent with predetermined limits for the errors of misclassification. The measures obtained are content specific estimates of proficiency useful for…
Descriptors: Ability Grouping, Academic Achievement, Criterion Referenced Tests, Decision Making
Haladyna, Thomas – 1975
A central problem for the user of domain-referenced tests in instruction is deciding who has passed and who has failed. Two procedures were presented and discussed. The first, employing classical test theory, was found to be more useful for larger domains and where the passing standard is 70 percent or less. The sampling procedure suggested by…
Descriptors: Academic Achievement, Academic Standards, Criterion Referenced Tests, Decision Making Skills