Publication Date
In 2025 | 0 |
Since 2024 | 0 |
Since 2021 (last 5 years) | 0 |
Since 2016 (last 10 years) | 0 |
Since 2006 (last 20 years) | 2 |
Descriptor
Criterion Referenced Tests | 25 |
Statistical Analysis | 25 |
Test Interpretation | 25 |
Test Reliability | 12 |
Mathematical Models | 11 |
Norm Referenced Tests | 8 |
Test Construction | 8 |
Cutting Scores | 6 |
Item Analysis | 6 |
Mastery Tests | 6 |
Scores | 6 |
More ▼ |
Source
Journal of Special Education | 2 |
Journal of Early Adolescence | 1 |
Journal of Educational… | 1 |
Language Assessment Quarterly | 1 |
Author
Publication Type
Reports - Research | 14 |
Speeches/Meeting Papers | 3 |
Information Analyses | 2 |
Journal Articles | 2 |
Guides - General | 1 |
Reports - Descriptive | 1 |
Reports - Evaluative | 1 |
Tests/Questionnaires | 1 |
Education Level
Higher Education | 1 |
Audience
Laws, Policies, & Programs
Elementary and Secondary… | 1 |
Assessments and Surveys
California Achievement Tests | 1 |
What Works Clearinghouse Rating
Fan, Xitao; Sun, Shaojing – Journal of Early Adolescence, 2014
In adolescence research, the treatment of measurement reliability is often fragmented, and it is not always clear how different reliability coefficients are related. We show that generalizability theory (G-theory) is a comprehensive framework of measurement reliability, encompassing all other reliability methods (e.g., Pearson "r,"…
Descriptors: Generalizability Theory, Measurement, Reliability, Correlation
Brown, James Dean – Language Assessment Quarterly, 2008
In keeping with the theme of the International Language Testing Association/Language Testing Research Colloquium Conference in 2008, "Focusing on the Core: Justifying the Use of Language Assessments to Stakeholders," I define "stakeholder-friendly tests," "defensible testing," and "testing-context analysis."…
Descriptors: Language Usage, Curriculum Development, Testing, Language Tests
Berk, Ronald A. – 1980
Seventeen statistics for measuring the reliability of criterion-referenced tests were critically reviewed. The review was organized into two sections: (1) a discussion of preliminary considerations to provide a foundation for choosing the appropriate category of "reliability" (threshold loss function, squared-error loss-function, or…
Descriptors: Criterion Referenced Tests, Cutting Scores, Scoring Formulas, Statistical Analysis
Besel, Ronald – 1973
The contention that interpretation of a student's performance on a criterion referenced test should be independent of the performance of his classmates is challenged. The Mastery Learning Test Model, which was developed for analyzing criterion referenced test data, is described. An estimate of the proportion of students in an instructional group…
Descriptors: Criterion Referenced Tests, Mathematical Models, Measurement Instruments, Speeches

Shoemaker, David M. – Journal of Special Education, 1972
Considered is the improvement of criterion-referenced measurement as applied to individual and group assessment of handicapped and normal children. (DB)
Descriptors: Criterion Referenced Tests, Evaluation, Exceptional Child Education, Handicapped Children
Woodson, M. I. Charles E.
It has been argued that item variance and test variance are not necessary characteristics for criterion-referenced tests, although they are necessary for norm-referenced tests. This position is in error because it considers sample statistics as the criteria for evaluating items and tests. Within a particular sample, an item or test may have no…
Descriptors: Criterion Referenced Tests, Evaluation Criteria, Item Analysis, Item Sampling

Gorth, William P.; Hambleton, Ronald K. – Journal of Special Education, 1972
Descriptors: Criterion Referenced Tests, Evaluation, Exceptional Child Education, Handicapped Children
Epstein, Kenneth I. – 1975
Since the primary purpose of classical testing is to rank order examinees consistently, the absolute value of the true score has been relatively unimportant. However, the major purpose of criterion referenced testing is to estimate the true capabilities of examinees to perform specific tasks. Hence, the problems of true score determination assume…
Descriptors: Bayesian Statistics, Criterion Referenced Tests, Mathematical Models, Military Personnel

Shavelson, Richard J.; And Others – Journal of Educational Measurement, 1972
In this comment a recent attempt by Samuel A. Livingston to develop a theory of reliability for criterion-referenced measures is critiqued. For Livingston's rejoinder see TM 500 560. (Authors/MB)
Descriptors: Criterion Referenced Tests, Error of Measurement, Measurement Techniques, Response Style (Tests)
Epstein, Kenneth I.; Knerr, Claramae S. – 1976
The literature on criterion referenced testing is full of discussions concerning whether classical measurement techniques are appropriate, whether variance is necessary, whether new indices of reliability are needed, and the like. What appears to be lacking, however, is a clear and simple discussion of why the problems occur. This paper suggests…
Descriptors: Career Development, Criterion Referenced Tests, Item Analysis, Item Sampling
Wilcox, Rand R. – 1977
Three statistical problems related to criterion-referenced testing are investigated: estimation of the likelihood of a false-positive or false-negative decision with a mastery test, estimation of true scores in the Compound Binomial Error Model, and comparison of the examinees to a control. Two methods for estimating the likelihood of…
Descriptors: Criterion Referenced Tests, Cutting Scores, Error Patterns, Item Sampling
Kane, Michael T.; Brennan, Robert L. – 1977
A large number of seemingly diverse coefficients have been proposed as indices of dependability, or reliability, for domain-referenced and/or mastery tests. In this paper, it is shown that most of these indices are special cases of two generalized indices of agreement: one that is corrected for chance, and one that is not. The special cases of…
Descriptors: Bayesian Statistics, Correlation, Criterion Referenced Tests, Cutting Scores
Besel, Ronald – 1971
The Mastery-Learning test model is extended. Methods for estimating prior probabilities are described. The use of an adjustment matrix to transform a probability of mastery measure and empirical methods for estimating adjustment matrix parameters are derived. Adjustment matrices are interpreted as indicators of instructional effectiveness and as…
Descriptors: Criterion Referenced Tests, Decision Making, Groups, Individual Testing
Long, John; And Others – 1978
An experiment was performed to evaluate the tenability of the assumption in the Elementary Secondary Education Act (ESEA) Title I proposed variance estimation procedures for criterion referenced tests. The assumption is that the ratio of the local to the national standard deviation for the national sample will be the same for the normed test as…
Descriptors: Compensatory Education, Criterion Referenced Tests, Educational Assessment, Elementary Education
Steinheiser, Frederick H., Jr.; And Others – 1978
Alternative mathematical models for scoring and decision making with criterion referenced tests are described, especially as they concern appropriate test length and methods of establishing statistically valid cutting scores. Several of these approaches are reviewed and compared on formal-analytic and empirical grounds: (1) Block's approach to…
Descriptors: Comparative Analysis, Criterion Referenced Tests, Cutting Scores, Decision Making
Previous Page | Next Page ยป
Pages: 1 | 2