Descriptor
Criterion Referenced Tests | 9 |
Test Reliability | 9 |
Test Validity | 4 |
Analysis of Variance | 2 |
Cutting Scores | 2 |
Higher Education | 2 |
Scoring | 2 |
Achievement Gains | 1 |
Achievement Tests | 1 |
Context Effect | 1 |
Elementary Education | 1 |
More ▼ |
Source
Educational and Psychological… | 9 |
Author
Lovett, Hubert T. | 2 |
Wilcox, Rand R. | 2 |
Bennett, Judith A. | 1 |
Hanna, Gerald S. | 1 |
Hutcheson, Sam J. | 1 |
Jaradat, Derar | 1 |
Powers, Stephen | 1 |
Raju, Nambury S. | 1 |
Tollefson, Nona | 1 |
Willoughby, T. Lee | 1 |
Publication Type
Journal Articles | 7 |
Reports - Research | 5 |
Reports - Evaluative | 2 |
Opinion Papers | 1 |
Reports - Descriptive | 1 |
Education Level
Audience
Location
Laws, Policies, & Programs
Assessments and Surveys
What Works Clearinghouse Rating

Wilcox, Rand R. – Educational and Psychological Measurement, 1981
This paper describes and compares procedures for estimating the reliability of proficiency tests that are scored with latent structure models. Results suggest that the predictive estimate is the most accurate of the procedures. (Author/BW)
Descriptors: Criterion Referenced Tests, Scoring, Test Reliability

Lovett, Hubert T. – Educational and Psychological Measurement, 1977
The analysis of variance model for estimating reliability in norm referenced tests is extended to criterion referenced tests. The essential modification is that the criterion or cut-off score is substituted for the population mean. An example and discussion are presented. (JKS)
Descriptors: Analysis of Variance, Criterion Referenced Tests, Cutting Scores, Test Reliability
The Effect of Violating the Assumption of Equal Item Means in Estimating the Livingston Coefficient.

Lovett, Hubert T. – Educational and Psychological Measurement, 1978
The validity of five methods of estimating the reliability of criterion-referenced tests was evaluated across nine conditions of variability among item means. The results were analyzed by analysis of variance, the Newman-Keuls test, and a nonparametric procedure. There was a tendency for all of the methods to be conservative. (Author/JKS)
Descriptors: Analysis of Variance, Criterion Referenced Tests, Item Analysis, Nonparametric Statistics

Raju, Nambury S. – Educational and Psychological Measurement, 1982
Rajaratnam, Cronbach and Gleser's generalizability formula for stratified-parallel tests and Raju's coefficient beta are generalized to estimate the reliability of a composite of criterion-referenced tests, where the parts have different cutting scores. (Author/GK)
Descriptors: Criterion Referenced Tests, Cutting Scores, Mathematical Formulas, Scoring Formulas

Wilcox, Rand R. – Educational and Psychological Measurement, 1979
For some situations the beta-binomial distribution might be used to describe the marginal distribution of test scores for a particular population of examinees. Several different methods of approximating the maximum likelihood estimate were investigated, and it was found that the Newton-Raphson method should be used when it yields admissable…
Descriptors: Criterion Referenced Tests, Maximum Likelihood Statistics, Measurement, Monte Carlo Methods

Powers, Stephen; And Others – Educational and Psychological Measurement, 1984
Spanish speaking first graders were administered the Artes de Lenguage (ADL)--a Spanish, criterion-referenced, language arts test. Reliability analyses indicated the adequacy of three of the four subscales (Phonetic Analysis, Vocabulary Development, Comprehension Skills, and General Skills). A principal factors analysis of the intercorrelation…
Descriptors: Criterion Referenced Tests, Elementary Education, Grade 1, Hispanic Americans

Hanna, Gerald S.; Bennett, Judith A. – Educational and Psychological Measurement, 1984
The presently viewed role and utility of measures of instructional sensitivity are summarized. A case is made that the rationale for the assessment of instructional sensitivity can be applied to all achievement tests and should not be restricted to criterion-referenced mastery tests. (Author/BW)
Descriptors: Achievement Tests, Context Effect, Criterion Referenced Tests, Mastery Tests

Jaradat, Derar; Tollefson, Nona – Educational and Psychological Measurement, 1988
This study compared the reliability and validity indexes of randomly parallel tests administered under inclusion, exclusion, and correction for guessing directions, using 54 graduate students. It also compared the criterion-referenced grading decisions based on the different scoring methods. (TJH)
Descriptors: Criterion Referenced Tests, Grading, Graduate Students, Guessing (Tests)

Willoughby, T. Lee; Hutcheson, Sam J. – Educational and Psychological Measurement, 1978
The standard growth expectations for various levels and categories of the Quarterly Profile Examination were reported in this study as an approach to assessing edumetric validity. The results demonstrated the usefulness of computing standard growth expectations for tests used in the measurement of within-individual growth. (Author/JKS)
Descriptors: Achievement Gains, Criterion Referenced Tests, Growth Patterns, Higher Education