Publication Date
In 2025 | 0 |
Since 2024 | 0 |
Since 2021 (last 5 years) | 0 |
Since 2016 (last 10 years) | 2 |
Since 2006 (last 20 years) | 3 |
Descriptor
Comparative Analysis | 32 |
Criterion Referenced Tests | 32 |
Test Reliability | 32 |
Norm Referenced Tests | 17 |
Test Validity | 16 |
Test Construction | 14 |
Statistical Analysis | 8 |
Item Analysis | 7 |
Test Theory | 6 |
Career Development | 5 |
Mathematical Models | 5 |
More ▼ |
Source
Author
Haladyna, Tom | 2 |
Shrock, Sharon | 2 |
Bashaw, W. L. | 1 |
Berk, Ronald A. | 1 |
Bernknopf, Stanley | 1 |
Brennan, Robert L. | 1 |
Chen, Tsuiping | 1 |
Conoyer, Sarah J. | 1 |
Coscarelli, William | 1 |
Crehan, Kevin D. | 1 |
Day, Gerald F. | 1 |
More ▼ |
Publication Type
Reports - Research | 19 |
Journal Articles | 8 |
Speeches/Meeting Papers | 5 |
Reports - Evaluative | 3 |
Opinion Papers | 2 |
Reports - Descriptive | 2 |
Dissertations/Theses -… | 1 |
Guides - General | 1 |
Information Analyses | 1 |
Education Level
Higher Education | 2 |
Grade 8 | 1 |
Audience
Counselors | 1 |
Practitioners | 1 |
Support Staff | 1 |
Laws, Policies, & Programs
Assessments and Surveys
What Works Clearinghouse Rating
Madison, Matthew J. – Educational Measurement: Issues and Practice, 2019
Recent advances have enabled diagnostic classification models (DCMs) to accommodate longitudinal data. These longitudinal DCMs were developed to study how examinees change, or transition, between different attribute mastery statuses over time. This study examines using longitudinal DCMs as an approach to assessing growth and serves three purposes:…
Descriptors: Longitudinal Studies, Item Response Theory, Psychometrics, Criterion Referenced Tests
Ford, Jeremy W.; Conoyer, Sarah J.; Lembke, Erica S.; Smith, R. Alex; Hosp, John L. – Assessment for Effective Intervention, 2018
In the present study, two types of curriculum-based measurement (CBM) tools in science, Vocabulary Matching (VM) and Statement Verification for Science (SV-S), a modified Sentence Verification Technique, were compared. Specifically, this study aimed to determine whether the format of information presented (i.e., SV-S vs. VM) produces differences…
Descriptors: Curriculum Based Assessment, Evaluation Methods, Measurement Techniques, Comparative Analysis
Coscarelli, William; Shrock, Sharon – Performance Improvement Quarterly, 2002
Discusses problems in using traditional measures of reliability for criterion-referenced tests (CRTs) and describes two approaches to reliability for CRTs: estimates sensitive to all measures of error; and estimates of consistency in test outcome. Compares the two approaches and proposes recommendations for interpretation and use. (Author/LRW)
Descriptors: Comparative Analysis, Criterion Referenced Tests, Measurement Techniques, Test Reliability

Crehan, Kevin D. – Journal of Educational Measurement, 1974
Various item selection techniques are compared on criterion-referenced reliability and validity. Techniques compared include three nominal criterion-referenced methods, a traditional point biserial selection, teacher selection, and random selection. (Author)
Descriptors: Comparative Analysis, Criterion Referenced Tests, Item Analysis, Item Banks
Moyer, Judith E.; Fishbein, Ronald L. – 1977
The problem that this research addressed was one of decision making. Given three sets of criterion-referenced tests which were designed to be parallel in content, would a traditional reliability coefficient produce different decisions about the reliability of those tests than would kappa? The procedure used collected statewide results on 136 test…
Descriptors: Analysis of Variance, Comparative Analysis, Criterion Referenced Tests, Measurement Techniques
Randall, Robert S. – 1972
Differences in design between norm referenced measures (NRM) and criterion referenced measures (CRM) are reviewed, and some of the procedures proposed on designing and evaluating CRM are examined. Differences in design of NRM and CRM are said to arise from the different purposes that underlie each measure. In addition, there are differences among…
Descriptors: Comparative Analysis, Criterion Referenced Tests, Norm Referenced Tests, Test Construction
Willoughby, Lee; And Others – 1976
This study compared a domain referenced approach with a traditional psychometric approach in the construction of a test. Results of the December, 1975 Quarterly Profile Exam (QPE) administered to 400 examinees at a university were the source of data. The 400 item QPE is a five alternative multiple choice test of information a "safe"…
Descriptors: Comparative Analysis, Criterion Referenced Tests, Norm Referenced Tests, Statistical Analysis
Downing, Steven M.; Mehrens, William A. – 1978
Four criterion-referenced reliability coefficicents were compared to the Kuder-Richardson estimates and to each other. The Kuder-Richardson formulas 20 and 21, the Livingston, the Subkoviak and two Huynh coefficients were computed for a random sample of 33 criterion-referenced tests. The Subkoviak coefficient yielded the highest mean value;…
Descriptors: Career Development, Comparative Analysis, Criterion Referenced Tests, Factor Analysis
Schwartz, Howard P. – 1974
Distinction between norm referenced and criterion referenced tests are explored in relationship to underlying philosophy and intent. In considering the use of a criterion referenced test for instructional purposes, consideration is given to: specification of objectives, item content and selection, reliability, and needs assessment. (Author)
Descriptors: Comparative Analysis, Criterion Referenced Tests, Educational Assessment, Educational Needs
Berk, Ronald A. – 1979
As alternatives to the objectives-based approach to specifying content domains for test construction purposes, six strategies are proposed: (1) amplified objectives; (2) Instructional Objectives Exchange (IOX) test specifications; (3) item transformations; (4) item forms; (5) algorithms; and (6) mapping sentences. Their effectiveness is assessed…
Descriptors: Behavioral Objectives, Comparative Analysis, Criterion Referenced Tests, Evaluation Criteria
MacFarland, Thomas W. – 1985
Criterion-referenced evaluation (CRE) describes achievement in performance terms, whereas norm-referenced evaluation (NRE) compares the performance of one individual to that of others with respect to a given evaluation instrument. Vocational educators who base their programs on behaviorism commonly evaluate student performance from a CRE…
Descriptors: Comparative Analysis, Criterion Referenced Tests, Norm Referenced Tests, Secondary Education
Reid, Jerry B.; Roberts, Dennis M. – 1978
Comparisons of corresponding values of phi and kappa coefficients were made for 270 instances of data generated by a Monte Carlo technique to simulate a test-retest situation. Data were generated for distributions with the same mean but three different levels of standard deviation, standard error of measurement and cutting score. Ten samples of…
Descriptors: Comparative Analysis, Correlation, Criterion Referenced Tests, Cutting Scores

Lovett, Hubert T. – 1975
The reliability of a criterion referenced test was defined as a measure of the degree to which the test discriminates between an individual's level of performance and a predetermined criterion level. The variances of observed and true scores were defined as the squared deviation of the score from the criterion. Based on these definitions and the…
Descriptors: Career Development, Comparative Analysis, Criterion Referenced Tests, Mathematical Models

Brennan, Robert L.; Kane, Michael T. – Psychometrika, 1977
Using the assumption of randomly parallel tests and concepts from generalizability theory, three signal/noise ratios for domain-referenced tests are developed, discussed, and compared. The three ratios have the same noise but different signals depending upon the kind of decision to be made as a result of measurement. (Author/JKS)
Descriptors: Comparative Analysis, Criterion Referenced Tests, Error of Measurement, Mathematical Models
Miller, Carson K. – 1984
A study was conducted at Stark Technical College to compare a normative-referenced test for mathematics placement with a criterion-referenced test that had been used by the college. The study sought to compare statistically the scores of 165 students on the Mathematics Inventory Test (MIT--a criterion-referenced test that had been developed…
Descriptors: Comparative Analysis, Criterion Referenced Tests, Educational Diagnosis, Mathematics Skills