ERIC - Search Results

Publication Date

In 2025	0
Since 2024	0
Since 2021 (last 5 years)	0
Since 2016 (last 10 years)	2
Since 2006 (last 20 years)	3

Descriptor

Comparative Analysis	32
Criterion Referenced Tests	32
Test Reliability	32
Norm Referenced Tests	17
Test Validity	16
Test Construction	14
Statistical Analysis	8
Item Analysis	7
Test Theory	6
Career Development	5
Mathematical Models	5
Measurement Techniques	5
Test Interpretation	5
Behavioral Objectives	4
Cutting Scores	4
Educational Objectives	4
Error of Measurement	4
Language Tests	4
Scores	4
Test Items	4
Testing	4
Analysis of Variance	3
English (Second Language)	3
Evaluation Criteria	3
Evaluation Methods	3
More ▼

Source

Assessment for Effective…	1
Edinburgh Working Papers in…	1
Educational Measurement:…	1
Foreign Language Annals	1
Journal of Educational…	1
Journal of Science Education…	1
Language Testing	1
Performance Improvement…	1
Performance and Instruction	1
Psychometrika	1

Publication Type

Reports - Research	19
Journal Articles	8
Speeches/Meeting Papers	5
Reports - Evaluative	3
Opinion Papers	2
Reports - Descriptive	2
Dissertations/Theses -…	1
Guides - General	1
Information Analyses	1

Education Level

Higher Education	2
Grade 8	1

Audience

Counselors	1
Practitioners	1
Support Staff	1

Location

Iran	1
Taiwan	1

Laws, Policies, & Programs

Assessments and Surveys

What Works Clearinghouse Rating

Showing 1 to 15 of 32 results Save | Export

Reliably Assessing Growth with Longitudinal Diagnostic Classification Models

Peer reviewed

Direct link

Madison, Matthew J. – Educational Measurement: Issues and Practice, 2019

Recent advances have enabled diagnostic classification models (DCMs) to accommodate longitudinal data. These longitudinal DCMs were developed to study how examinees change, or transition, between different attribute mastery statuses over time. This study examines using longitudinal DCMs as an approach to assessing growth and serves three purposes:…

Descriptors: Longitudinal Studies, Item Response Theory, Psychometrics, Criterion Referenced Tests

A Comparison of Two Content Area Curriculum-Based Measurement Tools

Peer reviewed

Direct link

Ford, Jeremy W.; Conoyer, Sarah J.; Lembke, Erica S.; Smith, R. Alex; Hosp, John L. – Assessment for Effective Intervention, 2018

In the present study, two types of curriculum-based measurement (CBM) tools in science, Vocabulary Matching (VM) and Statement Verification for Science (SV-S), a modified Sentence Verification Technique, were compared. Specifically, this study aimed to determine whether the format of information presented (i.e., SV-S vs. VM) produces differences…

Descriptors: Curriculum Based Assessment, Evaluation Methods, Measurement Techniques, Comparative Analysis

The Two Most Useful Approaches to Estimating Criterion-Referenced Test Reliability in a Single Test Administration.

Coscarelli, William; Shrock, Sharon – Performance Improvement Quarterly, 2002

Discusses problems in using traditional measures of reliability for criterion-referenced tests (CRTs) and describes two approaches to reliability for CRTs: estimates sensitive to all measures of error; and estimates of consistency in test outcome. Compares the two approaches and proposes recommendations for interpretation and use. (Author/LRW)

Descriptors: Comparative Analysis, Criterion Referenced Tests, Measurement Techniques, Test Reliability

Item Analysis for Teacher-Made Mastery Tests

Peer reviewed

Crehan, Kevin D. – Journal of Educational Measurement, 1974

Various item selection techniques are compared on criterion-referenced reliability and validity. Techniques compared include three nominal criterion-referenced methods, a traditional point biserial selection, teacher selection, and random selection. (Author)

Descriptors: Comparative Analysis, Criterion Referenced Tests, Item Analysis, Item Banks

A Comparison of Kuder-Richardson Formula 20 and Kappa as Estimates of the Reliability of Criterion-Referenced Tests.

Moyer, Judith E.; Fishbein, Ronald L. – 1977

The problem that this research addressed was one of decision making. Given three sets of criterion-referenced tests which were designed to be parallel in content, would a traditional reliability coefficient produce different decisions about the reliability of those tests than would kappa? The procedure used collected statewide results on 136 test…

Descriptors: Analysis of Variance, Comparative Analysis, Criterion Referenced Tests, Measurement Techniques

Contrasting Norm Referenced and Criterion Referenced Measures.

Download full text

Randall, Robert S. – 1972

Differences in design between norm referenced measures (NRM) and criterion referenced measures (CRM) are reviewed, and some of the procedures proposed on designing and evaluating CRM are examined. Differences in design of NRM and CRM are said to arise from the different purposes that underlie each measure. In addition, there are differences among…

Descriptors: Comparative Analysis, Criterion Referenced Tests, Norm Referenced Tests, Test Construction

A Comparison of Domain-Referenced and Classic Psychometric Test Construction Methods.

Download full text

Willoughby, Lee; And Others – 1976

This study compared a domain referenced approach with a traditional psychometric approach in the construction of a test. Results of the December, 1975 Quarterly Profile Exam (QPE) administered to 400 examinees at a university were the source of data. The 400 item QPE is a five alternative multiple choice test of information a "safe"…

Descriptors: Comparative Analysis, Criterion Referenced Tests, Norm Referenced Tests, Statistical Analysis

Six Single-Administration Reliability Coefficients for Criterion-Referenced Tests: A Comparative Study.

Download full text

Downing, Steven M.; Mehrens, William A. – 1978

Four criterion-referenced reliability coefficicents were compared to the Kuder-Richardson estimates and to each other. The Kuder-Richardson formulas 20 and 21, the Livingston, the Subkoviak and two Huynh coefficients were computed for a random sample of 33 criterion-referenced tests. The Subkoviak coefficient yielded the highest mean value;…

Descriptors: Career Development, Comparative Analysis, Criterion Referenced Tests, Factor Analysis

Testing for Instructional Purposes: Norm Referenced--Criterion Referenced.

Download full text

Schwartz, Howard P. – 1974

Distinction between norm referenced and criterion referenced tests are explored in relationship to underlying philosophy and intent. In considering the use of a criterion referenced test for instructional purposes, consideration is given to: specification of objectives, item content and selection, reliability, and needs assessment. (Author)

Descriptors: Comparative Analysis, Criterion Referenced Tests, Educational Assessment, Educational Needs

A Critical Review of Content Domain Specification/Item Generation Strategies for Criterion-Referenced Tests.

Berk, Ronald A. – 1979

As alternatives to the objectives-based approach to specifying content domains for test construction purposes, six strategies are proposed: (1) amplified objectives; (2) Instructional Objectives Exchange (IOX) test specifications; (3) item transformations; (4) item forms; (5) algorithms; and (6) mapping sentences. Their effectiveness is assessed…

Descriptors: Behavioral Objectives, Comparative Analysis, Criterion Referenced Tests, Evaluation Criteria

A Defense of Criterion-Referenced Evaluation for Vocational Education.

Download full text

MacFarland, Thomas W. – 1985

Criterion-referenced evaluation (CRE) describes achievement in performance terms, whereas norm-referenced evaluation (NRE) compares the performance of one individual to that of others with respect to a given evaluation instrument. Vocational educators who base their programs on behaviorism commonly evaluate student performance from a CRE…

Descriptors: Comparative Analysis, Criterion Referenced Tests, Norm Referenced Tests, Secondary Education

A Monte Carlo Comparison of Phi and Kappa as Measures of Criterion-Referenced Reliability.

Reid, Jerry B.; Roberts, Dennis M. – 1978

Comparisons of corresponding values of phi and kappa coefficients were made for 270 instances of data generated by a Monte Carlo technique to simulate a test-retest situation. Data were generated for distributions with the same mean but three different levels of standard deviation, standard error of measurement and cutting score. Ten samples of…

Descriptors: Comparative Analysis, Correlation, Criterion Referenced Tests, Cutting Scores

Elaboration and Application of a Theory of Criterion-Referenced Reliability.

PDF pending restoration

Lovett, Hubert T. – 1975

The reliability of a criterion referenced test was defined as a measure of the degree to which the test discriminates between an individual's level of performance and a predetermined criterion level. The variances of observed and true scores were defined as the squared deviation of the score from the criterion. Based on these definitions and the…

Descriptors: Career Development, Comparative Analysis, Criterion Referenced Tests, Mathematical Models

Signal/Noise Ratios for Domain-Referenced Tests

Peer reviewed

Brennan, Robert L.; Kane, Michael T. – Psychometrika, 1977

Using the assumption of randomly parallel tests and concepts from generalizability theory, three signal/noise ratios for domain-referenced tests are developed, discussed, and compared. The three ratios have the same noise but different signals depending upon the kind of decision to be made as a result of measurement. (Author/JKS)

Descriptors: Comparative Analysis, Criterion Referenced Tests, Error of Measurement, Mathematical Models

Criterion-Referenced vs. Normative-Referenced Mathematics Placement Test Comparison.

Miller, Carson K. – 1984

A study was conducted at Stark Technical College to compare a normative-referenced test for mathematics placement with a criterion-referenced test that had been used by the college. The study sought to compare statistically the scores of 165 students on the Mathematics Inventory Test (MIT--a criterion-referenced test that had been developed…

Descriptors: Comparative Analysis, Criterion Referenced Tests, Educational Diagnosis, Mathematics Skills

Previous Page | Next Page »

Pages: 1 | 2 | 3

Haladyna, Tom	2
Shrock, Sharon	2
Bashaw, W. L.	1
Berk, Ronald A.	1
Bernknopf, Stanley	1
Brennan, Robert L.	1
Chen, Tsuiping	1
Conoyer, Sarah J.	1
Coscarelli, William	1
Crehan, Kevin D.	1
Day, Gerald F.	1
Downing, Steven M.	1
Dwyer, Francis M.	1
Eignor, Daniel R.	1
Fishbein, Ronald L.	1
Ford, Jeremy W.	1
Ghonsooly, Behzad	1
Gross, Susan K.	1
Grulick, Lawrence Edward	1
Hallau, Margaret Gardner	1
Hambleton, Ronald K.	1
Hosp, John L.	1
Kane, Michael T.	1
Kunnan, Antony John	1
More ▼