ERIC - Search Results

Publication Date

In 2025	0
Since 2024	0
Since 2021 (last 5 years)	0
Since 2016 (last 10 years)	0
Since 2006 (last 20 years)	2

Descriptor

Criterion Referenced Tests	25
Statistical Analysis	25
Test Interpretation	25
Test Reliability	12
Mathematical Models	11
Norm Referenced Tests	8
Test Construction	8
Cutting Scores	6
Item Analysis	6
Mastery Tests	6
Scores	6
Test Results	6
Testing	6
Measurement Techniques	5
Test Theory	5
Educational Objectives	4
Error of Measurement	4
Item Sampling	4
Test Validity	4
Bayesian Statistics	3
Career Development	3
Comparative Analysis	3
Decision Making	3
Evaluation	3
Evaluation Criteria	3
More ▼

Source

Journal of Special Education	2
Journal of Early Adolescence	1
Journal of Educational…	1
Language Assessment Quarterly	1

Publication Type

Reports - Research	14
Speeches/Meeting Papers	3
Information Analyses	2
Journal Articles	2
Guides - General	1
Reports - Descriptive	1
Reports - Evaluative	1
Tests/Questionnaires	1

Education Level

Higher Education

Audience

Location

Delaware	1
Hawaii	1
Surinam	1

Laws, Policies, & Programs

Elementary and Secondary…

Assessments and Surveys

California Achievement Tests

What Works Clearinghouse Rating

Showing 1 to 15 of 25 results Save | Export

Generalizability Theory as a Unifying Framework of Measurement Reliability in Adolescent Research

Peer reviewed

Direct link

Fan, Xitao; Sun, Shaojing – Journal of Early Adolescence, 2014

In adolescence research, the treatment of measurement reliability is often fragmented, and it is not always clear how different reliability coefficients are related. We show that generalizability theory (G-theory) is a comprehensive framework of measurement reliability, encompassing all other reliability methods (e.g., Pearson "r,"…

Descriptors: Generalizability Theory, Measurement, Reliability, Correlation

Testing-Context Analysis: Assessment Is Just Another Part of Language Curriculum Development

Peer reviewed

Direct link

Brown, James Dean – Language Assessment Quarterly, 2008

In keeping with the theme of the International Language Testing Association/Language Testing Research Colloquium Conference in 2008, "Focusing on the Core: Justifying the Use of Language Assessments to Stakeholders," I define "stakeholder-friendly tests," "defensible testing," and "testing-context analysis."…

Descriptors: Language Usage, Curriculum Development, Testing, Language Tests

A Consumers' Guide to Criterion-Referenced Test "Reliability".

Berk, Ronald A. – 1980

Seventeen statistics for measuring the reliability of criterion-referenced tests were critically reviewed. The review was organized into two sections: (1) a discussion of preliminary considerations to provide a foundation for choosing the appropriate category of "reliability" (threshold loss function, squared-error loss-function, or…

Descriptors: Criterion Referenced Tests, Cutting Scores, Scoring Formulas, Statistical Analysis

Using Group Performance to Interpret Individual Responses to Criterion-Referenced Tests.

Download full text

Besel, Ronald – 1973

The contention that interpretation of a student's performance on a criterion referenced test should be independent of the performance of his classmates is challenged. The Mastery Learning Test Model, which was developed for analyzing criterion referenced test data, is described. An estimate of the proportion of students in an instructional group…

Descriptors: Criterion Referenced Tests, Mathematical Models, Measurement Instruments, Speeches

Improving Criterion-Referenced Measurement

Peer reviewed

Shoemaker, David M. – Journal of Special Education, 1972

Considered is the improvement of criterion-referenced measurement as applied to individual and group assessment of handicapped and normal children. (DB)

Descriptors: Criterion Referenced Tests, Evaluation, Exceptional Child Education, Handicapped Children

The Issue of Item and Test Variance for Criterion-Referenced Tests.

Download full text

Woodson, M. I. Charles E.

It has been argued that item variance and test variance are not necessary characteristics for criterion-referenced tests, although they are necessary for norm-referenced tests. This position is in error because it considers sample statistics as the criteria for evaluating items and tests. Within a particular sample, an item or test may have no…

Descriptors: Criterion Referenced Tests, Evaluation Criteria, Item Analysis, Item Sampling

Measurement Considerations for Criterion-Referenced Testing and Special Education

Peer reviewed

Gorth, William P.; Hambleton, Ronald K. – Journal of Special Education, 1972

Descriptors: Criterion Referenced Tests, Evaluation, Exceptional Child Education, Handicapped Children

An Empirical Investigation of Four Criterion-Referenced Testing Models.

Download full text

Epstein, Kenneth I. – 1975

Since the primary purpose of classical testing is to rank order examinees consistently, the absolute value of the true score has been relatively unimportant. However, the major purpose of criterion referenced testing is to estimate the true capabilities of examinees to perform specific tasks. Hence, the problems of true score determination assume…

Descriptors: Bayesian Statistics, Criterion Referenced Tests, Mathematical Models, Military Personnel

Criterion-Referenced Testing: Comments on Reliability

Peer reviewed

Shavelson, Richard J.; And Others – Journal of Educational Measurement, 1972

In this comment a recent attempt by Samuel A. Livingston to develop a theory of reliability for criterion-referenced measures is critiqued. For Livingston's rejoinder see TM 500 560. (Authors/MB)

Descriptors: Criterion Referenced Tests, Error of Measurement, Measurement Techniques, Response Style (Tests)

Criterion-Referenced Test Interpretations of "Classical" Measurement Theory.

Download full text

Epstein, Kenneth I.; Knerr, Claramae S. – 1976

The literature on criterion referenced testing is full of discussions concerning whether classical measurement techniques are appropriate, whether variance is necessary, whether new indices of reliability are needed, and the like. What appears to be lacking, however, is a clear and simple discussion of why the problems occur. This paper suggests…

Descriptors: Career Development, Criterion Referenced Tests, Item Analysis, Item Sampling

Statistical Techniques for Criterion-Referenced Tests. Final Report. October, 1976-October, 1977.

Wilcox, Rand R. – 1977

Three statistical problems related to criterion-referenced testing are investigated: estimation of the likelihood of a false-positive or false-negative decision with a mastery test, estimation of true scores in the Compound Binomial Error Model, and comparison of the examinees to a control. Two methods for estimating the likelihood of…

Descriptors: Criterion Referenced Tests, Cutting Scores, Error Patterns, Item Sampling

Agreement Coefficients as Indices of Dependability for Domain-Referenced Tests. ACT Technical Bulletin No. 28.

Download full text

Kane, Michael T.; Brennan, Robert L. – 1977

A large number of seemingly diverse coefficients have been proposed as indices of dependability, or reliability, for domain-referenced and/or mastery tests. In this paper, it is shown that most of these indices are special cases of two generalized indices of agreement: one that is corrected for chance, and one that is not. The special cases of…

Descriptors: Bayesian Statistics, Correlation, Criterion Referenced Tests, Cutting Scores

Mastery-Learning Decision Variables.

Download full text

Besel, Ronald – 1971

The Mastery-Learning test model is extended. Methods for estimating prior probabilities are described. The use of an adjustment matrix to transform a probability of mastery measure and empirical methods for estimating adjustment matrix parameters are derived. Adjustment matrices are interpreted as indicators of instructional effectiveness and as…

Descriptors: Criterion Referenced Tests, Decision Making, Groups, Individual Testing

An Empirical Investigation of the ESEA Title I Evaluation Systems' Proposed Variance Estimation Procedures for Use With Criterion Referenced Tests.

Long, John; And Others – 1978

An experiment was performed to evaluate the tenability of the assumption in the Elementary Secondary Education Act (ESEA) Title I proposed variance estimation procedures for criterion referenced tests. The assumption is that the ratio of the local to the national standard deviation for the national sample will be the same for the normed test as…

Descriptors: Compensatory Education, Criterion Referenced Tests, Educational Assessment, Elementary Education

Criterion-Referenced Testing: A Critical Analysis of Selected Models. Technical Paper 306. Final Report

Download full text

Steinheiser, Frederick H., Jr.; And Others – 1978

Alternative mathematical models for scoring and decision making with criterion referenced tests are described, especially as they concern appropriate test length and methods of establishing statistically valid cutting scores. Several of these approaches are reviewed and compared on formal-analytic and empirical grounds: (1) Block's approach to…

Descriptors: Comparative Analysis, Criterion Referenced Tests, Cutting Scores, Decision Making

Previous Page | Next Page »

Pages: 1 | 2

Besel, Ronald	2
Brennan, Robert L.	2
Epstein, Kenneth I.	2
Berk, Ronald A.	1
Blatchford, Charles H.	1
Bormuth, John R.	1
Brown, James Dean	1
Drenth, Pieter J. D.	1
Fan, Xitao	1
Gorth, William P.	1
Haladyna, Thomas	1
Haladyna, Tom	1
Hambleton, Ronald K.	1
Izard, J. F.	1
Kane, Michael T.	1
Knerr, Claramae S.	1
Long, John	1
Millman, Jason	1
Roid, Gale	1
Shavelson, Richard J.	1
Shoemaker, David M.	1
Steinheiser, Frederick H., Jr.	1
Sun, Shaojing	1
Tatsuoka, Kikumi K.	1
More ▼