ERIC - Search Results

Publication Date

In 2025	0
Since 2024	0
Since 2021 (last 5 years)	0
Since 2016 (last 10 years)	0
Since 2006 (last 20 years)	1

Descriptor

Criterion Referenced Tests	30
Error of Measurement	30
Test Reliability	30
Test Construction	13
Norm Referenced Tests	11
Test Interpretation	11
Cutting Scores	10
Test Validity	9
True Scores	9
Mastery Tests	7
Mathematical Models	7
Item Analysis	6
Scores	6
Test Theory	6
Statistical Analysis	5
Comparative Analysis	4
Higher Education	4
Test Items	4
Test Results	4
Achievement Tests	3
Career Development	3
Decision Making	3
Measurement	3
Multiple Choice Tests	3
Scoring	3
More ▼

Source

Journal of Educational…	6
Evaluation Review	1
GED Testing Service	1
Psychometrika	1

Publication Type

Reports - Research	16
Speeches/Meeting Papers	8
Journal Articles	2
Reports - Evaluative	2
Information Analyses	1
Reports - Descriptive	1
Tests/Questionnaires	1

Education Level

High Schools

Audience

Researchers

Location

Australia

Laws, Policies, & Programs

Assessments and Surveys

College Level Academic Skills…	1
General Educational…	1

What Works Clearinghouse Rating

Showing 1 to 15 of 30 results Save | Export

Reliability and Validity Evidence for the GED[R] English as a Second Language Test. GED Testing Service[R] Research Studies, 2009-4

Download full text

Setzer, J. Carl – GED Testing Service, 2009

The GED[R] English as a Second Language (GED ESL) Test was designed to serve as an adjunct to the GED test battery when an examinee takes either the Spanish- or French-language version of the tests. The GED ESL Test is a criterion-referenced, multiple-choice instrument that assesses the functional, English reading skills of adults whose first…

Descriptors: Language Tests, High School Equivalency Programs, Psychometrics, Reading Skills

An Investigation of Full-And Subscale Reliabilities of Criterion-Referenced Tests.

Download full text

Haladyna, Thomas M. – 1974

Classical test theory has been rejected for application to criterion-referenced (CR) tests by most psychometricians due to an expected lack of variance in scores and other difficulties. The present study was conceived to resolve the variance problem and explore the possibility that classical test theory is both appropriate and desirable for some…

Descriptors: Criterion Referenced Tests, Error of Measurement, Sampling, Test Construction

The Role of Reliability in Criterion-Referenced Tests.

Peer reviewed

Kane, Michael T. – Journal of Educational Measurement, 1986

These analyses suggest that if a criterion-referenced test had a reliability (defined in terms of internal consistency) below 0.5, a simple a priori procedure would provide better estimates of students' universe scores than would individual observed scores. (Author/LMO)

Descriptors: Criterion Referenced Tests, Educational Research, Error of Measurement, Generalizability Theory

The Criterion-Referenced Reliability of a Single Score. Report 76-01.

Livingston, Samuel A. – 1976

A distinction is made between reliability of measurement and reliability of classification; the "criterion-referenced reliability coefficient" describes the former. Application of this coefficient to the probability distribution of possible scores for a single student yields a meaningful way to describe the reliability of a single score. (Author)

Descriptors: Classification, Criterion Referenced Tests, Error of Measurement, Measurement

Errors of Measurement and Standard Setting in Mastery Testing.

Kane, Michael; Wilson, Jennifer – 1982

This paper evaluates the magnitude of the total error in estimates of the difference between an examinee's domain score and the cutoff score. An observed score based on a random sample of items from the domain, and an estimated cutoff score derived from a judgmental standard setting procedure are assumed. The work of Brennan and Lockwood (1980) is…

Descriptors: Criterion Referenced Tests, Cutting Scores, Error of Measurement, Mastery Tests

A Monte Carlo Comparison of Phi and Kappa as Measures of Criterion-Referenced Reliability.

Reid, Jerry B.; Roberts, Dennis M. – 1978

Comparisons of corresponding values of phi and kappa coefficients were made for 270 instances of data generated by a Monte Carlo technique to simulate a test-retest situation. Data were generated for distributions with the same mean but three different levels of standard deviation, standard error of measurement and cutting score. Ten samples of…

Descriptors: Comparative Analysis, Correlation, Criterion Referenced Tests, Cutting Scores

A New Index for the Accuracy of a Criterion-Referenced Test.

Divgi, D. R. – 1978

One aim of criterion-referenced testing is to classify an examinee without reference to a norm group; therefore, any statements about the dependability of such classification ought to be group-independent also. A population-independent index is proposed in terms of the probability of incorrect classification near the cutoff true score. The…

Descriptors: Criterion Referenced Tests, Cutting Scores, Difficulty Level, Error of Measurement

Signal/Noise Ratios for Domain-Referenced Tests

Peer reviewed

Brennan, Robert L.; Kane, Michael T. – Psychometrika, 1977

Using the assumption of randomly parallel tests and concepts from generalizability theory, three signal/noise ratios for domain-referenced tests are developed, discussed, and compared. The three ratios have the same noise but different signals depending upon the kind of decision to be made as a result of measurement. (Author/JKS)

Descriptors: Comparative Analysis, Criterion Referenced Tests, Error of Measurement, Mathematical Models

Criterion-Referenced Testing: Comments on Reliability

Peer reviewed

Shavelson, Richard J.; And Others – Journal of Educational Measurement, 1972

In this comment a recent attempt by Samuel A. Livingston to develop a theory of reliability for criterion-referenced measures is critiqued. For Livingston's rejoinder see TM 500 560. (Authors/MB)

Descriptors: Criterion Referenced Tests, Error of Measurement, Measurement Techniques, Response Style (Tests)

Measuring Criterion-Referenced Test Reliability with a Single Test Administration.

Scheetz, James P.; vonFraunhofer, J. Anthony – 1980

Subkoviak suggested a technique for estimating both group reliability and the reliability associated with assigning a given individual to a mastery or non-mastery category based on a single test administration. Two assumptions underlie this model. First, it is assumed that had successive test administrations occurred, scores for each individual…

Descriptors: Criterion Referenced Tests, Cutting Scores, Error of Measurement, Higher Education

A Duplicate Construction Experiment.

Download full text

Bridgeman, Brent – 1974

This experiment was designed to assess the ability of item writers to construct truly parallel tests based on a "duplicate-construction experiment" in which Cronbach argues that if the universe description and sampling are ideally refined, the two independently constructed tests will be entirely equivalent, and that within the limits of item…

Descriptors: Criterion Referenced Tests, Error of Measurement, Item Analysis, Norm Referenced Tests

An Interpretation of Livingston's Reliability Coefficient for Criterion-Referenced Tests.

PDF pending restoration

Harris, Chester W. – 1971

Livingston's work is a careful analysis of what occurs when one pools two populations with different means, but similar variances and reliability coefficients. However, his work fails to advance reliability theory for the special case of criterion-referenced testing. See ED 042 802 for Livingston's paper. (MS)

Descriptors: Analysis of Variance, Criterion Referenced Tests, Error of Measurement, Reliability

Assessing the Reliability of Criterion-Referenced Measures Used to Evaluate Health-Education Programs.

Peer reviewed

Schaeffer, Gary A.; And Others – Evaluation Review, 1986

The reliability of criterion-referenced tests (CRTs) used in health program evaluation can be conceptualized in different ways. Formulas are presented for estimating appropriate standard error of measurement (SEM) for CRTs. The SEM can be used in computing confidence intervals for domain score estimates and for a cut-score. (Author/LMO)

Descriptors: Accountability, Criterion Referenced Tests, Cutting Scores, Error of Measurement

A Reply to Harris's "An Interpretation of Livingston's Reliability Coefficient for Criterion-Referenced Tests"

Peer reviewed

Livingston, Samuel A. – Journal of Educational Measurement, 1972

This article is a reply to a previous paper (see TM 500 488) interpreting Livingston's original article (see TM 500 487). (CK)

Descriptors: Criterion Referenced Tests, Error of Measurement, Norm Referenced Tests, Test Construction

A Comparison of Two Item Selection Procedures for Building Criterion-Referenced Tests.

Download full text

Haladyna, Tom; Roid, Gale – 1981

Two approaches to criterion-referenced test construction are compared. Classical test theory is based on the practice of random sampling from a well-defined domain of test items; latent trait theory suggests that the difficulty of the items should be matched to the achievement level of the student. In addition to these two methods of test…

Descriptors: Criterion Referenced Tests, Error of Measurement, Latent Trait Theory, Test Construction

Previous Page | Next Page »

Pages: 1 | 2

Brennan, Robert L.	3
Haladyna, Tom	3
Livingston, Samuel A.	3
Harris, Chester W.	2
Kane, Michael T.	2
Roid, Gale	2
Schaeffer, Gary A.	2
Bateman, Andrea	1
Belcher, Marcia	1
Bridgeman, Brent	1
Divgi, D. R.	1
Emrick, John A.	1
Gillis, Shelley	1
Haladyna, Thomas M.	1
Kane, Michael	1
Macpherson, Colin R.	1
Marshall, J. Laird	1
Mills, Craig N.	1
Perry, Dallis	1
Reid, Jerry B.	1
Roberts, Dennis M.	1
Rowley, Glenn L.	1
Scheetz, James P.	1
Setzer, J. Carl	1
More ▼