ERIC - Search Results

Publication Date

In 2025	0
Since 2024	0
Since 2021 (last 5 years)	0
Since 2016 (last 10 years)	1
Since 2006 (last 20 years)	2

Descriptor

Test Reliability	17
True Scores	9
Criterion Referenced Tests	8
Error of Measurement	6
Norm Referenced Tests	6
Statistical Analysis	6
Measurement	5
Scoring	5
Test Construction	5
Test Interpretation	5
Cutting Scores	4
Scores	4
Multiple Choice Tests	3
Raw Scores	3
Reliability	3
Test Results	3
Accuracy	2
Classification	2
Decision Making	2
Efficiency	2
Equated Scores	2
Estimation (Mathematics)	2
Higher Education	2
Judges	2
Mastery Tests	2
More ▼

Source

Journal of Educational…	6
ETS Research Report Series	2
NCME Measurement in Education	1

Author

Livingston, Samuel A.	17
Kastrinos, William	2
Chen, Haiwen H.	1
Ebel, Robert L.	1
Kim, Sooyeon	1
Lewis, Charles	1
Sims-Gunzenhauser, Alice	1
Wingersky, Marilyn A.	1

Publication Type

Reports - Research	7
Journal Articles	4
Speeches/Meeting Papers	3
Numerical/Quantitative Data	2
Collected Works - Serials	1
Guides - Non-Classroom	1
Opinion Papers	1
Reports - Evaluative	1
Tests/Questionnaires	1

Education Level

Audience

Researchers

Location

Laws, Policies, & Programs

Assessments and Surveys

Praxis Series

What Works Clearinghouse Rating

Showing 1 to 15 of 17 results Save | Export

Accuracy of a Classical Test Theory-Based Procedure for Estimating the Reliability of a Multistage Test. Research Report. ETS RR-17-02

Peer reviewed
PDF on ERIC

Download full text

Kim, Sooyeon; Livingston, Samuel A. – ETS Research Report Series, 2017

The purpose of this simulation study was to assess the accuracy of a classical test theory (CTT)-based procedure for estimating the alternate-forms reliability of scores on a multistage test (MST) having 3 stages. We generated item difficulty and discrimination parameters for 10 parallel, nonoverlapping forms of the complete 3-stage test and…

Descriptors: Accuracy, Test Theory, Test Reliability, Adaptive Testing

Estimating Conditional Distributions of Scores on an Alternate Form of a Test. Research Report. ETS RR-15-18

Peer reviewed
PDF on ERIC

Download full text

Livingston, Samuel A.; Chen, Haiwen H. – ETS Research Report Series, 2015

Quantitative information about test score reliability can be presented in terms of the distribution of equated scores on an alternate form of the test for test takers with a given score on the form taken. In this paper, we describe a procedure for estimating that distribution, for any specified score on the test form taken, by estimating the joint…

Descriptors: Scores, Statistical Distributions, Research Reports, Equated Scores

A Note on the Interpretation of the Criterion-Referenced Reliability Coefficient

Peer reviewed

Livingston, Samuel A. – Journal of Educational Measurement, 1973

Article commented on a study by Harris, who presented formulas for the variance of errors of estimation (of a true score from an observed score) and the variance of errors of prediction (of an observed score from an observed score on a parallel test). (Author/RK)

Descriptors: Criterion Referenced Tests, Measurement, Norm Referenced Tests, Test Reliability

A Study of the Reliability of Nedelsky's Method for Choosing a Passing Score.

Livingston, Samuel A.; Kastrinos, William – 1982

Leo Nedelsky developed a method for determining absolute grading standards for multiple choice tests. His method required a group of judges to examine each test question and eliminate those responses which the lowest D- student should be able to reject as incorrect. The correct answer probabilities remaining were used in computing an expected test…

Descriptors: Cutting Scores, Judges, Multiple Choice Tests, Real Estate

Reply to Shavelson, Block, and Ravitch's "Criterion-Referenced Testing: Comments on Reliability"

Peer reviewed

Livingston, Samuel A. – Journal of Educational Measurement, 1972

Author replies to article TM 500 559. (MB)

Descriptors: Criterion Referenced Tests, Measurement Techniques, Norm Referenced Tests, Scoring

Estimating the Consistency and Accuracy of Classifications Based on Test Scores.

Peer reviewed

Livingston, Samuel A.; Lewis, Charles – Journal of Educational Measurement, 1995

A method is presented for estimating the accuracy and consistency of classifications based on test scores. The reliability of the score is used to estimate effective test length in terms of discrete items. The true-score distribution is estimated by fitting a four-parameter beta model. (SLD)

Descriptors: Classification, Estimation (Mathematics), Scores, Statistical Distributions

Some Observations on the Estimation of True Scores.

Livingston, Samuel A. – 1970

The procedure of estimating true scores by means of a transformation of the obtained score based on the reliability coefficient is compared with the use of the obtained score without transformation. Using the mean squared error as a criterion, the transformed score is a better estimate for most examinees but poorer for those whose true scores lie…

Descriptors: Analysis of Variance, Measurement, Raw Scores, Scores

The Criterion-Referenced Reliability of a Single Score. Report 76-01.

Livingston, Samuel A. – 1976

A distinction is made between reliability of measurement and reliability of classification; the "criterion-referenced reliability coefficient" describes the former. Application of this coefficient to the probability distribution of possible scores for a single student yields a meaningful way to describe the reliability of a single score. (Author)

Descriptors: Classification, Criterion Referenced Tests, Error of Measurement, Measurement

Reliability of Tests Used to Make Pass/Fail Decisions: Answering the Right Questions.

Download full text

Livingston, Samuel A. – 1978

The traditional reliability coefficient and standard error of measurement are not adequate measures of reliability for tests used to make pass/fail decisions. Answering the important reliability questions requires estimation of the joint distribution of true and observed scores. Lord's "Method 20" estimates this distribution without the…

Descriptors: Cutting Scores, Decision Making, Efficiency, Error of Measurement

Estimating the Reliability of Classifications Based on Composite Scores.

Download full text

Livingston, Samuel A. – 1984

Much previously published material for estimating the reliability of classification has been based on the assumption that a test consists of a known number of equally weighted items. The test score is the number of those items answered correctly. These methods cannot be used with classifications based on weighted composite scores, especially if…

Descriptors: Equated Scores, Essay Tests, Estimation (Mathematics), Mathematical Models

A Reply to Harris's "An Interpretation of Livingston's Reliability Coefficient for Criterion-Referenced Tests"

Peer reviewed

Livingston, Samuel A. – Journal of Educational Measurement, 1972

This article is a reply to a previous paper (see TM 500 488) interpreting Livingston's original article (see TM 500 487). (CK)

Descriptors: Criterion Referenced Tests, Error of Measurement, Norm Referenced Tests, Test Construction

The Reliability of Criterion-Referenced Measures.

Download full text

Livingston, Samuel A. – 1970

The assumptions of the classical test-theory model are used to develop a theory of reliability for criterion-referenced measures which parallels that for norm-referenced measures. It is shown that the Spearman-Brown formula holds for criterion-referenced measures and that the criterion-referenced reliability coefficient can be used to correct…

Descriptors: Correlation, Criterion Referenced Tests, Measurement Instruments, Norm Referenced Tests

Assessing the Reliability of Tests Used to Make Pass/Fail Decisions.

Peer reviewed

Livingston, Samuel A.; Wingersky, Marilyn A. – Journal of Educational Measurement, 1979

Procedures are described for studying the reliability of decisions based on specific passing scores with tests made up of discrete items and designed to measure continuous rather than categorical traits. These procedures are based on the estimation of the joint distribution of true scores and observed scores. (CTM)

Descriptors: Cutting Scores, Decision Making, Efficiency, Error of Measurement

Criterion-Referenced Applications of Classical Test Theory

Peer reviewed

Livingston, Samuel A. – Journal of Educational Measurement, 1972

A reliability coefficient for criterion-referenced tests is developed from the assumptions of classical test theory. The coefficient is based on deviations of scores from the criterion score, rather than from the mean. (Author/CK)

Descriptors: Criterion Referenced Tests, Error of Measurement, Mathematical Applications, Norm Referenced Tests

Issues in Testing for Competency.

Download full text

Ebel, Robert L.; Livingston, Samuel A. – NCME Measurement in Education, 1981

This issue of Measurement in Education is presented in the form of a dialogue between Dr. Robert L. Ebel, Distinguished Professor of Educational Measurement at Michigan State University, and Dr. Samual A. Livingston, Program Research Scientist at the Educational Testing Service. Alternative views on some aspects of the use of tests in assessing…

Descriptors: Competence, Criterion Referenced Tests, Multiple Choice Tests, Norm Referenced Tests

Previous Page | Next Page »

Pages: 1 | 2