ERIC - Search Results

Descriptor

Mathematical Models	8
Statistical Analysis	8
Test Length	8
Test Reliability	6
Equated Scores	3
Comparative Analysis	2
Criterion Referenced Tests	2
Cutting Scores	2
Elementary Secondary Education	2
Error of Measurement	2
Item Analysis	2
Mastery Tests	2
Measurement Techniques	2
Sampling	2
Statistical Data	2
Test Interpretation	2
Test Validity	2
Ability Identification	1
Achievement Tests	1
Adaptive Testing	1
Cognitive Measurement	1
College Entrance Examinations	1
Computer Assisted Testing	1
Computer Programs	1
Correlation	1
More ▼

Source

Journal of Educational…	1
Psychometrika	1

Author

Budescu, David	1
Cliff, Norman	1
Feldt, Leonard S.	1
Gilmer, Jerry S.	1
Hutten, Leah R.	1
Kristof, Walter	1
Livingston, Samuel A.	1
Steinheiser, Frederick H., Jr.	1
Subkoviak, Michael J.	1

Publication Type

Reports - Research	6
Speeches/Meeting Papers	2
Information Analyses	1
Journal Articles	1
Tests/Questionnaires	1

Education Level

Audience

Researchers

Location

Laws, Policies, & Programs

Assessments and Surveys

Stanford Binet Intelligence…

What Works Clearinghouse Rating

Showing all 8 results Save | Export

Efficiency of Linear Equating as a Function of the Length of the Anchor Test.

Peer reviewed

Budescu, David – Journal of Educational Measurement, 1985

An important determinant of equating process efficiency is the correlation between the anchor test and components of each form. Use of some monotonic function of this correlation as a measure of equating efficiency is suggested. A model relating anchor test length and test reliability to this measure of efficiency is presented. (Author/DWH)

Descriptors: Correlation, Equated Scores, Mathematical Models, Standardized Tests

Estimating the Reliability of Classifications Based on Composite Scores.

Download full text

Livingston, Samuel A. – 1984

Much previously published material for estimating the reliability of classification has been based on the assumption that a test consists of a known number of equally weighted items. The test score is the number of those items answered correctly. These methods cannot be used with classifications based on weighted composite scores, especially if…

Descriptors: Equated Scores, Essay Tests, Estimation (Mathematics), Mathematical Models

On the Theory of a Set of Tests Which Differ Only in Length

Peer reviewed

Kristof, Walter – Psychometrika, 1971

Descriptors: Cognitive Measurement, Error of Measurement, Mathematical Models, Psychological Testing

The Standard Errors of the Feldt-Gilmer Congeneric Reliability Coefficients: Iowa Testing Programs Occasional Papers. Number 31.

PDF pending restoration

Gilmer, Jerry S.; Feldt, Leonard S. – 1982

The Feldt-Gilmer congeneric reliability coefficients make it possible to estimate the reliability of a test composed of parts of unequal, unknown length. The approximate standard errors of the Feldt-Gilmer coefficients are derived via a method using the multivariate Taylor's expansion. Monte Carlo simulation is employed to corroborate the…

Descriptors: Educational Testing, Error of Measurement, Mathematical Formulas, Mathematical Models

Evaluation of Criterion-Referenced Reliability Coefficients. Final Report.

Download full text

Subkoviak, Michael J. – 1977

Four different procedures were used for estimating the proportion of persons who would be classified consistently as either passing both of two parallel tests or failing both. These four methods were applied at each of four different mastery level scores for each of three different length tests. Data were based on 50 replications of each procedure…

Descriptors: Criterion Referenced Tests, Cutting Scores, Data Analysis, Data Collection

Criterion-Referenced Testing: A Critical Analysis of Selected Models. Technical Paper 306. Final Report

Download full text

Steinheiser, Frederick H., Jr.; And Others – 1978

Alternative mathematical models for scoring and decision making with criterion referenced tests are described, especially as they concern appropriate test length and methods of establishing statistically valid cutting scores. Several of these approaches are reviewed and compared on formal-analytic and empirical grounds: (1) Block's approach to…

Descriptors: Comparative Analysis, Criterion Referenced Tests, Cutting Scores, Decision Making

Evaluations of Implied Orders as a Basis for Tailored Testing Using Simulations. Technical Report No. 4.

Cliff, Norman; And Others – 1977

TAILOR is a computer program that uses the implied orders concept as the basis for computerized adaptive testing. The basic characteristics of TAILOR, which does not involve pretesting, are reviewed here and two studies of it are reported. One is a Monte Carlo simulation based on the four-parameter Birnbaum model and the other uses a matrix of…

Descriptors: Adaptive Testing, Computer Assisted Testing, Computer Programs, Difficulty Level

A Comparison of the Fit of Empirical Data to Two Latent Trait Models. Report No. 92.

Hutten, Leah R. – 1979

Goodness of fit of raw test score data were compared, using two latent trait models: the Rasch model and the Birnbaum three-parameter logistic model. Data were taken from various achievement tests and the Scholastic Aptitude Test (Verbal). A minimum sample size of 1,000 was required, and the minimum test length was 40 items. Results indicated that…

Descriptors: Ability Identification, Achievement Tests, College Entrance Examinations, Comparative Analysis