ERIC - Search Results

Publication Date

In 2025	0
Since 2024	0
Since 2021 (last 5 years)	0
Since 2016 (last 10 years)	0
Since 2006 (last 20 years)	1

Descriptor

Criterion Referenced Tests	32
Test Validity	10
Test Reliability	9
Cutting Scores	7
Predictive Validity	6
Test Construction	6
Achievement Tests	5
Comparative Analysis	5
Higher Education	5
Norm Referenced Tests	5
Test Items	5
Item Analysis	4
Predictor Variables	4
Scoring	4
Test Length	4
Difficulty Level	3
Guessing (Tests)	3
Mastery Tests	3
Mathematical Models	3
Models	3
Psychometrics	3
Reading Tests	3
Tables (Data)	3
Technical Reports	3
Academic Achievement	2
More ▼

Source

Educational and Psychological…

Publication Type

Journal Articles	20
Reports - Research	15
Reports - Evaluative	6
Opinion Papers	1
Reports - Descriptive	1

Education Level

Grade 11	1
Grade 5	1

Audience

Location

Laws, Policies, & Programs

Assessments and Surveys

ACT Assessment	1
Comprehensive Tests of Basic…	1
Graduate Management Admission…	1
Slosson Intelligence Test	1
Stanford Diagnostic Reading…	1
Wechsler Adult Intelligence…	1

What Works Clearinghouse Rating

Showing 1 to 15 of 32 results Save | Export

Peer reviewed

Direct link

Wyse, Adam E. – Educational and Psychological Measurement, 2011

Standard setting is a method used to set cut scores on large-scale assessments. One of the most popular standard setting methods is the Bookmark method. In the Bookmark method, panelists are asked to envision a response probability (RP) criterion and move through a booklet of ordered items based on a RP criterion. This study investigates whether…

Descriptors: Testing Programs, Standard Setting (Scoring), Cutting Scores, Probability

The Single Administration Estimate of the Proportion of Agreement of a Proficiency Test Scored with a Latent Structure Model.

Peer reviewed

Wilcox, Rand R. – Educational and Psychological Measurement, 1981

This paper describes and compares procedures for estimating the reliability of proficiency tests that are scored with latent structure models. Results suggest that the predictive estimate is the most accurate of the procedures. (Author/BW)

Descriptors: Criterion Referenced Tests, Scoring, Test Reliability

Domain Validity--Why Care?

Peer reviewed

Rozeboom, William W. – Educational and Psychological Measurement, 1978

A strict equivalence presupposed by Kaiser and Michael to derive the coefficient of "domain validity" is defensible only as a biased approximation. But then, it is far from clear what psychometric significance this coefficient has in the first place. (Author)

Descriptors: Criterion Referenced Tests, Item Analysis, Item Banks, Test Validity

Criterion-Referenced Reliability Estimated by ANOVA

Peer reviewed

Lovett, Hubert T. – Educational and Psychological Measurement, 1977

The analysis of variance model for estimating reliability in norm referenced tests is extended to criterion referenced tests. The essential modification is that the criterion or cut-off score is substituted for the population mean. An example and discussion are presented. (JKS)

Descriptors: Analysis of Variance, Criterion Referenced Tests, Cutting Scores, Test Reliability

The Validity of the Tests of Achievement in Basic Skills for Predicting Achievement in General Mathematics and Algebra

Peer reviewed

Young, James C.; And Others – Educational and Psychological Measurement, 1970

Descriptors: Achievement Tests, Algebra, Criterion Referenced Tests, Mathematics

Peer reviewed

Huynh, Huynh – Educational and Psychological Measurement, 1990

Within the multivariate normality framework, a formula is provided for computation of the criterion-related validity of composite scores based on the highest (or lowest) of several equivalent measures. This partial composite score has more validity than each single observation, but less validity than a composite based on all observations. (SLD)

Descriptors: Concurrent Validity, Criterion Referenced Tests, Equations (Mathematics), Mathematical Models

The Effect of Violating the Assumption of Equal Item Means in Estimating the Livingston Coefficient.

Peer reviewed

Lovett, Hubert T. – Educational and Psychological Measurement, 1978

The validity of five methods of estimating the reliability of criterion-referenced tests was evaluated across nine conditions of variability among item means. The results were analyzed by analysis of variance, the Newman-Keuls test, and a nonparametric procedure. There was a tendency for all of the methods to be conservative. (Author/JKS)

Descriptors: Analysis of Variance, Criterion Referenced Tests, Item Analysis, Nonparametric Statistics

Homogeneity Within Item Forms in Domain Referenced Testing

Peer reviewed

Macready, George B.; Merwin, Jack C. – Educational and Psychological Measurement, 1973

In this paper consideration is given to the nature of the relationships among items within item forms and how these relationships compare with an ideal case for diagnostic tests in which if a person gets one item within an item form right then he would get all items within the item form correct. (Authors)

Descriptors: Criterion Referenced Tests, Diagnostic Tests, Homogeneous Grouping, Item Analysis

The Reliability of a Criterion-Referenced Composite with the Parts of the Composite Having Different Cutting Scores.

Peer reviewed

Raju, Nambury S. – Educational and Psychological Measurement, 1982

Rajaratnam, Cronbach and Gleser's generalizability formula for stratified-parallel tests and Raju's coefficient beta are generalized to estimate the reliability of a composite of criterion-referenced tests, where the parts have different cutting scores. (Author/GK)

Descriptors: Criterion Referenced Tests, Cutting Scores, Mathematical Formulas, Scoring Formulas

A Comparison of Objective-Based and Modified-Bormuth Item Writing Techniques

Peer reviewed

Roid, G. H.; Haladyna, Thomas M. – Educational and Psychological Measurement, 1978

Two techniques for writing achievement test items to accompany instructional materials are contrasted: writing items from statements of instructional objectives, and writing items from semi-automated rules for transforming instructional statements. Both systems resulted in about the same number of faulty items. (Author/JKS)

Descriptors: Achievement Tests, Comparative Analysis, Criterion Referenced Tests, Difficulty Level

Determining Optimal Test Lengths with a Fixed Total Testing Time.

Peer reviewed

Hambleton, Ronald K. – Educational and Psychological Measurement, 1987

This paper presents an algorithm for determining the number of items to measure each objective in a criterion-referenced test when testing time is fixed and when the objectives vary in their levels of importance, reliability, and validity. Results of four special applications of the algorithm are presented. (BS)

Descriptors: Algorithms, Behavioral Objectives, Criterion Referenced Tests, Test Construction

Correlates of the Wechsler Adult Intelligence Scale, The Slosson Intelligence Tests, ACT Scores and Grade Point Averages

Peer reviewed

Martin, John D.; Rudolph, Linda – Educational and Psychological Measurement, 1972

The SIT Correlates highly enough with ACT scores to be considered a valid instrument for predicting acceptance and success in college. (Authors)

Descriptors: Comparative Analysis, Criterion Referenced Tests, Grade Point Average, Intelligence Tests

Selecting the t Best of k Examinees Whose True Score Is Better than a Standard.

Peer reviewed

Wilcox, Rand R. – Educational and Psychological Measurement, 1981

The paper considers the problem of selecting the t best of k normal populations and simultaneously determining whether the selected populations have a mean larger than a known standard. Illustrations are given for selecting the t best of k examinees when the binomial error model applies. (Author)

Descriptors: Competitive Selection, Criterion Referenced Tests, Decision Making, Mathematical Models

A Computer Simulation Study of Tailored Testing Strategies for Objective-Based Instructional Programs

Peer reviewed

Spineti, John P.; Hambleton, Ronald K. – Educational and Psychological Measurement, 1977

The effectiveness of various tailored testing strategies for use in objective based instructional programs was investigated. The three factors of a tailored testing strategy under study with various hypothetical distributions of abilities across two learning hierarchies were test length, mastery cutting score, and starting point. (Author/JKS)

Descriptors: Adaptive Testing, Computer Assisted Testing, Criterion Referenced Tests, Cutting Scores

The California Comprehensive Test of Basic Skills: A Predictor of Success for High School Freshmen

Peer reviewed

Nolan, James S.; Jacobson, James – Educational and Psychological Measurement, 1972

In general Achievement Test scores appeared to be more valid predictors of grades in English and Mathematics courses than were scores on a scholastic aptitude or general intelligence test. (Authors)

Descriptors: Achievement Tests, College Freshmen, Criterion Referenced Tests, High School Freshmen

Previous Page | Next Page »

Pages: 1 | 2 | 3

Wilcox, Rand R.	5
Hambleton, Ronald K.	2
Huynh, Huynh	2
Lovett, Hubert T.	2
Michael, William B.	2
Young, James C.	2
Barnes, Laura L. B.	1
Behuniak, Peter, Jr.	1
Bennett, Judith A.	1
Brennan, Robert L.	1
Chen, Chin-Yi	1
Haladyna, Thomas M.	1
Hanna, Gerald S.	1
Holly, Keith A.	1
Horodezky, Betty	1
Hutcheson, Sam J.	1
Jacobson, James	1
Jaradat, Derar	1
Labercane, George	1
Macready, George B.	1
Martin, John D.	1
Merwin, Jack C.	1
Nolan, James S.	1
Paolillo, Joseph G. P.	1
More ▼