ERIC - Search Results

Publication Date

In 2025	0
Since 2024	0
Since 2021 (last 5 years)	0
Since 2016 (last 10 years)	2
Since 2006 (last 20 years)	5

Descriptor

Criterion Referenced Tests	29
Scoring Formulas	29
Measurement Techniques	9
Norm Referenced Tests	9
Test Interpretation	9
Test Reliability	9
Test Validity	9
Scoring	7
Cutting Scores	6
Statistical Analysis	6
Testing	6
Achievement Tests	5
Elementary Secondary Education	5
Guessing (Tests)	5
Test Construction	5
Correlation	4
Evaluation Criteria	4
Item Analysis	4
Multiple Choice Tests	4
Scores	4
Test Theory	4
True Scores	4
Comparative Analysis	3
Confidence Testing	3
Educational Testing	3
More ▼

Source

Evaluation in Education:…	2
Assessment in Education:…	1
ETS Research Report Series	1
Educational Measurement:…	1
Educational Testing Service	1
Educational and Psychological…	1
Journal of Educational…	1
Journal of Experimental…	1
Journal of Special Education	1
Language Assessment Quarterly	1
Review of Educational Research	1
Spectrum	1
Studies in Educational…	1
More ▼

Publication Type

Reports - Research	13
Journal Articles	9
Speeches/Meeting Papers	4
Reports - Evaluative	3
Tests/Questionnaires	3
Information Analyses	2
Reports - Descriptive	2
Collected Works - General	1
Guides - Classroom - Teacher	1
Guides - Non-Classroom	1
Opinion Papers	1
More ▼

Education Level

Higher Education	3
Adult Education	1
Elementary Secondary Education	1
High Schools	1
Postsecondary Education	1
Secondary Education	1

Audience

Location

Australia	1
Japan	1
Kansas	1

Laws, Policies, & Programs

Assessments and Surveys

California Achievement Tests	2
Stanford Achievement Tests	1

What Works Clearinghouse Rating

Showing 1 to 15 of 29 results Save | Export

Meta-Analysis of Criterion Validity for Curriculum-Based Measurement in Written Language

Peer reviewed

Direct link

Romig, John Elwood; Therrien, William J.; Lloyd, John W. – Journal of Special Education, 2017

We used meta-analysis to examine the criterion validity of four scoring procedures used in curriculum-based measurement of written language. A total of 22 articles representing 21 studies (N = 21) met the inclusion criteria. Results indicated that two scoring procedures, correct word sequences and correct minus incorrect sequences, have acceptable…

Descriptors: Meta Analysis, Curriculum Based Assessment, Written Language, Scoring Formulas

Guessing and the Rasch Model

Peer reviewed

Direct link

Holster, Trevor A.; Lake, J. – Language Assessment Quarterly, 2016

Stewart questioned Beglar's use of Rasch analysis of the Vocabulary Size Test (VST) and advocated the use of 3-parameter logistic item response theory (3PLIRT) on the basis that it models a non-zero lower asymptote for items, often called a "guessing" parameter. In support of this theory, Stewart presented fit statistics derived from…

Descriptors: Guessing (Tests), Item Response Theory, Vocabulary, Language Tests

Establishing and Applying Performance Standards for Curriculum-Based Examinations

Peer reviewed

Direct link

Bennett, John; Tognolini, Jim; Pickering, Samantha – Assessment in Education: Principles, Policy & Practice, 2012

This paper describes how a state education system in Australia introduced standards-referenced assessments into its large-scale, high-stakes, curriculum-based examinations in a way that enables comparison of performance across time even though the examinations are different each year. It describes the multi-stage modified Angoff standard-setting…

Descriptors: Feedback (Response), Tests, Foreign Countries, Cutting Scores

Aligning Scales of Certification Tests. Research Report. ETS RR-10-07

Download full text

Dorans, Neil J.; Liang, Longjuan; Puhan, Gautam – Educational Testing Service, 2010

Scores are the most visible and widely used products of a testing program. The choice of score scale has implications for test specifications, equating, and test reliability and validity, as well as for test interpretation. At the same time, the score scale should be viewed as infrastructure likely to require repair at some point. In this report…

Descriptors: Testing Programs, Standard Setting (Scoring), Test Interpretation, Certification

Subscores and Validity. Research Report. ETS RR-08-64

Peer reviewed
PDF on ERIC

Download full text

Haberman, Shelby J. – ETS Research Report Series, 2008

In educational testing, subscores may be provided based on a portion of the items from a larger test. One consideration in evaluation of such subscores is their ability to predict a criterion score. Two limitations on prediction exist. The first, which is well known, is that the coefficient of determination for linear prediction of the criterion…

Descriptors: Scores, Validity, Educational Testing, Correlation

Validity Coefficients and Correlated Errors in Test Theory

Peer reviewed

Zimmerman, Donald W. – Journal of Experimental Education, 1977

Derives formulas for the validity of predictor-criterion tests that hold for all test scores constructed according to the expected-value concept of true score. These more general formulas disclose some paradoxical properties of test validity under conditions where errors are correlated and have some implications for practical testing situations…

Descriptors: Correlation, Criterion Referenced Tests, Scoring Formulas, Tables (Data)

A Consumers' Guide to Criterion-Referenced Test "Reliability".

Berk, Ronald A. – 1980

Seventeen statistics for measuring the reliability of criterion-referenced tests were critically reviewed. The review was organized into two sections: (1) a discussion of preliminary considerations to provide a foundation for choosing the appropriate category of "reliability" (threshold loss function, squared-error loss-function, or…

Descriptors: Criterion Referenced Tests, Cutting Scores, Scoring Formulas, Statistical Analysis

Setting Cutting Scores: A Minimum Information Approach.

Veldhuijzen, Niels H. – Evaluation in Education: International Progress, 1982

Setting a cutting score is a key problem in criterion-referenced measurement which is discussed within a decision theoretic approach when just one student is considered. A minimum information solution is given and compared with approaches when there is information about a group of students. Formulas illustrate the discussion. (CM)

Descriptors: Criterion Referenced Tests, Cutting Scores, Educational Testing, Measurement Techniques

The Reliability of a Criterion-Referenced Composite with the Parts of the Composite Having Different Cutting Scores.

Peer reviewed

Raju, Nambury S. – Educational and Psychological Measurement, 1982

Rajaratnam, Cronbach and Gleser's generalizability formula for stratified-parallel tests and Raju's coefficient beta are generalized to estimate the reliability of a composite of criterion-referenced tests, where the parts have different cutting scores. (Author/GK)

Descriptors: Criterion Referenced Tests, Cutting Scores, Mathematical Formulas, Scoring Formulas

Setting Standards for Minimum Competency Tests.

Download full text

Mehrens, William A. – 1981

Some general questions about minimum competency tests are discussed, and various methods of setting standards are reviewed with major attention devoted to those methods used for dichotomizing a continuum. Methods reviewed under the heading of Absolute Judgments of Test Content include Nedelsky's, Angoff's, Ebel's, and Jaeger's. These methods are…

Descriptors: Criterion Referenced Tests, Cutting Scores, Elementary Secondary Education, Minimum Competency Testing

Passing Scores and Test Lengths for Domain-Referenced Measures

Peer reviewed

Millman, Jason – Review of Educational Research, 1973

Procedures for establishing standards and determining the number of items needed in criterion referenced measures were reviewed. Discussion of setting a passing score was organized around: performance of others, item content, educational consequences, psychological and financial costs, and error due to guessing and item sampling. (Author)

Descriptors: Criterion Referenced Tests, Educational Research, Literature Reviews, Measurement Techniques

Obtaining Intended Weights When Combining Students' Scores. NCME Instructional Module.

Peer reviewed

Oosterhof, Albert C. – Educational Measurement: Issues and Practice, 1987

This module describes a method for weighting various measures of student achievement, such as examinations and home assignments, in order to combine these measures into a final grade. Standard deviation methods receive extensive attention. (TJH)

Descriptors: Criterion Referenced Tests, Evaluation Criteria, Grading, Norm Referenced Tests

Binomial Test Models for Domain-Referenced Testing.

van den Brink, Wulfert – Evaluation in Education: International Progress, 1982

Binomial models for domain-referenced testing are compared, emphasizing the assumptions underlying the beta-binomial model. Advantages and disadvantages are discussed. A proposed item sampling model is presented which takes the effect of guessing into account. (Author/CM)

Descriptors: Comparative Analysis, Criterion Referenced Tests, Item Sampling, Measurement Techniques

Behavior on Objective Tests Under Theoretically Adequate, Inadequate and Unspecified Scoring Rules.

Download full text

Jacobs, Stanley S. – 1974

Investigated were the effects of two levels of penalty for incorrect responses on two dependent variables (a measure of risk-taking or confidence, based on nonsense items, and the number of response-attempts to legitimate items) for three treatment groups in a 2x3, multi-response repeated measures, multivariate ANOVA (Analysis of Variance) design.…

Descriptors: Confidence Testing, Criterion Referenced Tests, Guessing (Tests), Multiple Choice Tests

Toward an Integration of Theory and Method for Criterion-Referenced Tests.

Download full text

Hambleton, Ronald K.; Novick, Melvin R. – 1972

In this paper, an attempt has been made to synthesize some of the current thinking in the area of criterion-referenced testing as well as to provide the beginning of an integration of theory and method for such testing. Since criterion-referenced testing is viewed from a decision-theoretic point of view, approaches to reliability and validity…

Descriptors: Criterion Referenced Tests, Measurement Instruments, Measurement Techniques, Scaling

Previous Page | Next Page »

Pages: 1 | 2

Jacobs, Stanley S.	2
Barta, Maryann B.	1
Bennett, John	1
Berk, Ronald A.	1
Bormuth, John R.	1
Brennan, Robert L.	1
Bruno, James E.	1
Dorans, Neil J.	1
Gould, Jewell C.	1
Gramenz, Gary W.	1
Haberman, Shelby J.	1
Haladyna, Thomas	1
Hambleton, Ronald K.	1
Holster, Trevor A.	1
Jones, Bernard G.	1
Lake, J.	1
Liang, Longjuan	1
Lloyd, John W.	1
McNeil, Judy T.	1
Mehrens, William A.	1
Millman, Jason	1
Novick, Melvin R.	1
Oosterhof, Albert C.	1
Opp, Ronald D.	1
More ▼