Publication Date
In 2025 | 0 |
Since 2024 | 0 |
Since 2021 (last 5 years) | 0 |
Since 2016 (last 10 years) | 2 |
Since 2006 (last 20 years) | 5 |
Descriptor
Criterion Referenced Tests | 29 |
Scoring Formulas | 29 |
Measurement Techniques | 9 |
Norm Referenced Tests | 9 |
Test Interpretation | 9 |
Test Reliability | 9 |
Test Validity | 9 |
Scoring | 7 |
Cutting Scores | 6 |
Statistical Analysis | 6 |
Testing | 6 |
More ▼ |
Source
Author
Jacobs, Stanley S. | 2 |
Barta, Maryann B. | 1 |
Bennett, John | 1 |
Berk, Ronald A. | 1 |
Bormuth, John R. | 1 |
Brennan, Robert L. | 1 |
Bruno, James E. | 1 |
Dorans, Neil J. | 1 |
Gould, Jewell C. | 1 |
Gramenz, Gary W. | 1 |
Haberman, Shelby J. | 1 |
More ▼ |
Publication Type
Education Level
Higher Education | 3 |
Adult Education | 1 |
Elementary Secondary Education | 1 |
High Schools | 1 |
Postsecondary Education | 1 |
Secondary Education | 1 |
Audience
Laws, Policies, & Programs
Assessments and Surveys
California Achievement Tests | 2 |
Stanford Achievement Tests | 1 |
What Works Clearinghouse Rating
Romig, John Elwood; Therrien, William J.; Lloyd, John W. – Journal of Special Education, 2017
We used meta-analysis to examine the criterion validity of four scoring procedures used in curriculum-based measurement of written language. A total of 22 articles representing 21 studies (N = 21) met the inclusion criteria. Results indicated that two scoring procedures, correct word sequences and correct minus incorrect sequences, have acceptable…
Descriptors: Meta Analysis, Curriculum Based Assessment, Written Language, Scoring Formulas
Holster, Trevor A.; Lake, J. – Language Assessment Quarterly, 2016
Stewart questioned Beglar's use of Rasch analysis of the Vocabulary Size Test (VST) and advocated the use of 3-parameter logistic item response theory (3PLIRT) on the basis that it models a non-zero lower asymptote for items, often called a "guessing" parameter. In support of this theory, Stewart presented fit statistics derived from…
Descriptors: Guessing (Tests), Item Response Theory, Vocabulary, Language Tests
Bennett, John; Tognolini, Jim; Pickering, Samantha – Assessment in Education: Principles, Policy & Practice, 2012
This paper describes how a state education system in Australia introduced standards-referenced assessments into its large-scale, high-stakes, curriculum-based examinations in a way that enables comparison of performance across time even though the examinations are different each year. It describes the multi-stage modified Angoff standard-setting…
Descriptors: Feedback (Response), Tests, Foreign Countries, Cutting Scores
Dorans, Neil J.; Liang, Longjuan; Puhan, Gautam – Educational Testing Service, 2010
Scores are the most visible and widely used products of a testing program. The choice of score scale has implications for test specifications, equating, and test reliability and validity, as well as for test interpretation. At the same time, the score scale should be viewed as infrastructure likely to require repair at some point. In this report…
Descriptors: Testing Programs, Standard Setting (Scoring), Test Interpretation, Certification
Haberman, Shelby J. – ETS Research Report Series, 2008
In educational testing, subscores may be provided based on a portion of the items from a larger test. One consideration in evaluation of such subscores is their ability to predict a criterion score. Two limitations on prediction exist. The first, which is well known, is that the coefficient of determination for linear prediction of the criterion…
Descriptors: Scores, Validity, Educational Testing, Correlation

Zimmerman, Donald W. – Journal of Experimental Education, 1977
Derives formulas for the validity of predictor-criterion tests that hold for all test scores constructed according to the expected-value concept of true score. These more general formulas disclose some paradoxical properties of test validity under conditions where errors are correlated and have some implications for practical testing situations…
Descriptors: Correlation, Criterion Referenced Tests, Scoring Formulas, Tables (Data)
Berk, Ronald A. – 1980
Seventeen statistics for measuring the reliability of criterion-referenced tests were critically reviewed. The review was organized into two sections: (1) a discussion of preliminary considerations to provide a foundation for choosing the appropriate category of "reliability" (threshold loss function, squared-error loss-function, or…
Descriptors: Criterion Referenced Tests, Cutting Scores, Scoring Formulas, Statistical Analysis
Veldhuijzen, Niels H. – Evaluation in Education: International Progress, 1982
Setting a cutting score is a key problem in criterion-referenced measurement which is discussed within a decision theoretic approach when just one student is considered. A minimum information solution is given and compared with approaches when there is information about a group of students. Formulas illustrate the discussion. (CM)
Descriptors: Criterion Referenced Tests, Cutting Scores, Educational Testing, Measurement Techniques

Raju, Nambury S. – Educational and Psychological Measurement, 1982
Rajaratnam, Cronbach and Gleser's generalizability formula for stratified-parallel tests and Raju's coefficient beta are generalized to estimate the reliability of a composite of criterion-referenced tests, where the parts have different cutting scores. (Author/GK)
Descriptors: Criterion Referenced Tests, Cutting Scores, Mathematical Formulas, Scoring Formulas
Mehrens, William A. – 1981
Some general questions about minimum competency tests are discussed, and various methods of setting standards are reviewed with major attention devoted to those methods used for dichotomizing a continuum. Methods reviewed under the heading of Absolute Judgments of Test Content include Nedelsky's, Angoff's, Ebel's, and Jaeger's. These methods are…
Descriptors: Criterion Referenced Tests, Cutting Scores, Elementary Secondary Education, Minimum Competency Testing

Millman, Jason – Review of Educational Research, 1973
Procedures for establishing standards and determining the number of items needed in criterion referenced measures were reviewed. Discussion of setting a passing score was organized around: performance of others, item content, educational consequences, psychological and financial costs, and error due to guessing and item sampling. (Author)
Descriptors: Criterion Referenced Tests, Educational Research, Literature Reviews, Measurement Techniques

Oosterhof, Albert C. – Educational Measurement: Issues and Practice, 1987
This module describes a method for weighting various measures of student achievement, such as examinations and home assignments, in order to combine these measures into a final grade. Standard deviation methods receive extensive attention. (TJH)
Descriptors: Criterion Referenced Tests, Evaluation Criteria, Grading, Norm Referenced Tests
van den Brink, Wulfert – Evaluation in Education: International Progress, 1982
Binomial models for domain-referenced testing are compared, emphasizing the assumptions underlying the beta-binomial model. Advantages and disadvantages are discussed. A proposed item sampling model is presented which takes the effect of guessing into account. (Author/CM)
Descriptors: Comparative Analysis, Criterion Referenced Tests, Item Sampling, Measurement Techniques
Jacobs, Stanley S. – 1974
Investigated were the effects of two levels of penalty for incorrect responses on two dependent variables (a measure of risk-taking or confidence, based on nonsense items, and the number of response-attempts to legitimate items) for three treatment groups in a 2x3, multi-response repeated measures, multivariate ANOVA (Analysis of Variance) design.…
Descriptors: Confidence Testing, Criterion Referenced Tests, Guessing (Tests), Multiple Choice Tests
Hambleton, Ronald K.; Novick, Melvin R. – 1972
In this paper, an attempt has been made to synthesize some of the current thinking in the area of criterion-referenced testing as well as to provide the beginning of an integration of theory and method for such testing. Since criterion-referenced testing is viewed from a decision-theoretic point of view, approaches to reliability and validity…
Descriptors: Criterion Referenced Tests, Measurement Instruments, Measurement Techniques, Scaling
Previous Page | Next Page ยป
Pages: 1 | 2