NotesFAQContact Us
Collection
Advanced
Search Tips
Publication Date
In 20250
Since 20240
Since 2021 (last 5 years)0
Since 2016 (last 10 years)3
Since 2006 (last 20 years)3
Education Level
Audience
Researchers1
Laws, Policies, & Programs
What Works Clearinghouse Rating
Showing 1 to 15 of 23 results Save | Export
New Meridian Corporation, 2020
New Meridian Corporation has developed the "Quality Testing Standards and Criteria for Comparability Claims" (QTS). The goal of the QTS is to provide guidance to states that are interested in including content from the New Meridian item bank and intend to make comparability claims with "other assessments" that include New…
Descriptors: Testing, Standards, Comparative Analysis, Guidelines
New Meridian Corporation, 2020
New Meridian Corporation has developed the "Quality Testing Standards and Criteria for Comparability Claims" (QTS). The goal of the QTS is to provide guidance to states that are interested in including content from the New Meridian item bank and intend to make comparability claims with "other assessments" that include New…
Descriptors: Testing, Standards, Comparative Analysis, Guidelines
New Meridian Corporation, 2020
New Meridian Corporation has developed the "Quality Testing Standards and Criteria for Comparability Claims" (QTS) to provide guidance to states that are interested in including New Meridian content and would like to either keep reporting scores on the New Meridian Scale or use the New Meridian performance levels; that is, the state…
Descriptors: Testing, Standards, Comparative Analysis, Test Content
Bontempo, Brian D.; Marks, Casimer M.; Karabatsos, George – 1998
Using meta-analysis, this research takes a look at studies included in a meta-analysis by R. Jaeger (1989) that compared the cut score set by one standard setting method with that set by another. This meta-analysis looks beyond Jaeger's studies to select 10 from the research literature. Each compared at least two types of standard setting method.…
Descriptors: Comparative Analysis, Cutting Scores, Effect Size, Meta Analysis
Sigmon, Gary L.; And Others – 1983
In recent years educators have been utilizing judgmental methods, such as the ones advocated by Ebel and Angoff, to set minimum competency standards on test items. This study was designed to investigate the reliability and validity of these two procedures in setting minimum levels of performance on 175 vocational evaluator competency statements.…
Descriptors: Comparative Analysis, Evaluation Methods, Evaluators, Minimum Competencies
Peer reviewed Peer reviewed
Kane, Michael T. – Journal of Educational Measurement, 1987
The use of item response theory models for analyzing the results of judgmental standard setting studies (the Angoff technique) for establishing minimum pass levels is discussed. A comparison of three methods indicates the traditional approach may not be best. A procedure based on generalizability theory is suggested. (GDC)
Descriptors: Comparative Analysis, Cutting Scores, Generalizability Theory, Latent Trait Theory
Peer reviewed Peer reviewed
Mills, Craig N. – Journal of Educational Measurement, 1983
This study compares the results obtained using the Angoff, borderline group, and contrasting groups methods of determining performance standards. Congruent results were obtained from the Angoff and contrasting groups methods for several test forms. Borderline group standards were not similar to standards obtained with other methods. (Author/PN)
Descriptors: Comparative Analysis, Criterion Referenced Tests, Cutting Scores, Standard Setting (Scoring)
Peer reviewed Peer reviewed
Andrew, Barbara J.; Hecht, James T. – Educational and Psychological Measurement, 1976
Results suggest that different groups of judges do set similar examination standards when using the same procedure, and that the average of individual judgments does not differ significantly from group consensus judgments. Significant differences were found, however, between the standards set by the two procedures employed. (RC)
Descriptors: Comparative Analysis, Cutting Scores, Multiple Choice Tests, Pass Fail Grading
Cizek, Gregory J.; Fitzgerald, Shawn M. – 1996
A group-process approach to standard setting was compared to an independent approach for a medical specialty certification examination. Both approaches used the Angoff (1971) standard-setting method. In the group-process method, reviewers discussed items and their ratings during the rating process; in the independent condition, reviewers provided…
Descriptors: Comparative Analysis, Cost Effectiveness, Group Dynamics, Judges
Peer reviewed Peer reviewed
Harasym, P. H. – Educational and Psychological Measurement, 1981
To determine if a given standard-setting procedure will yield consistent evaluation outcomes, one of three parallel certifying examinations was administered to three classes of second year medical students. The results indicated that the standard-setting procedure is a significant factor in the determination of the evaluation outcome. (Author/BW)
Descriptors: Comparative Analysis, Cutting Scores, Foreign Countries, Graduate Medical Students
Buckendahl, Chad W.; Smith, Russ W.; Impara, James C.; Plake, Barbara S. – 2000
This paper presents a comparison of two commonly used methods, Angoff (W. Angoff, 1971) and Bookmark (D. Lewis, H. Mitzel, and D. Green, 1996), for setting cut scores on selected response tests. This comparison is presented through an application to a grade 7 mathematics assessment in a suburban Midwestern school district. Training and operational…
Descriptors: Comparative Analysis, Cutting Scores, Junior High School Students, Junior High Schools
Peer reviewed Peer reviewed
Norcini, John J.; Shea, Judy A. – Applied Measurement in Education, 1997
The major forms of evidence that support a standard's credibility are reviewed, and what can be done over time and for different forms of an examination to enhance its comparability in a credentialing setting is outlined. Pass-fail decisions must be consistent to ensure a standard's credibility. (SLD)
Descriptors: Certification, Comparative Analysis, Credentials, Credibility
Peer reviewed Peer reviewed
Henry, Gary T.; And Others – Evaluation Review, 1992
A statistical technique is presented for developing performance standards based on benchmark groups. The benchmark groups are selected using a multivariate technique that relies on a squared Euclidean distance method. For each observation unit (a school district in the example), a unique comparison group is selected. (SLD)
Descriptors: Accountability, Benchmarking, Comparative Analysis, Control Groups
Phillips, Gary W., Ed. – 1996
Recently, there has been a significant expansion in the use of performance assessment in large scale testing programs. Although there has been significant support from curriculum and policy stakeholders, the technical feasibility of large scale performance assessments has remained a question. This report is intended to contribute to the debate by…
Descriptors: Comparative Analysis, Generalizability Theory, Performance Based Assessment, Psychometrics
Peer reviewed Peer reviewed
Livingston, Samuel A.; Zieky, Michael J. – Applied Measurement in Education, 1989
The borderline group standard-setting method (BGSM), Nedelsky method (NM), and Angoff method (AM) were compared, using reading scores for 1,948 and mathematics scores for 2,191 sixth through ninth graders. The NM and AM were inconsistent with the BGSM. Passing scores were higher where students were more able. (SLD)
Descriptors: Comparative Analysis, Cutting Scores, Elementary Secondary Education, Intermediate Grades
Previous Page | Next Page ยป
Pages: 1  |  2