NotesFAQContact Us
Collection
Advanced
Search Tips
What Works Clearinghouse Rating
Showing 316 to 330 of 505 results Save | Export
Peer reviewed Peer reviewed
Thompson, Bruce, Ed. – Journal of Experimental Education, 1994
Five authors representing diverse perspectives comment on the revised "Program Evaluation Standards" approved by the American National Standards Institute (ANSI). Standards are considered in light of their development; measurement issues; program evaluation; evaluation in the local education agency; and the context of evaluation…
Descriptors: Context Effect, Evaluation Methods, Evaluation Utilization, Guides
Peer reviewed Peer reviewed
Putnam, Sarah E.; And Others – Applied Measurement in Education, 1995
Development of a multistage dominant profile method for setting standards on complex performance assessments is detailed. The method grew from experiences with a judgmental policy-capturing procedure and an extended Angoff method. The design of an early adolescence English language arts assessment illustrates the complexity of decisions panelists…
Descriptors: Adolescents, Decision Making, Elementary Secondary Education, Evaluation Methods
Peer reviewed Peer reviewed
Henry, Gary T.; And Others – Evaluation Review, 1992
A statistical technique is presented for developing performance standards based on benchmark groups. The benchmark groups are selected using a multivariate technique that relies on a squared Euclidean distance method. For each observation unit (a school district in the example), a unique comparison group is selected. (SLD)
Descriptors: Accountability, Benchmarking, Comparative Analysis, Control Groups
Peer reviewed Peer reviewed
Plake, Barbara S.; And Others – Educational Measurement: Issues and Practice, 1991
Possible sources of intrajudge inconsistency in standard setting are reviewed, and approaches are presented to improve the accuracy of rating. Procedures for providing judges with feedback through discussion or computerized communication are discussed. Monitoring and maintaining judges' consistency throughout the rating process are essential. (SLD)
Descriptors: Computer Assisted Instruction, Evaluators, Examiners, Feedback
Peer reviewed Peer reviewed
Fehrmann, Melinda L.; And Others – Educational and Psychological Measurement, 1991
Two frame-of-reference rater training approaches were compared for effects on reliability and accuracy of cutoff scores generated by 21 raters using Angoff methods on tests taken by 155 undergraduates. Both approaches result in higher interrater reliability and more accuracy than does a non-frame-of-reference method. (SLD)
Descriptors: Cutting Scores, Evaluators, Generalizability Theory, Higher Education
Peer reviewed Peer reviewed
Swanson, David B.; And Others – Academic Medicine, 1990
This study is the National Board of Medical Examiners exploration of content-based techniques (standard-setting techniques in which pass/fail decisions are based upon the performance of examinees in relation to test content). Two content-based techniques (Angoff and Ebel) and three methods of evaluating examinee performance were studied. (MLW)
Descriptors: Content Validity, Evaluation Methods, Higher Education, Medical Education
Peer reviewed Peer reviewed
Direct linkDirect link
Plake, Barbara S.; Hambleton, Ronald K. – Educational Assessment, 2000
Applied the analytical judgment standard setting method to 90 papers from the 1996 Grade 8 National Assessment of Educational Progress science assessment. Compared sorting versus direct classification, long and short versions of the classification scale, and effects of discussion on cutscores. Results from 17 Georgia teachers and 8 Michigan…
Descriptors: Classification, Cutting Scores, Junior High School Students, Junior High Schools
Peer reviewed Peer reviewed
Impara, James C.; Plake, Barbara S. – Journal of Educational Measurement, 1998
Sixth-grade teachers (n=26) estimated item performance for their students (724 total students) on a 50-item district-wide science test. Teachers were more accurate in estimating performance of the total group than of the borderline group, but in neither case was their accuracy high. Estimating proportion-correct values using the Angoff standard…
Descriptors: Difficulty Level, Elementary School Teachers, Grade 6, Intermediate Grades
Peer reviewed Peer reviewed
Direct linkDirect link
Abbott, Marilyn L. – Alberta Journal of Educational Research, 2006
The purpose of this article is to promote an increased awareness of the processes for setting cut-scores for complex performance assessments by (a) describing the Analytic Judgment Method (AJM) for setting cut-scores, and (b) critically evaluating the technical adequacy and practicability of the AJM by focusing on one investigation where the AJM…
Descriptors: Interrater Reliability, Cutting Scores, Performance Based Assessment, Standard Setting (Scoring)
Phillips, Gary W., Ed. – 1996
Recently, there has been a significant expansion in the use of performance assessment in large scale testing programs. Although there has been significant support from curriculum and policy stakeholders, the technical feasibility of large scale performance assessments has remained a question. This report is intended to contribute to the debate by…
Descriptors: Comparative Analysis, Generalizability Theory, Performance Based Assessment, Psychometrics
Reckase, Mark D. – 1994
Comparative results are presented for procedures recently appearing in literature related to standard setting on the National Assessment of Educational Progress--the paper selection method and the contrasting group method. For this comparison, a probabilistic model with normal distribution of performance and a six-point scale were assumed. The…
Descriptors: Comparative Analysis, Criteria, Educational Assessment, Elementary Secondary Education
Plake, Barbara S.; Hambleton, Ronald K. – 1998
This paper reports on a standard-setting method designed for complex performance assessments with multiple performance categories. The method studied, the Analytical Judgment Method, involves panelists' making analytical classification decisions for each of the test's components individually. It also allows for discussion and reconsideration of…
Descriptors: Classification, Data Analysis, Grade 8, Junior High School Students
Hambleton, Ronald K.; Plake, Barbara S. – 1994
The number of performance-based assessments is increasing rapidly, but to date there is no established procedure for setting standards on these assessments. This paper describes several extensions to the Angoff procedure to accommodate the characteristics of a performance-based assessment and presents the results of research in applying this…
Descriptors: Educational Assessment, Evaluation Methods, Interrater Reliability, Performance Based Assessment
Chang, Lei; And Others – 1994
The present study examines the influence of judges' item-related knowledge on setting standards for competency tests. Seventeen judges from different professions took a 122-item teacher-certification test in economics while setting competency standards for the test using the Angoff procedure. Judges tended to set higher standards for items they…
Descriptors: Economics, Evaluators, Experience, Interrater Reliability
Sykes, Robert C.; Fitzpatrick, Anne R. – 1990
The results of classifying test items on the basis of their Mantel-Haenszel (MH) alpha estimates were compared to the results of classifying these items using an item response theory (IRT) based procedure involving the comparison of item difficulties in the interest of identifying the alpha value that maximized the decision concordance between the…
Descriptors: Classification, Cutting Scores, Difficulty Level, Ethnic Groups
Pages: 1  |  ...  |  18  |  19  |  20  |  21  |  22  |  23  |  24  |  25  |  26  |  ...  |  34