NotesFAQContact Us
Collection
Advanced
Search Tips
What Works Clearinghouse Rating
Showing 286 to 300 of 505 results Save | Export
Peer reviewed Peer reviewed
Rowley, Glenn L. – Journal of Educational Measurement, 1982
Survey research establishes that beardedness correlates with many desired outcomes and supports that a minimal level of beardedness be set as a prerequisite for high school graduation. Research problems of concept clarification, domain-definition, instrument development, and standard setting methods are discussed. The political considerations of…
Descriptors: Administrative Principles, Criterion Referenced Tests, Cutting Scores, Dress Codes
Peer reviewed Peer reviewed
Behuniak, Peter, Jr.; And Others – Educational and Psychological Measurement, 1982
This study examined how local content specialists performed when applying the Angoff and Nedelsky standard setting procedures to objective-referenced instruments in reading and mathematics. Results revealed several differences between the standard setting procedures in terms of both level and consistency of the cut scores generated. (Author/BW)
Descriptors: Comparative Analysis, Criterion Referenced Tests, Cutting Scores, Interrater Reliability
Peer reviewed Peer reviewed
Green, Bert F. – Educational Measurement: Issues and Practice, 1995
If annual performance assessments are to yield results that can be compared from year to year, many technical problems must be addressed. It is essential that tests to be equated measure the same construct. Methods of equating performance assessment scores, ways of equating system assessments, and standard setting are discussed. (SLD)
Descriptors: Comparative Analysis, Educational Assessment, Educational Change, Equated Scores
Peer reviewed Peer reviewed
Fisher, William P., Jr., Ed.; Wright, Benjamin D., Ed. – International Journal of Educational Research, 1994
This special issue demonstrates the symmetry and rigor of probabilistic conjoint measurement in practical applications in education and other human sciences. Following the opening chapter introducing the theory, other chapters focus on conjoint measurement in testing and student evaluation, standard setting, and the study of behavior and attitude.…
Descriptors: Attitude Measures, Behavior, Educational Assessment, Educational Research
Peer reviewed Peer reviewed
Norcini, John; And Others – Applied Psychological Measurement, 1991
Effects of numbers of experts (NOEs) and common items (CIs) on the scaling of cutting scores from expert judgments were studied for 11,917 physicians taking 2 forms of a medical specialty examination. Increasing NOEs and CIs reduced error; beyond 5 experts and 25 CIs, error differences were small. (SLD)
Descriptors: Comparative Testing, Cutting Scores, Equated Scores, Estimation (Mathematics)
Peer reviewed Peer reviewed
Geisinger, Kurt F. – Educational Measurement: Issues and Practice, 1991
Ways to use standard-setting data to adjust cutoff scores on examinations are reviewed. Ten sources of information to be used in determining standards are listed. The decision to modify passing scores should be based on these types of information and consideration of adverse impact or rating process irregularities. (SLD)
Descriptors: Cutting Scores, Evaluation Utilization, Evaluators, Interrater Reliability
Peer reviewed Peer reviewed
Rothman, Arthur I.; And Others – Evaluation and the Health Professions, 1996
Results of the fall 1993 administration of part two of the Medical Council of Canada's Evaluating Examination for 744 candidates provided evidence of the consistency of the pass/fail and cutting score definitions for the objective- structured clinical examination stations used across examiners. These results support the validity of this…
Descriptors: Cutting Scores, Definitions, Foreign Countries, Medical Education
Peer reviewed Peer reviewed
Chang, Lei; And Others – Applied Measurement in Education, 1996
The influence of judges' knowledge on standard setting for competency tests was studied with 17 judges who took an economics teacher certification test while setting competency standards using the Angoff procedure. Judges tended to set higher standards for items they answered correctly and lower standards for items they answered incorrectly. (SLD)
Descriptors: Competence, Difficulty Level, Economics, Judges
Peer reviewed Peer reviewed
Cohen, Allan S.; Kane, Michael T.; Crooks, Terence J. – Applied Measurement in Education, 1999
Describes examinee-centered method for setting multiple cutscores on a test involving both objective and extended-response items. Judges evaluate a representative sample of examinee performance using a rating scale that is defined in terms of performance standards, and these ratings are linked to examinee's test scores to generate a functional…
Descriptors: Academic Standards, Achievement Tests, Constructed Response, Cutting Scores
Peer reviewed Peer reviewed
Taylor, Catherine – American Educational Research Journal, 1994
Reviews two models for assessment, the measurement model and the standards model, their underlying assumptions about learners, and the resulting implications for performance-based test development. Discusses the current testing debate, defines terms such as "authentic assessment" and "performance-based assessment," and…
Descriptors: Academic Standards, Educational Assessment, Educational Change, Elementary Secondary Education
Green, Bert F. – 1995
Setting performance standards is an area that different constituencies see quite differently. The choices of elements for a particular standard depend to a large extent on the purposes the standard is intended to serve. Standards can be used in certification, as predictors, as descriptors, and as motivators. While performance standards indicate…
Descriptors: Certification, Course Content, Cutting Scores, Elementary Secondary Education
Johanson, George A.; Rich, Charles E. – 1991
Assigning letter grades in a consistent manner to tests in large classes across semesters is problematic if absolute grading standards are used. It may be unreasonable to implement the usual standard-setting approaches recommended for large-scale criterion-referenced testing due to both time constraints and a desire to have criteria that appear…
Descriptors: Class Size, College Students, Criterion Referenced Tests, Difficulty Level
De Champlain, Andre F.; Margolis, Melissa J.; Ross, Linette P.; Macmillan, Mary K.; Klass, Daniel J. – 1998
The purpose of the present investigation was to address several critical issues relating to setting a performance standard on a nationally administered standardized patient examination (SPX). The specific goals of the study were to: (1) compare pass/fail rates from this exercise to those of past studies undertaken with the same examination; (2)…
Descriptors: Clinical Experience, Higher Education, Interrater Reliability, Medical Education
Bay, Luz; Nering, Michael L. – 1998
The use of person-fit methods to determine the extent to which a panelist's ratings fit the item response theory (IRT) models used in the National Assessment of Educational Progress (NAEP) is demonstrated. Person-fit methods are statistical methods that allow the identification of nonfitting response vectors. To determine whether panelists'…
Descriptors: Academic Achievement, Geography, Goodness of Fit, High School Seniors
Busch, John Christian – 1990
The work of the ethicist Charles Curran and the problem-solving strategy of the mixed consequentialist ethical model are applied to a traditional social science measurement problem--that of how to adjust a recommended standard in order to be fair to the test-taker and society. The focus is on criterion-referenced teacher certification tests.…
Descriptors: Criterion Referenced Tests, Ethics, Licensing Examinations (Professions), Models
Pages: 1  |  ...  |  16  |  17  |  18  |  19  |  20  |  21  |  22  |  23  |  24  |  ...  |  34