Descriptor
| Difficulty Level | 9 |
| Minimum Competency Testing | 9 |
| Standard Setting (Scoring) | 9 |
| Interrater Reliability | 6 |
| Test Items | 6 |
| Cutting Scores | 5 |
| Evaluators | 4 |
| Higher Education | 4 |
| Scoring | 4 |
| Judges | 3 |
| Multiple Choice Tests | 3 |
| More ▼ | |
Author
| Melican, Gerald J. | 2 |
| Chang, Lei | 1 |
| DeMauro, Gerald E. | 1 |
| Garrido, Mariquita | 1 |
| Jaeger, Richard M. | 1 |
| Melican, Gerald | 1 |
| Mills, Craig N. | 1 |
| Payne, David A. | 1 |
| Plake, Barbara S. | 1 |
| Reid, Jerry B. | 1 |
| Thomas, Nancy | 1 |
| More ▼ | |
Publication Type
| Reports - Research | 6 |
| Journal Articles | 4 |
| Speeches/Meeting Papers | 4 |
| Reports - Evaluative | 3 |
Education Level
Audience
| Researchers | 2 |
Location
| New Jersey | 1 |
Laws, Policies, & Programs
Assessments and Surveys
What Works Clearinghouse Rating
Melican, Gerald; Thomas, Nancy – 1984
Setting standards for the purpose of certification is frequently performed using judgmental techniques such as the Angoff method. This study was performed to identify types of items that judges find hard to rate accurately, that is, types of items on which examinees perform differently than predicted by the judges. Once identified these item types…
Descriptors: Certification, Cutting Scores, Difficulty Level, Minimum Competency Testing
Peer reviewedChang, Lei; And Others – Applied Measurement in Education, 1996
The influence of judges' knowledge on standard setting for competency tests was studied with 17 judges who took an economics teacher certification test while setting competency standards using the Angoff procedure. Judges tended to set higher standards for items they answered correctly and lower standards for items they answered incorrectly. (SLD)
Descriptors: Competence, Difficulty Level, Economics, Judges
Peer reviewedMills, Craig N.; And Others – Educational Measurement: Issues and Practice, 1991
An approach is presented to the definition of minimal competence for judges to use in standard setting. Panelists in standard setting must receive training to ensure that differences in rating result from differences in perceptions of item difficulty, not in differences of opinion about the definition of minimal competence. (SLD)
Descriptors: Cutting Scores, Decision Making, Definitions, Difficulty Level
Peer reviewedReid, Jerry B. – Educational Measurement: Issues and Practice, 1991
Training judges to generate item ratings in standard setting once the reference group has been defined is discussed. It is proposed that sensitivity to the factors that determine difficulty can be improved through training. Three criteria for determining when training is sufficient are offered. (SLD)
Descriptors: Computer Assisted Instruction, Difficulty Level, Evaluators, Interrater Reliability
DeMauro, Gerald E. – 1995
Studies of the Angoff method of standard setting suggest that judges agree in their estimates of the relative difficulties of test questions for minimally competent examinees and that each judge's estimates correlate well with the observed item difficulties for examinees whose total test scores are near the judge's personal standard (G. E.…
Descriptors: Ability, Competence, Construct Validity, Difficulty Level
Melican, Gerald J.; And Others – 1987
The effects of feedback about the ratings of other judges on subsequent ratings using the Nedelsky method and the ability of judges to retain or eliminate options in a manner consistent with the judgments of minimally competent examinees were studied using data from a basic algebra examination administered to 227 college students in 1987. The…
Descriptors: Certification, College Students, Cutting Scores, Difficulty Level
Peer reviewedPlake, Barbara S.; Melican, Gerald J. – Educational and Psychological Measurement, 1989
The impact of overall test length and difficulty on the expert judgments of item performance by the Nedelsky method were studied. Five university-level instructors predicting the performance of minimally competent candidates on a mathematics examination were fairly consistent in their assessments regardless of length or difficulty of the test.…
Descriptors: Difficulty Level, Estimation (Mathematics), Evaluators, Higher Education
Garrido, Mariquita; Payne, David A. – 1987
Minimum competency cut-off scores on a statistics exam were estimated under four conditions: the Angoff judging method with item data (n=20), and without data available (n=19); and the Modified Angoff method with (n=19), and without (n=19) item data available to judges. The Angoff method required free response percentage estimates (0-100) percent,…
Descriptors: Academic Standards, Comparative Analysis, Criterion Referenced Tests, Cutting Scores
Jaeger, Richard M. – 1989
Criteria for the selection of judges (evaluators) for setting item-based standards involved in tests for which cutting scores must be established are investigated. Focus is on cases in which test standards are based on specialists' judgments concerning the difficulty of test items in tests used to determine who will be awarded a diploma, admitted…
Descriptors: College Entrance Examinations, Cutting Scores, Difficulty Level, Estimation (Mathematics)


