ERIC - Search Results

Descriptor

Evaluators	7
Interrater Reliability	7
Minimum Competency Testing	7
Standard Setting (Scoring)	7
Difficulty Level	4
Minimum Competencies	4
Scoring	4
Test Interpretation	3
Computer Assisted Instruction	2
Cutting Scores	2
Estimation (Mathematics)	2
Feedback	2
Higher Education	2
Licensing Examinations…	2
Selection	2
Standardized Tests	2
Teacher Certification	2
Test Construction	2
Test Items	2
College Entrance Examinations	1
Decision Making	1
Definitions	1
Economics	1
Evaluation Criteria	1
Examiners	1
More ▼

Source

Educational Measurement:…	3
Educational and Psychological…	1

Author

Plake, Barbara S.	2
Chang, Lei	1
Friedman, Charles B.	1
Ho, Kevin T.	1
Jaeger, Richard M.	1
Melican, Gerald J.	1
Mills, Craig N.	1
Reid, Jerry B.	1

Publication Type

Journal Articles	4
Reports - Research	4
Reports - Evaluative	3
Speeches/Meeting Papers	3

Education Level

Audience

Location

Laws, Policies, & Programs

Assessments and Surveys

What Works Clearinghouse Rating

Showing all 7 results Save | Export

Interjudge Consensus and Intrajudge Consistency: Is It Possible To Have Both in Standard Setting?

Friedman, Charles B.; Ho, Kevin T. – 1990

Eleven judges representing 11 different geographic regions in the United States participated in a standard-setting session designed to determine the possibility of obtaining interjudge consensus and intrajudge consistency simultaneously. Each judge had experience in the field for which standards were being set. The judges rated 65 multiple-choice…

Descriptors: Evaluators, Feedback, Interrater Reliability, Licensing Examinations (Professions)

Factors Influencing Intrajudge Consistency during Standard-Setting.

Peer reviewed

Plake, Barbara S.; And Others – Educational Measurement: Issues and Practice, 1991

Possible sources of intrajudge inconsistency in standard setting are reviewed, and approaches are presented to improve the accuracy of rating. Procedures for providing judges with feedback through discussion or computerized communication are discussed. Monitoring and maintaining judges' consistency throughout the rating process are essential. (SLD)

Descriptors: Computer Assisted Instruction, Evaluators, Examiners, Feedback

Does a Standard Reflect Minimal Competency of Examinees or Judge Competency?

Download full text

Chang, Lei; And Others – 1994

The present study examines the influence of judges' item-related knowledge on setting standards for competency tests. Seventeen judges from different professions took a 122-item teacher-certification test in economics while setting competency standards for the test using the Angoff procedure. Judges tended to set higher standards for items they…

Descriptors: Economics, Evaluators, Experience, Interrater Reliability

Defining Minimal Competence.

Peer reviewed

Mills, Craig N.; And Others – Educational Measurement: Issues and Practice, 1991

An approach is presented to the definition of minimal competence for judges to use in standard setting. Panelists in standard setting must receive training to ensure that differences in rating result from differences in perceptions of item difficulty, not in differences of opinion about the definition of minimal competence. (SLD)

Descriptors: Cutting Scores, Decision Making, Definitions, Difficulty Level

Training Judges to Generate Standard-Setting Data.

Peer reviewed

Reid, Jerry B. – Educational Measurement: Issues and Practice, 1991

Training judges to generate item ratings in standard setting once the reference group has been defined is discussed. It is proposed that sensitivity to the factors that determine difficulty can be improved through training. Three criteria for determining when training is sufficient are offered. (SLD)

Descriptors: Computer Assisted Instruction, Difficulty Level, Evaluators, Interrater Reliability

Effects of Item Context on Intrajudge Consistency of Expert Judgments via the Nedelsky Standard Setting Method.

Peer reviewed

Plake, Barbara S.; Melican, Gerald J. – Educational and Psychological Measurement, 1989

The impact of overall test length and difficulty on the expert judgments of item performance by the Nedelsky method were studied. Five university-level instructors predicting the performance of minimally competent candidates on a mathematics examination were fairly consistent in their assessments regardless of length or difficulty of the test.…

Descriptors: Difficulty Level, Estimation (Mathematics), Evaluators, Higher Education

Selection of Judges for Standard Setting: What Kinds? How Many?

Jaeger, Richard M. – 1989

Criteria for the selection of judges (evaluators) for setting item-based standards involved in tests for which cutting scores must be established are investigated. Focus is on cases in which test standards are based on specialists' judgments concerning the difficulty of test items in tests used to determine who will be awarded a diploma, admitted…

Descriptors: College Entrance Examinations, Cutting Scores, Difficulty Level, Estimation (Mathematics)