Descriptor
Author
Lunz, Mary E. | 2 |
Britton, Bruce K. | 1 |
Jaeger, Richard M. | 1 |
Melican, Gerald J. | 1 |
Plake, Barbara S. | 1 |
Poggio, John P. | 1 |
Stahl, John A. | 1 |
Wheeler, Patricia | 1 |
Publication Type
Reports - Research | 6 |
Speeches/Meeting Papers | 4 |
Journal Articles | 3 |
Reports - Evaluative | 1 |
Education Level
Audience
Location
California | 1 |
Laws, Policies, & Programs
Assessments and Surveys
National Teacher Examinations | 1 |
What Works Clearinghouse Rating

Britton, Bruce K.; And Others – Journal of Educational Psychology, 1991
The accuracy of 210 college undergraduates' judgments of the relative learnability of original (Army training manuals) and rewritten (war or cultural histories) versions of 20 pairs of texts of known difficulty was studied. Students were 95 percent accurate in their judgments of learnability. Implications for college textbook selection are…
Descriptors: Difficulty Level, Evaluative Thinking, Evaluators, Guides
Lunz, Mary E.; Stahl, John A. – 1990
Three examinations administered to medical students were analyzed to determine differences among severities of judges' assessments and among grading periods. The examinations included essay, clinical, and oral forms of the tests. Twelve judges graded the three essays for 32 examinees during a 4-day grading session, which was divided into eight…
Descriptors: Clinical Diagnosis, Comparative Testing, Difficulty Level, Essay Tests

Poggio, John P.; And Others – Educational and Psychological Measurement, 1987
College faculty served as judges to rate the instructional validity of items on the National Teacher Examinations Core Battery. The ratings were examined in relation to actual test performance, as well as panelists' ratings of item difficulty and relevance. (Author/GDC)
Descriptors: Beginning Teachers, Content Validity, Difficulty Level, Education Majors

Plake, Barbara S.; Melican, Gerald J. – Educational and Psychological Measurement, 1989
The impact of overall test length and difficulty on the expert judgments of item performance by the Nedelsky method were studied. Five university-level instructors predicting the performance of minimally competent candidates on a mathematics examination were fairly consistent in their assessments regardless of length or difficulty of the test.…
Descriptors: Difficulty Level, Estimation (Mathematics), Evaluators, Higher Education
Lunz, Mary E.; And Others – 1989
A method for understanding and controlling the multiple facets of an oral examination (OE) or other judge-intermediated examination is presented and illustrated. This study focused on determining the extent to which the facets model (FM) analysis constructs meaningful variables for each facet of an OE involving protocols, examiners, and…
Descriptors: Computer Software, Difficulty Level, Evaluators, Examiners
Wheeler, Patricia – 1991
The appropriateness of the Angoff method (W. H. Angoff, 1971) for setting standards on tests was studied. Evaluators (judges) from California school districts and teacher training institutions reviewed 15 NTE (National Teacher Examinations) Program Specialty Area Tests published by the Educational Testing Service for their appropriateness in…
Descriptors: Art Education, Biology, Difficulty Level, Elementary Secondary Education
Jaeger, Richard M. – 1989
Criteria for the selection of judges (evaluators) for setting item-based standards involved in tests for which cutting scores must be established are investigated. Focus is on cases in which test standards are based on specialists' judgments concerning the difficulty of test items in tests used to determine who will be awarded a diploma, admitted…
Descriptors: College Entrance Examinations, Cutting Scores, Difficulty Level, Estimation (Mathematics)