ERIC - Search Results

Descriptor

Difficulty Level	7
Evaluators	7
Higher Education	7
Interrater Reliability	5
Test Items	4
Estimation (Mathematics)	3
Licensing Examinations…	3
Standard Setting (Scoring)	3
Elementary Secondary Education	2
Latent Trait Theory	2
Mathematics Tests	2
Minimum Competency Testing	2
Scoring	2
Teacher Certification	2
Test Format	2
Art Education	1
Beginning Teachers	1
Biology	1
Clinical Diagnosis	1
College Entrance Examinations	1
Comparative Testing	1
Computer Software	1
Content Validity	1
Cutting Scores	1
Education Majors	1
More ▼

Source

Educational and Psychological…	2
Journal of Educational…	1

Author

Lunz, Mary E.	2
Britton, Bruce K.	1
Jaeger, Richard M.	1
Melican, Gerald J.	1
Plake, Barbara S.	1
Poggio, John P.	1
Stahl, John A.	1
Wheeler, Patricia	1

Publication Type

Reports - Research	6
Speeches/Meeting Papers	4
Journal Articles	3
Reports - Evaluative	1

Education Level

Audience

Location

California

Laws, Policies, & Programs

Assessments and Surveys

National Teacher Examinations

What Works Clearinghouse Rating

Showing all 7 results Save | Export

Accuracy of Learnability Judgments for Instructional Texts.

Peer reviewed

Britton, Bruce K.; And Others – Journal of Educational Psychology, 1991

The accuracy of 210 college undergraduates' judgments of the relative learnability of original (Army training manuals) and rewritten (war or cultural histories) versions of 20 pairs of texts of known difficulty was studied. Students were 95 percent accurate in their judgments of learnability. Implications for college textbook selection are…

Descriptors: Difficulty Level, Evaluative Thinking, Evaluators, Guides

Severity of Grading across Time Periods.

Download full text

Lunz, Mary E.; Stahl, John A. – 1990

Three examinations administered to medical students were analyzed to determine differences among severities of judges' assessments and among grading periods. The examinations included essay, clinical, and oral forms of the tests. Twelve judges graded the three essays for 32 examinees during a 4-day grading session, which was divided into eight…

Descriptors: Clinical Diagnosis, Comparative Testing, Difficulty Level, Essay Tests

Adequacy of Retrospective Judgments to Establish Instructional Validity.

Peer reviewed

Poggio, John P.; And Others – Educational and Psychological Measurement, 1987

College faculty served as judges to rate the instructional validity of items on the National Teacher Examinations Core Battery. The ratings were examined in relation to actual test performance, as well as panelists' ratings of item difficulty and relevance. (Author/GDC)

Descriptors: Beginning Teachers, Content Validity, Difficulty Level, Education Majors

Effects of Item Context on Intrajudge Consistency of Expert Judgments via the Nedelsky Standard Setting Method.

Peer reviewed

Plake, Barbara S.; Melican, Gerald J. – Educational and Psychological Measurement, 1989

The impact of overall test length and difficulty on the expert judgments of item performance by the Nedelsky method were studied. Five university-level instructors predicting the performance of minimally competent candidates on a mathematics examination were fairly consistent in their assessments regardless of length or difficulty of the test.…

Descriptors: Difficulty Level, Estimation (Mathematics), Evaluators, Higher Education

Variation among Examiners and Protocols on Oral Examinations.

Lunz, Mary E.; And Others – 1989

A method for understanding and controlling the multiple facets of an oral examination (OE) or other judge-intermediated examination is presented and illustrated. This study focused on determining the extent to which the facets model (FM) analysis constructs meaningful variables for each facet of an OE involving protocols, examiners, and…

Descriptors: Computer Software, Difficulty Level, Evaluators, Examiners

The Relationship between Modified Angoff Knowledge Estimation Judgments and Item Difficulty Values for Seven NTE Specialty Area Tests.

Wheeler, Patricia – 1991

The appropriateness of the Angoff method (W. H. Angoff, 1971) for setting standards on tests was studied. Evaluators (judges) from California school districts and teacher training institutions reviewed 15 NTE (National Teacher Examinations) Program Specialty Area Tests published by the Educational Testing Service for their appropriateness in…

Descriptors: Art Education, Biology, Difficulty Level, Elementary Secondary Education

Selection of Judges for Standard Setting: What Kinds? How Many?

Jaeger, Richard M. – 1989

Criteria for the selection of judges (evaluators) for setting item-based standards involved in tests for which cutting scores must be established are investigated. Focus is on cases in which test standards are based on specialists' judgments concerning the difficulty of test items in tests used to determine who will be awarded a diploma, admitted…

Descriptors: College Entrance Examinations, Cutting Scores, Difficulty Level, Estimation (Mathematics)