Publication Date
In 2025 | 0 |
Since 2024 | 0 |
Since 2021 (last 5 years) | 0 |
Since 2016 (last 10 years) | 1 |
Since 2006 (last 20 years) | 3 |
Descriptor
Foreign Countries | 5 |
Generalizability Theory | 5 |
Academic Achievement | 2 |
Educational Assessment | 2 |
Evaluation Methods | 2 |
Physicians | 2 |
Scoring | 2 |
Student Evaluation | 2 |
Test Construction | 2 |
Test Items | 2 |
Computation | 1 |
More ▼ |
Author
Bell, John F. | 1 |
Bimpeh, Yaw | 1 |
Bruce, David A. | 1 |
Chis, Liliana | 1 |
Clauser, Brian E. | 1 |
Eva, Kevin W. | 1 |
Gipps, Caroline V. | 1 |
Harik, Polina | 1 |
Harrison, Liz | 1 |
Johnson, Sandra | 1 |
Margolis, Melissa J. | 1 |
More ▼ |
Publication Type
Journal Articles | 4 |
Reports - Evaluative | 3 |
Reports - Research | 2 |
Speeches/Meeting Papers | 1 |
Education Level
Higher Education | 1 |
Postsecondary Education | 1 |
Audience
Researchers | 1 |
Location
United Kingdom | 5 |
Laws, Policies, & Programs
Assessments and Surveys
What Works Clearinghouse Rating
Bimpeh, Yaw; Pointer, William; Smith, Ben Alexander; Harrison, Liz – Applied Measurement in Education, 2020
Many high-stakes examinations in the United Kingdom (UK) use both constructed-response items and selected-response items. We need to evaluate the inter-rater reliability for constructed-response items that are scored by humans. While there are a variety of methods for evaluating rater consistency across ratings in the psychometric literature, we…
Descriptors: Scoring, Generalizability Theory, Interrater Reliability, Foreign Countries
Clauser, Brian E.; Harik, Polina; Margolis, Melissa J.; McManus, I. C.; Mollon, Jennifer; Chis, Liliana; Williams, Simon – Applied Measurement in Education, 2009
Numerous studies have compared the Angoff standard-setting procedure to other standard-setting methods, but relatively few studies have evaluated the procedure based on internal criteria. This study uses a generalizability theory framework to evaluate the stability of the estimated cut score. To provide a measure of internal consistency, this…
Descriptors: Generalizability Theory, Group Discussion, Standard Setting (Scoring), Scoring
Murphy, Douglas J.; Bruce, David A.; Mercer, Stewart W.; Eva, Kevin W. – Advances in Health Sciences Education, 2009
To investigate the reliability and feasibility of six potential workplace-based assessment methods in general practice training: criterion audit, multi-source feedback from clinical and non-clinical colleagues, patient feedback (the CARE Measure), referral letters, significant event analysis, and video analysis of consultations. Performance of GP…
Descriptors: Reliability, Graduate Medical Education, Family Practice (Medicine), Vocational Evaluation

Johnson, Sandra; Bell, John F. – Journal of Educational Measurement, 1985
The assessment framework underlying a science performance monitoring program is process-oriented and intended to appeal to generalizability theory for a suitable estimation paradigm. Preliminary applications are described. Results suggest that computerized question-banking, domain-sampling of questions, and generalizablity theory together provide…
Descriptors: Academic Achievement, Computer Assisted Testing, Educational Assessment, Foreign Countries
Gipps, Caroline V. – 1994
The teacher assessment that is the subject of this paper is an essentially informal activity. The teacher assesses the student by posing questions, observing activities, and evaluating work in a planned or ad hoc way. The information obtained may be partial or fragmented, but repeating such assessments over time will allow the buildup of a solid…
Descriptors: Academic Achievement, Educational Assessment, Elementary Secondary Education, Evaluation Methods