ERIC - Search Results

Publication Date

In 2025	0
Since 2024	0
Since 2021 (last 5 years)	0
Since 2016 (last 10 years)	1
Since 2006 (last 20 years)	3

Descriptor

Foreign Countries	5
Generalizability Theory	5
Academic Achievement	2
Educational Assessment	2
Evaluation Methods	2
Physicians	2
Scoring	2
Student Evaluation	2
Test Construction	2
Test Items	2
Computation	1
Computer Assisted Testing	1
Credentials	1
Cutting Scores	1
Difficulty Level	1
Elementary Secondary Education	1
Family Practice (Medicine)	1
Feedback	1
Feedback (Response)	1
Formative Evaluation	1
Graduate Medical Education	1
Graduate Students	1
Group Discussion	1
High Stakes Tests	1
Interrater Reliability	1
More ▼

Source

Applied Measurement in…	2
Advances in Health Sciences…	1
Journal of Educational…	1

Author

Bell, John F.	1
Bimpeh, Yaw	1
Bruce, David A.	1
Chis, Liliana	1
Clauser, Brian E.	1
Eva, Kevin W.	1
Gipps, Caroline V.	1
Harik, Polina	1
Harrison, Liz	1
Johnson, Sandra	1
Margolis, Melissa J.	1
McManus, I. C.	1
Mercer, Stewart W.	1
Mollon, Jennifer	1
Murphy, Douglas J.	1
Pointer, William	1
Smith, Ben Alexander	1
Williams, Simon	1
More ▼

Publication Type

Journal Articles	4
Reports - Evaluative	3
Reports - Research	2
Speeches/Meeting Papers	1

Education Level

Higher Education	1
Postsecondary Education	1

Audience

Researchers

Location

United Kingdom

Laws, Policies, & Programs

Assessments and Surveys

What Works Clearinghouse Rating

Showing all 5 results Save | Export

Evaluating Human Scoring Using Generalizability Theory

Peer reviewed

Direct link

Bimpeh, Yaw; Pointer, William; Smith, Ben Alexander; Harrison, Liz – Applied Measurement in Education, 2020

Many high-stakes examinations in the United Kingdom (UK) use both constructed-response items and selected-response items. We need to evaluate the inter-rater reliability for constructed-response items that are scored by humans. While there are a variety of methods for evaluating rater consistency across ratings in the psychometric literature, we…

Descriptors: Scoring, Generalizability Theory, Interrater Reliability, Foreign Countries

An Empirical Examination of the Impact of Group Discussion and Examinee Performance Information on Judgments Made in the Angoff Standard-Setting Procedure

Peer reviewed

Direct link

Clauser, Brian E.; Harik, Polina; Margolis, Melissa J.; McManus, I. C.; Mollon, Jennifer; Chis, Liliana; Williams, Simon – Applied Measurement in Education, 2009

Numerous studies have compared the Angoff standard-setting procedure to other standard-setting methods, but relatively few studies have evaluated the procedure based on internal criteria. This study uses a generalizability theory framework to evaluate the stability of the estimated cut score. To provide a measure of internal consistency, this…

Descriptors: Generalizability Theory, Group Discussion, Standard Setting (Scoring), Scoring

The Reliability of Workplace-Based Assessment in Postgraduate Medical Education and Training: A National Evaluation in General Practice in the United Kingdom

Peer reviewed

Direct link

Murphy, Douglas J.; Bruce, David A.; Mercer, Stewart W.; Eva, Kevin W. – Advances in Health Sciences Education, 2009

To investigate the reliability and feasibility of six potential workplace-based assessment methods in general practice training: criterion audit, multi-source feedback from clinical and non-clinical colleagues, patient feedback (the CARE Measure), referral letters, significant event analysis, and video analysis of consultations. Performance of GP…

Descriptors: Reliability, Graduate Medical Education, Family Practice (Medicine), Vocational Evaluation

Evaluating and Predicting Survey Efficiency Using Generalizability Theory.

Peer reviewed

Johnson, Sandra; Bell, John F. – Journal of Educational Measurement, 1985

The assessment framework underlying a science performance monitoring program is process-oriented and intended to appeal to generalizability theory for a suitable estimation paradigm. Preliminary applications are described. Results suggest that computerized question-banking, domain-sampling of questions, and generalizablity theory together provide…

Descriptors: Academic Achievement, Computer Assisted Testing, Educational Assessment, Foreign Countries

Quality Assurance in Teachers' Assessment.

Download full text

Gipps, Caroline V. – 1994

The teacher assessment that is the subject of this paper is an essentially informal activity. The teacher assesses the student by posing questions, observing activities, and evaluating work in a planned or ad hoc way. The information obtained may be partial or fragmented, but repeating such assessments over time will allow the buildup of a solid…

Descriptors: Academic Achievement, Educational Assessment, Elementary Secondary Education, Evaluation Methods