ERIC - Search Results

Publication Date

In 2025	0
Since 2024	0
Since 2021 (last 5 years)	0
Since 2016 (last 10 years)	0
Since 2006 (last 20 years)	7

Source

Journal of Educational…	3
Advances in Health Sciences…	2
Applied Measurement in…	1
Applied Psychological…	1

Author

Clauser, Brian E.	7
Harik, Polina	4
Margolis, Melissa J.	3
Raymond, Mark R.	2
Chis, Liliana	1
Clauser, Jerome C.	1
Furman, Gail E.	1
Grabovsky, Irina	1
Keller, Lisa A.	1
McManus, I. C.	1
Mollon, Jennifer	1
Nandakumar, Ratna	1
Nungester, Ronald J.	1
Swanson, Dave	1
Swanson, David B.	1
Williams, Simon	1
More ▼

Publication Type

Journal Articles	7
Reports - Research	5
Reports - Evaluative	2

Education Level

Higher Education

Audience

Location

United Kingdom	1
United States	1

Laws, Policies, & Programs

Assessments and Surveys

United States Medical…

What Works Clearinghouse Rating

Showing all 7 results Save | Export

An Examination of the Replicability of Angoff Standard Setting Results within a Generalizability Theory Framework

Peer reviewed

Direct link

Clauser, Jerome C.; Margolis, Melissa J.; Clauser, Brian E. – Journal of Educational Measurement, 2014

Evidence of stable standard setting results over panels or occasions is an important part of the validity argument for an established cut score. Unfortunately, due to the high cost of convening multiple panels of content experts, standards often are based on the recommendation from a single panel of judges. This approach implicitly assumes that…

Descriptors: Standard Setting (Scoring), Generalizability Theory, Replication (Evaluation), Cutting Scores

Using Multivariate Generalizability Theory to Assess the Effect of Content Stratification on the Reliability of a Performance Assessment

Peer reviewed

Direct link

Keller, Lisa A.; Clauser, Brian E.; Swanson, David B. – Advances in Health Sciences Education, 2010

In recent years, demand for performance assessments has continued to grow. However, performance assessments are notorious for lower reliability, and in particular, low reliability resulting from task specificity. Since reliability analyses typically treat the performance tasks as randomly sampled from an infinite universe of tasks, these estimates…

Descriptors: Generalizability Theory, Test Reliability, Performance Based Assessment, Error of Measurement

The Impact of Statistically Adjusting for Rater Effects on Conditional Standard Errors of Performance Ratings

Peer reviewed

Direct link

Raymond, Mark R.; Harik, Polina; Clauser, Brian E. – Applied Psychological Measurement, 2011

Prior research indicates that the overall reliability of performance ratings can be improved by using ordinary least squares (OLS) regression to adjust for rater effects. The present investigation extends previous work by evaluating the impact of OLS adjustment on standard errors of measurement ("SEM") at specific score levels. In…

Descriptors: Performance Based Assessment, Licensing Examinations (Professions), Least Squares Statistics, Item Response Theory

The Impact of Statistical Adjustment on Conditional Standard Errors of Measurement in the Assessment of Physician Communication Skills

Peer reviewed

Direct link

Raymond, Mark R.; Clauser, Brian E.; Furman, Gail E. – Advances in Health Sciences Education, 2010

The use of standardized patients to assess communication skills is now an essential part of assessing a physician's readiness for practice. To improve the reliability of communication scores, it has become increasingly common in recent years to use statistical models to adjust ratings provided by standardized patients. This study employed ordinary…

Descriptors: Generalizability Theory, Physicians, Patients, Least Squares Statistics

An Examination of Rater Drift within a Generalizability Theory Framework

Peer reviewed

Direct link

Harik, Polina; Clauser, Brian E.; Grabovsky, Irina; Nungester, Ronald J.; Swanson, Dave; Nandakumar, Ratna – Journal of Educational Measurement, 2009

The present study examined the long-term usefulness of estimated parameters used to adjust the scores from a performance assessment to account for differences in rater stringency. Ratings from four components of the USMLE[R] Step 2 Clinical Skills Examination data were analyzed. A generalizability-theory framework was used to examine the extent to…

Descriptors: Generalizability Theory, Performance Based Assessment, Performance Tests, Clinical Experience

An Empirical Examination of the Impact of Group Discussion and Examinee Performance Information on Judgments Made in the Angoff Standard-Setting Procedure

Peer reviewed

Direct link

Clauser, Brian E.; Harik, Polina; Margolis, Melissa J.; McManus, I. C.; Mollon, Jennifer; Chis, Liliana; Williams, Simon – Applied Measurement in Education, 2009

Numerous studies have compared the Angoff standard-setting procedure to other standard-setting methods, but relatively few studies have evaluated the procedure based on internal criteria. This study uses a generalizability theory framework to evaluate the stability of the estimated cut score. To provide a measure of internal consistency, this…

Descriptors: Generalizability Theory, Group Discussion, Standard Setting (Scoring), Scoring

A Multivariate Generalizability Analysis of Data from a Performance Assessment of Physicians' Clinical Skills

Peer reviewed

Direct link

Clauser, Brian E.; Harik, Polina; Margolis, Melissa J. – Journal of Educational Measurement, 2006

Although multivariate generalizability theory was developed more than 30 years ago, little published research utilizing this framework exists and most of what does exist examines tests built from tables of specifications. In this context, it is assumed that the universe scores from levels of the fixed multivariate facet will be correlated, but the…

Descriptors: Multivariate Analysis, Job Skills, Correlation, Test Items

Generalizability Theory	7
Performance Based Assessment	4
Models	3
Clinical Experience	2
Cutting Scores	2
Error of Measurement	2
Least Squares Statistics	2
Licensing Examinations…	2
Physicians	2
Scores	2
Standard Setting (Scoring)	2
Test Items	2
Communication Skills	1
Computation	1
Correlation	1
Credentials	1
Difficulty Level	1
Evaluation	1
Evaluators	1
Foreign Countries	1
Group Discussion	1
Interaction Process Analysis	1
Interrater Reliability	1
Item Response Theory	1
Job Skills	1
More ▼