ERIC - Search Results

Publication Date

In 2025	0
Since 2024	0
Since 2021 (last 5 years)	0
Since 2016 (last 10 years)	1
Since 2006 (last 20 years)	5

Descriptor

Generalizability Theory	5
Error of Measurement	3
High Stakes Tests	2
Least Squares Statistics	2
Licensing Examinations…	2
Models	2
Reliability	2
Scores	2
Achievement Tests	1
Adults	1
Aptitude Tests	1
Certification	1
Clinical Experience	1
Communication Skills	1
Comparative Testing	1
English	1
Evaluation	1
Evaluation Problems	1
Evaluators	1
Evidence	1
Interrater Reliability	1
Item Response Theory	1
Measures (Individuals)	1
Medical Students	1
Multivariate Analysis	1
More ▼

Source

Advances in Health Sciences…	2
Applied Psychological…	1
Educational Measurement:…	1
Educational and Psychological…	1

Author

Raymond, Mark R.	5
Clauser, Brian E.	2
Anderson, Dan	1
Furman, Gail E.	1
Harik, Polina	1
Jiang, Zhehan	1
Kahraman, Nilufer	1
Neustel, Sandra	1
Swygert, Kimberly A.	1

Publication Type

Journal Articles	5
Reports - Research	4
Reports - Descriptive	1

Education Level

Higher Education

Audience

Location

Laws, Policies, & Programs

Assessments and Surveys

What Works Clearinghouse Rating

Showing all 5 results Save | Export

Indices of Subscore Utility for Individuals and Subgroups Based on Multivariate Generalizability Theory

Peer reviewed

Direct link

Raymond, Mark R.; Jiang, Zhehan – Educational and Psychological Measurement, 2020

Conventional methods for evaluating the utility of subscores rely on traditional indices of reliability and on correlations among subscores. One limitation of correlational methods is that they do not explicitly consider variation in subtest means. An exception is an index of score profile reliability designated as [G], which quantifies the ratio…

Descriptors: Generalizability Theory, Multivariate Analysis, Scores, Reliability

Measurement Precision for Repeat Examinees on a Standardized Patient Examination

Peer reviewed

Direct link

Raymond, Mark R.; Swygert, Kimberly A.; Kahraman, Nilufer – Advances in Health Sciences Education, 2012

Examinees who initially fail and later repeat an SP-based clinical skills exam typically exhibit large score gains on their second attempt, suggesting the possibility that examinees were not well measured on one of those attempts. This study evaluates score precision for examinees who repeated an SP-based clinical skills test administered as part…

Descriptors: Evidence, Generalizability Theory, Error of Measurement, Clinical Experience

The Impact of Statistically Adjusting for Rater Effects on Conditional Standard Errors of Performance Ratings

Peer reviewed

Direct link

Raymond, Mark R.; Harik, Polina; Clauser, Brian E. – Applied Psychological Measurement, 2011

Prior research indicates that the overall reliability of performance ratings can be improved by using ordinary least squares (OLS) regression to adjust for rater effects. The present investigation extends previous work by evaluating the impact of OLS adjustment on standard errors of measurement ("SEM") at specific score levels. In…

Descriptors: Performance Based Assessment, Licensing Examinations (Professions), Least Squares Statistics, Item Response Theory

The Impact of Statistical Adjustment on Conditional Standard Errors of Measurement in the Assessment of Physician Communication Skills

Peer reviewed

Direct link

Raymond, Mark R.; Clauser, Brian E.; Furman, Gail E. – Advances in Health Sciences Education, 2010

The use of standardized patients to assess communication skills is now an essential part of assessing a physician's readiness for practice. To improve the reliability of communication scores, it has become increasingly common in recent years to use statistical models to adjust ratings provided by standardized patients. This study employed ordinary…

Descriptors: Generalizability Theory, Physicians, Patients, Least Squares Statistics

Same-Form Retest Effects on Credentialing Examinations

Peer reviewed

Direct link

Raymond, Mark R.; Neustel, Sandra; Anderson, Dan – Educational Measurement: Issues and Practice, 2009

Examinees who take high-stakes assessments are usually given an opportunity to repeat the test if they are unsuccessful on their initial attempt. To prevent examinees from obtaining unfair score increases by memorizing the content of specific test items, testing agencies usually assign a different test form to repeat examinees. The use of multiple…

Descriptors: Test Results, Test Items, Testing, Aptitude Tests