ERIC - Search Results

Publication Date

In 2025	0
Since 2024	0
Since 2021 (last 5 years)	0
Since 2016 (last 10 years)	0
Since 2006 (last 20 years)	2

Source

Advances in Health Sciences…	1
Applied Psychological…	1
Evaluation and the Health…	1
Practical Assessment,…	1

Author

Bergstrom, Betty A.	1
Boulet, John R.	1
Faggen, Jane	1
Judd, Wallace	1
Lunz, Mary E.	1
McKinley, Danette W.	1
Roberts, William L.	1
Sawyer, Richard L.	1
Schulz, E. Matthew	1
Sykes, Robert C.	1
Wang, Lin	1
Woodruff, David J.	1
More ▼

Publication Type

Reports - Evaluative	7
Journal Articles	4
Reports - Research	2
Speeches/Meeting Papers	2

Education Level

Higher Education

Audience

Location

Laws, Policies, & Programs

Assessments and Surveys

What Works Clearinghouse Rating

Showing all 7 results Save | Export

Effect of First-Encounter Pretest on Pass/Fail Rates of a Clinical Skills Medical Licensure Examination

Peer reviewed

Direct link

Roberts, William L.; McKinley, Danette W.; Boulet, John R. – Advances in Health Sciences Education, 2010

Due to the high-stakes nature of medical exams it is prudent for test agencies to critically evaluate test data and control for potential threats to validity. For the typical multiple station performance assessments used in medicine, it may take time for examinees to become comfortable with the test format and administrative protocol. Since each…

Descriptors: Student Evaluation, Pretests Posttests, Licensing Examinations (Professions), Scores

Gating Items: Definition, Significance, and Need for Further Study

Peer reviewed

Direct link

Judd, Wallace – Practical Assessment, Research & Evaluation, 2009

Over the past twenty years in performance testing a specific item type with distinguishing characteristics has arisen time and time again. It's been invented independently by dozens of test development teams. And yet this item type is not recognized in the research literature. This article is an invitation to investigate the item type, evaluate…

Descriptors: Test Items, Test Format, Evaluation, Item Analysis

The Classification Accuracy of Shortened versus Full Length Tests with Number Correct Scoring.

Download full text

Schulz, E. Matthew; Wang, Lin – 2001

In this study, items were drawn from a full-length test of 30 items in order to construct shorter tests for the purpose of making accurate pass/fail classifications with regard to a specific criterion point on the latent ability metric. A three-item parameter Item Response Theory (IRT) framework was used. The criterion point on the latent ability…

Descriptors: Ability, Classification, Item Response Theory, Pass Fail Grading

Effects of Mode of Item Presentation on Standard Setting.

PDF pending restoration

Faggen, Jane; And Others – 1995

The objective of this study was to determine the degree to which recommendations for passing scores, calculated on the basis of a traditional standard-setting methodology, might be affected by the mode (paper versus computer-screen prints) in which test items were presented to standard setting panelists. Results were based on the judgments of 31…

Descriptors: Computer Assisted Testing, Cutting Scores, Difficulty Level, Evaluators

Estimating Measures of Pass-Fail Reliability from Parallel Half-Tests.

Peer reviewed

Woodruff, David J.; Sawyer, Richard L. – Applied Psychological Measurement, 1989

Two methods--non-distributional and normal--are derived for estimating measures of pass-fail reliability. Both are based on the Spearman Brown formula and require only a single test administration. Results from a simulation (n=20,000 examinees) and a licensure examination (n=4,828 examinees) illustrate these methods. (SLD)

Descriptors: Equations (Mathematics), Estimation (Mathematics), Licensing Examinations (Professions), Measures (Individuals)

Assessing the Impact of Multidimensionality on the Classification Decisions of an IRT-Based Licensure Examination.

Download full text

Sykes, Robert C.; And Others – 1992

A part-form methodology was used to study the effect of varying degrees of multidimensionality on the consistency of pass/fail classification decisions obtained from simulated unidimensional item response theory (IRT) based licensure examinations. A control on the degree of form multidimensionality permitted an assessment throughout the range of…

Descriptors: Classification, Comparative Testing, Computer Simulation, Decision Making

Confidence in Pass/Fail Decisions for Computer Adaptive and Paper and Pencil Examinations.

Peer reviewed

Bergstrom, Betty A.; Lunz, Mary E. – Evaluation and the Health Professions, 1992

The level of confidence in pass/fail decisions obtained with computerized adaptive tests and paper-and-pencil tests was greater for 645 medical technology students when the computer adaptive test implemented a 90 percent confidence stopping rule than for paper-and-pencil tests of comparable length. (SLD)

Descriptors: Adaptive Testing, Comparative Testing, Computer Assisted Testing, Confidence Testing

Pass Fail Grading	7
Test Format	7
Item Response Theory	3
Licensing Examinations…	3
Scores	3
Test Items	3
Adaptive Testing	2
Classification	2
Comparative Testing	2
Computer Assisted Testing	2
Decision Making	2
Estimation (Mathematics)	2
Scoring	2
Standard Setting (Scoring)	2
Ability	1
Clinical Experience	1
Computer Simulation	1
Confidence Testing	1
Cutting Scores	1
Difficulty Level	1
Equations (Mathematics)	1
Error of Measurement	1
Evaluation	1
Evaluators	1
Failure	1
More ▼