ERIC - Search Results

Publication Date

In 2025	0
Since 2024	1
Since 2021 (last 5 years)	1
Since 2016 (last 10 years)	2
Since 2006 (last 20 years)	3

Descriptor

Performance Based Assessment	7
Test Validity	4
Construct Validity	2
Educational Assessment	2
Error of Measurement	2
Generalizability Theory	2
Measurement Techniques	2
Scores	2
Test Items	2
Academic Achievement	1
Achievement Tests	1
Cognitive Processes	1
Cutting Scores	1
Decision Making	1
Difficulty Level	1
Educational Testing	1
Efficiency	1
Elementary Education	1
Elementary School Students	1
Evaluators	1
Expertise	1
Intermediate Grades	1
Interrater Reliability	1
Item Analysis	1
Licensing Examinations…	1
More ▼

Source

Journal of Educational…

Author

Carolin Hahnel	1
Ercikan, Kadriye	1
Fitzpatrick, Anne R.	1
Frank Goldhammer	1
Ito, Kyoko	1
Johannes Naumann	1
Kahraman, Nilufer	1
Kane, Michael T.	1
Lane, Suzanne	1
Paul De Boeck	1
Peabody, Michael R.	1
Raymond, Mark R.	1
Shavelson, Richard J.	1
Swygert, Kimberly A.	1
Sykes, Robert C.	1
Ulf Kroehne	1
Wind, Stefanie A.	1
More ▼

Publication Type

Journal Articles	7
Reports - Research	3
Reports - Evaluative	2
Book/Product Reviews	1
Reports - Descriptive	1

Education Level

Audience

Location

Laws, Policies, & Programs

Assessments and Surveys

United States Medical…

What Works Clearinghouse Rating

Showing all 7 results Save | Export

Does Timed Testing Affect the Interpretation of Efficiency Scores?--A GLMM Analysis of Reading Components

Peer reviewed

Direct link

Frank Goldhammer; Ulf Kroehne; Carolin Hahnel; Johannes Naumann; Paul De Boeck – Journal of Educational Measurement, 2024

The efficiency of cognitive component skills is typically assessed with speeded performance tests. Interpreting only effective ability or effective speed as efficiency may be challenging because of the within-person dependency between both variables (speed-ability tradeoff, SAT). The present study measures efficiency as effective ability…

Descriptors: Timed Tests, Efficiency, Scores, Test Interpretation

Exploring the Influence of Judge Proficiency on Standard-Setting Judgments

Peer reviewed

Direct link

Peabody, Michael R.; Wind, Stefanie A. – Journal of Educational Measurement, 2019

Setting performance standards is a judgmental process involving human opinions and values as well as technical and empirical considerations. Although all cut score decisions are by nature somewhat arbitrary, they should not be capricious. Judges selected for standard-setting panels should have the proper qualifications to make the judgments asked…

Descriptors: Standard Setting, Decision Making, Performance Based Assessment, Evaluators

Psychometric Equivalence of Ratings for Repeat Examinees on a Performance Assessment for Physician Licensure

Peer reviewed

Direct link

Raymond, Mark R.; Swygert, Kimberly A.; Kahraman, Nilufer – Journal of Educational Measurement, 2012

Although a few studies report sizable score gains for examinees who repeat performance-based assessments, research has not yet addressed the reliability and validity of inferences based on ratings of repeat examinees on such tests. This study analyzed scores for 8,457 single-take examinees and 4,030 repeat examinees who completed a 6-hour clinical…

Descriptors: Physicians, Licensing Examinations (Professions), Performance Based Assessment, Repetition

Current Concerns in Validity Theory.

Peer reviewed

Kane, Michael T. – Journal of Educational Measurement, 2001

Provides a brief historical review of construct validity and discusses the current state of validity theory, emphasizing the role of arguments in validation. Examines the application of an argument-based approach with regard to the distinction between performance-based and theory-based interpretations and the role of consequences in validation.…

Descriptors: Construct Validity, Educational Testing, Performance Based Assessment, Theories

Technical Issues in Large-Scale Performance Assessment [Book Review].

Peer reviewed

Sykes, Robert C.; Ito, Kyoko; Fitzpatrick, Anne R.; Ercikan, Kadriye – Journal of Educational Measurement, 1997

The five chapters of this report provide resources that deal with the validity, generalizability, comparability, performance standards, and fairness, equity, and bias of performance assessments. The book is written for experienced educational measurement practitioners, although an extensive familiarity with performance assessment is not required.…

Descriptors: Educational Assessment, Measurement Techniques, Performance Based Assessment, Standards

Sampling Variability of Performance Assessments.

Peer reviewed

Shavelson, Richard J.; And Others – Journal of Educational Measurement, 1993

Evidence is presented on the generalizability and convergent validity of performance assessments using data from six studies of student achievement that sampled a wide range of measurement facets and methods. Results at individual and school levels indicate that task-sampling variability is the major source of measurement error. (SLD)

Descriptors: Academic Achievement, Educational Assessment, Error of Measurement, Generalizability Theory

Generalizability and Validity of a Mathematics Performance Assessment.

Peer reviewed

Lane, Suzanne; And Others – Journal of Educational Measurement, 1996

Evidence from test results of 3,604 sixth and seventh graders is provided for the generalizability and validity of the Quantitative Understanding: Amplifying Student Achievement and Reasoning (QUASAR) Cognitive Assessment Instrument, which is designed to measure program outcomes and growth in mathematics. (SLD)

Descriptors: Achievement Tests, Cognitive Processes, Elementary Education, Elementary School Students