Publication Date
In 2025 | 0 |
Since 2024 | 0 |
Since 2021 (last 5 years) | 0 |
Since 2016 (last 10 years) | 1 |
Since 2006 (last 20 years) | 1 |
Descriptor
Performance Based Assessment | 3 |
Scores | 3 |
Test Interpretation | 3 |
Decision Making | 2 |
Validity | 2 |
Certification | 1 |
Computation | 1 |
Educational Assessment | 1 |
English (Second Language) | 1 |
Error Patterns | 1 |
Foreign Students | 1 |
More ▼ |
Source
Applied Measurement in… | 3 |
Publication Type
Journal Articles | 3 |
Reports - Evaluative | 2 |
Book/Product Reviews | 1 |
Reports - Research | 1 |
Speeches/Meeting Papers | 1 |
Education Level
Higher Education | 1 |
Audience
Location
California (Los Angeles) | 1 |
Laws, Policies, & Programs
Assessments and Surveys
What Works Clearinghouse Rating
Schmidgall, Jonathan – Applied Measurement in Education, 2017
This study utilizes an argument-based approach to validation to examine the implications of reliability in order to further differentiate the concepts of score and decision consistency. In a methodological example, the framework of generalizability theory was used to estimate appropriate indices of score consistency and evaluations of the…
Descriptors: Scores, Reliability, Validity, Generalizability Theory

Mehrens, William A. – Applied Measurement in Education, 1997
This commentary on articles in this special issue generally agrees with the viewpoints expressed, although it argues that in some cases the authors of these articles should have expanded on certain issues. Many comments relate to the legal defensibility of the positions taken. (SLD)
Descriptors: Certification, Decision Making, Licensing Examinations (Professions), Performance Based Assessment

Dunbar, Stephen B.; And Others – Applied Measurement in Education, 1991
Issues pertaining to the quality of performance assessments, including reliability and validity, are discussed. The relatively limited generalizability of performance across tasks is indicative of the care needed to evaluate performance assessments. Quality control is an empirical matter when measurement is intended to inform public policy. (SLD)
Descriptors: Educational Assessment, Generalization, Interrater Reliability, Measurement Techniques