Descriptor
Performance Based Assessment | 5 |
Test Reliability | 5 |
Educational Assessment | 3 |
Test Use | 3 |
Test Validity | 3 |
Licensing Examinations… | 2 |
Mathematics Tests | 2 |
Scores | 2 |
Scoring | 2 |
Test Construction | 2 |
Academic Achievement | 1 |
More ▼ |
Source
Applied Measurement in… | 5 |
Author
Barnes, Laura L. B. | 1 |
Dunbar, Stephen B. | 1 |
Hambleton, Ronald K. | 1 |
Klein, Stephen P. | 1 |
McBee, Maridyth M. | 1 |
Millman, Jason | 1 |
Slater, Sharon C. | 1 |
Publication Type
Journal Articles | 5 |
Reports - Evaluative | 4 |
Speeches/Meeting Papers | 2 |
Reports - Research | 1 |
Education Level
Audience
Location
Vermont | 1 |
Laws, Policies, & Programs
Assessments and Surveys
What Works Clearinghouse Rating

Klein, Stephen P.; And Others – Applied Measurement in Education, 1995
Portfolios are the centerpiece of Vermont's statewide assessment program in mathematics. Portfolio scores in the first two years were not reliable enough to permit the reporting of student-level results, but increasing the number of readers or the number of portfolio pieces is not operationally feasible. (SLD)
Descriptors: Educational Assessment, Elementary Secondary Education, Mathematics Tests, Performance Based Assessment

McBee, Maridyth M.; Barnes, Laura L. B. – Applied Measurement in Education, 1998
The temporal stability and intertask consistency of an eighth-grade mathematics performance assessment and how task similarity affects the ability to generalize results of the assessments were studied with results from 101 eighth graders. Results support the suggestion that large-scale performance assessments be used with considerable caution…
Descriptors: Academic Achievement, Grade 8, Junior High School Students, Junior High Schools

Hambleton, Ronald K.; Slater, Sharon C. – Applied Measurement in Education, 1997
A brief history of developments in the assessment of the reliability of credentialing examinations is presented, and some new results are outlined that highlight the interactions among scoring, standard setting, and the reliability and validity of pass-fail decisions. Decision consistency is an important concept in evaluating credentialing…
Descriptors: Certification, Credentials, Decision Making, Interaction

Dunbar, Stephen B.; And Others – Applied Measurement in Education, 1991
Issues pertaining to the quality of performance assessments, including reliability and validity, are discussed. The relatively limited generalizability of performance across tasks is indicative of the care needed to evaluate performance assessments. Quality control is an empirical matter when measurement is intended to inform public policy. (SLD)
Descriptors: Educational Assessment, Generalization, Interrater Reliability, Measurement Techniques

Millman, Jason – Applied Measurement in Education, 1991
Alternatives to multiple-choice tests for teacher licensing examinations are described, and their advantages are cited. Concerns are expressed in the areas of cost and practicality, reliability, corruptibility, and validity. A suggestion for reducing costs using multiple-choice responses calibrated to constructed-response tasks is proposed. (SLD)
Descriptors: Beginning Teachers, Constructed Response, Cost Effectiveness, Educational Assessment