Publication Date
In 2025 | 0 |
Since 2024 | 0 |
Since 2021 (last 5 years) | 0 |
Since 2016 (last 10 years) | 1 |
Since 2006 (last 20 years) | 2 |
Descriptor
Performance Based Assessment | 8 |
Scores | 8 |
Educational Assessment | 3 |
Test Interpretation | 3 |
Decision Making | 2 |
Scoring | 2 |
State Programs | 2 |
Test Construction | 2 |
Test Reliability | 2 |
Test Use | 2 |
Testing Programs | 2 |
More ▼ |
Source
Applied Measurement in… | 8 |
Author
Publication Type
Journal Articles | 8 |
Reports - Evaluative | 5 |
Reports - Research | 3 |
Book/Product Reviews | 1 |
Speeches/Meeting Papers | 1 |
Education Level
Higher Education | 1 |
Audience
Location
California (Los Angeles) | 1 |
Netherlands | 1 |
Vermont | 1 |
Laws, Policies, & Programs
Assessments and Surveys
What Works Clearinghouse Rating
Schmidgall, Jonathan – Applied Measurement in Education, 2017
This study utilizes an argument-based approach to validation to examine the implications of reliability in order to further differentiate the concepts of score and decision consistency. In a methodological example, the framework of generalizability theory was used to estimate appropriate indices of score consistency and evaluations of the…
Descriptors: Scores, Reliability, Validity, Generalizability Theory
Kuhlemeier, Hans; Hemker, Bas; van den Bergh, Huub – Applied Measurement in Education, 2013
In recent years many countries have introduced authentic performance-based assessments in their national exam systems. Teachers' ratings of their own candidates' performances may suffer from errors of leniency and range restriction. The goal of this study was to examine the impact of manipulating the descriptiveness, balancedness, and polarity of…
Descriptors: Performance Based Assessment, Rating Scales, Scores, High Stakes Tests

Goldberg, Gail Lynn; Roswell, Barbara Sherr – Applied Measurement in Education, 2001
To determine the factors that contribute to or compromise the effectiveness of multiscored items, this study combined analysis of statewide score data from the 1996 Maryland School Performance Assessment Program tests with systematic analyses of 60 activities providing measures of writing, language usage, or both, and one or more content areas.…
Descriptors: Performance Based Assessment, Scores, State Programs, Testing Programs

Wolfe, Edward W.; Gitomer, Drew H. – Applied Measurement in Education, 2001
Attempted to improve the measurement quality of a complex performance assessment through principled assessment design using the example of the National Board for Professional Teaching Standards Early Childhood/Generalist examination. All indexes examined improved after revisions were made. Results show the importance of attention to assessment…
Descriptors: Change, Performance Based Assessment, Psychometrics, Scores

Mehrens, William A. – Applied Measurement in Education, 1997
This commentary on articles in this special issue generally agrees with the viewpoints expressed, although it argues that in some cases the authors of these articles should have expanded on certain issues. Many comments relate to the legal defensibility of the positions taken. (SLD)
Descriptors: Certification, Decision Making, Licensing Examinations (Professions), Performance Based Assessment

Klein, Stephen P.; And Others – Applied Measurement in Education, 1995
Portfolios are the centerpiece of Vermont's statewide assessment program in mathematics. Portfolio scores in the first two years were not reliable enough to permit the reporting of student-level results, but increasing the number of readers or the number of portfolio pieces is not operationally feasible. (SLD)
Descriptors: Educational Assessment, Elementary Secondary Education, Mathematics Tests, Performance Based Assessment

Roid, Gale H. – Applied Measurement in Education, 1994
A study with more than 1,000 students in grades 3 and 8 explored the advantages of analytical scoring in writing assessment. Cluster analysis revealed 11 patterns of student scores, reflecting different patterns of writing strength and weakness, and replicated across 5 modes of writing. (SLD)
Descriptors: Cluster Analysis, Discourse Modes, Educational Assessment, Elementary Education

Dunbar, Stephen B.; And Others – Applied Measurement in Education, 1991
Issues pertaining to the quality of performance assessments, including reliability and validity, are discussed. The relatively limited generalizability of performance across tasks is indicative of the care needed to evaluate performance assessments. Quality control is an empirical matter when measurement is intended to inform public policy. (SLD)
Descriptors: Educational Assessment, Generalization, Interrater Reliability, Measurement Techniques