Publication Date
In 2025 | 0 |
Since 2024 | 0 |
Since 2021 (last 5 years) | 0 |
Since 2016 (last 10 years) | 2 |
Since 2006 (last 20 years) | 8 |
Descriptor
Performance Based Assessment | 56 |
Standard Setting (Scoring) | 56 |
Standards | 29 |
Educational Assessment | 22 |
Elementary Secondary Education | 20 |
Evaluation Methods | 16 |
Academic Standards | 13 |
Test Construction | 13 |
Cutting Scores | 12 |
Scoring | 11 |
Decision Making | 10 |
More ▼ |
Source
Author
Publication Type
Education Level
Higher Education | 3 |
Postsecondary Education | 2 |
Adult Education | 1 |
Elementary Secondary Education | 1 |
Audience
Policymakers | 1 |
Location
Australia | 2 |
California | 2 |
Canada | 1 |
Georgia | 1 |
Illinois | 1 |
Ireland | 1 |
Kentucky | 1 |
Maine | 1 |
Michigan | 1 |
Minnesota | 1 |
New Hampshire | 1 |
More ▼ |
Laws, Policies, & Programs
Carl D Perkins Vocational and… | 1 |
Improving Americas Schools… | 1 |
Assessments and Surveys
National Assessment of… | 7 |
edTPA (Teacher Performance… | 2 |
Praxis Series | 1 |
TerraNova Multiple Assessments | 1 |
What Works Clearinghouse Rating
Sinclair, Andrea L., Ed.; Thacker, Arthur, Ed. – Human Resources Research Organization (HumRRO), 2019
These are the appendices for the technical report, "An Investigation of the Comparability of Commission-Approved Teaching Performance Assessment Models." California's Commission on Teacher Credentialing (Commission) requires all programs of preliminary multiple and single subject teacher preparation to use a Commission-approved Teaching…
Descriptors: Performance Based Assessment, Preservice Teachers, Models, Scoring Rubrics
Sinclair, Andrea L., Ed.; Thacker, Arthur, Ed. – Human Resources Research Organization (HumRRO), 2019
California's Commission on Teacher Credentialing (Commission) requires all programs of preliminary multiple and single subject teacher preparation to use a Commission-approved Teaching Performance Assessment (TPA) as one of the program completion requirements for prospective teacher candidates. Three TPA models were approved by the Commission: (1)…
Descriptors: Preservice Teachers, Performance Based Assessment, Models, Credentials
Northwest Evaluation Association, 2015
Recently, the Smarter Balanced Assessment Consortium (Smarter Balanced) released a document that established initial performance levels and the associated threshold scale scores for the Smarter Balanced assessment. The report included estimated percentages of students expected to perform at each of the four performance levels, reported by grade…
Descriptors: Standard Setting, Standard Setting (Scoring), Pretesting, Cutting Scores
Northwest Evaluation Association, 2014
Recently, Northwest Evaluation Association (NWEA) completed a study to connect the scale of the Minnesota Comprehensive Assessments (MCA) Testing Program used for Minnesota's mathematics and reading assessments with NWEA's RIT (Rasch Unit) scale. Information from the state assessments was used in a study to establish performance-level scores on…
Descriptors: Alignment (Education), Testing Programs, State Programs, Mathematics Tests
Bennett, John; Tognolini, Jim; Pickering, Samantha – Assessment in Education: Principles, Policy & Practice, 2012
This paper describes how a state education system in Australia introduced standards-referenced assessments into its large-scale, high-stakes, curriculum-based examinations in a way that enables comparison of performance across time even though the examinations are different each year. It describes the multi-stage modified Angoff standard-setting…
Descriptors: Feedback (Response), Tests, Foreign Countries, Cutting Scores
Klenowski, Val; Wyatt-Smith, Claire – Australian Educational Researcher, 2010
While externally moderated standards-based assessment has been practised in Queensland senior schooling for more than three decades, there has been no such practice in the middle years. With the introduction of standards at state and national levels in these years, teacher judgement as developed in moderation practices is now vital. This paper…
Descriptors: Student Evaluation, Educational Change, Foreign Countries, Standard Setting (Scoring)
Stone, Gregory Ethan; Beltyukova, Svetlana; Fox, Christine M. – International Journal of Testing, 2008
Judge-mediated examinations are defined as those for which expert evaluation (using rubrics) is required to determine correctness, completeness, and reasonability of test-taker responses. The use of multifaceted Rasch modeling has led to improvements in the reliability of scoring such examinations. The establishment of criterion-referenced…
Descriptors: Interrater Reliability, High Stakes Tests, Standard Setting, Minimum Competencies
DeMauro, Gerald E. – 2003
An analysis was made of the cognitive processes that support the judgments made in standard setting activities. These processes were conceived as having two components: forming the domain needed to pass the test and identifying the criterion level of performance to pass the test. In fact, these processes are interactive, and were separated for the…
Descriptors: Cognitive Processes, Judges, Matrices, Performance Based Assessment
Abbott, Marilyn L. – Alberta Journal of Educational Research, 2006
The purpose of this article is to promote an increased awareness of the processes for setting cut-scores for complex performance assessments by (a) describing the Analytic Judgment Method (AJM) for setting cut-scores, and (b) critically evaluating the technical adequacy and practicability of the AJM by focusing on one investigation where the AJM…
Descriptors: Interrater Reliability, Cutting Scores, Performance Based Assessment, Standard Setting (Scoring)
Lunz, Mary E. – 1997
This paper explains the multifacet technology for analyzing performance examinations and the fair average method of setting criterion standards. The multidimensional nature of performance examinations requires that multiple and often different facets elements of a candidate's examination form be accounted for in the analysis. After this is…
Descriptors: Ability, Computer Assisted Testing, Criteria, Educational Technology
Verhelst, N. D.; Kaftandjieva, F. – 1999
A new method is proposed to set multiple standards in performance tests. The method combines three sources of information coming from three different data collections. The first is an empirical definition of mastery of an item; the second consists of parameter estimates of the items in an Item Response Theory (IRT) model, and the third source is a…
Descriptors: Cutting Scores, Data Collection, Foreign Countries, Item Response Theory

Plake, Barbara S.; And Others – Educational and Psychological Measurement, 1997
The dominant profile judgment method, designed for use with profiles of polytomous scores on exercises in a performance-based assessment, is presented as a standard-setting method. The approach guides standard-setting panelists in articulating their standard-setting policies and allows for complex policy statements. (SLD)
Descriptors: Educational Policy, Field Tests, Performance Based Assessment, Policy Formation

Berk, Ronald A. – Applied Measurement in Education, 1995
A brief summary of standard setting knowledge is presented, derived from about 20 methods that utilize a judgmental review process, the approach most relevant to the standard-setting strategies proposed in this special issue. Criteria for judging effectiveness and critiques of the methods discussed in the issue are offered. (SLD)
Descriptors: Criteria, Decision Making, Educational History, Evaluation Methods
Plake, Barbara S.; Impara, James C. – 1996
This study investigated the intrajudge consistency of Angoff-based item performance estimates. The examination used was a certification examination in an emergency medicine specialty. Ten expert panelists rated the same 24 items twice during an operational standard setting study. Results indicate that the panelists were highly consistent, in terms…
Descriptors: Cutting Scores, Interrater Reliability, Licensing Examinations (Professions), Performance Based Assessment

Jaeger, Richard M. – Educational Measurement: Issues and Practice, 1995
A newly developed performance standard-setting procedure, termed iterative judgmental policy capturing (JPC), is applicable to assessments composed of distinct multidimensional exercises. The procedure is described, and results are reported from the application of JPC in a study involving a panel of 20 teachers and 6 performance exercises. (SLD)
Descriptors: Decision Making, Educational Assessment, Licensing Examinations (Professions), Multidimensional Scaling