Publication Date
In 2025 | 0 |
Since 2024 | 0 |
Since 2021 (last 5 years) | 0 |
Since 2016 (last 10 years) | 0 |
Since 2006 (last 20 years) | 2 |
Descriptor
Performance Based Assessment | 27 |
Test Reliability | 27 |
Test Use | 27 |
Educational Assessment | 20 |
Test Validity | 16 |
Elementary Secondary Education | 12 |
Test Construction | 11 |
Student Evaluation | 9 |
Accountability | 7 |
Evaluation Methods | 7 |
Measurement Techniques | 5 |
More ▼ |
Source
Applied Measurement in… | 3 |
Educational Assessment | 1 |
Educational Evaluation and… | 1 |
Evaluation Review | 1 |
International Journal of… | 1 |
Journal of Educational… | 1 |
Journal of Personnel… | 1 |
National Association of State… | 1 |
Author
Publication Type
Education Level
Elementary Secondary Education | 2 |
Elementary Education | 1 |
Location
Netherlands | 1 |
New York | 1 |
South Carolina | 1 |
Vermont | 1 |
Laws, Policies, & Programs
Education Consolidation… | 1 |
Assessments and Surveys
Wechsler Intelligence Scale… | 1 |
What Works Clearinghouse Rating
Parsi, Ace; Darling-Hammond, Linda – National Association of State Boards of Education, 2015
Employers, postsecondary institutions, and civic leaders are urging greater focus on 21st century skills essential for college, career, and civic success: problem solving, interpersonal skills, and collaboration, among others. In response to these demands, states across the country are working to realign policies--on learning standards,…
Descriptors: Educational Assessment, State Policy, Performance Based Assessment, Sustainability
Falk, Beverly; Ort, Suzanne Wichterle; Moirs, Katie – Educational Assessment, 2007
This article describes the findings of studies conducted on a large-scale, classroom-based performance assessment of literacy for the early grades designed to provide information that is useful for reporting, as well as teaching. Technical studies found the assessment to be a promising instrument that is reliable and valid. Follow-up studies of…
Descriptors: Program Effectiveness, Performance Based Assessment, Student Evaluation, Evaluation Research
Wolfe, Edward W.; Kao, Chi-Wen – 1996
This paper reports the results of an analysis of the relationship between scorer behaviors and score variability. Thirty-six essay scorers were interviewed and asked to perform a think-aloud task as they scored 24 essays. Each comment made by a scorer was coded according to its content focus (i.e. appearance, assignment, mechanics, communication,…
Descriptors: Content Analysis, Educational Assessment, Essays, Evaluation Methods

Sykes, Robert C.; Ito, Kyoko; Fitzpatrick, Anne R.; Ercikan, Kadriye – Journal of Educational Measurement, 1997
The five chapters of this report provide resources that deal with the validity, generalizability, comparability, performance standards, and fairness, equity, and bias of performance assessments. The book is written for experienced educational measurement practitioners, although an extensive familiarity with performance assessment is not required.…
Descriptors: Educational Assessment, Measurement Techniques, Performance Based Assessment, Standards
Reckase, Mark D. – 1997
This paper argues that special procedures for constructing assessment tools containing performance assessment tasks are unnecessary and that current test methodology can easily be generalized to complex performance assessment tasks without destroying the desirable characteristics of those tasks. Reasonable statistical requirements for sound…
Descriptors: Educational Assessment, Generalizability Theory, High Stakes Tests, Interrater Reliability

Bullock, Cheryl Davis; DeStefano, Lizanne – Educational Evaluation and Policy Analysis, 1998
The usefulness of the 1992 TSA in reading was studied using interviews from 26 state directors of assessment. Perceptions about TSA credibility and orientation of test components and current use of TSA results were examined. The directors suggested involving more teachers in assessment, modifying descriptors, disseminating results quicker, and…
Descriptors: Achievement Tests, Educational Assessment, Elementary Secondary Education, Performance Based Assessment

Klein, Stephen P.; And Others – Applied Measurement in Education, 1995
Portfolios are the centerpiece of Vermont's statewide assessment program in mathematics. Portfolio scores in the first two years were not reliable enough to permit the reporting of student-level results, but increasing the number of readers or the number of portfolio pieces is not operationally feasible. (SLD)
Descriptors: Educational Assessment, Elementary Secondary Education, Mathematics Tests, Performance Based Assessment

McBee, Maridyth M.; Barnes, Laura L. B. – Applied Measurement in Education, 1998
The temporal stability and intertask consistency of an eighth-grade mathematics performance assessment and how task similarity affects the ability to generalize results of the assessments were studied with results from 101 eighth graders. Results support the suggestion that large-scale performance assessments be used with considerable caution…
Descriptors: Academic Achievement, Grade 8, Junior High School Students, Junior High Schools
Kaufman, Alan S.; And Others – 1994
The reliability and validity of three short forms of the Wechsler Intelligence Scale for Children III (WISC-III) were compared. Each of the short forms was a tetrad composed of two verbal and two performance subtests. The first tetrad was selected based primarily on practical considerations, particularly its brevity to administer and score. The…
Descriptors: Adolescents, Age Differences, Children, Clinical Diagnosis
Lyman, Howard B. – 1998
The first edition of this book was written to give information about testing to people whose work gave them access to test results, but whose training included little or nothing about the use and interpretation of tests. Later editions have been intended for a broader audience as the need for understanding what test scores really mean has…
Descriptors: Educational Testing, Norm Referenced Tests, Performance Based Assessment, Psychometrics
Flaitz, Jim; Perdomo, Toni – 1994
A cursory examination of current measurement texts used in teacher education reveals a treatment of such topics as test reliability, item analysis, and test interpretation based largely upon classical test theory. In the meantime, the landscape of the classroom has been significantly impacted by a growing emphasis on more authentic assessment…
Descriptors: Educational Assessment, Educational Innovation, Educational Trends, Item Analysis
Perrone, Vito – 1991
This ERIC Digest was adapted from the Association for Childhood Education International's (ACEI) 1991 position paper on standardized testing. Since the publication of "A Nation at Risk" in 1983, standardized testing programs have expanded greatly. Tests may be of pencil-and-paper or performance-oriented varieties. The purposes of tests…
Descriptors: Academic Achievement, Accountability, Elementary Education, Elementary School Students

Stoskopf, Carleen H.; And Others – Evaluation Review, 1992
Data are presented that demonstrate the reliability and construct validity of a 27-item behaviorally anchored rating scale (BARS) used to rate the performance of 757 nursing assistants in South Carolina. Results support the reliability and construct validity of the BARS and the usefulness of the BARS approach for evaluation. (SLD)
Descriptors: Construct Validity, Evaluation Methods, Long Term Care, Measurement Techniques

Shavelson, R. J., Ed. – International Journal of Educational Research, 1994
The seven chapters of this issue focus on technical qualities of performance assessments and report the findings of systematic empirical inquiries. They add to the body of research that shows that performance assessment, although time consuming and expensive, merits the serious attention of educators. (SLD)
Descriptors: Educational Assessment, Elementary Secondary Education, Evaluation Methods, International Studies

Klein, Stephen P. – Journal of Personnel Evaluation in Education, 1998
Discusses how recent court decisions and the move toward performance assessment may affect the adverse impact, reliability, validity, and pass-fail standards of teacher-certification tests. Recommendations are made for tests that combine multiple-choice items with open-ended tasks. (SLD)
Descriptors: Constructed Response, Court Litigation, Elementary Secondary Education, Multiple Choice Tests
Previous Page | Next Page ยป
Pages: 1 | 2