Publication Date
In 2025 | 0 |
Since 2024 | 1 |
Since 2021 (last 5 years) | 2 |
Since 2016 (last 10 years) | 3 |
Since 2006 (last 20 years) | 9 |
Descriptor
Performance Based Assessment | 21 |
Scores | 21 |
Test Interpretation | 21 |
Educational Assessment | 9 |
Standardized Tests | 6 |
Test Construction | 6 |
Test Validity | 6 |
Construct Validity | 5 |
Student Evaluation | 5 |
Validity | 5 |
Evaluation Methods | 4 |
More ▼ |
Source
Author
Messick, Samuel | 4 |
Archbald, Doug A. | 1 |
Benjamin, Roger | 1 |
Carolin Hahnel | 1 |
Choi, Ick Kyu | 1 |
Clemens, Nathan H. | 1 |
Davis, John L. | 1 |
DiVesta, Francis J. | 1 |
Dunbar, Stephen B. | 1 |
Frank Goldhammer | 1 |
Gschwendtner, Tobias | 1 |
More ▼ |
Publication Type
Education Level
Higher Education | 4 |
Elementary Education | 2 |
High Schools | 2 |
Postsecondary Education | 2 |
Secondary Education | 2 |
Elementary Secondary Education | 1 |
Grade 5 | 1 |
Grade 7 | 1 |
Grade 8 | 1 |
Grade 9 | 1 |
Junior High Schools | 1 |
More ▼ |
Audience
Practitioners | 2 |
Community | 1 |
Teachers | 1 |
Location
California | 1 |
California (Los Angeles) | 1 |
Canada | 1 |
Italy | 1 |
Massachusetts | 1 |
Michigan | 1 |
United States | 1 |
Vermont | 1 |
Laws, Policies, & Programs
No Child Left Behind Act 2001 | 1 |
Assessments and Surveys
Gates MacGinitie Reading Tests | 1 |
General Educational… | 1 |
National Assessment of… | 1 |
What Works Clearinghouse Rating
Frank Goldhammer; Ulf Kroehne; Carolin Hahnel; Johannes Naumann; Paul De Boeck – Journal of Educational Measurement, 2024
The efficiency of cognitive component skills is typically assessed with speeded performance tests. Interpreting only effective ability or effective speed as efficiency may be challenging because of the within-person dependency between both variables (speed-ability tradeoff, SAT). The present study measures efficiency as effective ability…
Descriptors: Timed Tests, Efficiency, Scores, Test Interpretation
Hartmann, Stefan; Güzel, Emre; Gschwendtner, Tobias – Empirical Research in Vocational Education and Training, 2023
We investigated the ecological validity of performance measures from a computer-based assessment tool that utilises scripted video vignettes. The intended purpose of this tool is to assess the maintenance and repair skills of automotive technician apprentices, complementing traditional hands-on assessment formats from the German journeymen's…
Descriptors: Performance Based Assessment, Computer Assisted Testing, Auto Mechanics, Job Skills
Schmidgall, Jonathan – Applied Measurement in Education, 2017
This study utilizes an argument-based approach to validation to examine the implications of reliability in order to further differentiate the concepts of score and decision consistency. In a methodological example, the framework of generalizability theory was used to estimate appropriate indices of score consistency and evaluations of the…
Descriptors: Scores, Reliability, Validity, Generalizability Theory
Clemens, Nathan H.; Davis, John L.; Simmons, Leslie E.; Oslund, Eric L.; Simmons, Deborah C. – Journal of Psychoeducational Assessment, 2015
Standardized measures are often used as an index of students' reading comprehension and scores have important implications, particularly for students who perform below expectations. This study examined secondary-level students' patterns of responding and the prevalence and impact of non-attempted items on a timed, group-administered,…
Descriptors: Secondary School Students, Performance Based Assessment, Multiple Choice Tests, Reading Comprehension
Wolf, Raffaela; Zahner, Doris; Kostoris, Fiorella; Benjamin, Roger – Council for Aid to Education, 2014
The measurement of higher-order competencies within a tertiary education system across countries presents methodological challenges due to differences in educational systems, socio-economic factors, and perceptions as to which constructs should be assessed (Blömeke, Zlatkin-Troitschanskaia, Kuhn, & Fege, 2013). According to Hart Research…
Descriptors: Case Studies, International Assessment, Performance Based Assessment, Critical Thinking
Choi, Ick Kyu – ProQuest LLC, 2013
At the University of California, Los Angeles, the Test of Oral Proficiency (TOP), an internally developed oral proficiency test, is administered to international teaching assistant (ITA) candidates to ensure an appropriate level of academic oral English proficiency. Test taker performances are rated live by two raters according to four subscales.…
Descriptors: Screening Tests, Profiles, Oral Language, English
Spalding, Audrey – Mackinac Center for Public Policy, 2014
The 2014 Michigan Public High School Context and Performance Report Card is the Mackinac Center's second effort to measure high school performance. The first high school assessment was published in 2012, followed by the Center's 2013 elementary and middle school report card, which used a similar methodology to evaluate school performance. The…
Descriptors: Academic Achievement, Achievement Rating, Comparative Analysis, Comparative Testing
Noble, Tracy; Suarez, Catherine; Rosebery, Ann; O'Connor, Mary Catherine; Warren, Beth; Hudicourt-Barnes, Josiane – Journal of Research in Science Teaching, 2012
Education policy in the U.S. in the last two decades has emphasized large-scale assessment of students, with growing consequences for schools, teachers, and students. Given the high stakes of such tests, it is important to understand the relationships between students' answers to test items and their knowledge and skills in the tested content…
Descriptors: Testing, Science Tests, Second Language Learning, Measures (Individuals)
Shermis, Mark D.; DiVesta, Francis J. – Rowman & Littlefield Publishers, Inc., 2011
"Classroom Assessment in Action" clarifies the multi-faceted roles of measurement and assessment and their applications in a classroom setting. Comprehensive in scope, Shermis and Di Vesta explain basic measurement concepts and show students how to interpret the results of standardized tests. From these basic concepts, the authors then…
Descriptors: Student Evaluation, Standardized Tests, Scores, Measurement

Mehrens, William A. – Applied Measurement in Education, 1997
This commentary on articles in this special issue generally agrees with the viewpoints expressed, although it argues that in some cases the authors of these articles should have expanded on certain issues. Many comments relate to the legal defensibility of the positions taken. (SLD)
Descriptors: Certification, Decision Making, Licensing Examinations (Professions), Performance Based Assessment

Messick, Samuel – Educational Measurement: Issues and Practice, 1995
Six distinguishable aspects of construct validity are discussed as they apply to performance assessment, emphasizing content, substantive, structural, generalizability, external, and consequential aspects. Taken together, these aspects provide a way to address validity questions in score interpretation and use. (SLD)
Descriptors: Construct Validity, Content Validity, Educational Assessment, Generalization
Messick, Samuel – 1994
The construct validity of content standards is addressed in terms of their representative coverage of a construct domain and their alignment with the students' cognitive level of developing expertise in the subject matter. The construct validity of performance standards is addressed in terms of the extent to which they reflect increasing levels of…
Descriptors: Construct Validity, Educational Assessment, Inferences, Knowledge Level
Messick, Samuel – 1994
In contrast to multiple choice, alternative modes of assessment afford varying degrees of openness in the allowable responses. Prominent among the alternatives is the assessment of performance, sometimes in its own right where the issue is the quality of the particular performance per se, but more often as a vehicle for the assessment of…
Descriptors: Alternative Assessment, Construct Validity, Educational Assessment, Inferences
Messick, Samuel – 1996
The concept of "washback," especially prominent in the field of applied linguistics, refers to the extent to which a test influences teachers and learners to do things they would not otherwise necessarily do. Some writers invoke the notion of washback validity, holding that a test's validity should be gauged by the degree to which it has…
Descriptors: Applied Linguistics, Construct Validity, Criteria, Language Tests
Lyman, Howard B. – 1998
The first edition of this book was written to give information about testing to people whose work gave them access to test results, but whose training included little or nothing about the use and interpretation of tests. Later editions have been intended for a broader audience as the need for understanding what test scores really mean has…
Descriptors: Educational Testing, Norm Referenced Tests, Performance Based Assessment, Psychometrics
Previous Page | Next Page »
Pages: 1 | 2