Publication Date
In 2025 | 0 |
Since 2024 | 0 |
Since 2021 (last 5 years) | 1 |
Since 2016 (last 10 years) | 3 |
Since 2006 (last 20 years) | 12 |
Descriptor
Educational Assessment | 38 |
Test Use | 38 |
Validity | 38 |
Elementary Secondary Education | 17 |
Evaluation Methods | 15 |
Performance Based Assessment | 14 |
Student Evaluation | 14 |
Reliability | 12 |
Test Construction | 10 |
Educational Change | 9 |
Academic Achievement | 8 |
More ▼ |
Source
Author
Baker, Eva L. | 2 |
Dings, Jonathan | 2 |
Herman, Joan L. | 2 |
Adams, Elizabeth | 1 |
Bennett, Jessica G. | 1 |
Bracey, Gerald W. | 1 |
Braden, Jeffery P. | 1 |
Burling, Kelly S. | 1 |
Chase, Clinton I. | 1 |
Crehan, Kevin D. | 1 |
Crocker, Linda | 1 |
More ▼ |
Publication Type
Education Level
Elementary Secondary Education | 2 |
Secondary Education | 2 |
Grade 9 | 1 |
Higher Education | 1 |
Junior High Schools | 1 |
Middle Schools | 1 |
Postsecondary Education | 1 |
Laws, Policies, & Programs
No Child Left Behind Act 2001 | 1 |
Assessments and Surveys
Clinical Evaluation of… | 1 |
Comprehensive Tests of Basic… | 1 |
Expressive One Word Picture… | 1 |
National Assessment of… | 1 |
Peabody Picture Vocabulary… | 1 |
What Works Clearinghouse Rating
Lederman, Josh – Applied Measurement in Education, 2023
Given its centrality to assessment, until the concept of validity includes concern for racial justice, such matters will be seen as residing outside the "real" work of validation, rendering them powerless to count against the apparent scientific merit of the test. As the definition of validity has evolved, however, it holds great…
Descriptors: Educational Assessment, Validity, Social Justice, Race
Della-Piana, Gabriel M.; Gardner, Michael K.; Mayne, Zachary M. – Journal of Research Practice, 2018
The authors describe challenges of following professional standards for educational achievement testing due to the complexity of gathering appropriate evidence to support demanding test interpretation and use. Validity evidence has been found to be low for some individual testing standards, leading to the possibility of faulty or impoverished test…
Descriptors: Achievement Tests, Standards, Educational Assessment, Testing
Ketterlin-Geller, Leanne R.; Perry, Lindsey; Adams, Elizabeth – Applied Measurement in Education, 2019
Despite the call for an argument-based approach to validity over 25 years ago, few examples exist in the published literature. One possible explanation for this outcome is that the complexity of the argument-based approach makes implementation difficult. To counter this claim, we propose that the Assessment Triangle can serve as the overarching…
Descriptors: Validity, Educational Assessment, Models, Screening Tests
Bennett, Jessica G.; Gardner, Ralph, III; Rizzi, Gleides Lopes – American Annals of the Deaf, 2013
Strong correlations exist between signed and/or spoken English and the literacy skills of deaf and hard of hearing students. Assessments that are both valid and reliable are key for researchers and practitioners investigating the signed and/or spoken English skills of signing populations. The authors conducted a literature review to explore which…
Descriptors: Deafness, Hearing Impairments, Sign Language, Language Skills
Pant, Hans A.; Rupp, Andre A.; Tiffin-Richards, Simon P.; Koller, Olaf – Studies in Educational Evaluation, 2009
Standard-setting procedures are a key component within many large-scale educational assessment systems. They are consensual approaches in which committees of experts set cut-scores on continuous proficiency scales, which facilitate communication of proficiency distributions of students to a wide variety of stakeholders. This communicative function…
Descriptors: Test Use, Educational Assessment, Validity, Standard Setting
Koch, Martha J.; DeLuca, Christopher – Assessment in Education: Principles, Policy & Practice, 2012
In this article we rethink validation within the complex contexts of high-stakes assessment. We begin by considering the utility of existing models for validation and argue that these models tend to overlook some of the complexities inherent to assessment use, including the multiple interpretations of assessment purposes and the potential…
Descriptors: Foreign Countries, Test Use, Case Studies, Educational Assessment
Stenlund, Tova – Assessment & Evaluation in Higher Education, 2010
The process of giving official acknowledgment to formal, informal and non-formal prior learning is commonly labelled as assessment, accreditation or recognition of prior learning (APL), representing a practice that is expanding in higher education in many countries. This paper focuses specifically on the assessment part of APL, which undoubtedly…
Descriptors: Higher Education, Validity, Prior Learning, Program Effectiveness
Braden, Jeffery P.; Shaw, Steven R. – Assessment for Effective Intervention, 2009
The intervention validity of cognitive assessment batteries is considered within an historical context to identify what the evidence supports (knowns), what cannot be known (unknowables), and what is not yet known (unknowns). Two ways cognitive batteries could inform intervention are identified: a disordinal (i.e., aptitude-treatment interaction)…
Descriptors: Intervention, Validity, Cognitive Tests, Cognitive Measurement
Nichols, Paul D.; Meyers, Jason L.; Burling, Kelly S. – Educational Measurement: Issues and Practice, 2009
Assessments labeled as formative have been offered as a means to improve student achievement. But labels can be a powerful way to miscommunicate. For an assessment use to be appropriately labeled "formative," both empirical evidence and reasoned arguments must be offered to support the claim that improvements in student achievement can be linked…
Descriptors: Academic Achievement, Tutoring, Student Evaluation, Evaluation Methods
Herman, Joan L.; Osmundson, Ellen; Dietel, Ronald – Assessment and Accountability Comprehensive Center, 2010
This report describes the purposes of benchmark assessments and provides recommendations for selecting and using benchmark assessments--addressing validity, alignment, reliability, fairness and bias and accessibility, instructional sensitivity, utility, and reporting issues. We also present recommendations on building capacity to support schools'…
Descriptors: Multiple Choice Tests, Test Items, Benchmarking, Educational Assessment
Sireci, Stephen G.; Han, Kyung T.; Wells, Craig S. – Educational Assessment, 2008
In the United States, when English language learners (ELLs) are tested, they are usually tested in English and their limited English proficiency is a potential cause of construct-irrelevant variance. When such irrelevancies affect test scores, inaccurate interpretations of ELLs' knowledge, skills, and abilities may occur. In this article, we…
Descriptors: Test Use, Educational Assessment, Psychological Testing, Validity
Shepard, Lorrie A. – Educational Measurement: Issues and Practice, 2009
In many school districts, the pressure to raise test scores has created overnight celebrity status for formative assessment. Its powers to raise student achievement have been touted, however, without attending to the research on which these claims were based. Sociocultural learning theory provides theoretical grounding for understanding how…
Descriptors: Learning Theories, Validity, Student Evaluation, Evaluation Methods
Baker, Eva L.; Linn Robert L. – 2002
This report analyzes the validity issues that arise in the context of educational accountability systems. The report addresses validity from three interlocking perspectives. The first explores the theory of action underlying accountability provisions. It considers problems emerging from the distance between aspirations for accountability in…
Descriptors: Accountability, Educational Assessment, Educational Change, Educational Testing
Ferrara, Steven; And Others – 1995
A study was conducted to begin a process of validating hypothesized causes of local item dependence (LID) in large-scale performance assessments. Data for the study are item level scores from 26 science tasks from the 1993 edition of the Maryland School Performance Assessment Program. Causes of high LID were hypothesized from studies by Ferrara et…
Descriptors: Educational Assessment, Hands on Science, Performance Based Assessment, Prediction
Messick, Samuel – 1994
The traditional concept of validity divides it into three separate types; content, criterion, and construct validities. This view is fragmented and incomplete, failing to take into account evidence of the value implications of score meaning as a basis for action and of the social consequences of score use. The new unified concept of validity…
Descriptors: Construct Validity, Criteria, Educational Assessment, Hypothesis Testing