Publication Date
| In 2026 | 0 |
| Since 2025 | 17 |
| Since 2022 (last 5 years) | 74 |
| Since 2017 (last 10 years) | 189 |
| Since 2007 (last 20 years) | 384 |
Descriptor
Source
Author
Publication Type
Education Level
Audience
| Practitioners | 274 |
| Researchers | 122 |
| Teachers | 102 |
| Administrators | 63 |
| Counselors | 28 |
| Parents | 21 |
| Policymakers | 21 |
| Students | 15 |
| Community | 8 |
Location
| Canada | 45 |
| Australia | 33 |
| California | 33 |
| United Kingdom | 23 |
| United States | 20 |
| Pennsylvania | 18 |
| United Kingdom (England) | 17 |
| New York | 15 |
| Japan | 14 |
| Michigan | 14 |
| New Jersey | 12 |
| More ▼ | |
Laws, Policies, & Programs
Assessments and Surveys
What Works Clearinghouse Rating
Brennan, Robert L. – Journal of Educational Measurement, 2013
Kane's paper "Validating the Interpretations and Uses of Test Scores" is the most complete and clearest discussion yet available of the argument-based approach to validation. At its most basic level, validation as formulated by Kane is fundamentally a simply-stated two-step enterprise: (1) specify the claims inherent in a particular interpretation…
Descriptors: Validity, Test Interpretation, Test Use, Scores
Jerrim, John – Assessment in Education: Principles, Policy & Practice, 2016
The Programme for International Assessment (PISA) is an important cross-national study of 15-year olds academic achievement. Although it has traditionally been conducted using paper-and-pencil tests, the vast majority of countries will use computer-based assessment from 2015. In this paper, we consider how cross-country comparisons of children's…
Descriptors: Foreign Countries, Achievement Tests, International Assessment, Secondary School Students
Oller, John W., Jr. – Language Testing, 2012
Kane's argument-based framework is summarized and examined. He implicitly appeals to the backgrounded concepts of fairness and justice. From there it is a short distance to grounding the whole system in the mundane notion of truth. In fact, valid argument systems must depend on representations that are "true" by virtue of agreement with purported…
Descriptors: Scores, Validity, Test Interpretation, Cutting Scores
Ho, Andrew – Measurement: Interdisciplinary Research and Perspectives, 2013
In his thoughtful focus article, Haertel (this issue) pushes testing experts to broaden the scope of their validation efforts and to invite scholars from other disciplines to join them. He credits existing validation frameworks for helping the measurement community to identify incomplete or nonexistent validity arguments. However, he notes his…
Descriptors: Educational Testing, Scores, Test Use, Test Validity
Haertel, Edward – Measurement: Interdisciplinary Research and Perspectives, 2013
The author is deeply gratified by the commentators' thoughtful responses and finds almost nothing to disagree with in any of them. Each offers additional insights prompting further reflection. In drawing out just a few common themes, this brief rejoinder omits many important ideas from the individual contributions. As stated in his title, the…
Descriptors: Educational Testing, Educational Improvement, Test Interpretation, Test Use
Tienken, Christopher H. – Kappa Delta Pi Record, 2015
The ubiquitous use of standardized test results to make varied judgments about educators, students, and schools within the public school system raises concerns of validity. If the test results have not been validated for making multiple determinations, then the decisions made about educators, students, schools, and school districts based on the…
Descriptors: Standardized Tests, Test Use, Test Results, Test Interpretation
Schleicher, Andreas – OECD Publishing, 2019
The OECD Programme for International Student Assessment (PISA) examines what students know in reading, mathematics and science, and what they can do with what they know. It provides the most comprehensive and rigorous international assessment of student learning outcomes to date. Results from PISA indicate the quality and equity of learning…
Descriptors: Test Results, Test Interpretation, Achievement Tests, Foreign Countries
Bruni, Teryn P. – Journal of Psychoeducational Assessment, 2014
This article reviews the Social Responsiveness Scale-Second Edition (SRS-2), a 65-item rating scale measuring deficits in social behavior associated with Autism Spectrum Disorder (ASD), as outlined by the "Diagnostic and Statistical Manual of Mental Disorders" (4th ed., text rev.; "DSM-IV-TR"; American Psychiatric Association,…
Descriptors: Behavior Rating Scales, Social Behavior, Autism, Pervasive Developmental Disorders
Chapelle, Carol A. – Language Testing, 2012
According to Kane (2006), the argument-based framework is quite simple and involves two steps. First, specify the proposed interpretations and uses of the scores in some detail. Second, evaluate the overall plausibility of the proposed interpretations and uses. Based on experience gained in developing that validity argument, Chapelle, Enright, and…
Descriptors: Validity, Language Tests, Test Interpretation, Test Use
Tracey, Terence J. G. – Journal of Vocational Behavior, 2012
The presence of the general factor in interest and self-efficacy assessment and its meaning are reviewed. The general factor is found in all interest and self-efficacy assessment and has been viewed as (a) a nuisance factor with little effect on assessment, (b) a variable having substantive meaning and thus worthy of including in interpretation,…
Descriptors: Interest Inventories, Self Efficacy, Test Interpretation, Evaluation Problems
Skaggs, Gary – Measurement: Interdisciplinary Research and Perspectives, 2013
The construct map is a particularly good way to approach instrument development, and this author states that he was delighted to read Adam Wyse's thoughts about how to use construct maps for standard setting. For a number of popular standard-setting methods, Wyse shows how typical feedback to panelists fits within a construct map framework.…
Descriptors: Standard Setting (Scoring), Maps, Test Construction, Measurement
Hall, Anna H.; Tannebaum, Rory P. – Journal of Psychoeducational Assessment, 2013
The first edition of the Gray Oral Reading Tests (GORT, 1963) was written by Dr. William S. Gray, a founding member and the first president of the International Reading Association. The GORT was designed to measure oral reading abilities (i.e., Rate, Accuracy, Fluency, and Comprehension) of students in Grades 2 through 12 due to the noteworthy…
Descriptors: Oral Reading, Reading Tests, Children, Testing
Zou, Shen; Xu, Qian – Language Assessment Quarterly, 2017
Washback and fairness are interrelated in validity research, and thus an investigation into washback inevitably involves fairness. This article reports Phase One of a washback study of "Test for English Majors for Grade Eight" (TEM8). Phase One was a questionnaire survey administered to university program administrators. Two research…
Descriptors: Foreign Countries, Language Tests, English (Second Language), Test Bias
Cheng, Liying; Sun, Youyi – Language Assessment Quarterly, 2015
This article draws on Kane's (2006) argument-based validation framework to synthesize evidence derived from a large-scale, mixed-method explanatory study on the impact of the Ontario Secondary School Literacy Test (OSSLT) on second language (L2) students. The purpose of the OSSLT is to ensure that students have acquired the essential reading and…
Descriptors: Foreign Countries, Secondary School Students, Literacy, Reading Tests
Irby, Sarah M.; Floyd, Randy G. – Canadian Journal of School Psychology, 2013
The Wechsler Abbreviated Scale of Intelligence, Second Edition (WASI-II; Wechsler, 2011) is a brief intelligence test designed for individuals aged 6 through 90 years. It is a revision of the Wechsler Abbreviated Scale of Intelligence (WASI; Wechsler, 1999). During revision, there were three goals: enhancing the link between the Wechsler…
Descriptors: Test Reviews, Intelligence Tests, Psychometrics, Item Analysis

Peer reviewed
Direct link
