Publication Date
In 2025 | 0 |
Since 2024 | 0 |
Since 2021 (last 5 years) | 0 |
Since 2016 (last 10 years) | 0 |
Since 2006 (last 20 years) | 5 |
Descriptor
Evaluation | 6 |
Scores | 6 |
Test Use | 6 |
Validity | 5 |
Testing | 4 |
Tests | 4 |
Inferences | 3 |
Accountability | 2 |
Construct Validity | 2 |
Language Tests | 2 |
Reliability | 2 |
More ▼ |
Author
Bachman, Lyle F. | 1 |
Kane, Michael | 1 |
Kolen, Michael J. | 1 |
Lane, Suzanne | 1 |
Lee, Won-Chan | 1 |
McNamara, Tim | 1 |
Zand Scholten, Annemarie | 1 |
Publication Type
Journal Articles | 6 |
Opinion Papers | 3 |
Reports - Descriptive | 3 |
Education Level
Audience
Location
Australia | 1 |
Laws, Policies, & Programs
Assessments and Surveys
ACTFL Oral Proficiency… | 1 |
What Works Clearinghouse Rating
Zand Scholten, Annemarie – Measurement: Interdisciplinary Research and Perspectives, 2012
This paper presents the author's critique to Paul E. Newton's article titled "Clarifying the consensus definition of validity." In his article, Newton not only clarifies but also redefines the consensus definition of validity. In this redefinition he omits the term "construct" and introduces the term "measurement." Both omission and introduction…
Descriptors: Validity, Definitions, Evaluation, Test Use
Kane, Michael – Measurement: Interdisciplinary Research and Perspectives, 2012
Paul E. Newton's article on the consensus definition of validity tackles a number of big issues and makes a number of strong claims. I agreed with much of what he said, and I disagreed with a number of his claims, but I found his article to be consistently interesting and thought provoking (whether I agreed or not). I will focus on three general…
Descriptors: Validity, Construct Validity, Tests, Testing
Lane, Suzanne – Measurement: Interdisciplinary Research and Perspectives, 2012
Considering consequences in the evaluation of validity is not new although it is still debated by Paul E. Newton and others. The argument-based approach to validity entails an interpretative argument that explicitly identifies the proposed interpretations and uses of test scores and a validity argument that provides a structure for evaluating the…
Descriptors: Educational Opportunities, Accountability, Validity, Inferences
Kolen, Michael J.; Lee, Won-Chan – Educational Measurement: Issues and Practice, 2011
This paper illustrates that the psychometric properties of scores and scales that are used with mixed-format educational tests can impact the use and interpretation of the scores that are reported to examinees. Psychometric properties that include reliability and conditional standard errors of measurement are considered in this paper. The focus is…
Descriptors: Test Use, Test Format, Error of Measurement, Raw Scores
Bachman, Lyle F. – Language Assessment Quarterly, 2005
The fields of language testing and educational and psychological measurement have not, as yet, developed a set of principles and procedures for linking test scores and score-based inferences to test use and the consequences of test use. Although Messick (1989) discusses test use and consequences, his framework provides virtually no guidance on how…
Descriptors: Test Use, Testing, Language Tests, Validity
McNamara, Tim – Language Assessment Quarterly, 2006
The thought of Samuel Messick has influenced language testing in 2 main ways: in proposing a new understanding of how inferences made based on tests must be challenged, and in drawing attention to the consequences of test use. The former has had a powerful impact on language-testing research, most notably in Bachman's work on validity and the…
Descriptors: Test Use, Testing, Language Tests, Validity