NotesFAQContact Us
Collection
Advanced
Search Tips
Showing all 12 results Save | Export
Peer reviewed Peer reviewed
Direct linkDirect link
Kane, Michael – Measurement: Interdisciplinary Research and Perspectives, 2012
Paul E. Newton's article on the consensus definition of validity tackles a number of big issues and makes a number of strong claims. I agreed with much of what he said, and I disagreed with a number of his claims, but I found his article to be consistently interesting and thought provoking (whether I agreed or not). I will focus on three general…
Descriptors: Validity, Construct Validity, Tests, Testing
Kane, Michael; Case, Susan – 2003
The scores on two distinct tests (e.g., essay and objective) are often combined into a composite score, which is used to make decisions. The validity of the observed composite can sometimes be evaluated relative to a separate criterion. In cases where no criterion is available, the observed composite has generally been evaluated in terms of its…
Descriptors: Reliability, Simulation, Validity, Weighted Scores
Kane, Michael – 2000
Validity is concerned with the clarification and justification of the intended interpretations and uses of observed scores. It has not been easy to formulate a general methodology set of principles for validation, but progress has been made, especially as the field has moved from relatively limited criterion-related models to sophisticated…
Descriptors: Scores, Test Interpretation, Test Results, Theories
Kane, Michael – 1999
The relationship between generalizability and validity is explained, making four important points. The first is that generalizability coefficients provide upper bounds on validity. The second point is that generalization is one step in most interpretive arguments, and therefore, generalizability is a necessary condition for the validity of these…
Descriptors: Error of Measurement, Generalizability Theory, Test Interpretation, Validity
Peer reviewed Peer reviewed
Direct linkDirect link
Kane, Michael; Case, Susan M. – Applied Measurement in Education, 2004
The scores on 2 distinct tests (e.g., essay and objective) are often combined to create a composite score, which is used to make decisions. The validity of the observed composite can sometimes be evaluated relative to an external criterion. However, in cases where no criterion is available, the observed composite has generally been evaluated in…
Descriptors: Validity, Weighted Scores, Reliability, Student Evaluation
Peer reviewed Peer reviewed
Kane, Michael; Crooks, Terence; Cohen, Allan – Educational Measurement: Issues and Practice, 1999
Analyzes the three major inferences involved in interpretation of performance assessments: (1) scoring of the observed performances; (2) generalization to a domain of assessment performances like those included in the assessment; and (3) extrapolation to the large performance domain of interest. Suggests ways to improve the validity of performance…
Descriptors: Performance Based Assessment, Performance Factors, Scoring, Test Interpretation
Peer reviewed Peer reviewed
Direct linkDirect link
Kane, Michael – Measurement: Interdisciplinary Research and Perspectives, 2007
According to Schilling, Blunk, and Hill, the set of papers presented in this journal issue had two main purposes: (1) to use an argument-based approach to evaluate the validity of the tests of mathematical knowledge for teaching (MKT), and (2) to critically assess the author's version of an argument-based approach to validation (Kane, 2001, 2004).…
Descriptors: Mathematics Education, Validity, Knowledge Base for Teaching, Knowledge Level
Peer reviewed Peer reviewed
Kane, Michael – Educational Measurement: Issues and Practice, 2002
Makes the point that the interpretations and use of high-stakes test scores rely on policy assumptions about what should be taught and the content standards and performance standards that should be applied. The assumptions built into an assessment need to be subjected to scrutiny and criticism if a strong case is to be made for the validity of the…
Descriptors: Educational Policy, Elementary Secondary Education, High Stakes Tests, Scores
Peer reviewed Peer reviewed
Kane, Michael – Applied Measurement in Education, 1996
This overview of the role of error and tolerance for error in measurement asserts that the generic precision associated with a measurement procedure is defined as the root mean square error, or standard error, in some relevant population. This view of precision is explored in several applications of measurement. (SLD)
Descriptors: Error of Measurement, Error Patterns, Generalizability Theory, Measurement Techniques
Kane, Michael – 2001
L. Cronbach has made the point that for validity arguments to be convincing to diverse audiences, they need to be based on assumptions that are credible to these audiences. The interpretations and uses of high stakes test scores rely on a number of policy assumptions about what should be taught in schools, and more specifically, about the content…
Descriptors: Educational Policy, Elementary Secondary Education, Graduation Requirements, High Stakes Tests
Peer reviewed Peer reviewed
Kane, Michael – Review of Educational Research, 1994
A framework is provided for examining the validity of performance standards for high-stakes achievement tests. Assumptions that need to be evaluated are that the passing score corresponds to specified performance standards and that the specified standard is reasonable given the purpose of the decision. (SLD)
Descriptors: Achievement Tests, Criteria, Cutting Scores, Decision Making
Kane, Michael – 1993
A general framework is provided for examining the validity of performance standards for high-stakes achievement tests. The emphasis is on conceptual issues and broadly defined methodological questions, the types of validity evidence that can be collected, and the advantages and limitations of different types of evidence. Validation consists of a…
Descriptors: Academic Standards, Achievement Tests, Cutting Scores, Data Collection