NotesFAQContact Us
Collection
Advanced
Search Tips
Showing all 14 results Save | Export
Peer reviewed Peer reviewed
Direct linkDirect link
Newton, Paul E. – Journal of Educational Measurement, 2013
Kane distinguishes between two kinds of argument: the interpretation/use argument and the validity argument. This commentary considers whether there really are two kinds of argument, two arguments, or just one. It concludes that there is just one argument: the validity argument. (Contains 2 figures and 5 notes.)
Descriptors: Validity, Test Interpretation, Test Use
Peer reviewed Peer reviewed
Direct linkDirect link
Haertel, Edward – Journal of Educational Measurement, 2013
In validating uses of testing, it is helpful to distinguish those that rely directly on the information provided by scores or score distributions ("direct" uses and consequences) versus those that instead capitalize on the motivational effects of testing, or use testing and test reporting to shape public opinion ("indirect" uses and consequences).…
Descriptors: Validity, Testing, Test Results, Test Use
Peer reviewed Peer reviewed
Direct linkDirect link
Sireci, Stephen G. – Journal of Educational Measurement, 2013
Kane (this issue) presents a comprehensive review of validity theory and reminds us that the focus of validation is on test score interpretations and use. In reacting to his article, I support the argument-based approach to validity and all of the major points regarding validation made by Dr. Kane. In addition, I call for a simpler, three-step…
Descriptors: Validity, Theories, Test Interpretation, Test Use
Peer reviewed Peer reviewed
Direct linkDirect link
Borsboom, Denny; Markus, Keith A. – Journal of Educational Measurement, 2013
According to Kane (this issue), "the validity of a proposed interpretation or use depends on how well the evidence supports" the claims being made. Because truth and evidence are distinct, this means that the validity of a test score interpretation could be high even though the interpretation is false. As an illustration, we discuss the case of…
Descriptors: Evidence, Ethics, Validity, Theories
Peer reviewed Peer reviewed
Direct linkDirect link
Brennan, Robert L. – Journal of Educational Measurement, 2013
Kane's paper "Validating the Interpretations and Uses of Test Scores" is the most complete and clearest discussion yet available of the argument-based approach to validation. At its most basic level, validation as formulated by Kane is fundamentally a simply-stated two-step enterprise: (1) specify the claims inherent in a particular interpretation…
Descriptors: Validity, Test Interpretation, Test Use, Scores
Peer reviewed Peer reviewed
Direct linkDirect link
Kane, Michael T. – Journal of Educational Measurement, 2013
This response to the comments contains three main sections, each addressing a subset of the comments. In the first section, I will respond to the comments by Brennan, Haertel, and Moss. All of these comments suggest ways in which my presentation could be extended or improved; I generally agree with their suggestions, so my response to their…
Descriptors: Validity, Test Interpretation, Test Use, Scores
Peer reviewed Peer reviewed
Direct linkDirect link
Moss, Pamela A. – Journal of Educational Measurement, 2013
Studies of data use illuminate ways in which education professionals have used test scores and other evidence relevant to students' learning--in action in their own contexts of work--to make decisions about their practice. These studies raise instructive challenges for a validity theory that focuses on intended interpretations and uses of test…
Descriptors: Validity, Test Use, Test Interpretation, Scores
Peer reviewed Peer reviewed
Secolsky, Charles – Journal of Educational Measurement, 1987
For measuring the face validity of a test, Nevo suggested that test takers and nonprofessional users rate items on a five point scale. This article questions the ability of those raters and the credibility of the aggregated judgment as evidence of the validity of the test. (JAZ)
Descriptors: Content Validity, Measurement Techniques, Rating Scales, Test Items
Peer reviewed Peer reviewed
Tallmadge, G. Kasten – Journal of Educational Measurement, 1985
Support for the validity of the equipercentile assumption is presented in contrast with the conclusion of Powers, Slaughter, and Helmick (EJ 289 091). Observed "gains" from pre- to posttests are better attributed to stakeholder bias, posttests that match curriculum content too closely, or a combination of these factors. (Author/DWH)
Descriptors: Data Interpretation, Evaluation Methods, Norm Referenced Tests, Predictive Measurement
Peer reviewed Peer reviewed
Haertel, Edward; Calfee, Robert – Journal of Educational Measurement, 1983
The history of the relation between achievement tests and curriculum programs is reviewed, and it is concluded that content specialists are best qualified as sources of curricular goals to specify content, kinds of attainment, and standards. The specification of instructional intents is also considered from the perspective of modern cognition…
Descriptors: Achievement Tests, Cognitive Processes, Curriculum, Instructional Development
Peer reviewed Peer reviewed
Airasian, Peter W. – Journal of Educational Measurement, 1985
The Stanford Achievement Test Forms E and F were judged to be one of the best achievement batteries for assessing basic skills taught in grades one through nine. The test publisher provides several booklets in addition to the administration manual. These include the Norms Booklet, Handbook of Instructional Strategies, and Guide to Classroom…
Descriptors: Academic Achievement, Achievement Rating, Achievement Tests, Elementary Secondary Education
Peer reviewed Peer reviewed
Hall, Alfred E. – Journal of Educational Measurement, 1985
The 12 subtests of the Ball Aptitude Battery (BAB) listed in the administration manual were described. The reviewer believes this aptitude battery, designed for use with high school students and adults in job selection and placement, needs major improvements. It is suggested that the BAB be used solely for research purposes. (DWH)
Descriptors: Adults, Aptitude Tests, High Schools, Norm Referenced Tests
Peer reviewed Peer reviewed
Messick, Samuel – Journal of Educational Measurement, 1984
Comprehensive assessment in context focuses on the processes and structures involved in subject matter competence as moderated in performance by personal and environmental influences. This article addresses in detail both the nature of developing competence and its measurement in terms of context-dependent task performance. (Author/EGS)
Descriptors: Academic Achievement, Achievement Tests, Cognitive Ability, Cognitive Development
Peer reviewed Peer reviewed
Wainer, Howard – Journal of Educational Measurement, 1993
Focusing on educational measurement that suggests an action and has an outcome, 16 problem areas are defined and grouped into the following classes: (1) validity; (2) issues of statistical adjustment; (3) data insufficiencies; (4) other issues related to standardized testing and constructed responses; and (5) technical issues of psychometrics.…
Descriptors: Comparative Analysis, Computer Uses in Education, Constructed Response, Educational Assessment