Publication Date
In 2025 | 1 |
Since 2024 | 2 |
Since 2021 (last 5 years) | 2 |
Since 2016 (last 10 years) | 2 |
Since 2006 (last 20 years) | 11 |
Descriptor
Test Interpretation | 30 |
Test Validity | 18 |
Validity | 11 |
Scores | 10 |
Test Reliability | 8 |
Test Use | 7 |
Item Analysis | 6 |
Test Construction | 6 |
Test Results | 6 |
Testing Problems | 6 |
Achievement Tests | 5 |
More ▼ |
Source
Journal of Educational… | 30 |
Author
Kane, Michael T. | 2 |
Prediger, Dale J. | 2 |
Ayrer, James E. | 1 |
Borsboom, Denny | 1 |
Brennan, Robert L. | 1 |
Briggs, Derek C. | 1 |
Carolin Hahnel | 1 |
Diamond, James J. | 1 |
Dorans, Neil J. | 1 |
Frank Goldhammer | 1 |
Hanna, Gerald S. | 1 |
More ▼ |
Publication Type
Journal Articles | 20 |
Reports - Research | 10 |
Opinion Papers | 7 |
Reports - Evaluative | 3 |
Education Level
Audience
Researchers | 1 |
Location
Laws, Policies, & Programs
Assessments and Surveys
ACT Interest Inventory | 1 |
Differential Aptitude Test | 1 |
Iowa Tests of Basic Skills | 1 |
Lexile Scale of Reading | 1 |
Metropolitan Achievement Tests | 1 |
Peabody Picture Vocabulary… | 1 |
Sequential Tests of… | 1 |
What Works Clearinghouse Rating
Frank Goldhammer; Ulf Kroehne; Carolin Hahnel; Johannes Naumann; Paul De Boeck – Journal of Educational Measurement, 2024
The efficiency of cognitive component skills is typically assessed with speeded performance tests. Interpreting only effective ability or effective speed as efficiency may be challenging because of the within-person dependency between both variables (speed-ability tradeoff, SAT). The present study measures efficiency as effective ability…
Descriptors: Timed Tests, Efficiency, Scores, Test Interpretation
Kylie Gorney; Sandip Sinharay – Journal of Educational Measurement, 2025
Although there exists an extensive amount of research on subscores and their properties, limited research has been conducted on categorical subscores and their interpretations. In this paper, we focus on the claim of Feinberg and von Davier that categorical subscores are useful for remediation and instructional purposes. We investigate this claim…
Descriptors: Tests, Scores, Test Interpretation, Alternative Assessment
Kane, Michael T. – Journal of Educational Measurement, 2013
To validate an interpretation or use of test scores is to evaluate the plausibility of the claims based on the scores. An argument-based approach to validation suggests that the claims based on the test scores be outlined as an argument that specifies the inferences and supporting assumptions needed to get from test responses to score-based…
Descriptors: Test Interpretation, Validity, Scores, Test Use
Newton, Paul E. – Journal of Educational Measurement, 2013
Kane distinguishes between two kinds of argument: the interpretation/use argument and the validity argument. This commentary considers whether there really are two kinds of argument, two arguments, or just one. It concludes that there is just one argument: the validity argument. (Contains 2 figures and 5 notes.)
Descriptors: Validity, Test Interpretation, Test Use
Sireci, Stephen G. – Journal of Educational Measurement, 2013
Kane (this issue) presents a comprehensive review of validity theory and reminds us that the focus of validation is on test score interpretations and use. In reacting to his article, I support the argument-based approach to validity and all of the major points regarding validation made by Dr. Kane. In addition, I call for a simpler, three-step…
Descriptors: Validity, Theories, Test Interpretation, Test Use
Borsboom, Denny; Markus, Keith A. – Journal of Educational Measurement, 2013
According to Kane (this issue), "the validity of a proposed interpretation or use depends on how well the evidence supports" the claims being made. Because truth and evidence are distinct, this means that the validity of a test score interpretation could be high even though the interpretation is false. As an illustration, we discuss the case of…
Descriptors: Evidence, Ethics, Validity, Theories
Dorans, Neil J.; Middleton, Kyndra – Journal of Educational Measurement, 2012
The interpretability of score comparisons depends on the design and execution of a sound data collection plan and the establishment of linkings between these scores. When comparisons are made between scores from two or more assessments that are built to different specifications and are administered to different populations under different…
Descriptors: Tests, Equated Scores, Test Interpretation, Validity
Brennan, Robert L. – Journal of Educational Measurement, 2013
Kane's paper "Validating the Interpretations and Uses of Test Scores" is the most complete and clearest discussion yet available of the argument-based approach to validation. At its most basic level, validation as formulated by Kane is fundamentally a simply-stated two-step enterprise: (1) specify the claims inherent in a particular interpretation…
Descriptors: Validity, Test Interpretation, Test Use, Scores
Briggs, Derek C. – Journal of Educational Measurement, 2013
A vertical score scale is needed to measure growth across multiple tests in terms of absolute changes in magnitude. Since the warrant for subsequent growth interpretations depends upon the assumption that the scale has interval properties, the validation of a vertical scale would seem to require methods for distinguishing interval scales from…
Descriptors: Measurement, Scaling, Validity, Test Interpretation
Kane, Michael T. – Journal of Educational Measurement, 2013
This response to the comments contains three main sections, each addressing a subset of the comments. In the first section, I will respond to the comments by Brennan, Haertel, and Moss. All of these comments suggest ways in which my presentation could be extended or improved; I generally agree with their suggestions, so my response to their…
Descriptors: Validity, Test Interpretation, Test Use, Scores
Moss, Pamela A. – Journal of Educational Measurement, 2013
Studies of data use illuminate ways in which education professionals have used test scores and other evidence relevant to students' learning--in action in their own contexts of work--to make decisions about their practice. These studies raise instructive challenges for a validity theory that focuses on intended interpretations and uses of test…
Descriptors: Validity, Test Use, Test Interpretation, Scores

Millman, Jason; Popham, W. James – Journal of Educational Measurement, 1974
The use of the regression equation derived from the Anglo-American sample to predict grades of Mexican-American students resulted in overprediction. An examination of the standardized regression weights revealed a significant difference in the weight given to the Scholastic Aptitude Test Mathematics Score. (Author/BB)
Descriptors: Criterion Referenced Tests, Item Analysis, Predictive Validity, Scores

Reschly, Daniel J.; Sabers, Darrell L. – Journal of Educational Measurement, 1979
Test bias, assumed as equal regression lines between two different tests for different populations was investigated to predict Metropolitan Achievement Tests from the Wechsler Intelligence Scale for Children--Revised. Subjects were 1,040 children in grades 1, 3, 5, 7, and 9: Anglo American, Black, Mexican American, and Native American Papago. (JKS)
Descriptors: Academic Achievement, Elementary Education, Intelligence Tests, Minority Group Children

Prediger, Dale J. – Journal of Educational Measurement, 1971
A computer-based system for converting test data into locally-validated counseling information was developed and field tested with potential vocational school students. Two data information conversion procedures were used: similarity (centour) scores based on discriminant analyses and success estimates based on experience tables. Illustrations of…
Descriptors: Career Counseling, Computer Oriented Programs, Test Interpretation, Test Results

Ayrer, James E.; McNamara, Thomas C. – Journal of Educational Measurement, 1973
Out-of-level'' testing is the assigning of pupils to levels of a standardized test on the basis of previous test scores rather than their present grade assignment. Test results of 1500 children were reviewed to see if their performance supported the rationale behind the practice. (Author/CB)
Descriptors: Achievement Rating, Elementary School Students, Standardized Tests, Test Interpretation
Previous Page | Next Page ยป
Pages: 1 | 2