ERIC - Search Results

Publication Date

In 2025	1
Since 2024	2
Since 2021 (last 5 years)	2
Since 2016 (last 10 years)	2
Since 2006 (last 20 years)	11

Descriptor

Test Interpretation	30
Test Validity	18
Validity	11
Scores	10
Test Reliability	8
Test Use	7
Item Analysis	6
Test Construction	6
Test Results	6
Testing Problems	6
Achievement Tests	5
Tests	4
Theories	4
Academic Achievement	3
Evidence	3
Generalization	3
Inferences	3
Mathematical Models	3
Measurement	3
Measurement Techniques	3
Sampling	3
Scoring	3
Test Items	3
Testing	3
Aptitude Tests	2
More ▼

Source

Journal of Educational…

Publication Type

Journal Articles	20
Reports - Research	10
Opinion Papers	7
Reports - Evaluative	3

Education Level

Audience

Researchers

Location

Laws, Policies, & Programs

Assessments and Surveys

ACT Interest Inventory	1
Differential Aptitude Test	1
Iowa Tests of Basic Skills	1
Lexile Scale of Reading	1
Metropolitan Achievement Tests	1
Peabody Picture Vocabulary…	1
Sequential Tests of…	1

What Works Clearinghouse Rating

Showing 1 to 15 of 30 results Save | Export

Does Timed Testing Affect the Interpretation of Efficiency Scores?--A GLMM Analysis of Reading Components

Peer reviewed

Direct link

Frank Goldhammer; Ulf Kroehne; Carolin Hahnel; Johannes Naumann; Paul De Boeck – Journal of Educational Measurement, 2024

The efficiency of cognitive component skills is typically assessed with speeded performance tests. Interpreting only effective ability or effective speed as efficiency may be challenging because of the within-person dependency between both variables (speed-ability tradeoff, SAT). The present study measures efficiency as effective ability…

Descriptors: Timed Tests, Efficiency, Scores, Test Interpretation

A Note on the Use of Categorical Subscores

Peer reviewed

Direct link

Kylie Gorney; Sandip Sinharay – Journal of Educational Measurement, 2025

Although there exists an extensive amount of research on subscores and their properties, limited research has been conducted on categorical subscores and their interpretations. In this paper, we focus on the claim of Feinberg and von Davier that categorical subscores are useful for remediation and instructional purposes. We investigate this claim…

Descriptors: Tests, Scores, Test Interpretation, Alternative Assessment

Validating the Interpretations and Uses of Test Scores

Peer reviewed

Direct link

Kane, Michael T. – Journal of Educational Measurement, 2013

To validate an interpretation or use of test scores is to evaluate the plausibility of the claims based on the scores. An argument-based approach to validation suggests that the claims based on the test scores be outlined as an argument that specifies the inferences and supporting assumptions needed to get from test responses to score-based…

Descriptors: Test Interpretation, Validity, Scores, Test Use

Two Kinds of Argument?

Peer reviewed

Direct link

Newton, Paul E. – Journal of Educational Measurement, 2013

Kane distinguishes between two kinds of argument: the interpretation/use argument and the validity argument. This commentary considers whether there really are two kinds of argument, two arguments, or just one. It concludes that there is just one argument: the validity argument. (Contains 2 figures and 5 notes.)

Descriptors: Validity, Test Interpretation, Test Use

Agreeing on Validity Arguments

Peer reviewed

Direct link

Sireci, Stephen G. – Journal of Educational Measurement, 2013

Kane (this issue) presents a comprehensive review of validity theory and reminds us that the focus of validation is on test score interpretations and use. In reacting to his article, I support the argument-based approach to validity and all of the major points regarding validation made by Dr. Kane. In addition, I call for a simpler, three-step…

Descriptors: Validity, Theories, Test Interpretation, Test Use

Truth and Evidence in Validity Theory

Peer reviewed

Direct link

Borsboom, Denny; Markus, Keith A. – Journal of Educational Measurement, 2013

According to Kane (this issue), "the validity of a proposed interpretation or use depends on how well the evidence supports" the claims being made. Because truth and evidence are distinct, this means that the validity of a test score interpretation could be high even though the interpretation is false. As an illustration, we discuss the case of…

Descriptors: Evidence, Ethics, Validity, Theories

Addressing the Extreme Assumptions of Presumed Linkings

Peer reviewed

Direct link

Dorans, Neil J.; Middleton, Kyndra – Journal of Educational Measurement, 2012

The interpretability of score comparisons depends on the design and execution of a sound data collection plan and the establishment of linkings between these scores. When comparisons are made between scores from two or more assessments that are built to different specifications and are administered to different populations under different…

Descriptors: Tests, Equated Scores, Test Interpretation, Validity

Commentary on "Validating the Interpretations and Uses of Test Scores"

Peer reviewed

Direct link

Brennan, Robert L. – Journal of Educational Measurement, 2013

Kane's paper "Validating the Interpretations and Uses of Test Scores" is the most complete and clearest discussion yet available of the argument-based approach to validation. At its most basic level, validation as formulated by Kane is fundamentally a simply-stated two-step enterprise: (1) specify the claims inherent in a particular interpretation…

Descriptors: Validity, Test Interpretation, Test Use, Scores

Measuring Growth with Vertical Scales

Peer reviewed

Direct link

Briggs, Derek C. – Journal of Educational Measurement, 2013

A vertical score scale is needed to measure growth across multiple tests in terms of absolute changes in magnitude. Since the warrant for subsequent growth interpretations depends upon the assumption that the scale has interval properties, the validation of a vertical scale would seem to require methods for distinguishing interval scales from…

Descriptors: Measurement, Scaling, Validity, Test Interpretation

Validation as a Pragmatic, Scientific Activity

Peer reviewed

Direct link

Kane, Michael T. – Journal of Educational Measurement, 2013

This response to the comments contains three main sections, each addressing a subset of the comments. In the first section, I will respond to the comments by Brennan, Haertel, and Moss. All of these comments suggest ways in which my presentation could be extended or improved; I generally agree with their suggestions, so my response to their…

Descriptors: Validity, Test Interpretation, Test Use, Scores

Validity in Action: Lessons from Studies of Data Use

Peer reviewed

Direct link

Moss, Pamela A. – Journal of Educational Measurement, 2013

Studies of data use illuminate ways in which education professionals have used test scores and other evidence relevant to students' learning--in action in their own contexts of work--to make decisions about their practice. These studies raise instructive challenges for a validity theory that focuses on intended interpretations and uses of test…

Descriptors: Validity, Test Use, Test Interpretation, Scores

The Issue of Item and Test Variance for Criterion-Referenced Tests: A Clarification

Peer reviewed

Millman, Jason; Popham, W. James – Journal of Educational Measurement, 1974

The use of the regression equation derived from the Anglo-American sample to predict grades of Mexican-American students resulted in overprediction. An examination of the standardized regression weights revealed a significant difference in the weight given to the Scholastic Aptitude Test Mathematics Score. (Author/BB)

Descriptors: Criterion Referenced Tests, Item Analysis, Predictive Validity, Scores

Analysis of Test Bias in Four Groups with the Regression Definition.

Peer reviewed

Reschly, Daniel J.; Sabers, Darrell L. – Journal of Educational Measurement, 1979

Test bias, assumed as equal regression lines between two different tests for different populations was investigated to predict Metropolitan Achievement Tests from the Wechsler Intelligence Scale for Children--Revised. Subjects were 1,040 children in grades 1, 3, 5, 7, and 9: Anglo American, Black, Mexican American, and Native American Papago. (JKS)

Descriptors: Academic Achievement, Elementary Education, Intelligence Tests, Minority Group Children

Converting Test Data to Counseling Information: System Trial--With Feedback

Peer reviewed

Prediger, Dale J. – Journal of Educational Measurement, 1971

A computer-based system for converting test data into locally-validated counseling information was developed and field tested with potential vocational school students. Two data information conversion procedures were used: similarity (centour) scores based on discriminant analyses and success estimates based on experience tables. Illustrations of…

Descriptors: Career Counseling, Computer Oriented Programs, Test Interpretation, Test Results

Survey Testing on an Out-Of-Level Basis

Peer reviewed

Ayrer, James E.; McNamara, Thomas C. – Journal of Educational Measurement, 1973

Out-of-level'' testing is the assigning of pupils to levels of a standardized test on the basis of previous test scores rather than their present grade assignment. Test results of 1500 children were reviewed to see if their performance supported the rationale behind the practice. (Author/CB)

Descriptors: Achievement Rating, Elementary School Students, Standardized Tests, Test Interpretation

Previous Page | Next Page »

Pages: 1 | 2

Kane, Michael T.	2
Prediger, Dale J.	2
Ayrer, James E.	1
Borsboom, Denny	1
Brennan, Robert L.	1
Briggs, Derek C.	1
Carolin Hahnel	1
Diamond, James J.	1
Dorans, Neil J.	1
Frank Goldhammer	1
Hanna, Gerald S.	1
Hanna, Gila	1
Hoover, H. D.	1
Johannes Naumann	1
Kirsch, Irwin S.	1
Kylie Gorney	1
Lamb, Richard R.	1
Lennon, Roger T.	1
Madaus, George F.	1
Markus, Keith A.	1
McNamara, Thomas C.	1
Messick, Samuel	1
Middleton, Kyndra	1
Millman, Jason	1
More ▼