ERIC - Search Results

Publication Date

In 2025	1
Since 2024	2
Since 2021 (last 5 years)	3
Since 2016 (last 10 years)	3
Since 2006 (last 20 years)	13

Descriptor

Test Interpretation	79
Scores	23
Test Reliability	21
Criterion Referenced Tests	18
Test Validity	18
Testing Problems	17
Test Construction	14
Test Results	14
Evaluation Criteria	12
Validity	11
Test Use	10
Achievement Tests	9
Decision Making	9
Error of Measurement	9
Item Analysis	9
Mastery Tests	9
Scoring	9
Testing	9
Academic Standards	8
Test Items	8
Measurement Techniques	7
Standardized Tests	7
Cutting Scores	6
Higher Education	6
Mathematical Models	6
More ▼

Source

Journal of Educational…

Publication Type

Journal Articles	52
Reports - Research	24
Opinion Papers	10
Reports - Evaluative	6
Book/Product Reviews	3
Reports - Descriptive	3
Guides - Non-Classroom	1
Information Analyses	1

Education Level

Audience

Researchers

Location

Israel

Laws, Policies, & Programs

Assessments and Surveys

Iowa Tests of Basic Skills	2
Metropolitan Achievement Tests	2
SAT (College Admission Test)	2
Sequential Tests of…	2
ACT Interest Inventory	1
California Achievement Tests	1
Differential Aptitude Test	1
Graduate Record Examinations	1
Law School Admission Test	1
Lexile Scale of Reading	1
National Teacher Examinations	1
Peabody Picture Vocabulary…	1
Pre Professional Skills Tests	1
Raven Progressive Matrices	1
SRA Achievement Series	1
More ▼

What Works Clearinghouse Rating

Showing 1 to 15 of 79 results Save | Export

Does Timed Testing Affect the Interpretation of Efficiency Scores?--A GLMM Analysis of Reading Components

Peer reviewed

Direct link

Frank Goldhammer; Ulf Kroehne; Carolin Hahnel; Johannes Naumann; Paul De Boeck – Journal of Educational Measurement, 2024

The efficiency of cognitive component skills is typically assessed with speeded performance tests. Interpreting only effective ability or effective speed as efficiency may be challenging because of the within-person dependency between both variables (speed-ability tradeoff, SAT). The present study measures efficiency as effective ability…

Descriptors: Timed Tests, Efficiency, Scores, Test Interpretation

A Note on the Use of Categorical Subscores

Peer reviewed

Direct link

Kylie Gorney; Sandip Sinharay – Journal of Educational Measurement, 2025

Although there exists an extensive amount of research on subscores and their properties, limited research has been conducted on categorical subscores and their interpretations. In this paper, we focus on the claim of Feinberg and von Davier that categorical subscores are useful for remediation and instructional purposes. We investigate this claim…

Descriptors: Tests, Scores, Test Interpretation, Alternative Assessment

Historical Perspectives on Score Comparability Issues Raised by Innovations in Testing

Peer reviewed

Direct link

Baldwin, Peter; Clauser, Brian E. – Journal of Educational Measurement, 2022

While score comparability across test forms typically relies on common (or randomly equivalent) examinees or items, innovations in item formats, test delivery, and efforts to extend the range of score interpretation may require a special data collection before examinees or items can be used in this way--or may be incompatible with common examinee…

Descriptors: Scoring, Testing, Test Items, Test Format

Two Kinds of Argument?

Peer reviewed

Direct link

Newton, Paul E. – Journal of Educational Measurement, 2013

Kane distinguishes between two kinds of argument: the interpretation/use argument and the validity argument. This commentary considers whether there really are two kinds of argument, two arguments, or just one. It concludes that there is just one argument: the validity argument. (Contains 2 figures and 5 notes.)

Descriptors: Validity, Test Interpretation, Test Use

Agreeing on Validity Arguments

Peer reviewed

Direct link

Sireci, Stephen G. – Journal of Educational Measurement, 2013

Kane (this issue) presents a comprehensive review of validity theory and reminds us that the focus of validation is on test score interpretations and use. In reacting to his article, I support the argument-based approach to validity and all of the major points regarding validation made by Dr. Kane. In addition, I call for a simpler, three-step…

Descriptors: Validity, Theories, Test Interpretation, Test Use

Truth and Evidence in Validity Theory

Peer reviewed

Direct link

Borsboom, Denny; Markus, Keith A. – Journal of Educational Measurement, 2013

According to Kane (this issue), "the validity of a proposed interpretation or use depends on how well the evidence supports" the claims being made. Because truth and evidence are distinct, this means that the validity of a test score interpretation could be high even though the interpretation is false. As an illustration, we discuss the case of…

Descriptors: Evidence, Ethics, Validity, Theories

Addressing the Extreme Assumptions of Presumed Linkings

Peer reviewed

Direct link

Dorans, Neil J.; Middleton, Kyndra – Journal of Educational Measurement, 2012

The interpretability of score comparisons depends on the design and execution of a sound data collection plan and the establishment of linkings between these scores. When comparisons are made between scores from two or more assessments that are built to different specifications and are administered to different populations under different…

Descriptors: Tests, Equated Scores, Test Interpretation, Validity

Commentary on "Validating the Interpretations and Uses of Test Scores"

Peer reviewed

Direct link

Brennan, Robert L. – Journal of Educational Measurement, 2013

Kane's paper "Validating the Interpretations and Uses of Test Scores" is the most complete and clearest discussion yet available of the argument-based approach to validation. At its most basic level, validation as formulated by Kane is fundamentally a simply-stated two-step enterprise: (1) specify the claims inherent in a particular interpretation…

Descriptors: Validity, Test Interpretation, Test Use, Scores

Measuring Growth with Vertical Scales

Peer reviewed

Direct link

Briggs, Derek C. – Journal of Educational Measurement, 2013

A vertical score scale is needed to measure growth across multiple tests in terms of absolute changes in magnitude. Since the warrant for subsequent growth interpretations depends upon the assumption that the scale has interval properties, the validation of a vertical scale would seem to require methods for distinguishing interval scales from…

Descriptors: Measurement, Scaling, Validity, Test Interpretation

Validation as a Pragmatic, Scientific Activity

Peer reviewed

Direct link

Kane, Michael T. – Journal of Educational Measurement, 2013

This response to the comments contains three main sections, each addressing a subset of the comments. In the first section, I will respond to the comments by Brennan, Haertel, and Moss. All of these comments suggest ways in which my presentation could be extended or improved; I generally agree with their suggestions, so my response to their…

Descriptors: Validity, Test Interpretation, Test Use, Scores

Validity in Action: Lessons from Studies of Data Use

Peer reviewed

Direct link

Moss, Pamela A. – Journal of Educational Measurement, 2013

Studies of data use illuminate ways in which education professionals have used test scores and other evidence relevant to students' learning--in action in their own contexts of work--to make decisions about their practice. These studies raise instructive challenges for a validity theory that focuses on intended interpretations and uses of test…

Descriptors: Validity, Test Use, Test Interpretation, Scores

The Errors of Our Ways

Peer reviewed

Direct link

Kane, Michael – Journal of Educational Measurement, 2011

Errors don't exist in our data, but they serve a vital function. Reality is complicated, but our models need to be simple in order to be manageable. We assume that attributes are invariant over some conditions of observation, and once we do that we need some way of accounting for the variability in observed scores over these conditions of…

Descriptors: Error of Measurement, Scores, Test Interpretation, Testing

Validating the Interpretations and Uses of Test Scores

Peer reviewed

Direct link

Kane, Michael T. – Journal of Educational Measurement, 2013

To validate an interpretation or use of test scores is to evaluate the plausibility of the claims based on the scores. An argument-based approach to validation suggests that the claims based on the test scores be outlined as an argument that specifies the inferences and supporting assumptions needed to get from test responses to score-based…

Descriptors: Test Interpretation, Validity, Scores, Test Use

Item-Examinee Sampling Procedures and Associated Standard Errors in Estimating Test Parameters

Peer reviewed

Shoemaker, David M. – Journal of Educational Measurement, 1970

Descriptors: Item Sampling, Norms, Test Interpretation

Use of the Logistic Model as an Alternative to Linear Interpolation for Computing Percentile Ranks

Peer reviewed

Marco, Gary L. – Journal of Educational Measurement, 1977

A method of computing percentile ranks from the logistic distribution function is described. The method was applied to three distributions of number-right scores from the Law School Admission Test, and compared to percentile ranks computed using linear interpolation. (Author/JKS)

Descriptors: Computation, Law Schools, Scores, Test Interpretation

Previous Page | Next Page »

Pages: 1 | 2 | 3 | 4 | 5 | 6

Linn, Robert L.	3
Livingston, Samuel A.	3
Hambleton, Ronald K.	2
Jaeger, Richard M.	2
Kane, Michael T.	2
Lindsay, Carl A.	2
Popham, W. James	2
Prediger, Dale J.	2
Shavelson, Richard J.	2
Tatsuoka, Kikumi K.	2
Ayrer, James E.	1
Baldwin, Peter	1
Bashaw, W. L.	1
Birenbaum, Menucha	1
Block, James H.	1
Borsboom, Denny	1
Brennan, Robert L.	1
Briggs, Derek C.	1
Burton, Nancy W.	1
Busch, John Christian	1
Carolin Hahnel	1
Clauser, Brian E.	1
Conklin, Jonathan E.	1
Cox, Richard C.	1
More ▼