ERIC - Search Results

Publication Date

In 2025	0
Since 2024	0
Since 2021 (last 5 years)	0
Since 2016 (last 10 years)	0
Since 2006 (last 20 years)	7

Descriptor

Test Use	9
Test Interpretation	7
Validity	7
Scores	5
Test Validity	4
Theories	4
Academic Achievement	3
Achievement Tests	3
Measurement Techniques	3
Norm Referenced Tests	3
Test Results	3
Beliefs	2
Cognitive Processes	2
Decision Making	2
Educational Research	2
Ethics	2
Evidence	2
Generalization	2
Inferences	2
Predictive Validity	2
Scoring	2
Test Reliability	2
Test Reviews	2
Test Theory	2
Testing	2
More ▼

Source

Journal of Educational…

Author

Haertel, Edward	2
Airasian, Peter W.	1
Borsboom, Denny	1
Brennan, Robert L.	1
Calfee, Robert	1
Hall, Alfred E.	1
Kane, Michael T.	1
Markus, Keith A.	1
Messick, Samuel	1
Moss, Pamela A.	1
Newton, Paul E.	1
Secolsky, Charles	1
Sireci, Stephen G.	1
Tallmadge, G. Kasten	1
Wainer, Howard	1
More ▼

Publication Type

Journal Articles	14
Opinion Papers	14
Information Analyses	3
Reports - Evaluative	2

Education Level

Audience

Practitioners

Location

Laws, Policies, & Programs

Assessments and Surveys

Stanford Achievement Tests

What Works Clearinghouse Rating

Showing all 14 results Save | Export

Two Kinds of Argument?

Peer reviewed

Direct link

Newton, Paul E. – Journal of Educational Measurement, 2013

Kane distinguishes between two kinds of argument: the interpretation/use argument and the validity argument. This commentary considers whether there really are two kinds of argument, two arguments, or just one. It concludes that there is just one argument: the validity argument. (Contains 2 figures and 5 notes.)

Descriptors: Validity, Test Interpretation, Test Use

Getting the Help We Need

Peer reviewed

Direct link

Haertel, Edward – Journal of Educational Measurement, 2013

In validating uses of testing, it is helpful to distinguish those that rely directly on the information provided by scores or score distributions ("direct" uses and consequences) versus those that instead capitalize on the motivational effects of testing, or use testing and test reporting to shape public opinion ("indirect" uses and consequences).…

Descriptors: Validity, Testing, Test Results, Test Use

Agreeing on Validity Arguments

Peer reviewed

Direct link

Sireci, Stephen G. – Journal of Educational Measurement, 2013

Kane (this issue) presents a comprehensive review of validity theory and reminds us that the focus of validation is on test score interpretations and use. In reacting to his article, I support the argument-based approach to validity and all of the major points regarding validation made by Dr. Kane. In addition, I call for a simpler, three-step…

Descriptors: Validity, Theories, Test Interpretation, Test Use

Truth and Evidence in Validity Theory

Peer reviewed

Direct link

Borsboom, Denny; Markus, Keith A. – Journal of Educational Measurement, 2013

According to Kane (this issue), "the validity of a proposed interpretation or use depends on how well the evidence supports" the claims being made. Because truth and evidence are distinct, this means that the validity of a test score interpretation could be high even though the interpretation is false. As an illustration, we discuss the case of…

Descriptors: Evidence, Ethics, Validity, Theories

Commentary on "Validating the Interpretations and Uses of Test Scores"

Peer reviewed

Direct link

Brennan, Robert L. – Journal of Educational Measurement, 2013

Kane's paper "Validating the Interpretations and Uses of Test Scores" is the most complete and clearest discussion yet available of the argument-based approach to validation. At its most basic level, validation as formulated by Kane is fundamentally a simply-stated two-step enterprise: (1) specify the claims inherent in a particular interpretation…

Descriptors: Validity, Test Interpretation, Test Use, Scores

Validation as a Pragmatic, Scientific Activity

Peer reviewed

Direct link

Kane, Michael T. – Journal of Educational Measurement, 2013

This response to the comments contains three main sections, each addressing a subset of the comments. In the first section, I will respond to the comments by Brennan, Haertel, and Moss. All of these comments suggest ways in which my presentation could be extended or improved; I generally agree with their suggestions, so my response to their…

Descriptors: Validity, Test Interpretation, Test Use, Scores

Validity in Action: Lessons from Studies of Data Use

Peer reviewed

Direct link

Moss, Pamela A. – Journal of Educational Measurement, 2013

Studies of data use illuminate ways in which education professionals have used test scores and other evidence relevant to students' learning--in action in their own contexts of work--to make decisions about their practice. These studies raise instructive challenges for a validity theory that focuses on intended interpretations and uses of test…

Descriptors: Validity, Test Use, Test Interpretation, Scores

On the Direct Measurement of Face Validity: A Comment on Nevo.

Peer reviewed

Secolsky, Charles – Journal of Educational Measurement, 1987

For measuring the face validity of a test, Nevo suggested that test takers and nonprofessional users rate items on a five point scale. This article questions the ability of those raters and the credibility of the aggregated judgment as evidence of the validity of the test. (JAZ)

Descriptors: Content Validity, Measurement Techniques, Rating Scales, Test Items

Rumors Regarding the Death of the Equipercentile Assumption May Have Been Greatly Exaggerated.

Peer reviewed

Tallmadge, G. Kasten – Journal of Educational Measurement, 1985

Support for the validity of the equipercentile assumption is presented in contrast with the conclusion of Powers, Slaughter, and Helmick (EJ 289 091). Observed "gains" from pre- to posttests are better attributed to stakeholder bias, posttests that match curriculum content too closely, or a combination of these factors. (Author/DWH)

Descriptors: Data Interpretation, Evaluation Methods, Norm Referenced Tests, Predictive Measurement

School Achievement: Thinking About What to Test.

Peer reviewed

Haertel, Edward; Calfee, Robert – Journal of Educational Measurement, 1983

The history of the relation between achievement tests and curriculum programs is reviewed, and it is concluded that content specialists are best qualified as sources of curricular goals to specify content, kinds of attainment, and standards. The specification of instructional intents is also considered from the perspective of modern cognition…

Descriptors: Achievement Tests, Cognitive Processes, Curriculum, Instructional Development

Stanford Achievement Test Forms E and F (Test Review).

Peer reviewed

Airasian, Peter W. – Journal of Educational Measurement, 1985

The Stanford Achievement Test Forms E and F were judged to be one of the best achievement batteries for assessing basic skills taught in grades one through nine. The test publisher provides several booklets in addition to the administration manual. These include the Norms Booklet, Handbook of Instructional Strategies, and Guide to Classroom…

Descriptors: Academic Achievement, Achievement Rating, Achievement Tests, Elementary Secondary Education

The Ball Aptitude Battery (Test Review).

Peer reviewed

Hall, Alfred E. – Journal of Educational Measurement, 1985

The 12 subtests of the Ball Aptitude Battery (BAB) listed in the administration manual were described. The reviewer believes this aptitude battery, designed for use with high school students and adults in job selection and placement, needs major improvements. It is suggested that the BAB be used solely for research purposes. (DWH)

Descriptors: Adults, Aptitude Tests, High Schools, Norm Referenced Tests

The Psychology of Educational Measurement.

Peer reviewed

Messick, Samuel – Journal of Educational Measurement, 1984

Comprehensive assessment in context focuses on the processes and structures involved in subject matter competence as moderated in performance by personal and environmental influences. This article addresses in detail both the nature of developing competence and its measurement in terms of context-dependent task performance. (Author/EGS)

Descriptors: Academic Achievement, Achievement Tests, Cognitive Ability, Cognitive Development

Measurement Problems.

Peer reviewed

Wainer, Howard – Journal of Educational Measurement, 1993

Focusing on educational measurement that suggests an action and has an outcome, 16 problem areas are defined and grouped into the following classes: (1) validity; (2) issues of statistical adjustment; (3) data insufficiencies; (4) other issues related to standardized testing and constructed responses; and (5) technical issues of psychometrics.…

Descriptors: Comparative Analysis, Computer Uses in Education, Constructed Response, Educational Assessment