ERIC - Search Results

Publication Date

In 2025	2
Since 2024	2
Since 2021 (last 5 years)	2
Since 2016 (last 10 years)	5
Since 2006 (last 20 years)	13

Descriptor

Test Use	45
Test Construction	12
Test Validity	11
Scores	10
Test Interpretation	10
Testing Problems	10
Test Items	8
Validity	8
Achievement Tests	7
Educational Assessment	7
Elementary Secondary Education	7
Academic Achievement	6
Measurement Techniques	6
Computer Assisted Testing	5
Educational Testing	5
Testing	5
Theories	5
Educational Research	4
Evaluation Methods	4
Higher Education	4
Standardized Tests	4
Test Reliability	4
Test Results	4
Testing Programs	4
Tests	4
More ▼

Source

Journal of Educational…

Publication Type

Journal Articles	45
Reports - Research	14
Opinion Papers	13
Book/Product Reviews	10
Reports - Evaluative	7
Information Analyses	4
Reports - Descriptive	2
Guides - Non-Classroom	1
Speeches/Meeting Papers	1

Education Level

Higher Education	2
Postsecondary Education	2

Audience

Researchers

Location

Laws, Policies, & Programs

Assessments and Surveys

Alabama High School…	1
Kaufman Assessment Battery…	1
Law School Admission Test	1

What Works Clearinghouse Rating

Showing 1 to 15 of 45 results Save | Export

Using Multiple Maximum Exposure Rates in Computerized Adaptive Testing

Peer reviewed

Direct link

Kylie Gorney; Mark D. Reckase – Journal of Educational Measurement, 2025

In computerized adaptive testing, item exposure control methods are often used to provide a more balanced usage of the item pool. Many of the most popular methods, including the restricted method (Revuelta and Ponsoda), use a single maximum exposure rate to limit the proportion of times that each item is administered. However, Barrada et al.…

Descriptors: Computer Assisted Testing, Adaptive Testing, Test Items, Item Banks

Using Multilabel Neural Network to Score High-Dimensional Assessments for Different Use Foci: An Example with College Major Preference Assessment

Peer reviewed

Direct link

Shun-Fu Hu; Amery D. Wu; Jake Stone – Journal of Educational Measurement, 2025

Scoring high-dimensional assessments (e.g., > 15 traits) can be a challenging task. This paper introduces the multilabel neural network (MNN) as a scoring method for high-dimensional assessments. Additionally, it demonstrates how MNN can score the same test responses to maximize different performance metrics, such as accuracy, recall, or…

Descriptors: Tests, Testing, Scores, Test Construction

Confidence Intervals for Weighted Composite Scores under the Compound Binomial Error Model

Peer reviewed

Direct link

Kim, Kyung Yong; Lee, Won-Chan – Journal of Educational Measurement, 2018

Reporting confidence intervals with test scores helps test users make important decisions about examinees by providing information about the precision of test scores. Although a variety of estimation procedures based on the binomial error model are available for computing intervals for test scores, these procedures assume that items are randomly…

Descriptors: Weighted Scores, Error of Measurement, Test Use, Decision Making

Further Study of the Choice of Anchor Tests in Equating

Peer reviewed

Direct link

Trierweiler, Tammy J.; Lewis, Charles; Smith, Robert L. – Journal of Educational Measurement, 2016

In this study, we describe what factors influence the observed score correlation between an (external) anchor test and a total test. We show that the anchor to full-test observed score correlation is based on two components: the true score correlation between the anchor and total test, and the reliability of the anchor test. Findings using an…

Descriptors: Scores, Correlation, Tests, Test Reliability

How Developments in Psychology and Technology Challenge Validity Argumentation

Peer reviewed

Direct link

Mislevy, Robert J. – Journal of Educational Measurement, 2016

Validity is the sine qua non of properties of educational assessment. While a theory of validity and a practical framework for validation has emerged over the past decades, most of the discussion has addressed familiar forms of assessment and psychological framings. Advances in digital technologies and in cognitive and social psychology have…

Descriptors: Test Validity, Technology, Cognitive Psychology, Social Psychology

Two Kinds of Argument?

Peer reviewed

Direct link

Newton, Paul E. – Journal of Educational Measurement, 2013

Kane distinguishes between two kinds of argument: the interpretation/use argument and the validity argument. This commentary considers whether there really are two kinds of argument, two arguments, or just one. It concludes that there is just one argument: the validity argument. (Contains 2 figures and 5 notes.)

Descriptors: Validity, Test Interpretation, Test Use

Getting the Help We Need

Peer reviewed

Direct link

Haertel, Edward – Journal of Educational Measurement, 2013

In validating uses of testing, it is helpful to distinguish those that rely directly on the information provided by scores or score distributions ("direct" uses and consequences) versus those that instead capitalize on the motivational effects of testing, or use testing and test reporting to shape public opinion ("indirect" uses and consequences).…

Descriptors: Validity, Testing, Test Results, Test Use

Agreeing on Validity Arguments

Peer reviewed

Direct link

Sireci, Stephen G. – Journal of Educational Measurement, 2013

Kane (this issue) presents a comprehensive review of validity theory and reminds us that the focus of validation is on test score interpretations and use. In reacting to his article, I support the argument-based approach to validity and all of the major points regarding validation made by Dr. Kane. In addition, I call for a simpler, three-step…

Descriptors: Validity, Theories, Test Interpretation, Test Use

Truth and Evidence in Validity Theory

Peer reviewed

Direct link

Borsboom, Denny; Markus, Keith A. – Journal of Educational Measurement, 2013

According to Kane (this issue), "the validity of a proposed interpretation or use depends on how well the evidence supports" the claims being made. Because truth and evidence are distinct, this means that the validity of a test score interpretation could be high even though the interpretation is false. As an illustration, we discuss the case of…

Descriptors: Evidence, Ethics, Validity, Theories

Commentary on "Validating the Interpretations and Uses of Test Scores"

Peer reviewed

Direct link

Brennan, Robert L. – Journal of Educational Measurement, 2013

Kane's paper "Validating the Interpretations and Uses of Test Scores" is the most complete and clearest discussion yet available of the argument-based approach to validation. At its most basic level, validation as formulated by Kane is fundamentally a simply-stated two-step enterprise: (1) specify the claims inherent in a particular interpretation…

Descriptors: Validity, Test Interpretation, Test Use, Scores

Validation as a Pragmatic, Scientific Activity

Peer reviewed

Direct link

Kane, Michael T. – Journal of Educational Measurement, 2013

This response to the comments contains three main sections, each addressing a subset of the comments. In the first section, I will respond to the comments by Brennan, Haertel, and Moss. All of these comments suggest ways in which my presentation could be extended or improved; I generally agree with their suggestions, so my response to their…

Descriptors: Validity, Test Interpretation, Test Use, Scores

Validity in Action: Lessons from Studies of Data Use

Peer reviewed

Direct link

Moss, Pamela A. – Journal of Educational Measurement, 2013

Studies of data use illuminate ways in which education professionals have used test scores and other evidence relevant to students' learning--in action in their own contexts of work--to make decisions about their practice. These studies raise instructive challenges for a validity theory that focuses on intended interpretations and uses of test…

Descriptors: Validity, Test Use, Test Interpretation, Scores

Validating the Interpretations and Uses of Test Scores

Peer reviewed

Direct link

Kane, Michael T. – Journal of Educational Measurement, 2013

To validate an interpretation or use of test scores is to evaluate the plausibility of the claims based on the scores. An argument-based approach to validation suggests that the claims based on the test scores be outlined as an argument that specifies the inferences and supporting assumptions needed to get from test responses to score-based…

Descriptors: Test Interpretation, Validity, Scores, Test Use

Increasing the Reliability of Ability-Achievement Difference Scores: An Example Using the Kaufman Assessment Battery for Children.

Peer reviewed

Caruso, John C.; Witkiewitz, Katie – Journal of Educational Measurement, 2002

As an alternative to equally weighted difference scores, examined an orthogonal reliable component analysis (RCA) solution and an oblique principal components analysis (PCA) solution for the standardization sample of the Kaufman Assessment Battery for Children (KABC; A. Kaufman and N. Kaufman, 1983). Discusses the practical implications of the…

Descriptors: Ability, Academic Achievement, Children, Factor Analysis

The New Faces of Fairness.

Peer reviewed

Cole, Nancy S.; Zieky, Michael J. – Journal of Educational Measurement, 2001

Proposes additional ways for people in the measurement profession to think about the fairness of assessments and about the fairness of the uses of assessments. Suggests that measurement professionals must pay more attention to reducing group differences at the design stage of test development, to providing all examinees an opportunity to…

Descriptors: Educational Testing, Equal Education, Groups, Test Bias

Previous Page | Next Page »

Pages: 1 | 2 | 3

Fitzpatrick, Anne R.	2
Haertel, Edward	2
Kane, Michael T.	2
Lewis, Charles	2
Linn, Robert L.	2
Airasian, Peter W.	1
Amery D. Wu	1
Baker, Eva L.	1
Bennett, Randy Elliot	1
Borsboom, Denny	1
Brennan, Robert L.	1
Bridgeford, Nancy J.	1
Bridgeman, Brent	1
Cahalan, Cara	1
Calfee, Robert	1
Caruso, John C.	1
Cole, Nancy S.	1
Curtis, Mary E.	1
Ebel, Robert L.	1
Eignor, Daniel R.	1
Ercikan, Kadriye	1
Frisbie, David A.	1
Gallagher, Ann	1
Glaser, Robert	1
More ▼