ERIC - Search Results

Publication Date

In 2025	0
Since 2024	0
Since 2021 (last 5 years)	1
Since 2016 (last 10 years)	2
Since 2006 (last 20 years)	19

Descriptor

Testing	22
Measurement	11
Scores	9
Psychometrics	8
Definitions	7
Test Items	7
Validity	7
Test Validity	6
Tests	6
Evaluation Methods	5
Evaluation Problems	5
Measurement Techniques	5
Models	5
Classification	4
Diagnostic Tests	4
Test Theory	4
Accountability	3
Educational Assessment	3
Educational Improvement	3
Evaluation	3
Test Construction	3
Test Use	3
Ability	2
Attitudes	2
Construct Validity	2
More ▼

Source

Measurement:…

Publication Type

Journal Articles	22
Opinion Papers	17
Reports - Descriptive	3
Reports - Evaluative	3
Reports - Research	1

Education Level

Higher Education	2
Elementary Secondary Education	1
Postsecondary Education	1

Audience

Practitioners

Location

Germany

Laws, Policies, & Programs

Assessments and Surveys

National Assessment of…	1
SAT (College Admission Test)	1

What Works Clearinghouse Rating

Showing 1 to 15 of 22 results Save | Export

Resources for Identifying Measurement Instruments for Social Science Research

Peer reviewed

Direct link

Schumacker, Randall E.; Wind, Stefanie A.; Holmes, Lauren F. – Measurement: Interdisciplinary Research and Perspectives, 2021

A variety of resources are available from which researchers can identify measurement instruments, including peer-reviewed journal articles, collections of technical information about published instruments, and electronic databases that are sponsored by universities, testing organizations, and other groups. Although these resources are widespread,…

Descriptors: Measurement Techniques, Journal Articles, Databases, Testing

Exploring the Stability of Differential Item Functioning across Administrations and Critical Values Using the Rasch Separate Calibration "t"-Test Method

Peer reviewed

Direct link

Peabody, Michael R.; Wind, Stefanie A. – Measurement: Interdisciplinary Research and Perspectives, 2019

Differential Item Functioning (DIF) detection procedures provide validity evidence for proposed interpretations of test scores that can help researchers and practitioners ensure that test scores are free from potential bias, and that individual items do not create an advantage for any subgroup of examinees over another. In this study, we use the…

Descriptors: Item Response Theory, Test Items, Scores, Testing

Formative versus Reflective Measurement in Executive Functions: A Critique of Willoughby et al.

Peer reviewed

Direct link

Peterson, Eric; Welsh, Marilyn C. – Measurement: Interdisciplinary Research and Perspectives, 2014

Research into executive functioning (EF) has indeed grown exponentially across the past few decades, but as the Willoughby et al. critique makes clear, there remain fundamental questions to be resolved. The crux of their argument is built upon an examination of the confirmatory factor analysis (CFA) approach to understanding executive processes.…

Descriptors: Executive Function, Measurement, Factor Analysis, Reliability

Lies, Damn Lies, and Tests

Peer reviewed

Direct link

Garner, Mary – Measurement: Interdisciplinary Research and Perspectives, 2013

In "How Is Testing Supposed to Improve Schooling," Haertel describes seven broad mechanisms whereby testing is used to improve schooling (this issue). The first four are direct mechanisms, meaning that "test scores are taken as indicators of some underlying construct and on that basis scores are used to guide some decision or draw some…

Descriptors: Testing, Early Intervention, Educational Improvement, Change Strategies

Consequences of Assessment and Accountability Systems Are Integral to the Argument-Based Approach to Validity

Peer reviewed

Direct link

Lane, Suzanne – Measurement: Interdisciplinary Research and Perspectives, 2012

Considering consequences in the evaluation of validity is not new although it is still debated by Paul E. Newton and others. The argument-based approach to validity entails an interpretative argument that explicitly identifies the proposed interpretations and uses of test scores and a validity argument that provides a structure for evaluating the…

Descriptors: Educational Opportunities, Accountability, Validity, Inferences

Involving Diverse Communities of Practice to Minimize Unintended Consequences of Test-Based Accountability Systems

Peer reviewed

Direct link

Behizadeh, Nadia; Engelhard, George, Jr. – Measurement: Interdisciplinary Research and Perspectives, 2015

In his focus article, Koretz (this issue) argues that accountability has become the primary function of large-scale testing in the United States. He then points out that tests being used for accountability purposes are flawed and that the high-stakes nature of these tests creates a context that encourages score inflation. Koretz is concerned about…

Descriptors: Communities of Practice, High Stakes Tests, Testing, Test Validity

Validity Cannot Be Created, It Can Only Be Lost

Peer reviewed

Direct link

Pollitt, Alastair – Measurement: Interdisciplinary Research and Perspectives, 2012

Paul E. Newton's article is valuable in many ways, especially for clarifying confusions and inconsistencies in the assessment business. Most importantly, he points out confusions that persist and where open discussion will help us understand what we say and what we mean to say. But I will focus here on the only faults I find in the article: three…

Descriptors: Validity, Evaluation, Definitions, Test Construction

Whose Consensus Is It Anyway? Scientific versus Legalistic Conceptions of Validity

Peer reviewed

Direct link

Borsboom, Denny – Measurement: Interdisciplinary Research and Perspectives, 2012

Paul E. Newton provides an insightful and scholarly overview of central issues in validity theory. As he notes, many of the conceptual problems in validity theory derive from the fact that the word "validity" has two meanings. First, it indicates "whether a test measures what it purports to measure." This is a factual claim about the psychometric…

Descriptors: Validity, Psychometrics, Test Interpretation, Scores

Conceptions of Validity: The Private and the Public

Peer reviewed

Direct link

Braun, Henry – Measurement: Interdisciplinary Research and Perspectives, 2012

Paul E. Newton is to be commended for addressing as challenging a topic as the clarification of the concept of validity. The impetus for this foray is Newton's judgment that, despite decades of development, the definition and elaboration of the term test validity in the 1999 "Standards" retains sufficient ambiguity to permit, if not invite, both…

Descriptors: Educational Improvement, Test Validity, Validity, Tests

From Construct Validity to Theory Validation

Peer reviewed

Direct link

Haig, Brian D. – Measurement: Interdisciplinary Research and Perspectives, 2012

Lee Cronbach once expressed the view that all roads lead to construct validity. In looking to clarify the consensus definition of validity, and its place in assessment, Newton is also led to the troublesome idea of construct validity. To be sure, he addresses other validity issues, but in this commentary, I will restrict my attention to construct…

Descriptors: Validity, Educational Assessment, Construct Validity, Definitions

All Validity Is Construct Validity. Or Is It?

Peer reviewed

Direct link

Kane, Michael – Measurement: Interdisciplinary Research and Perspectives, 2012

Paul E. Newton's article on the consensus definition of validity tackles a number of big issues and makes a number of strong claims. I agreed with much of what he said, and I disagreed with a number of his claims, but I found his article to be consistently interesting and thought provoking (whether I agreed or not). I will focus on three general…

Descriptors: Validity, Construct Validity, Tests, Testing

Promoting Rigorous Validation Practice: An Applied Perspective

Peer reviewed

Direct link

Mattern, Krista D.; Kobrin, Jennifer L.; Camara, Wayne J. – Measurement: Interdisciplinary Research and Perspectives, 2012

As researchers at a testing organization concerned with the appropriate uses and validity evidence for our assessments, we provide an applied perspective related to the issues raised in the focus article. Newton's proposal for elaborating the consensus definition of validity is offered with the intention to reduce the risks of inadequate…

Descriptors: Evidence, Validity, Tests, Testing

Externalities of Testing: Lessons from the Blizzard of 2010

Peer reviewed

Direct link

Feuer, Michael J. – Measurement: Interdisciplinary Research and Perspectives, 2010

This article presents lessons learned from a story of a snowy and dangerous intersection where there was no way for pedestrians to cross. The basic theme of this paper is that if political economy is preoccupied largely with the measurement of externalities, then a goal for the testing and assessment policy community should be to devise strategies…

Descriptors: Testing, Measurement, Educational Assessment, Accountability

Person Response Functions and the Definition of Units in the Social Sciences

Peer reviewed

Direct link

Engelhard, George, Jr.; Perkins, Aminah F. – Measurement: Interdisciplinary Research and Perspectives, 2011

Humphry (this issue) has written a thought-provoking piece on the interpretation of item discrimination parameters as scale units in item response theory. One of the key features of his work is the description of an item response theory (IRT) model that he calls the logistic measurement function that combines aspects of two traditions in IRT that…

Descriptors: Foreign Countries, Social Sciences, Item Response Theory, Testing

Some Notes on the Reinvention of Latent Structure Models as Diagnostic Classification Models

Peer reviewed

Direct link

von Davier, Matthias – Measurement: Interdisciplinary Research and Perspectives, 2009

In this commentary, the author points out few issues, one being that there are models mislabeled as diagnostic, which deal with linear decompositions of item difficulties rather than estimating multidimensional skill variables. The author discusses the issue that there are many new names for essentially well-known models for multiple simultaneous…

Descriptors: Test Items, Probability, Models, Diagnostic Tests

Previous Page | Next Page »

Pages: 1 | 2

Embretson, Susan E.	2
Engelhard, George, Jr.	2
Wind, Stefanie A.	2
Behizadeh, Nadia	1
Borsboom, Denny	1
Braun, Henry	1
Camara, Wayne J.	1
Feuer, Michael J.	1
Garner, Mary	1
Gee, James Paul	1
Haertel, Edward H.	1
Haig, Brian D.	1
Hancock, Gregory R.	1
Holmes, Lauren F.	1
Jiao, Hong	1
Kane, Michael	1
Kobrin, Jennifer L.	1
Lane, Suzanne	1
Mattern, Krista D.	1
Meijer, Rob R.	1
Moss, Pamela A.	1
Oosterloo, Sebie J.	1
Peabody, Michael R.	1
Perkins, Aminah F.	1
Peterson, Eric	1
More ▼