ERIC - Search Results

Publication Date

In 2025	0
Since 2024	0
Since 2021 (last 5 years)	0
Since 2016 (last 10 years)	2
Since 2006 (last 20 years)	14

Descriptor

Test Theory	15
Psychometrics	10
Evaluation Methods	9
Definitions	8
Measurement Techniques	8
Classification	7
Test Items	7
Measurement	6
Comparative Analysis	5
Educational Assessment	5
Equated Scores	5
Models	5
Diagnostic Tests	4
Educational Testing	4
Evaluation Problems	4
Foreign Countries	4
High Stakes Tests	4
Predictive Measurement	4
Test Interpretation	4
Test Use	4
Testing	4
Testing Problems	4
Item Response Theory	3
Test Construction	3
Computer Software	2
More ▼

Source

Measurement:…

Author

Hancock, Gregory R.	2
An, Ji	1
Baird, Jo-Anne	1
Briggs, Derek C.	1
Cresswell, Mike	1
Grimm, Kevin J.	1
Jiao, Hong	1
Maydeu-Olivares, Alberto	1
Molenaar, Peter C. M.	1
Newton, Paul E.	1
Robitzsch, Alexander	1
Schumacker, Randall	1
Talbot, Robert M.	1
West, Stephen G.	1
Wilhelm, Oliver	1
van der Linden, Wim J.	1
von Davier, Alina A.	1
von Davier, Matthias	1
More ▼

Publication Type

Journal Articles	15
Opinion Papers	10
Reports - Descriptive	4
Reports - Evaluative	2
Reference Materials -…	1
Reports - Research	1

Education Level

Elementary Secondary Education

Audience

Practitioners

Location

United Kingdom	2
United Kingdom (England)	2
United States	2
Australia	1
United Kingdom (Wales)	1

Laws, Policies, & Programs

Assessments and Surveys

Advanced Placement…	1
SAT (College Admission Test)	1

What Works Clearinghouse Rating

Showing all 15 results Save | Export

A Closed-Form Alternative for Estimating [omega] Reliability under Unidimensionality

Peer reviewed

Direct link

Hancock, Gregory R.; An, Ji – Measurement: Interdisciplinary Research and Perspectives, 2020

As an alternative to Cronbach's [alpha] for estimating scale reliability, McDonald's [omega] has attracted increased attention within the methodological community for its less stringent measurement assumptions. Notwithstanding, [omega] is still seldom used by practitioners, likely due to its unavailability in popular software packages (e.g., SPSS)…

Descriptors: Evaluation, Alternative Assessment, Reliability, Test Reliability

Psychometric Packages in R

Peer reviewed

Direct link

Schumacker, Randall – Measurement: Interdisciplinary Research and Perspectives, 2019

The R software provides packages and functions that provide data analysis in classical true score, generalizability theory, item response theory, and Rasch measurement theories. A brief list of notable articles in each measurement theory and the first measurement journals is followed by a list of R psychometric software packages. Each psychometric…

Descriptors: Psychometrics, Computer Software, Measurement, Item Response Theory

Causal Indicator Models: Unresolved Issues of Construction and Evaluation

Peer reviewed

Direct link

West, Stephen G.; Grimm, Kevin J. – Measurement: Interdisciplinary Research and Perspectives, 2014

These authors agree with Bainter and Bollen that causal effects represents a useful measurement structure in some applications. The structure of the science of the measurement problem should determine the model; the measurement model should not determine the science. They also applaud Bainter and Bollen's important reminder that the full…

Descriptors: Causal Models, Measurement, Test Theory, Statistical Analysis

Why Should We Assess the Goodness-of-Fit of IRT Models?

Peer reviewed

Direct link

Maydeu-Olivares, Alberto – Measurement: Interdisciplinary Research and Perspectives, 2013

In this rejoinder, Maydeu-Olivares states that, in item response theory (IRT) measurement applications, the application of goodness-of-fit (GOF) methods informs researchers of the discrepancy between the model and the data being fitted (the room for improvement). By routinely reporting the GOF of IRT models, together with the substantive results…

Descriptors: Goodness of Fit, Models, Evaluation Methods, Item Response Theory

On Bias in Linear Observed-Score Equating

Peer reviewed

Direct link

van der Linden, Wim J. – Measurement: Interdisciplinary Research and Perspectives, 2010

The traditional way of equating the scores on a new test form X to those on an old form Y is equipercentile equating for a population of examinees. Because the population is likely to change between the two administrations, a popular approach is to equate for a "synthetic population." The authors of the articles in this issue of the…

Descriptors: Test Format, Equated Scores, Population Distribution, Population Trends

Some Notes on the Reinvention of Latent Structure Models as Diagnostic Classification Models

Peer reviewed

Direct link

von Davier, Matthias – Measurement: Interdisciplinary Research and Perspectives, 2009

In this commentary, the author points out few issues, one being that there are models mislabeled as diagnostic, which deal with linear decompositions of item difficulties rather than estimating multidimensional skill variables. The author discusses the issue that there are many new names for essentially well-known models for multiple simultaneous…

Descriptors: Test Items, Probability, Models, Diagnostic Tests

Diagnostic Classification Modeling: Opportunity for Identity

Peer reviewed

Direct link

Hancock, Gregory R. – Measurement: Interdisciplinary Research and Perspectives, 2009

As Rupp and Templin (2008) stated directly, diagnostic classification methods "are confirmatory in nature." Methods, though, are neither inherently confirmatory nor exploratory. Diagnostic classification modeling, with its analytical and computational obstacles eventually yielding as a comprehensive and potent discipline emerges, will…

Descriptors: Structural Equation Models, Test Items, Models, Diagnostic Tests

Have Cognitive Diagnostic Models Delivered Their Goods? Some Substantial and Methodological Concerns

Peer reviewed

Direct link

Wilhelm, Oliver; Robitzsch, Alexander – Measurement: Interdisciplinary Research and Perspectives, 2009

The paper by Rupp and Templin (2008) is an excellent work on the characteristics and features of cognitive diagnostic models (CDM). In this article, the authors comment on some substantial and methodological aspects of this focus paper. They organize their comments by going through issues associated with the terms "cognitive,"…

Descriptors: Research Methodology, Test Items, Models, Diagnostic Tests

Defending the Quality of Links between Scores from Different Tests and Exams

Peer reviewed

Direct link

Cresswell, Mike – Measurement: Interdisciplinary Research and Perspectives, 2010

Paul Newton (2010), with his characteristic concern about theory, has set out two different ways of thinking about the basis upon which equivalences of one sort or another are established between test score scales. His reason for doing this is a desire to establish "the defensibility of linkages lower on the continuum than concordance."…

Descriptors: Foreign Countries, Measurement Techniques, Psychometrics, Comparative Analysis

Diagnostic Classification Models: Which One Should I Use?

Peer reviewed

Direct link

Jiao, Hong – Measurement: Interdisciplinary Research and Perspectives, 2009

Diagnostic assessment is currently an active research area in educational measurement. Literature related to diagnostic modeling has been in existence for several decades, but a great deal of research has been conducted within the last decade or so, especially within the last five years. The author summarizes the key components in the application…

Descriptors: Educational Assessment, Literature Reviews, Test Items, Probability

Conceptualizing Comparability

Peer reviewed

Direct link

Newton, Paul E. – Measurement: Interdisciplinary Research and Perspectives, 2010

This article presents the author's rejoinder to thinking about linking from issue 8(1). Particularly within the more embracing linking frameworks, e.g., Holland & Dorans (2006) and Holland (2007), there appears to be a major disjunction between (1) classification discourse: the supposed basis for classification, that is, the underlying theory…

Descriptors: Foreign Countries, Measurement Techniques, Psychometrics, Comparative Analysis

What Constitutes Legitimate Causal Linking?

Peer reviewed

Direct link

Baird, Jo-Anne – Measurement: Interdisciplinary Research and Perspectives, 2010

Newton's article (2010) makes three main contributions to the literature. First, it is transatlantic, bringing together literatures that have been dealing with similar problems, using sometimes different methods and certainly with distinctive educational, cultural perspectives. He points out that neither of these literatures has all of the…

Descriptors: Foreign Countries, Predictive Validity, Standards, Ethics

What Dictates the Meaning of Test Linking? A Reaction to "Thinking about Linking"

Peer reviewed

Direct link

von Davier, Alina A. – Measurement: Interdisciplinary Research and Perspectives, 2010

The article "Thinking About Linking" by Newton (2010) presents a novel philosophical perspective on the way that educational assessments should be linked. Newton starts by describing the linking framework as it was characterized in various publications and identifies a cross-cultural dimension in the definitions and uses of test…

Descriptors: Foreign Countries, Educational Assessment, Student Evaluation, Evaluation Criteria

Does Theory Drive the Items or Do Items Drive the Theory?

Peer reviewed

Direct link

Talbot, Robert M.; Briggs, Derek C. – Measurement: Interdisciplinary Research and Perspectives, 2007

At the core of the argument-based approach to test validation as it has been presented by Kane (1992, 2004, 2006) is a relatively simple premise: test validity is demonstrated by linking the score that is observed from a test instrument to the use of that score for some subsequent inference. Details, however, are not so simple: How does one craft…

Descriptors: Test Validity, Inferences, Knowledge Base for Teaching, Mathematics Education

A Manifesto on Psychology as Idiographic Science: Bringing the Person Back into Scientific Psychology, This Time Forever

Peer reviewed

Direct link

Molenaar, Peter C. M. – Measurement: Interdisciplinary Research and Perspectives, 2004

Psychology is focused on variation between cases (interindividual variation). Results thus obtained are considered to be generalizable to the understanding and explanation of variation within single cases (intraindividual variation). It is indicated, however, that the direct consequences of the classical ergodic theorems for psychology and…

Descriptors: Psychology, Psychometrics, Developmental Psychology, Personality Theories