ERIC - Search Results

Publication Date

In 2025	1
Since 2024	2
Since 2021 (last 5 years)	4
Since 2016 (last 10 years)	8
Since 2006 (last 20 years)	9

Source

Educational Measurement:…

Publication Type

Journal Articles	15
Reports - Evaluative	9
Reports - Descriptive	4
Speeches/Meeting Papers	3
Reports - Research	2

Education Level

Elementary Education	1
Grade 4	1
Higher Education	1
Intermediate Grades	1
Postsecondary Education	1

Audience

Location

Laws, Policies, & Programs

Assessments and Surveys

Progress in International…

What Works Clearinghouse Rating

Showing all 15 results Save | Export

Improving Instructional Decision-Making Using Diagnostic Classification Models

Peer reviewed

Direct link

W. Jake Thompson; Amy K. Clark – Educational Measurement: Issues and Practice, 2024

In recent years, educators, administrators, policymakers, and measurement experts have called for assessments that support educators in making better instructional decisions. One promising approach to measurement to support instructional decision-making is diagnostic classification models (DCMs). DCMs are flexible psychometric models that…

Descriptors: Decision Making, Instructional Improvement, Evaluation Methods, Models

An Application of Text Embeddings to Support Alignment of Educational Content Standards

Peer reviewed

Direct link

Reese Butterfuss; Harold Doran – Educational Measurement: Issues and Practice, 2025

Large language models are increasingly used in educational and psychological measurement activities. Their rapidly evolving sophistication and ability to detect language semantics make them viable tools to supplement subject matter experts and their reviews of large amounts of text statements, such as educational content standards. This paper…

Descriptors: Alignment (Education), Academic Standards, Content Analysis, Concept Mapping

Digital Module 29: Multidimensional Item Response Theory Equating

Peer reviewed

Direct link

Kim, Stella Y. – Educational Measurement: Issues and Practice, 2022

In this digital ITEMS module, Dr. Stella Kim provides an overview of multidimensional item response theory (MIRT) equating. Traditional unidimensional item response theory (IRT) equating methods impose the sometimes untenable restriction on data that only a single ability is assessed. This module discusses potential sources of multidimensionality…

Descriptors: Item Response Theory, Models, Equated Scores, Evaluation Methods

Validation as Evaluating Desired and Undesired Effects: Insights from Cross-Classified Mixed Effects Model

Peer reviewed

Direct link

Ji, Xuejun Ryan; Wu, Amery D. – Educational Measurement: Issues and Practice, 2023

The Cross-Classified Mixed Effects Model (CCMEM) has been demonstrated to be a flexible framework for evaluating reliability by measurement specialists. Reliability can be estimated based on the variance components of the test scores. Built upon their accomplishment, this study extends the CCMEM to be used for evaluating validity evidence.…

Descriptors: Measurement, Validity, Reliability, Models

Digital Module 05: Diagnostic Measurement--The G-DINA Framework

Peer reviewed

Direct link

Ma, Wenchao; de la Torre, Jimmy – Educational Measurement: Issues and Practice, 2019

In this ITEMS module, we introduce the generalized deterministic inputs, noisy "and" gate (G-DINA) model, which is a general framework for specifying, estimating, and evaluating a wide variety of cognitive diagnosis models. The module contains a nontechnical introduction to diagnostic measurement, an introductory overview of the G-DINA…

Descriptors: Models, Classification, Measurement, Identification

Ten Years after the Spellings Commission: From Accountability to Internal Improvement

Peer reviewed

Direct link

Liu, Ou Lydia – Educational Measurement: Issues and Practice, 2017

Student learning outcomes assessment has been increasingly used in U.S. higher education institutions over the last 10 years, partly fueled by the recommendation from the Spellings Commission that institutions need to demonstrate more direct evidence of student learning. To respond to the Commission's call, various accountability initiatives have…

Descriptors: College Outcomes Assessment, Accountability, Higher Education, Educational Improvement

An NCME Instructional Module on Latent DIF Analysis Using Mixture Item Response Models

Peer reviewed

Direct link

Cho, Sun-Joo; Suh, Youngsuk; Lee, Woo-yeol – Educational Measurement: Issues and Practice, 2016

The purpose of this ITEMS module is to provide an introduction to differential item functioning (DIF) analysis using mixture item response models. The mixture item response models for DIF analysis involve comparing item profiles across latent groups, instead of manifest groups. First, an overview of DIF analysis based on latent groups, called…

Descriptors: Test Bias, Research Methodology, Evaluation Methods, Models

An Instructional Module on Mokken Scale Analysis

Peer reviewed

Direct link

Wind, Stefanie A. – Educational Measurement: Issues and Practice, 2017

Mokken scale analysis (MSA) is a probabilistic-nonparametric approach to item response theory (IRT) that can be used to evaluate fundamental measurement properties with less strict assumptions than parametric IRT models. This instructional module provides an introduction to MSA as a probabilistic-nonparametric framework in which to explore…

Descriptors: Probability, Nonparametric Statistics, Item Response Theory, Scaling

An NCME Instructional Module on Item-Fit Statistics for Item Response Theory Models

Peer reviewed

Direct link

Ames, Allison J.; Penfield, Randall D. – Educational Measurement: Issues and Practice, 2015

Drawing valid inferences from item response theory (IRT) models is contingent upon a good fit of the data to the model. Violations of model-data fit have numerous consequences, limiting the usefulness and applicability of the model. This instructional module provides an overview of methods used for evaluating the fit of IRT models. Upon completing…

Descriptors: Item Response Theory, Goodness of Fit, Models, Evaluation Methods

Developing a Strong Program of Construct Validation: A Test Anxiety Example.

Peer reviewed

Benson, Jeri – Educational Measurement: Issues and Practice, 1998

Explores how a program of strong construct validation could be applied to the assessment of the construct of test anxiety, paying special attention to substantive, structural, and external aspects of construct validation. A framework is proposed to pull together various statistical methods used in construct validation research into an organized…

Descriptors: Construct Validity, Evaluation Methods, Models, Program Development

An Independent Auditing Mechanism for Testing.

Peer reviewed

Madaus, George F. – Educational Measurement: Issues and Practice, 1992

The need for an independent mechanism that regulates, or audits, the testing enterprise is discussed along with a critique of current mechanisms for challenging a high-stakes test or its use and the need for independent auditing of the commercial test industry. Models for an auditing mechanism are reviewed. (SLD)

Descriptors: Accountability, Elementary Secondary Education, Evaluation Methods, Higher Education

Using Dimensionality-Based DIF Analyses to Identify and Interpret Constructs That Elicit Group Differences

Peer reviewed

Direct link

Gierl, Mark J. – Educational Measurement: Issues and Practice, 2005

In this paper I describe and illustrate the Roussos-Stout (1996) multidimensionality-based DIF analysis paradigm, with emphasis on its implication for the selection of a matching and studied subtest for DIF analyses. Standard DIF practice encourages an exploratory search for matching subtest items based on purely statistical criteria, such as a…

Descriptors: Models, Test Items, Test Bias, Statistical Analysis

Is the Curriculum a Reasonable Basis for Assessment Reform?

Peer reviewed

Nitko, Anthony J. – Educational Measurement: Issues and Practice, 1995

If curriculum is to be the basis for assessment reform, assessment specialists must model the process for producing valid assessment products. Validity criteria should guide any model for the assessment development process. However, curriculum-based assessment systems should not be confused with standards-driven assessment systems. (SLD)

Descriptors: Criteria, Curriculum Based Assessment, Educational Change, Evaluation Methods

A Theory-Based Framework for Assessing Domain-Specific Problem-Solving Ability.

Peer reviewed

Sugrue, Brenda – Educational Measurement: Issues and Practice, 1995

A more fragmented approach to assessment of global ability concepts than is generally advocated is suggested, based on the assumption that decomposing a complex ability into cognitive components and tracking performance across multiple measures will yield valid and instructionally useful information. Specifications are suggested for designing…

Descriptors: Ability, Educational Assessment, Educational Theories, Evaluation Methods

Recontextualizing Mental Measurement.

Peer reviewed

Goldstein, Harvey – Educational Measurement: Issues and Practice, 1994

This article examines how psychometric models based on certain assumptions have come to be used counterproductively by many practitioners in ways that limit the kinds of conclusions that can be made. The general problem of the context's influence on performance is discussed, and some implications are drawn. (SLD)

Descriptors: Context Effect, Educational Research, Evaluation Methods, Measurement Techniques

Evaluation Methods	15
Models	15
Item Response Theory	3
Accountability	2
Classification	2
Educational Assessment	2
Goodness of Fit	2
Higher Education	2
Measurement	2
Measurement Techniques	2
Research Methodology	2
Student Evaluation	2
Test Bias	2
Test Construction	2
Ability	1
Academic Standards	1
Accuracy	1
Achievement Tests	1
Alignment (Education)	1
Artificial Intelligence	1
Bayesian Statistics	1
College Outcomes Assessment	1
Comparative Analysis	1
Computation	1
Computer Software	1
More ▼

Ames, Allison J.	1
Amy K. Clark	1
Benson, Jeri	1
Cho, Sun-Joo	1
Gierl, Mark J.	1
Goldstein, Harvey	1
Harold Doran	1
Ji, Xuejun Ryan	1
Kim, Stella Y.	1
Lee, Woo-yeol	1
Liu, Ou Lydia	1
Ma, Wenchao	1
Madaus, George F.	1
Nitko, Anthony J.	1
Penfield, Randall D.	1
Reese Butterfuss	1
Sugrue, Brenda	1
Suh, Youngsuk	1
W. Jake Thompson	1
Wind, Stefanie A.	1
Wu, Amery D.	1
de la Torre, Jimmy	1
More ▼