ERIC - Search Results

Publication Date

In 2025	1
Since 2024	2
Since 2021 (last 5 years)	5
Since 2016 (last 10 years)	11
Since 2006 (last 20 years)	13

Source

Educational Measurement:…

Publication Type

Journal Articles	19
Reports - Evaluative	10
Reports - Descriptive	6
Reports - Research	3
Speeches/Meeting Papers	3

Education Level

Elementary Education	2
Intermediate Grades	2
Grade 4	1
Grade 5	1
Higher Education	1
Middle Schools	1
Postsecondary Education	1

Audience

Location

Laws, Policies, & Programs

Assessments and Surveys

Progress in International…

What Works Clearinghouse Rating

Showing 1 to 15 of 19 results Save | Export

Improving Instructional Decision-Making Using Diagnostic Classification Models

Peer reviewed

Direct link

W. Jake Thompson; Amy K. Clark – Educational Measurement: Issues and Practice, 2024

In recent years, educators, administrators, policymakers, and measurement experts have called for assessments that support educators in making better instructional decisions. One promising approach to measurement to support instructional decision-making is diagnostic classification models (DCMs). DCMs are flexible psychometric models that…

Descriptors: Decision Making, Instructional Improvement, Evaluation Methods, Models

An Application of Text Embeddings to Support Alignment of Educational Content Standards

Peer reviewed

Direct link

Reese Butterfuss; Harold Doran – Educational Measurement: Issues and Practice, 2025

Large language models are increasingly used in educational and psychological measurement activities. Their rapidly evolving sophistication and ability to detect language semantics make them viable tools to supplement subject matter experts and their reviews of large amounts of text statements, such as educational content standards. This paper…

Descriptors: Alignment (Education), Academic Standards, Content Analysis, Concept Mapping

Digital Module 29: Multidimensional Item Response Theory Equating

Peer reviewed

Direct link

Kim, Stella Y. – Educational Measurement: Issues and Practice, 2022

In this digital ITEMS module, Dr. Stella Kim provides an overview of multidimensional item response theory (MIRT) equating. Traditional unidimensional item response theory (IRT) equating methods impose the sometimes untenable restriction on data that only a single ability is assessed. This module discusses potential sources of multidimensionality…

Descriptors: Item Response Theory, Models, Equated Scores, Evaluation Methods

Development of a New Learning Progression Verification Method Based on the Hierarchical Diagnostic Classification Model: Taking Grade 5 Students' Fractional Operations as an Example

Peer reviewed

Direct link

Yuan, Lu; Liu, Yanlou; Chen, Ping; Xin, Tao – Educational Measurement: Issues and Practice, 2022

Learning progressions can reflect students' continuous in-depth thinking development paths, and their establishment is an iterative process from the construction of hypothetical learning progressions to the verification of that hypotheses. Considering the limitations of the existing verification method of learning progressions based on a rule…

Descriptors: Grade 5, Mathematics Instruction, Fractions, Elementary School Students

Validation as Evaluating Desired and Undesired Effects: Insights from Cross-Classified Mixed Effects Model

Peer reviewed

Direct link

Ji, Xuejun Ryan; Wu, Amery D. – Educational Measurement: Issues and Practice, 2023

The Cross-Classified Mixed Effects Model (CCMEM) has been demonstrated to be a flexible framework for evaluating reliability by measurement specialists. Reliability can be estimated based on the variance components of the test scores. Built upon their accomplishment, this study extends the CCMEM to be used for evaluating validity evidence.…

Descriptors: Measurement, Validity, Reliability, Models

Digital Module 05: Diagnostic Measurement--The G-DINA Framework

Peer reviewed

Direct link

Ma, Wenchao; de la Torre, Jimmy – Educational Measurement: Issues and Practice, 2019

In this ITEMS module, we introduce the generalized deterministic inputs, noisy "and" gate (G-DINA) model, which is a general framework for specifying, estimating, and evaluating a wide variety of cognitive diagnosis models. The module contains a nontechnical introduction to diagnostic measurement, an introductory overview of the G-DINA…

Descriptors: Models, Classification, Measurement, Identification

Digital Module 13: Monte Carlo Simulation Studies in Item Response Theory

Peer reviewed

Direct link

Leventhal, Brian; Ames, Allison – Educational Measurement: Issues and Practice, 2020

In this digital ITEMS module, Dr. Brian Leventhal and Dr. Allison Ames provide an overview of "Monte Carlo simulation studies" (MCSS) in "item response theory" (IRT). MCSS are utilized for a variety of reasons, one of the most compelling being that they can be used when analytic solutions are impractical or nonexistent because…

Descriptors: Item Response Theory, Monte Carlo Methods, Simulation, Test Items

Toward Assessment in the Service of Learning

Peer reviewed

Direct link

Gordon, Edmund W. – Educational Measurement: Issues and Practice, 2020

Drawing upon his experience, more than 60 years ago, as a psychometric support person to a very special teacher of brain damaged children, the author of this article reflects on the productive use of educational assessments and data from them to educate - assessment in the service of learning. Findings from the Gordon Commission on the Future of…

Descriptors: Psychometrics, Student Evaluation, Special Education Teachers, Educational Assessment

Ten Years after the Spellings Commission: From Accountability to Internal Improvement

Peer reviewed

Direct link

Liu, Ou Lydia – Educational Measurement: Issues and Practice, 2017

Student learning outcomes assessment has been increasingly used in U.S. higher education institutions over the last 10 years, partly fueled by the recommendation from the Spellings Commission that institutions need to demonstrate more direct evidence of student learning. To respond to the Commission's call, various accountability initiatives have…

Descriptors: College Outcomes Assessment, Accountability, Higher Education, Educational Improvement

An NCME Instructional Module on Latent DIF Analysis Using Mixture Item Response Models

Peer reviewed

Direct link

Cho, Sun-Joo; Suh, Youngsuk; Lee, Woo-yeol – Educational Measurement: Issues and Practice, 2016

The purpose of this ITEMS module is to provide an introduction to differential item functioning (DIF) analysis using mixture item response models. The mixture item response models for DIF analysis involve comparing item profiles across latent groups, instead of manifest groups. First, an overview of DIF analysis based on latent groups, called…

Descriptors: Test Bias, Research Methodology, Evaluation Methods, Models

An Instructional Module on Mokken Scale Analysis

Peer reviewed

Direct link

Wind, Stefanie A. – Educational Measurement: Issues and Practice, 2017

Mokken scale analysis (MSA) is a probabilistic-nonparametric approach to item response theory (IRT) that can be used to evaluate fundamental measurement properties with less strict assumptions than parametric IRT models. This instructional module provides an introduction to MSA as a probabilistic-nonparametric framework in which to explore…

Descriptors: Probability, Nonparametric Statistics, Item Response Theory, Scaling

An NCME Instructional Module on Item-Fit Statistics for Item Response Theory Models

Peer reviewed

Direct link

Ames, Allison J.; Penfield, Randall D. – Educational Measurement: Issues and Practice, 2015

Drawing valid inferences from item response theory (IRT) models is contingent upon a good fit of the data to the model. Violations of model-data fit have numerous consequences, limiting the usefulness and applicability of the model. This instructional module provides an overview of methods used for evaluating the fit of IRT models. Upon completing…

Descriptors: Item Response Theory, Goodness of Fit, Models, Evaluation Methods

Developing a Strong Program of Construct Validation: A Test Anxiety Example.

Peer reviewed

Benson, Jeri – Educational Measurement: Issues and Practice, 1998

Explores how a program of strong construct validation could be applied to the assessment of the construct of test anxiety, paying special attention to substantive, structural, and external aspects of construct validation. A framework is proposed to pull together various statistical methods used in construct validation research into an organized…

Descriptors: Construct Validity, Evaluation Methods, Models, Program Development

Integrated, Comprehensive Alignment as a Foundation for Measuring Student Progress

Peer reviewed

Direct link

Martineau, Joseph; Paek, Pamela; Keene, John; Hirsch, Thomas – Educational Measurement: Issues and Practice, 2007

This paper describes a comprehensive model of alignment that provides a foundation for meaningful reporting of students' academic progress over time. The model includes both horizontal and vertical alignment as integral parts of the development of content standards, test blueprints, items, item pools, instruments, performance level descriptors,…

Descriptors: Academic Achievement, Student Evaluation, Cognitive Measurement, Models

An Independent Auditing Mechanism for Testing.

Peer reviewed

Madaus, George F. – Educational Measurement: Issues and Practice, 1992

The need for an independent mechanism that regulates, or audits, the testing enterprise is discussed along with a critique of current mechanisms for challenging a high-stakes test or its use and the need for independent auditing of the commercial test industry. Models for an auditing mechanism are reviewed. (SLD)

Descriptors: Accountability, Elementary Secondary Education, Evaluation Methods, Higher Education

Previous Page | Next Page »

Pages: 1 | 2

Models	19
Evaluation Methods	15
Item Response Theory	4
Student Evaluation	4
Educational Assessment	3
Research Methodology	3
Teaching Methods	3
Accountability	2
Classification	2
Diagnostic Tests	2
Goodness of Fit	2
Higher Education	2
Instructional Improvement	2
Measurement	2
Measurement Techniques	2
Psychometrics	2
Test Bias	2
Test Construction	2
Test Items	2
Validity	2
Ability	1
Academic Achievement	1
Academic Standards	1
Accuracy	1
Achievement Tests	1
More ▼

Ames, Allison	1
Ames, Allison J.	1
Amy K. Clark	1
Benson, Jeri	1
Chen, Ping	1
Cho, Sun-Joo	1
Gierl, Mark J.	1
Goldstein, Harvey	1
Gordon, Edmund W.	1
Harold Doran	1
Hirsch, Thomas	1
Ji, Xuejun Ryan	1
Keene, John	1
Kim, Stella Y.	1
Lee, Woo-yeol	1
Leventhal, Brian	1
Liu, Ou Lydia	1
Liu, Yanlou	1
Ma, Wenchao	1
Madaus, George F.	1
Martineau, Joseph	1
Nitko, Anthony J.	1
Paek, Pamela	1
Penfield, Randall D.	1
Reese Butterfuss	1
More ▼