NotesFAQContact Us
Collection
Advanced
Search Tips
Showing 1 to 15 of 19 results Save | Export
Peer reviewed Peer reviewed
Direct linkDirect link
W. Jake Thompson; Amy K. Clark – Educational Measurement: Issues and Practice, 2024
In recent years, educators, administrators, policymakers, and measurement experts have called for assessments that support educators in making better instructional decisions. One promising approach to measurement to support instructional decision-making is diagnostic classification models (DCMs). DCMs are flexible psychometric models that…
Descriptors: Decision Making, Instructional Improvement, Evaluation Methods, Models
Peer reviewed Peer reviewed
Direct linkDirect link
Reese Butterfuss; Harold Doran – Educational Measurement: Issues and Practice, 2025
Large language models are increasingly used in educational and psychological measurement activities. Their rapidly evolving sophistication and ability to detect language semantics make them viable tools to supplement subject matter experts and their reviews of large amounts of text statements, such as educational content standards. This paper…
Descriptors: Alignment (Education), Academic Standards, Content Analysis, Concept Mapping
Peer reviewed Peer reviewed
Direct linkDirect link
Kim, Stella Y. – Educational Measurement: Issues and Practice, 2022
In this digital ITEMS module, Dr. Stella Kim provides an overview of multidimensional item response theory (MIRT) equating. Traditional unidimensional item response theory (IRT) equating methods impose the sometimes untenable restriction on data that only a single ability is assessed. This module discusses potential sources of multidimensionality…
Descriptors: Item Response Theory, Models, Equated Scores, Evaluation Methods
Peer reviewed Peer reviewed
Direct linkDirect link
Yuan, Lu; Liu, Yanlou; Chen, Ping; Xin, Tao – Educational Measurement: Issues and Practice, 2022
Learning progressions can reflect students' continuous in-depth thinking development paths, and their establishment is an iterative process from the construction of hypothetical learning progressions to the verification of that hypotheses. Considering the limitations of the existing verification method of learning progressions based on a rule…
Descriptors: Grade 5, Mathematics Instruction, Fractions, Elementary School Students
Peer reviewed Peer reviewed
Direct linkDirect link
Ji, Xuejun Ryan; Wu, Amery D. – Educational Measurement: Issues and Practice, 2023
The Cross-Classified Mixed Effects Model (CCMEM) has been demonstrated to be a flexible framework for evaluating reliability by measurement specialists. Reliability can be estimated based on the variance components of the test scores. Built upon their accomplishment, this study extends the CCMEM to be used for evaluating validity evidence.…
Descriptors: Measurement, Validity, Reliability, Models
Peer reviewed Peer reviewed
Direct linkDirect link
Ma, Wenchao; de la Torre, Jimmy – Educational Measurement: Issues and Practice, 2019
In this ITEMS module, we introduce the generalized deterministic inputs, noisy "and" gate (G-DINA) model, which is a general framework for specifying, estimating, and evaluating a wide variety of cognitive diagnosis models. The module contains a nontechnical introduction to diagnostic measurement, an introductory overview of the G-DINA…
Descriptors: Models, Classification, Measurement, Identification
Peer reviewed Peer reviewed
Direct linkDirect link
Leventhal, Brian; Ames, Allison – Educational Measurement: Issues and Practice, 2020
In this digital ITEMS module, Dr. Brian Leventhal and Dr. Allison Ames provide an overview of "Monte Carlo simulation studies" (MCSS) in "item response theory" (IRT). MCSS are utilized for a variety of reasons, one of the most compelling being that they can be used when analytic solutions are impractical or nonexistent because…
Descriptors: Item Response Theory, Monte Carlo Methods, Simulation, Test Items
Peer reviewed Peer reviewed
Direct linkDirect link
Gordon, Edmund W. – Educational Measurement: Issues and Practice, 2020
Drawing upon his experience, more than 60 years ago, as a psychometric support person to a very special teacher of brain damaged children, the author of this article reflects on the productive use of educational assessments and data from them to educate - assessment in the service of learning. Findings from the Gordon Commission on the Future of…
Descriptors: Psychometrics, Student Evaluation, Special Education Teachers, Educational Assessment
Peer reviewed Peer reviewed
Direct linkDirect link
Liu, Ou Lydia – Educational Measurement: Issues and Practice, 2017
Student learning outcomes assessment has been increasingly used in U.S. higher education institutions over the last 10 years, partly fueled by the recommendation from the Spellings Commission that institutions need to demonstrate more direct evidence of student learning. To respond to the Commission's call, various accountability initiatives have…
Descriptors: College Outcomes Assessment, Accountability, Higher Education, Educational Improvement
Peer reviewed Peer reviewed
Direct linkDirect link
Cho, Sun-Joo; Suh, Youngsuk; Lee, Woo-yeol – Educational Measurement: Issues and Practice, 2016
The purpose of this ITEMS module is to provide an introduction to differential item functioning (DIF) analysis using mixture item response models. The mixture item response models for DIF analysis involve comparing item profiles across latent groups, instead of manifest groups. First, an overview of DIF analysis based on latent groups, called…
Descriptors: Test Bias, Research Methodology, Evaluation Methods, Models
Peer reviewed Peer reviewed
Direct linkDirect link
Wind, Stefanie A. – Educational Measurement: Issues and Practice, 2017
Mokken scale analysis (MSA) is a probabilistic-nonparametric approach to item response theory (IRT) that can be used to evaluate fundamental measurement properties with less strict assumptions than parametric IRT models. This instructional module provides an introduction to MSA as a probabilistic-nonparametric framework in which to explore…
Descriptors: Probability, Nonparametric Statistics, Item Response Theory, Scaling
Peer reviewed Peer reviewed
Direct linkDirect link
Ames, Allison J.; Penfield, Randall D. – Educational Measurement: Issues and Practice, 2015
Drawing valid inferences from item response theory (IRT) models is contingent upon a good fit of the data to the model. Violations of model-data fit have numerous consequences, limiting the usefulness and applicability of the model. This instructional module provides an overview of methods used for evaluating the fit of IRT models. Upon completing…
Descriptors: Item Response Theory, Goodness of Fit, Models, Evaluation Methods
Peer reviewed Peer reviewed
Benson, Jeri – Educational Measurement: Issues and Practice, 1998
Explores how a program of strong construct validation could be applied to the assessment of the construct of test anxiety, paying special attention to substantive, structural, and external aspects of construct validation. A framework is proposed to pull together various statistical methods used in construct validation research into an organized…
Descriptors: Construct Validity, Evaluation Methods, Models, Program Development
Peer reviewed Peer reviewed
Direct linkDirect link
Martineau, Joseph; Paek, Pamela; Keene, John; Hirsch, Thomas – Educational Measurement: Issues and Practice, 2007
This paper describes a comprehensive model of alignment that provides a foundation for meaningful reporting of students' academic progress over time. The model includes both horizontal and vertical alignment as integral parts of the development of content standards, test blueprints, items, item pools, instruments, performance level descriptors,…
Descriptors: Academic Achievement, Student Evaluation, Cognitive Measurement, Models
Peer reviewed Peer reviewed
Madaus, George F. – Educational Measurement: Issues and Practice, 1992
The need for an independent mechanism that regulates, or audits, the testing enterprise is discussed along with a critique of current mechanisms for challenging a high-stakes test or its use and the need for independent auditing of the commercial test industry. Models for an auditing mechanism are reviewed. (SLD)
Descriptors: Accountability, Elementary Secondary Education, Evaluation Methods, Higher Education
Previous Page | Next Page ยป
Pages: 1  |  2