ERIC - Search Results

Publication Date

In 2025	2
Since 2024	2
Since 2021 (last 5 years)	2
Since 2016 (last 10 years)	9
Since 2006 (last 20 years)	28

Descriptor

Educational Assessment	42
Models	42
Simulation	35
Evaluation Methods	12
Item Response Theory	10
Comparative Analysis	9
Computer Simulation	7
Data Analysis	7
Educational Games	7
Elementary Secondary Education	7
Classification	6
Psychometrics	6
Student Evaluation	6
Computer Assisted Testing	5
Educational Research	5
Instructional Design	5
Measurement	5
Probability	5
Sample Size	5
Scores	5
Test Items	5
Bayesian Statistics	4
Computation	4
Computer Software	4
Correlation	4
More ▼

Source

ProQuest LLC	7
Measurement:…	4
Journal of Educational…	3
Journal of Educational and…	3
Applied Measurement in…	2
International Association for…	2
Psychometrika	2
Asia Pacific Education Review	1
Communication Education	1
Computers & Education	1
ETS Research Report Series	1
Education Finance and Policy	1
International Journal of…	1
International Working Group…	1
Online Submission	1
Society for Research on…	1
More ▼

Publication Type

Journal Articles	20
Reports - Research	13
Reports - Evaluative	12
Dissertations/Theses -…	7
Reports - Descriptive	4
Collected Works - Proceedings	3
Guides - Classroom - Teacher	2
Speeches/Meeting Papers	2
Guides - Non-Classroom	1
Opinion Papers	1

Education Level

Elementary Secondary Education	6
Higher Education	4
Postsecondary Education	4
Adult Education	2
Elementary Education	2
Junior High Schools	2
Middle Schools	2
Secondary Education	2
Grade 10	1
Grade 12	1
Grade 4	1
Grade 7	1
Grade 8	1
Grade 9	1
High Schools	1
More ▼

Audience

Policymakers

Location

Australia	3
China	2
Israel	2
Netherlands	2
Pennsylvania	2
Spain	2
Asia	1
Brazil	1
Connecticut	1
Costa Rica	1
Croatia	1
Czech Republic	1
Denmark	1
Egypt	1
Estonia	1
Florida	1
Germany	1
Greece	1
Hawaii	1
India	1
Ireland	1
Italy	1
Japan	1
Kazakhstan	1
Massachusetts	1
More ▼

Laws, Policies, & Programs

Assessments and Surveys

Massachusetts Comprehensive…	1
Program for International…	1
Rosenberg Self Esteem Scale	1
Trends in International…	1
Wechsler Adult Intelligence…	1

What Works Clearinghouse Rating

Showing 1 to 15 of 42 results Save | Export

Using Regularized Methods to Validate Q-Matrix in Cognitive Diagnostic Assessment

Peer reviewed

Direct link

Daoxuan Fu; Chunying Qin; Zhaosheng Luo; Yujun Li; Xiaofeng Yu; Ziyu Ye – Journal of Educational and Behavioral Statistics, 2025

One of the central components of cognitive diagnostic assessment is the Q-matrix, which is an essential loading indicator matrix and is typically constructed by subject matter experts. Nonetheless, to a large extent, the construction of Q-matrix remains a subjective process and might lead to misspecifications. Many researchers have recognized the…

Descriptors: Q Methodology, Matrices, Diagnostic Tests, Cognitive Measurement

Bayesian Diagnostic Classification Models for a Partially Known Q-Matrix

Peer reviewed

Direct link

Kazuhiro Yamaguchi – Journal of Educational and Behavioral Statistics, 2025

This study proposes a Bayesian method for diagnostic classification models (DCMs) for a partially known Q-matrix setting between exploratory and confirmatory DCMs. This Q-matrix setting is practical and useful because test experts have pre-knowledge of the Q-matrix but cannot readily specify it completely. The proposed method employs priors for…

Descriptors: Models, Classification, Bayesian Statistics, Evaluation Methods

Within-Item Interactions in Bifactor Models for Ordered-Categorical Item Responses

Direct link

Fager, Meghan L. – ProQuest LLC, 2019

Recent research in multidimensional item response theory has introduced within-item interaction effects between latent dimensions in the prediction of item responses. The objective of this study was to extend this research to bifactor models to include an interaction effect between the general and specific latent variables measured by an item.…

Descriptors: Test Items, Item Response Theory, Factor Analysis, Simulation

Using Posterior Predictive Checking of Item Response Theory Models to Study Invariance Violations

Direct link

Xin, Xin – ProQuest LLC, 2017

The common practice for testing measurement invariance is to constrain parameters to be equal over groups, and then evaluate the model-data fit to reject or fail to reject the restrictive model. Posterior predictive checking (PPC) provides an alternative approach to evaluating model-data discrepancy. This paper explores the utility of PPC in…

Descriptors: Item Response Theory, Educational Assessment, Prediction, Models

GDINA and CDM Packages in R

Peer reviewed

Direct link

Rupp, André A.; van Rijn, Peter W. – Measurement: Interdisciplinary Research and Perspectives, 2018

We review the GIDNA and CDM packages in R for fitting cognitive diagnosis/diagnostic classification models. We first provide a summary of their core capabilities and then use both simulated and real data to compare their functionalities in practice. We found that the most relevant routines in the two packages appear to be more similar than…

Descriptors: Educational Assessment, Cognitive Measurement, Measurement, Computer Software

An Examination of the Impact of Residuals and Residual Covariance Structures on Scores for Next Generation, Mixed-Format, Online Assessments with the Existence of Potential Irrelevant Dimensions under Various Calibration Strategies

Direct link

Bukhari, Nurliyana – ProQuest LLC, 2017

In general, newer educational assessments are deemed more demanding challenges than students are currently prepared to face. Two types of factors may contribute to the test scores: (1) factors or dimensions that are of primary interest to the construct or test domain; and, (2) factors or dimensions that are irrelevant to the construct, causing…

Descriptors: Item Response Theory, Models, Psychometrics, Computer Simulation

Further Thoughts on "How Task Features Impact Evidence from Assessments Embedded in Simulations and Games"

Peer reviewed

Direct link

Oliveri, María Elena; Khan, Saad – Measurement: Interdisciplinary Research and Perspectives, 2014

María Oliveri, and Saad Khan write that the article: "How Task Features Impact Evidence from Assessments Embedded in Simulations and Games" provided helpful illustrations regarding the implementation of evidence-centered assessment design (Mislevy & Haertel, 2006; Mislevy, Steinberg, & Almond, 1999) with games and simulations.…

Descriptors: Task Analysis, Models, Educational Assessment, Word Problems (Mathematics)

Commentary on "How Task Features Impact Evidence from Assessments Embedded in Simulations and Games" by Almond et al.

Peer reviewed

Direct link

Timms, Mike – Measurement: Interdisciplinary Research and Perspectives, 2014

In his commentary on "How Task Features Impact Evidence from Assessments Embedded in Simulations and Games" by Almond et al., Mike Timms writes that his own research has involved the use of embedded assessments using simulations in interactive learning environments, and the Evidence Centered Design (ECD) approach has provided a solid…

Descriptors: Task Analysis, Models, Educational Assessment, Simulation

Modeling Data from Collaborative Assessments: Learning in Digital Interactive Social Networks

Peer reviewed

Direct link

Wilson, Mark; Gochyyev, Perman; Scalise, Kathleen – Journal of Educational Measurement, 2017

This article summarizes assessment of cognitive skills through collaborative tasks, using field test results from the Assessment and Teaching of 21st Century Skills (ATC21S) project. This project, sponsored by Cisco, Intel, and Microsoft, aims to help educators around the world enable students with the skills to succeed in future career and…

Descriptors: Cognitive Ability, Thinking Skills, Evaluation Methods, Educational Assessment

How Task Features Impact Evidence from Assessments Embedded in Simulations and Games

Peer reviewed

Direct link

Almond, Russell G.; Kim, Yoon Jeon; Velasquez, Gertrudes; Shute, Valerie J. – Measurement: Interdisciplinary Research and Perspectives, 2014

One of the key ideas of evidence-centered assessment design (ECD) is that task features can be deliberately manipulated to change the psychometric properties of items. ECD identifies a number of roles that task-feature variables can play, including determining the focus of evidence, guiding form creation, determining item difficulty and…

Descriptors: Educational Games, Simulation, Psychometrics, Educational Assessment

Posterior Predictive Model Checking in Bayesian Networks

Direct link

Crawford, Aaron – ProQuest LLC, 2014

This simulation study compared the utility of various discrepancy measures within a posterior predictive model checking (PPMC) framework for detecting different types of data-model misfit in multidimensional Bayesian network (BN) models. The investigated conditions were motivated by an applied research program utilizing an operational complex…

Descriptors: Bayesian Statistics, Networks, Models, Goodness of Fit

Parameter Recovery and Classification Accuracy under Conditions of Testlet Dependency: A Comparison of the Traditional 2PL, Testlet, and Bi-Factor Models

Peer reviewed

Direct link

Koziol, Natalie A. – Applied Measurement in Education, 2016

Testlets, or groups of related items, are commonly included in educational assessments due to their many logistical and conceptual advantages. Despite their advantages, testlets introduce complications into the theory and practice of educational measurement. Responses to items within a testlet tend to be correlated even after controlling for…

Descriptors: Classification, Accuracy, Comparative Analysis, Models

Between-Person and Within-Person Subscore Reliability: Comparison of Unidimensional and Multidimensional IRT Models

Direct link

Bulut, Okan – ProQuest LLC, 2013

The importance of subscores in educational and psychological assessments is undeniable. Subscores yield diagnostic information that can be used for determining how each examinee's abilities/skills vary over different content domains. One of the most common criticisms about reporting and using subscores is insufficient reliability of subscores.…

Descriptors: Item Response Theory, Simulation, Correlation, Reliability

Cognitive Diagnostic Analysis Using Hierarchically Structured Skills

Direct link

Su, Yu-Lan – ProQuest LLC, 2013

This dissertation proposes two modified cognitive diagnostic models (CDMs), the deterministic, inputs, noisy, "and" gate with hierarchy (DINA-H) model and the deterministic, inputs, noisy, "or" gate with hierarchy (DINO-H) model. Both models incorporate the hierarchical structures of the cognitive skills in the model estimation…

Descriptors: Models, Diagnostic Tests, Cognitive Processes, Thinking Skills

Performance of the Generalized S-X[squared] Item Fit Index for the Graded Response Model

Peer reviewed

Direct link

Kang, Taehoon; Chen, Troy T. – Asia Pacific Education Review, 2011

The utility of Orlando and Thissen's ("2000", "2003") S-X[squared] fit index was extended to the model-fit analysis of the graded response model (GRM). The performance of a modified S-X[squared] in assessing item-fit of the GRM was investigated in light of empirical Type I error rates and power with a simulation study having…

Descriptors: Simulation, Item Response Theory, Models, Testing

Previous Page | Next Page »

Pages: 1 | 2 | 3

Almond, Russell G.	2
Armstrong, Ronald D.	1
Barnes, Tiffany, Ed.	1
Bartolucci, Francesco	1
Bertling, Maria	1
Biswas, Gautam	1
Breyer, F. Jay	1
Bukhari, Nurliyana	1
Bulut, Okan	1
Burket, George	1
Cai, Li	1
Chan, Helen	1
Chassapis, Constantin	1
Chen, Li-Sue	1
Chen, Troy T.	1
Chia, Mike	1
Chunying Qin	1
Clauser, Brian E.	1
Corter, James E.	1
Crawford, Aaron	1
Daoxuan Fu	1
Desmarais, Michel, Ed.	1
Diakow, Ronli Phyllis	1
Dobbert, Daniel	1
More ▼