ERIC - Search Results

Publication Date

In 2025	0
Since 2024	4
Since 2021 (last 5 years)	16
Since 2016 (last 10 years)	21
Since 2006 (last 20 years)	24

Descriptor

Evaluation Methods	24
Models	24
Psychometrics	7
Item Response Theory	6
Bayesian Statistics	5
Accuracy	4
Data Collection	4
Goodness of Fit	4
Statistical Analysis	4
Comparative Analysis	3
Error of Measurement	3
Measurement Techniques	3
Prediction	3
Predictor Variables	3
Sample Size	3
Simulation	3
Adolescents	2
Benchmarking	2
Case Studies	2
Children	2
Classification	2
Computer Software	2
Correlation	2
Data Analysis	2
Diagnostic Tests	2
More ▼

Source

Grantee Submission

Publication Type

Reports - Research	21
Journal Articles	4
Speeches/Meeting Papers	3
Reports - Evaluative	2
Tests/Questionnaires	2
Reports - Descriptive	1

Education Level

Elementary Education	2
Higher Education	2
Postsecondary Education	2
Early Childhood Education	1
Kindergarten	1
Primary Education	1

Audience

Location

California	1
Minnesota	1
Spain	1
United States	1

Laws, Policies, & Programs

Assessments and Surveys

What Works Clearinghouse Rating

Showing 1 to 15 of 24 results Save | Export

Assessing Disparities in Predictive Modeling Outcomes for College Student Success: The Impact of Imputation Techniques on Model Performance and Fairness

Peer reviewed

Direct link

Nazanin Nezami; Parian Haghighat; Denisa Gándara; Hadis Anahideh – Grantee Submission, 2024

The education sector has been quick to recognize the power of predictive analytics to enhance student success rates. However, there are challenges to widespread adoption, including the lack of accessibility and the potential perpetuation of inequalities. These challenges present in different stages of modeling, including data preparation, model…

Descriptors: Evaluation Methods, College Students, Success, Predictor Variables

A Note on Standard Errors for Multidimensional Two-Parameter Logistic Models Using Gaussian Variational Estimation

Peer reviewed

Direct link

Jiaying Xiao; Chun Wang; Gongjun Xu – Grantee Submission, 2024

Accurate item parameters and standard errors (SEs) are crucial for many multidimensional item response theory (MIRT) applications. A recent study proposed the Gaussian Variational Expectation Maximization (GVEM) algorithm to improve computational efficiency and estimation accuracy (Cho et al., 2021). However, the SE estimation procedure has yet to…

Descriptors: Error of Measurement, Models, Evaluation Methods, Item Analysis

Opaque Prior Distributions in Bayesian Latent Variable Models

Peer reviewed
PDF on ERIC

Download full text

Direct link

Edgar C. Merkle; Oludare Ariyo; Sonja D. Winter; Mauricio Garnier-Villarreal – Grantee Submission, 2023

We review common situations in Bayesian latent variable models where the prior distribution that a researcher specifies differs from the prior distribution used during estimation. These situations can arise from the positive definite requirement on correlation matrices, from sign indeterminacy of factor loadings, and from order constraints on…

Descriptors: Models, Bayesian Statistics, Correlation, Evaluation Methods

Estimating the Reliability of Skill Transitions in Longitudinal Diagnostic Classification Models

Peer reviewed

Direct link

Madeline A. Schellman; Matthew J. Madison – Grantee Submission, 2024

Diagnostic classification models (DCMs) have grown in popularity as stakeholders increasingly desire actionable information related to students' skill competencies. Longitudinal DCMs offer a psychometric framework for providing estimates of students' proficiency status transitions over time. For both cross-sectional and longitudinal DCMs, it is…

Descriptors: Diagnostic Tests, Classification, Models, Psychometrics

Generalizability of Dynamic Fit Index, Equivalence Testing, and Hu & Bentler Cutoffs for Evaluating Fit in Factor Analysis

Peer reviewed

Direct link

Daniel McNeish – Grantee Submission, 2023

Factor analysis is often used to model scales created to measure latent constructs, and internal structure validity evidence is commonly assessed with indices like SRMR, RMSEA, and CFI. These indices are essentially effect size measures and definitive benchmarks regarding which values connote reasonable fit have been elusive. Simulations from the…

Descriptors: Models, Testing, Indexes, Factor Analysis

Increasing Generalizability via the Principle of Minimum Description Length

Peer reviewed
PDF on ERIC

Download full text

Direct link

Bonifay, Wes – Grantee Submission, 2022

Traditional statistical model evaluation typically relies on goodness-of-fit testing and quantifying model complexity by counting parameters. Both of these practices may result in overfitting and have thereby contributed to the generalizability crisis. The information-theoretic principle of minimum description length addresses both of these…

Descriptors: Statistical Analysis, Models, Goodness of Fit, Evaluation Methods

Evaluating Methods for Assessing Model Fit in Diagnostic Classification Models

Peer reviewed

W. Jake Thompson – Grantee Submission, 2024

Diagnostic classification models (DCMs) are psychometric models that can be used to estimate the presence or absence of psychological traits, or proficiency on fine-grained skills. Critical to the use of any psychometric model in practice, including DCMs, is an evaluation of model fit. Traditionally, DCMs have been estimated with maximum…

Descriptors: Bayesian Statistics, Classification, Psychometrics, Goodness of Fit

A Bayesian Latent Variable Selection Model for Nonignorable Missingness

Peer reviewed
PDF on ERIC

Download full text

Direct link

Du, Han; Enders, Craig; Keller, Brian; Bradbury, Thomas N.; Karney, Benjamin R. – Grantee Submission, 2022

Missing data are exceedingly common across a variety of disciplines, such as educational, social, and behavioral science areas. Missing not at random (MNAR) mechanism where missingness is related to unobserved data is widespread in real data and has detrimental consequence. However, the existing MNAR-based methods have potential problems such as…

Descriptors: Bayesian Statistics, Data Analysis, Computer Simulation, Sample Size

Predictive Fit Metrics for Item Response Models

Peer reviewed
PDF on ERIC

Download full text

Direct link

Ben Stenhaug; Ben Domingue – Grantee Submission, 2022

The fit of an item response model is typically conceptualized as whether a given model could have generated the data. We advocate for an alternative view of fit, "predictive fit", based on the model's ability to predict new data. We derive two predictive fit metrics for item response models that assess how well an estimated item response…

Descriptors: Goodness of Fit, Item Response Theory, Prediction, Models

Automated Assessment of Comprehension Strategies from Self-Explanations Using LLMs

Peer reviewed
PDF on ERIC

Download full text

Direct link

Bogdan Nicula; Mihai Dascalu; Tracy Arner; Renu Balyan; Danielle S. McNamara – Grantee Submission, 2023

Text comprehension is an essential skill in today's information-rich world, and self-explanation practice helps students improve their understanding of complex texts. This study was centered on leveraging open-source Large Language Models (LLMs), specifically FLAN-T5, to automatically assess the comprehension strategies employed by readers while…

Descriptors: Reading Comprehension, Language Processing, Models, STEM Education

Ordinal Models to Analyze Strategy Sophistication: Evidence from a Learning Trajectory Efficacy Study

Peer reviewed
PDF on ERIC

Download full text

Direct link

T. S. Kutaka; P. Chernyavskiy; J. Sarama; D. H. Clements – Grantee Submission, 2023

Investigators often rely on the proportion of correct responses in an assessment when describing the impact of early mathematics interventions on child outcomes. Here, we propose a shift in focus to the relative sophistication of problem-solving strategies and offer methodological guidance to researchers interested in working with strategies. We…

Descriptors: Learning Trajectories, Problem Solving, Mathematics Instruction, Early Intervention

Using Lasso and Adaptive Lasso to Identify DIF in Multidimensional 2PL Models

Peer reviewed
PDF on ERIC

Download full text

Direct link

Chun Wang; Ruoyi Zhu; Gongjun Xu – Grantee Submission, 2022

Differential item functioning (DIF) analysis refers to procedures that evaluate whether an item's characteristic differs for different groups of persons after controlling for overall differences in performance. DIF is routinely evaluated as a screening step to ensure items behavior the same across groups. Currently, the majority DIF studies focus…

Descriptors: Models, Item Response Theory, Item Analysis, Comparative Analysis

The Operations Triad Model and Youth Mental Health Assessments: Catalyzing a Paradigm Shift in Measurement Validation

Peer reviewed
PDF on ERIC

Download full text

Direct link

Andres De Los Reyes; Mo Wang; Matthew D. Lerner; Bridget A. Makol; Olivia M. Fitzpatrick; John R. Weisz – Grantee Submission, 2022

Researchers strategically assess youth mental health by soliciting reports from multiple informants. Typically, these informants (e.g., parents, teachers, youth themselves) vary in the social contexts where they observe youth. Decades of research reveal that the most common data conditions produced with this approach consist of discrepancies…

Descriptors: Mental Health, Measurement Techniques, Evaluation Methods, Research

An Evaluation of Statistical Methods for Aggregate Patterns of Replication Failure

Peer reviewed
PDF on ERIC

Download full text

Direct link

Jacob M. Schauer; Kaitlyn G. Fitzgerald; Sarah Peko-Spicer; Mena C. R. Whalen; Rrita Zejnullahi; Larry V. Hedges – Grantee Submission, 2021

Several programs of research have sought to assess the replicability of scientific findings in different fields, including economics and psychology. These programs attempt to replicate several findings and use the results to say something about large-scale patterns of replicability in a field. However, little work has been done to understand the…

Descriptors: Statistical Analysis, Research Methodology, Evaluation Methods, Replication (Evaluation)

The Needs-to-Goals Gap: How Informant Discrepancies in Youth Mental Health Assessments Impact Service Delivery

Peer reviewed
PDF on ERIC

Download full text

Direct link

Andres De Los Reyes; Elizabeth Talbott; Thomas J. Power; Jeremy J. Michel; Clayton R. Cook; Sarah J. Racz; Olivia Fitzpatrick – Grantee Submission, 2021

Over 60 years of research reveal that informants who observe youth in clinically relevant contexts (e.g., home, school)--typically parents, teachers, and youth clients themselves--often hold discrepant views about that client's needs for mental health services (i.e., "informant discrepancies"). The last 10 years of research reveal that…

Descriptors: Youth, Mental Health, Evaluation Methods, Measures (Individuals)

Previous Page | Next Page »

Pages: 1 | 2

Aleven, Vincent	2
Andres De Los Reyes	2
Brunskill, Emma	2
Cai, Li	2
Chun Wang	2
Doroudi, Shayan	2
Falk, Carl F.	2
Gongjun Xu	2
Angeline Gacad	1
Barbara McMorris	1
Ben Domingue	1
Ben Stenhaug	1
Bogdan Nicula	1
Bonifay, Wes	1
Bradbury, Thomas N.	1
Bridget A. Makol	1
Callow-Heusser, Catherine	1
Clayton R. Cook	1
Coates, Susan	1
Culbertson, Michael J.	1
D. H. Clements	1
Daniel McNeish	1
Danielle S. McNamara	1
Deane, Paul	1
Denisa Gándara	1
More ▼