Publication Date
In 2025 | 0 |
Since 2024 | 4 |
Since 2021 (last 5 years) | 16 |
Since 2016 (last 10 years) | 21 |
Since 2006 (last 20 years) | 24 |
Descriptor
Evaluation Methods | 24 |
Models | 24 |
Psychometrics | 7 |
Item Response Theory | 6 |
Bayesian Statistics | 5 |
Accuracy | 4 |
Data Collection | 4 |
Goodness of Fit | 4 |
Statistical Analysis | 4 |
Comparative Analysis | 3 |
Error of Measurement | 3 |
More ▼ |
Source
Grantee Submission | 24 |
Author
Aleven, Vincent | 2 |
Andres De Los Reyes | 2 |
Brunskill, Emma | 2 |
Cai, Li | 2 |
Chun Wang | 2 |
Doroudi, Shayan | 2 |
Falk, Carl F. | 2 |
Gongjun Xu | 2 |
Angeline Gacad | 1 |
Barbara McMorris | 1 |
Ben Domingue | 1 |
More ▼ |
Publication Type
Reports - Research | 21 |
Journal Articles | 4 |
Speeches/Meeting Papers | 3 |
Reports - Evaluative | 2 |
Tests/Questionnaires | 2 |
Reports - Descriptive | 1 |
Education Level
Elementary Education | 2 |
Higher Education | 2 |
Postsecondary Education | 2 |
Early Childhood Education | 1 |
Kindergarten | 1 |
Primary Education | 1 |
Audience
Location
California | 1 |
Minnesota | 1 |
Spain | 1 |
United States | 1 |
Laws, Policies, & Programs
Assessments and Surveys
What Works Clearinghouse Rating
Nazanin Nezami; Parian Haghighat; Denisa Gándara; Hadis Anahideh – Grantee Submission, 2024
The education sector has been quick to recognize the power of predictive analytics to enhance student success rates. However, there are challenges to widespread adoption, including the lack of accessibility and the potential perpetuation of inequalities. These challenges present in different stages of modeling, including data preparation, model…
Descriptors: Evaluation Methods, College Students, Success, Predictor Variables
Jiaying Xiao; Chun Wang; Gongjun Xu – Grantee Submission, 2024
Accurate item parameters and standard errors (SEs) are crucial for many multidimensional item response theory (MIRT) applications. A recent study proposed the Gaussian Variational Expectation Maximization (GVEM) algorithm to improve computational efficiency and estimation accuracy (Cho et al., 2021). However, the SE estimation procedure has yet to…
Descriptors: Error of Measurement, Models, Evaluation Methods, Item Analysis
Edgar C. Merkle; Oludare Ariyo; Sonja D. Winter; Mauricio Garnier-Villarreal – Grantee Submission, 2023
We review common situations in Bayesian latent variable models where the prior distribution that a researcher specifies differs from the prior distribution used during estimation. These situations can arise from the positive definite requirement on correlation matrices, from sign indeterminacy of factor loadings, and from order constraints on…
Descriptors: Models, Bayesian Statistics, Correlation, Evaluation Methods
Madeline A. Schellman; Matthew J. Madison – Grantee Submission, 2024
Diagnostic classification models (DCMs) have grown in popularity as stakeholders increasingly desire actionable information related to students' skill competencies. Longitudinal DCMs offer a psychometric framework for providing estimates of students' proficiency status transitions over time. For both cross-sectional and longitudinal DCMs, it is…
Descriptors: Diagnostic Tests, Classification, Models, Psychometrics
Daniel McNeish – Grantee Submission, 2023
Factor analysis is often used to model scales created to measure latent constructs, and internal structure validity evidence is commonly assessed with indices like SRMR, RMSEA, and CFI. These indices are essentially effect size measures and definitive benchmarks regarding which values connote reasonable fit have been elusive. Simulations from the…
Descriptors: Models, Testing, Indexes, Factor Analysis
Bonifay, Wes – Grantee Submission, 2022
Traditional statistical model evaluation typically relies on goodness-of-fit testing and quantifying model complexity by counting parameters. Both of these practices may result in overfitting and have thereby contributed to the generalizability crisis. The information-theoretic principle of minimum description length addresses both of these…
Descriptors: Statistical Analysis, Models, Goodness of Fit, Evaluation Methods

W. Jake Thompson – Grantee Submission, 2024
Diagnostic classification models (DCMs) are psychometric models that can be used to estimate the presence or absence of psychological traits, or proficiency on fine-grained skills. Critical to the use of any psychometric model in practice, including DCMs, is an evaluation of model fit. Traditionally, DCMs have been estimated with maximum…
Descriptors: Bayesian Statistics, Classification, Psychometrics, Goodness of Fit
Du, Han; Enders, Craig; Keller, Brian; Bradbury, Thomas N.; Karney, Benjamin R. – Grantee Submission, 2022
Missing data are exceedingly common across a variety of disciplines, such as educational, social, and behavioral science areas. Missing not at random (MNAR) mechanism where missingness is related to unobserved data is widespread in real data and has detrimental consequence. However, the existing MNAR-based methods have potential problems such as…
Descriptors: Bayesian Statistics, Data Analysis, Computer Simulation, Sample Size
Ben Stenhaug; Ben Domingue – Grantee Submission, 2022
The fit of an item response model is typically conceptualized as whether a given model could have generated the data. We advocate for an alternative view of fit, "predictive fit", based on the model's ability to predict new data. We derive two predictive fit metrics for item response models that assess how well an estimated item response…
Descriptors: Goodness of Fit, Item Response Theory, Prediction, Models
Bogdan Nicula; Mihai Dascalu; Tracy Arner; Renu Balyan; Danielle S. McNamara – Grantee Submission, 2023
Text comprehension is an essential skill in today's information-rich world, and self-explanation practice helps students improve their understanding of complex texts. This study was centered on leveraging open-source Large Language Models (LLMs), specifically FLAN-T5, to automatically assess the comprehension strategies employed by readers while…
Descriptors: Reading Comprehension, Language Processing, Models, STEM Education
T. S. Kutaka; P. Chernyavskiy; J. Sarama; D. H. Clements – Grantee Submission, 2023
Investigators often rely on the proportion of correct responses in an assessment when describing the impact of early mathematics interventions on child outcomes. Here, we propose a shift in focus to the relative sophistication of problem-solving strategies and offer methodological guidance to researchers interested in working with strategies. We…
Descriptors: Learning Trajectories, Problem Solving, Mathematics Instruction, Early Intervention
Chun Wang; Ruoyi Zhu; Gongjun Xu – Grantee Submission, 2022
Differential item functioning (DIF) analysis refers to procedures that evaluate whether an item's characteristic differs for different groups of persons after controlling for overall differences in performance. DIF is routinely evaluated as a screening step to ensure items behavior the same across groups. Currently, the majority DIF studies focus…
Descriptors: Models, Item Response Theory, Item Analysis, Comparative Analysis
Andres De Los Reyes; Mo Wang; Matthew D. Lerner; Bridget A. Makol; Olivia M. Fitzpatrick; John R. Weisz – Grantee Submission, 2022
Researchers strategically assess youth mental health by soliciting reports from multiple informants. Typically, these informants (e.g., parents, teachers, youth themselves) vary in the social contexts where they observe youth. Decades of research reveal that the most common data conditions produced with this approach consist of discrepancies…
Descriptors: Mental Health, Measurement Techniques, Evaluation Methods, Research
Jacob M. Schauer; Kaitlyn G. Fitzgerald; Sarah Peko-Spicer; Mena C. R. Whalen; Rrita Zejnullahi; Larry V. Hedges – Grantee Submission, 2021
Several programs of research have sought to assess the replicability of scientific findings in different fields, including economics and psychology. These programs attempt to replicate several findings and use the results to say something about large-scale patterns of replicability in a field. However, little work has been done to understand the…
Descriptors: Statistical Analysis, Research Methodology, Evaluation Methods, Replication (Evaluation)
Andres De Los Reyes; Elizabeth Talbott; Thomas J. Power; Jeremy J. Michel; Clayton R. Cook; Sarah J. Racz; Olivia Fitzpatrick – Grantee Submission, 2021
Over 60 years of research reveal that informants who observe youth in clinically relevant contexts (e.g., home, school)--typically parents, teachers, and youth clients themselves--often hold discrepant views about that client's needs for mental health services (i.e., "informant discrepancies"). The last 10 years of research reveal that…
Descriptors: Youth, Mental Health, Evaluation Methods, Measures (Individuals)
Previous Page | Next Page »
Pages: 1 | 2