Publication Date
In 2025 | 0 |
Since 2024 | 3 |
Since 2021 (last 5 years) | 11 |
Since 2016 (last 10 years) | 29 |
Since 2006 (last 20 years) | 73 |
Descriptor
Models | 82 |
Scores | 82 |
Simulation | 64 |
Item Response Theory | 31 |
Comparative Analysis | 21 |
Statistical Analysis | 20 |
Computer Simulation | 19 |
Evaluation Methods | 18 |
Correlation | 15 |
Foreign Countries | 14 |
Test Items | 14 |
More ▼ |
Source
Author
Publication Type
Journal Articles | 61 |
Reports - Research | 51 |
Reports - Evaluative | 13 |
Dissertations/Theses -… | 8 |
Reports - Descriptive | 6 |
Collected Works - Proceedings | 3 |
Speeches/Meeting Papers | 2 |
Tests/Questionnaires | 1 |
Education Level
Audience
Researchers | 3 |
Location
Pennsylvania | 3 |
Spain | 3 |
Brazil | 2 |
California | 2 |
Germany | 2 |
Greece | 2 |
Portugal | 2 |
Arkansas | 1 |
Asia | 1 |
Australia | 1 |
Canada | 1 |
More ▼ |
Laws, Policies, & Programs
No Child Left Behind Act 2001 | 1 |
Assessments and Surveys
What Works Clearinghouse Rating
Sijia Huang; Seungwon Chung; Carl F. Falk – Journal of Educational Measurement, 2024
In this study, we introduced a cross-classified multidimensional nominal response model (CC-MNRM) to account for various response styles (RS) in the presence of cross-classified data. The proposed model allows slopes to vary across items and can explore impacts of observed covariates on latent constructs. We applied a recently developed variant of…
Descriptors: Response Style (Tests), Classification, Data, Models
Emma Somer; Carl Falk; Milica Miocevic – Structural Equation Modeling: A Multidisciplinary Journal, 2024
Factor Score Regression (FSR) is increasingly employed as an alternative to structural equation modeling (SEM) in small samples. Despite its popularity in psychology, the performance of FSR in multigroup models with small samples remains relatively unknown. The goal of this study was to examine the performance of FSR, namely Croon's correction and…
Descriptors: Scores, Structural Equation Models, Comparative Analysis, Sample Size
Sosa, Ricardo; van Dijck, Max – Creativity Research Journal, 2022
The distinction between "Big-C" and "little-c" creativity implies that the generative process of celebrated creators is of a special type or degree. Arguments for and against such a hierarchy of creativity are found in the literature, primarily built on rhetorical argumentation. The aim of this work is to examine the rationale…
Descriptors: Creativity, Computer Simulation, Models, Social Systems
Liu, Jin – Journal of Educational and Behavioral Statistics, 2022
Longitudinal data analysis has been widely employed to examine between-individual differences in within-individual changes. One challenge of such analyses is that the rate-of-change is only available indirectly when change patterns are nonlinear with respect to time. Latent change score models (LCSMs), which can be employed to investigate the…
Descriptors: Longitudinal Studies, Individual Differences, Scores, Models
Egamaria Alacam; Craig K. Enders; Han Du; Brian T. Keller – Grantee Submission, 2023
Composite scores are an exceptionally important psychometric tool for behavioral science research applications. A prototypical example occurs with self-report data, where researchers routinely use questionnaires with multiple items that tap into different features of a target construct. Item-level missing data are endemic to composite score…
Descriptors: Regression (Statistics), Scores, Psychometrics, Test Items
Reed, Janet M. – ProQuest LLC, 2022
Research literature provides evidence that new graduate nurses are often deficient in clinical judgment (CJ). One way to increase CJ is by using simulations. However, the literature is replete with descriptions of the high anxiety that simulation triggers. It is not currently known how anxiety in simulation affects clinical judgment for…
Descriptors: Nurses, Decision Making, Anxiety, Evidence
Zoran Sevarac; Jelena Jovanovic; Vladan Devedzic; Bojan Tomic – Interactive Learning Environments, 2023
The paper proposes EXPLODE, a new model of exploratory learning environment for teaching and learning neural networks. The EXPLODE model is about pedagogically instrumenting a software development environment to transform it into an exploratory learning environment for neural networks. Such an environment is particularly aimed for students who are…
Descriptors: Models, Discovery Learning, Artificial Intelligence, Computer Simulation
Leventhal, Brian; Ames, Allison – Educational Measurement: Issues and Practice, 2020
In this digital ITEMS module, Dr. Brian Leventhal and Dr. Allison Ames provide an overview of "Monte Carlo simulation studies" (MCSS) in "item response theory" (IRT). MCSS are utilized for a variety of reasons, one of the most compelling being that they can be used when analytic solutions are impractical or nonexistent because…
Descriptors: Item Response Theory, Monte Carlo Methods, Simulation, Test Items
Robert-Mihai Botarleanu; Micah Watanabe; Mihai Dascalu; Scott A. Crossley; Danielle S. McNamara – International Journal of Artificial Intelligence in Education, 2024
Age of Acquisition (AoA) scores approximate the age at which a language speaker fully understands a word's semantic meaning and represent a quantitative measure of the relative difficulty of words in a language. AoA word lists exist across various languages, with English having the most complete lists that capture the largest percentage of the…
Descriptors: Multilingualism, English (Second Language), Second Language Learning, Second Language Instruction
Robert-Mihai Botarleanu; Micah Watanabe; Mihai Dascalu; Scott A. Crossley; Danielle S. McNamara – Grantee Submission, 2023
Age of Acquisition (AoA) scores approximate the age at which a language speaker fully understands a word's semantic meaning and represent a quantitative measure of the relative difficulty of words in a language. AoA word lists exist across various languages, with English having the most complete lists that capture the largest percentage of the…
Descriptors: Multilingualism, English (Second Language), Second Language Learning, Second Language Instruction
Feuerstahler, Leah; Wilson, Mark – Journal of Educational Measurement, 2019
Scores estimated from multidimensional item response theory (IRT) models are not necessarily comparable across dimensions. In this article, the concept of aligned dimensions is formalized in the context of Rasch models, and two methods are described--delta dimensional alignment (DDA) and logistic regression alignment (LRA)--to transform estimated…
Descriptors: Item Response Theory, Models, Scores, Comparative Analysis
Yesiltas, Gonca; Paek, Insu – Educational and Psychological Measurement, 2020
A log-linear model (LLM) is a well-known statistical method to examine the relationship among categorical variables. This study investigated the performance of LLM in detecting differential item functioning (DIF) for polytomously scored items via simulations where various sample sizes, ability mean differences (impact), and DIF types were…
Descriptors: Simulation, Sample Size, Item Analysis, Scores
The AI Teacher Test: Measuring the Pedagogical Ability of Blender and GPT-3 in Educational Dialogues
Tack, Anaïs; Piech, Chris – International Educational Data Mining Society, 2022
How can we test whether state-of-the-art generative models, such as Blender and GPT-3, are good AI teachers, capable of replying to a student in an educational dialogue? Designing an AI teacher test is challenging: although evaluation methods are much-needed, there is no off-the-shelf solution to measuring pedagogical ability. This paper reports…
Descriptors: Artificial Intelligence, Dialogs (Language), Bayesian Statistics, Decision Making
James Soland; Megan Kuhfeld – Annenberg Institute for School Reform at Brown University, 2020
Survey respondents use different response styles when they use the categories of the Likert scale differently despite having the same true score on the construct of interest. For example, respondents may be more likely to use the extremes of the response scale independent of their true score. Research already shows that differing response styles…
Descriptors: Social Emotional Learning, Scores, Likert Scales, Surveys
Dalessio, Samantha J.; Carlino, Nancy; Barnum, Mary G.; Joseph, Denise; Sovak, Melissa M. – Teaching and Learning in Communication Sciences & Disorders, 2021
Purpose: The purpose of this study was to investigate the effect of the supervision-questioning-feedback (SQF) model of supervision on critical thinking in graduate students studying speech-language pathology. The researchers hypothesized that students who were provided with the SQF model of supervision would score higher than students who…
Descriptors: Supervision, Questioning Techniques, Feedback (Response), Models