Publication Date
In 2025 | 1 |
Since 2024 | 4 |
Since 2021 (last 5 years) | 13 |
Since 2016 (last 10 years) | 32 |
Since 2006 (last 20 years) | 62 |
Descriptor
Hierarchical Linear Modeling | 63 |
Item Response Theory | 63 |
Foreign Countries | 15 |
Scores | 15 |
Comparative Analysis | 14 |
Computation | 13 |
Correlation | 12 |
Simulation | 12 |
Test Bias | 12 |
Test Items | 12 |
Models | 11 |
More ▼ |
Source
Author
Cho, Sun-Joo | 3 |
Albano, Anthony D. | 2 |
Beretvas, S. Natasha | 2 |
Bottge, Brian A. | 2 |
Fox, Jean-Paul | 2 |
Jiao, Hong | 2 |
Kamata, Akihito | 2 |
Kara, Yusuf | 2 |
Quesen, Sarah | 2 |
Sijia Huang | 2 |
von Davier, Matthias | 2 |
More ▼ |
Publication Type
Reports - Research | 51 |
Journal Articles | 49 |
Dissertations/Theses -… | 9 |
Reports - Descriptive | 2 |
Speeches/Meeting Papers | 2 |
Tests/Questionnaires | 2 |
Reports - Evaluative | 1 |
Education Level
Audience
Location
Germany | 3 |
Netherlands | 2 |
Texas | 2 |
Canada | 1 |
Colorado | 1 |
Florida | 1 |
Greece | 1 |
Iran | 1 |
Italy | 1 |
Kentucky (Louisville) | 1 |
Malaysia | 1 |
More ▼ |
Laws, Policies, & Programs
No Child Left Behind Act 2001 | 1 |
Assessments and Surveys
What Works Clearinghouse Rating
Meets WWC Standards without Reservations | 1 |
Meets WWC Standards with or without Reservations | 1 |
Sijia Huang; Dubravka Svetina Valdivia – Educational and Psychological Measurement, 2024
Identifying items with differential item functioning (DIF) in an assessment is a crucial step for achieving equitable measurement. One critical issue that has not been fully addressed with existing studies is how DIF items can be detected when data are multilevel. In the present study, we introduced a Lord's Wald X[superscript 2] test-based…
Descriptors: Item Analysis, Item Response Theory, Algorithms, Accuracy
Tong Wu; Stella Y. Kim; Carl Westine; Michelle Boyer – Journal of Educational Measurement, 2025
While significant attention has been given to test equating to ensure score comparability, limited research has explored equating methods for rater-mediated assessments, where human raters inherently introduce error. If not properly addressed, these errors can undermine score interchangeability and test validity. This study proposes an equating…
Descriptors: Item Response Theory, Evaluators, Error of Measurement, Test Validity
Cross-Classified Item Response Theory Modeling with an Application to Student Evaluation of Teaching
Sijia Huang; Li Cai – Journal of Educational and Behavioral Statistics, 2024
The cross-classified data structure is ubiquitous in education, psychology, and health outcome sciences. In these areas, assessment instruments that are made up of multiple items are frequently used to measure latent constructs. The presence of both the cross-classified structure and multivariate categorical outcomes leads to the so-called…
Descriptors: Classification, Data Collection, Data Analysis, Item Response Theory
Kara, Yusuf; Kamata, Akihito – Journal of Experimental Education, 2022
Within-cluster variance homogeneity is one of the key assumptions of multilevel models; however, assuming a constant (i.e. equal) within-cluster variance may not be realistic. Moreover, existent within-cluster variance heterogeneity should be regarded as a source of additional information rather than a violation of a model assumption. This study…
Descriptors: Bayesian Statistics, Hierarchical Linear Modeling, Item Response Theory, Multivariate Analysis
Carmen Köhler; Lale Khorramdel; Artur Pokropek; Johannes Hartig – Journal of Educational Measurement, 2024
For assessment scales applied to different groups (e.g., students from different states; patients in different countries), multigroup differential item functioning (MG-DIF) needs to be evaluated in order to ensure that respondents with the same trait level but from different groups have equal response probabilities on a particular item. The…
Descriptors: Measures (Individuals), Test Bias, Models, Item Response Theory
Casabianca, Jodi M. – Educational Measurement: Issues and Practice, 2021
Module Overview: In this digital ITEMS module, Dr. Jodi M. Casabianca provides a primer on the "hierarchical rater model" (HRM) framework and the recent expansions to the model for analyzing raters and ratings of constructed responses. In the first part of the module, she establishes an understanding of the nature of constructed…
Descriptors: Hierarchical Linear Modeling, Rating Scales, Error of Measurement, Item Response Theory
Gertrudes Velasquez – ProQuest LLC, 2021
This study introduces a longitudinal diagnostic classification model, called the LTA+HDCM, which is a fusion of latent transition analysis (LTA; Collins & Flaherty, 2002; Collins & Wugalter, 1992) and the hierarchical diagnostic classification model (HDCM; Templin & Bradshaw, 2014). The primary goals in this study are (1) to evaluate…
Descriptors: Learning Trajectories, Measurement, Longitudinal Studies, Research Design
Fox, Jean-Paul; Wenzel, Jeremias; Klotzke, Konrad – Journal of Educational and Behavioral Statistics, 2021
Standard item response theory (IRT) models have been extended with testlet effects to account for the nesting of items; these are well known as (Bayesian) testlet models or random effect models for testlets. The testlet modeling framework has several disadvantages. A sufficient number of testlet items are needed to estimate testlet effects, and a…
Descriptors: Bayesian Statistics, Tests, Item Response Theory, Hierarchical Linear Modeling
Wang, Yan; Kim, Eunsook; Joo, Seang-Hwane; Chun, Seokjoon; Alamri, Abeer; Lee, Philseok; Stark, Stephen – Journal of Experimental Education, 2022
Multilevel latent class analysis (MLCA) has been increasingly used to investigate unobserved population heterogeneity while taking into account data dependency. Nonparametric MLCA has gained much popularity due to the advantage of classifying both individuals and clusters into latent classes. This study demonstrated the need to relax the…
Descriptors: Nonparametric Statistics, Hierarchical Linear Modeling, Monte Carlo Methods, Simulation
Nagy, Gabriel; Ulitzsch, Esther – Educational and Psychological Measurement, 2022
Disengaged item responses pose a threat to the validity of the results provided by large-scale assessments. Several procedures for identifying disengaged responses on the basis of observed response times have been suggested, and item response theory (IRT) models for response engagement have been proposed. We outline that response time-based…
Descriptors: Item Response Theory, Hierarchical Linear Modeling, Predictor Variables, Classification
Xue Zhang; Chun Wang – Grantee Submission, 2021
Among current state-of-art estimation methods for multilevel IRT models, the two-stage divide-and-conquer strategy has practical advantages, such as clearer definition of factors, convenience for secondary data analysis, convenience for model calibration and fit evaluation, and avoidance of improper solutions. However, various studies have shown…
Descriptors: Error of Measurement, Error Correction, Item Response Theory, Comparative Analysis
Lee, Hyung Rock; Lee, Sunbok; Sung, Jaeyun – International Journal of Assessment Tools in Education, 2019
Applying single-level statistical models to multilevel data typically produces underestimated standard errors, which may result in misleading conclusions. This study examined the impact of ignoring multilevel data structure on the estimation of item parameters and their standard errors of the Rasch, two-, and three-parameter logistic models in…
Descriptors: Item Response Theory, Computation, Error of Measurement, Test Bias
Fan Pan – ProQuest LLC, 2021
This dissertation informed researchers about the performance of different level-specific and target-specific model fit indices in Multilevel Latent Growth Model (MLGM) using unbalanced design and different trajectories. As the use of MLGMs is a relatively new field, this study helped further the field by informing researchers interested in using…
Descriptors: Goodness of Fit, Item Response Theory, Growth Models, Monte Carlo Methods
Sales, Adam; Prihar, Ethan; Heffernan, Neil; Pane, John F. – International Educational Data Mining Society, 2021
This paper drills deeper into the documented effects of the Cognitive Tutor Algebra I and ASSISTments intelligent tutoring systems by estimating their effects on specific problems. We start by describing a multilevel Rasch-type model that facilitates testing for differences in the effects between problems and precise problem-specific effect…
Descriptors: Intelligent Tutoring Systems, Academic Achievement, Educational Technology, Algebra
Kara, Yusuf; Kamata, Akihito – Educational Sciences: Theory and Practice, 2017
A multilevel Rasch model using a hierarchical generalized linear model is one approach to multilevel item response theory (IRT) modeling and is referred to as a one-parameter hierarchical generalized linear logistic model (1-P HGLLM). Although it has the flexibility to model nested structure of data with covariates, the model assumes the normality…
Descriptors: Item Response Theory, Hierarchical Linear Modeling, Statistical Distributions, Computation