Publication Date
In 2025 | 3 |
Since 2024 | 4 |
Since 2021 (last 5 years) | 6 |
Since 2016 (last 10 years) | 6 |
Since 2006 (last 20 years) | 16 |
Descriptor
Evaluation Methods | 19 |
Models | 19 |
Simulation | 9 |
Item Response Theory | 7 |
Comparative Analysis | 6 |
Responses | 6 |
Bayesian Statistics | 5 |
Academic Achievement | 4 |
Computation | 4 |
Correlation | 4 |
Equations (Mathematics) | 4 |
More ▼ |
Source
Journal of Educational and… | 19 |
Author
Berger, Martijn P. F. | 1 |
Cho, Sun-Joo | 1 |
Cohen, Allan S. | 1 |
Doran, Harold C. | 1 |
George, Rani | 1 |
Huang, Hung-Yu | 1 |
Hung, Su-Pin | 1 |
James O. Ramsay | 1 |
Jean-Paul Fox | 1 |
Jo, Booil | 1 |
Joakim Wallmark | 1 |
More ▼ |
Publication Type
Journal Articles | 19 |
Reports - Research | 9 |
Reports - Descriptive | 5 |
Reports - Evaluative | 5 |
Education Level
Elementary Education | 2 |
Elementary Secondary Education | 1 |
Grade 1 | 1 |
Higher Education | 1 |
Middle Schools | 1 |
Postsecondary Education | 1 |
Secondary Education | 1 |
Audience
Location
California | 1 |
Laws, Policies, & Programs
Assessments and Surveys
National Longitudinal Study… | 1 |
Wechsler Adult Intelligence… | 1 |
What Works Clearinghouse Rating
Jean-Paul Fox – Journal of Educational and Behavioral Statistics, 2025
Popular item response theory (IRT) models are considered complex, mainly due to the inclusion of a random factor variable (latent variable). The random factor variable represents the incidental parameter problem since the number of parameters increases when including data of new persons. Therefore, IRT models require a specific estimation method…
Descriptors: Sample Size, Item Response Theory, Accuracy, Bayesian Statistics
Kazuhiro Yamaguchi – Journal of Educational and Behavioral Statistics, 2025
This study proposes a Bayesian method for diagnostic classification models (DCMs) for a partially known Q-matrix setting between exploratory and confirmatory DCMs. This Q-matrix setting is practical and useful because test experts have pre-knowledge of the Q-matrix but cannot readily specify it completely. The proposed method employs priors for…
Descriptors: Models, Classification, Bayesian Statistics, Evaluation Methods
Joakim Wallmark; James O. Ramsay; Juan Li; Marie Wiberg – Journal of Educational and Behavioral Statistics, 2024
Item response theory (IRT) models the relationship between the possible scores on a test item against a test taker's attainment of the latent trait that the item is intended to measure. In this study, we compare two models for tests with polytomously scored items: the optimal scoring (OS) model, a nonparametric IRT model based on the principles of…
Descriptors: Item Response Theory, Test Items, Models, Scoring
Liu, Jin – Journal of Educational and Behavioral Statistics, 2022
Longitudinal data analysis has been widely employed to examine between-individual differences in within-individual changes. One challenge of such analyses is that the rate-of-change is only available indirectly when change patterns are nonlinear with respect to time. Latent change score models (LCSMs), which can be employed to investigate the…
Descriptors: Longitudinal Studies, Individual Differences, Scores, Models
Bayesian Adaptive Lasso for the Detection of Differential Item Functioning in Graded Response Models
Na Shan; Ping-Feng Xu – Journal of Educational and Behavioral Statistics, 2025
The detection of differential item functioning (DIF) is important in psychological and behavioral sciences. Standard DIF detection methods perform an item-by-item test iteratively, often assuming that all items except the one under investigation are DIF-free. This article proposes a Bayesian adaptive Lasso method to detect DIF in graded response…
Descriptors: Bayesian Statistics, Item Response Theory, Adolescents, Longitudinal Studies
Hung, Su-Pin; Huang, Hung-Yu – Journal of Educational and Behavioral Statistics, 2022
To address response style or bias in rating scales, forced-choice items are often used to request that respondents rank their attitudes or preferences among a limited set of options. The rating scales used by raters to render judgments on ratees' performance also contribute to rater bias or errors; consequently, forced-choice items have recently…
Descriptors: Evaluation Methods, Rating Scales, Item Analysis, Preferences
Thissen-Roe, Anne; Thissen, David – Journal of Educational and Behavioral Statistics, 2013
Extreme response set, the tendency to prefer the lowest or highest response option when confronted with a Likert-type response scale, can lead to misfit of item response models such as the generalized partial credit model. Recently, a series of intrinsically multidimensional item response models have been hypothesized, wherein tendency toward…
Descriptors: Likert Scales, Responses, Item Response Theory, Models
Ranger, Jochen; Kuhn, Jorg-Tobias – Journal of Educational and Behavioral Statistics, 2013
It is common practice to log-transform response times before analyzing them with standard factor analytical methods. However, sometimes the log-transformation is not capable of linearizing the relation between the response times and the latent traits. Therefore, a more general approach to response time analysis is proposed in the current…
Descriptors: Item Response Theory, Simulation, Reaction Time, Least Squares Statistics
Karl, Andrew T.; Yang, Yan; Lohr, Sharon L. – Journal of Educational and Behavioral Statistics, 2013
Value-added models have been widely used to assess the contributions of individual teachers and schools to students' academic growth based on longitudinal student achievement outcomes. There is concern, however, that ignoring the presence of missing values, which are common in longitudinal studies, can bias teachers' value-added scores.…
Descriptors: Evaluation Methods, Teacher Effectiveness, Academic Achievement, Achievement Gains
Strunk, Katharine O.; Reardon, Sean F. – Journal of Educational and Behavioral Statistics, 2010
The literature on teachers' unions is relatively silent about the role of union strength in affecting important outcomes, due in large part to the difficulty in measuring union strength. In this article, we illustrate a method for obtaining valid, reliable, and replicable measures of union strength through the use of a Partial Independence Item…
Descriptors: Collective Bargaining, Unions, Teaching Methods, Models
Cho, Sun-Joo; Cohen, Allan S. – Journal of Educational and Behavioral Statistics, 2010
Mixture item response theory models have been suggested as a potentially useful methodology for identifying latent groups formed along secondary, possibly nuisance dimensions. In this article, we describe a multilevel mixture item response theory (IRT) model (MMixIRTM) that allows for the possibility that this nuisance dimensionality may function…
Descriptors: Simulation, Mathematics Tests, Item Response Theory, Student Behavior
Passos, Valeria Lima; Berger, Martijn P. F.; Tan, Frans E. S. – Journal of Educational and Behavioral Statistics, 2008
During the early stage of computerized adaptive testing (CAT), item selection criteria based on Fisher"s information often produce less stable latent trait estimates than the Kullback-Leibler global information criterion. Robustness against early stage instability has been reported for the D-optimality criterion in a polytomous CAT with the…
Descriptors: Computer Assisted Testing, Adaptive Testing, Evaluation Criteria, Item Analysis

Kaplan, David; George, Rani – Journal of Educational and Behavioral Statistics, 1998
The use of ex post (historical) simulation statistics as means of evaluating latent growth models is considered, and a variety of simulation quality statistics are applied to such models. Results illustrate the importance of using these measures as adjuncts to more traditional forms of model evaluation. (SLD)
Descriptors: Evaluation Methods, Models, Research Methodology, Simulation
Jo, Booil – Journal of Educational and Behavioral Statistics, 2008
An analytical approach was employed to compare sensitivity of causal effect estimates with different assumptions on treatment noncompliance and non-response behaviors. The core of this approach is to fully clarify bias mechanisms of considered models and to connect these models based on common parameters. Focusing on intention-to-treat analysis,…
Descriptors: Evaluation Methods, Intention, Research Methodology, Causal Models
Reckase, Mark D. – Journal of Educational and Behavioral Statistics, 2004
It is understandable that parents, policy makers, educators, etc. want to know how schools are functioning. Extensive resources are expended on the educational enterprise and it is only reasonable that the impact of those resources be determined. However, determining the amount of change in students' skills and knowledge is not easy. Further,…
Descriptors: Achievement Tests, Models, Evaluation Methods, Test Results
Previous Page | Next Page ยป
Pages: 1 | 2