ERIC - Search Results

Publication Date

In 2025	3
Since 2024	4
Since 2021 (last 5 years)	6
Since 2016 (last 10 years)	6
Since 2006 (last 20 years)	16

Descriptor

Evaluation Methods	19
Models	19
Simulation	9
Item Response Theory	7
Comparative Analysis	6
Responses	6
Bayesian Statistics	5
Academic Achievement	4
Computation	4
Correlation	4
Equations (Mathematics)	4
Item Analysis	4
Scores	4
Test Items	4
Data Analysis	3
Goodness of Fit	3
Longitudinal Studies	3
Rating Scales	3
Sample Size	3
Statistical Analysis	3
Teacher Effectiveness	3
Accountability	2
Achievement Tests	2
Adaptive Testing	2
Diagnostic Tests	2
More ▼

Source

Journal of Educational and…

Publication Type

Journal Articles	19
Reports - Research	9
Reports - Descriptive	5
Reports - Evaluative	5

Education Level

Elementary Education	2
Elementary Secondary Education	1
Grade 1	1
Higher Education	1
Middle Schools	1
Postsecondary Education	1
Secondary Education	1

Audience

Location

California

Laws, Policies, & Programs

Assessments and Surveys

National Longitudinal Study…	1
Wechsler Adult Intelligence…	1

What Works Clearinghouse Rating

Showing 1 to 15 of 19 results Save | Export

Redefining Item Response Models for Small Samples

Peer reviewed

Direct link

Jean-Paul Fox – Journal of Educational and Behavioral Statistics, 2025

Popular item response theory (IRT) models are considered complex, mainly due to the inclusion of a random factor variable (latent variable). The random factor variable represents the incidental parameter problem since the number of parameters increases when including data of new persons. Therefore, IRT models require a specific estimation method…

Descriptors: Sample Size, Item Response Theory, Accuracy, Bayesian Statistics

Bayesian Diagnostic Classification Models for a Partially Known Q-Matrix

Peer reviewed

Direct link

Kazuhiro Yamaguchi – Journal of Educational and Behavioral Statistics, 2025

This study proposes a Bayesian method for diagnostic classification models (DCMs) for a partially known Q-matrix setting between exploratory and confirmatory DCMs. This Q-matrix setting is practical and useful because test experts have pre-knowledge of the Q-matrix but cannot readily specify it completely. The proposed method employs priors for…

Descriptors: Models, Classification, Bayesian Statistics, Evaluation Methods

Analyzing Polytomous Test Data: A Comparison between an Information-Based IRT Model and the Generalized Partial Credit Model

Peer reviewed

Direct link

Joakim Wallmark; James O. Ramsay; Juan Li; Marie Wiberg – Journal of Educational and Behavioral Statistics, 2024

Item response theory (IRT) models the relationship between the possible scores on a test item against a test taker's attainment of the latent trait that the item is intended to measure. In this study, we compare two models for tests with polytomously scored items: the optimal scoring (OS) model, a nonparametric IRT model based on the principles of…

Descriptors: Item Response Theory, Test Items, Models, Scoring

Jenss-Bayley Latent Change Score Model with Individual Ratio of the Growth Acceleration in the Framework of Individual Measurement Occasions

Peer reviewed

Direct link

Liu, Jin – Journal of Educational and Behavioral Statistics, 2022

Longitudinal data analysis has been widely employed to examine between-individual differences in within-individual changes. One challenge of such analyses is that the rate-of-change is only available indirectly when change patterns are nonlinear with respect to time. Latent change score models (LCSMs), which can be employed to investigate the…

Descriptors: Longitudinal Studies, Individual Differences, Scores, Models

Bayesian Adaptive Lasso for the Detection of Differential Item Functioning in Graded Response Models

Peer reviewed

Direct link

Na Shan; Ping-Feng Xu – Journal of Educational and Behavioral Statistics, 2025

The detection of differential item functioning (DIF) is important in psychological and behavioral sciences. Standard DIF detection methods perform an item-by-item test iteratively, often assuming that all items except the one under investigation are DIF-free. This article proposes a Bayesian adaptive Lasso method to detect DIF in graded response…

Descriptors: Bayesian Statistics, Item Response Theory, Adolescents, Longitudinal Studies

Forced-Choice Ranking Models for Raters' Ranking Data

Peer reviewed

Direct link

Hung, Su-Pin; Huang, Hung-Yu – Journal of Educational and Behavioral Statistics, 2022

To address response style or bias in rating scales, forced-choice items are often used to request that respondents rank their attitudes or preferences among a limited set of options. The rating scales used by raters to render judgments on ratees' performance also contribute to rater bias or errors; consequently, forced-choice items have recently…

Descriptors: Evaluation Methods, Rating Scales, Item Analysis, Preferences

A Two-Decision Model for Responses to Likert-Type Items

Peer reviewed

Direct link

Thissen-Roe, Anne; Thissen, David – Journal of Educational and Behavioral Statistics, 2013

Extreme response set, the tendency to prefer the lowest or highest response option when confronted with a Likert-type response scale, can lead to misfit of item response models such as the generalized partial credit model. Recently, a series of intrinsically multidimensional item response models have been hypothesized, wherein tendency toward…

Descriptors: Likert Scales, Responses, Item Response Theory, Models

Analyzing Response Times in Tests with Rank Correlation Approaches

Peer reviewed

Direct link

Ranger, Jochen; Kuhn, Jorg-Tobias – Journal of Educational and Behavioral Statistics, 2013

It is common practice to log-transform response times before analyzing them with standard factor analytical methods. However, sometimes the log-transformation is not capable of linearizing the relation between the response times and the latent traits. Therefore, a more general approach to response time analysis is proposed in the current…

Descriptors: Item Response Theory, Simulation, Reaction Time, Least Squares Statistics

Peer reviewed

Direct link

Karl, Andrew T.; Yang, Yan; Lohr, Sharon L. – Journal of Educational and Behavioral Statistics, 2013

Value-added models have been widely used to assess the contributions of individual teachers and schools to students' academic growth based on longitudinal student achievement outcomes. There is concern, however, that ignoring the presence of missing values, which are common in longitudinal studies, can bias teachers' value-added scores.…

Descriptors: Evaluation Methods, Teacher Effectiveness, Academic Achievement, Achievement Gains

Measuring the Strength of Teachers' Unions: An Empirical Application of the Partial Independence Item Response Approach

Peer reviewed

Direct link

Strunk, Katharine O.; Reardon, Sean F. – Journal of Educational and Behavioral Statistics, 2010

The literature on teachers' unions is relatively silent about the role of union strength in affecting important outcomes, due in large part to the difficulty in measuring union strength. In this article, we illustrate a method for obtaining valid, reliable, and replicable measures of union strength through the use of a Partial Independence Item…

Descriptors: Collective Bargaining, Unions, Teaching Methods, Models

A Multilevel Mixture IRT Model with an Application to DIF

Peer reviewed

Direct link

Cho, Sun-Joo; Cohen, Allan S. – Journal of Educational and Behavioral Statistics, 2010

Mixture item response theory models have been suggested as a potentially useful methodology for identifying latent groups formed along secondary, possibly nuisance dimensions. In this article, we describe a multilevel mixture item response theory (IRT) model (MMixIRTM) that allows for the possibility that this nuisance dimensionality may function…

Descriptors: Simulation, Mathematics Tests, Item Response Theory, Student Behavior

The D-Optimality Item Selection Criterion in the Early Stage of CAT: A Study with the Graded Response Model

Peer reviewed

Direct link

Passos, Valeria Lima; Berger, Martijn P. F.; Tan, Frans E. S. – Journal of Educational and Behavioral Statistics, 2008

During the early stage of computerized adaptive testing (CAT), item selection criteria based on Fisher"s information often produce less stable latent trait estimates than the Kullback-Leibler global information criterion. Robustness against early stage instability has been reported for the D-optimality criterion in a polytomous CAT with the…

Descriptors: Computer Assisted Testing, Adaptive Testing, Evaluation Criteria, Item Analysis

Evaluating Latent Variable Growth Models through Ex Post Simulation.

Peer reviewed

Kaplan, David; George, Rani – Journal of Educational and Behavioral Statistics, 1998

The use of ex post (historical) simulation statistics as means of evaluating latent growth models is considered, and a variety of simulation quality statistics are applied to such models. Results illustrate the importance of using these measures as adjuncts to more traditional forms of model evaluation. (SLD)

Descriptors: Evaluation Methods, Models, Research Methodology, Simulation

The Real World is More Complicated than We Would Like

Peer reviewed

Direct link

Reckase, Mark D. – Journal of Educational and Behavioral Statistics, 2004

It is understandable that parents, policy makers, educators, etc. want to know how schools are functioning. Extensive resources are expended on the educational enterprise and it is only reasonable that the impact of those resources be determined. However, determining the amount of change in students' skills and knowledge is not easy. Further,…

Descriptors: Achievement Tests, Models, Evaluation Methods, Test Results

Bias Mechanisms in Intention-to-Treat Analysis with Data Subject to Treatment Noncompliance and Missing Outcomes

Peer reviewed

Direct link

Jo, Booil – Journal of Educational and Behavioral Statistics, 2008

An analytical approach was employed to compare sensitivity of causal effect estimates with different assumptions on treatment noncompliance and non-response behaviors. The core of this approach is to fully clarify bias mechanisms of considered models and to connect these models based on common parameters. Focusing on intention-to-treat analysis,…

Descriptors: Evaluation Methods, Intention, Research Methodology, Causal Models

Previous Page | Next Page »

Pages: 1 | 2

Berger, Martijn P. F.	1
Cho, Sun-Joo	1
Cohen, Allan S.	1
Doran, Harold C.	1
George, Rani	1
Huang, Hung-Yu	1
Hung, Su-Pin	1
James O. Ramsay	1
Jean-Paul Fox	1
Jo, Booil	1
Joakim Wallmark	1
Juan Li	1
Kaplan, David	1
Karl, Andrew T.	1
Kazuhiro Yamaguchi	1
Kuhn, Jorg-Tobias	1
Liu, Jin	1
Lockwood, J. R.	1
Lohr, Sharon L.	1
Marie Wiberg	1
Martineau, Joseph A.	1
Na Shan	1
Passos, Valeria Lima	1
Ping-Feng Xu	1
Ranger, Jochen	1
More ▼