ERIC - Search Results

Publication Date

In 2025	2
Since 2024	3
Since 2021 (last 5 years)	3
Since 2016 (last 10 years)	11
Since 2006 (last 20 years)	23

Source

Journal of Educational and…

Publication Type

Journal Articles	23
Reports - Research	16
Reports - Evaluative	4
Reports - Descriptive	3

Education Level

Secondary Education	3
Higher Education	2
Elementary Education	1
Elementary Secondary Education	1
Grade 1	1
High Schools	1
Postsecondary Education	1

Audience

Location

Netherlands	3
Finland	2
Sweden	2
United States	2
Australia	1
Austria	1
Azerbaijan	1
Belgium	1
Canada	1
China (Shanghai)	1
Cyprus	1
Czech Republic	1
Denmark	1
Estonia	1
France	1
Germany	1
Greece	1
Indonesia	1
Ireland	1
Italy	1
Japan	1
Liechtenstein	1
Massachusetts	1
Montenegro	1
New Zealand	1
More ▼

Laws, Policies, & Programs

Assessments and Surveys

National Assessment of…	2
Center for Epidemiologic…	1
Law School Admission Test	1
National Longitudinal Study…	1
Program for International…	1
Wechsler Adult Intelligence…	1

What Works Clearinghouse Rating

Showing 1 to 15 of 23 results Save | Export

Bayesian Diagnostic Classification Models for a Partially Known Q-Matrix

Peer reviewed

Direct link

Kazuhiro Yamaguchi – Journal of Educational and Behavioral Statistics, 2025

This study proposes a Bayesian method for diagnostic classification models (DCMs) for a partially known Q-matrix setting between exploratory and confirmatory DCMs. This Q-matrix setting is practical and useful because test experts have pre-knowledge of the Q-matrix but cannot readily specify it completely. The proposed method employs priors for…

Descriptors: Models, Classification, Bayesian Statistics, Evaluation Methods

Analyzing Polytomous Test Data: A Comparison between an Information-Based IRT Model and the Generalized Partial Credit Model

Peer reviewed

Direct link

Joakim Wallmark; James O. Ramsay; Juan Li; Marie Wiberg – Journal of Educational and Behavioral Statistics, 2024

Item response theory (IRT) models the relationship between the possible scores on a test item against a test taker's attainment of the latent trait that the item is intended to measure. In this study, we compare two models for tests with polytomously scored items: the optimal scoring (OS) model, a nonparametric IRT model based on the principles of…

Descriptors: Item Response Theory, Test Items, Models, Scoring

Bayesian Adaptive Lasso for the Detection of Differential Item Functioning in Graded Response Models

Peer reviewed

Direct link

Na Shan; Ping-Feng Xu – Journal of Educational and Behavioral Statistics, 2025

The detection of differential item functioning (DIF) is important in psychological and behavioral sciences. Standard DIF detection methods perform an item-by-item test iteratively, often assuming that all items except the one under investigation are DIF-free. This article proposes a Bayesian adaptive Lasso method to detect DIF in graded response…

Descriptors: Bayesian Statistics, Item Response Theory, Adolescents, Longitudinal Studies

On Longitudinal Item Response Theory Models: A Didactic

Peer reviewed
PDF on ERIC

Download full text

Direct link

Wang, Chun; Nydick, Steven W. – Journal of Educational and Behavioral Statistics, 2020

Recent work on measuring growth with categorical outcome variables has combined the item response theory (IRT) measurement model with the latent growth curve model and extended the assessment of growth to multidimensional IRT models and higher order IRT models. However, there is a lack of synthetic studies that clearly evaluate the strength and…

Descriptors: Item Response Theory, Longitudinal Studies, Comparative Analysis, Models

Estimation of Expected Fisher Information for IRT Models

Peer reviewed

Direct link

Monroe, Scott – Journal of Educational and Behavioral Statistics, 2019

In item response theory (IRT) modeling, the Fisher information matrix is used for numerous inferential procedures such as estimating parameter standard errors, constructing test statistics, and facilitating test scoring. In principal, these procedures may be carried out using either the expected information or the observed information. However, in…

Descriptors: Item Response Theory, Error of Measurement, Scoring, Inferences

Developments in Psychometric Population Models for Technology-Based Large-Scale Assessments: An Overview of Challenges and Opportunities

Peer reviewed

Direct link

von Davier, Matthias; Khorramdel, Lale; He, Qiwei; Shin, Hyo Jeong; Chen, Haiwen – Journal of Educational and Behavioral Statistics, 2019

International large-scale assessments (ILSAs) transitioned from paper-based assessments to computer-based assessments (CBAs) facilitating the use of new item types and more effective data collection tools. This allows implementation of more complex test designs and to collect process and response time (RT) data. These new data types can be used to…

Descriptors: International Assessment, Computer Assisted Testing, Psychometrics, Item Response Theory

Peer reviewed

Direct link

Flynt, Abby; Dean, Nema – Journal of Educational and Behavioral Statistics, 2016

Cluster analysis is a set of statistical methods for discovering new group/class structure when exploring data sets. This article reviews the following popular libraries/commands in the R software language for applying different types of cluster analysis: from the stats library, the kmeans, and hclust functions; the mclust library; the poLCA…

Descriptors: Multivariate Analysis, Computer Software, Comparative Analysis, Programming Languages

A Strategy for Replacing Sum Scoring

Peer reviewed

Direct link

Ramsay, James O.; Wiberg, Marie – Journal of Educational and Behavioral Statistics, 2017

This article promotes the use of modern test theory in testing situations where sum scores for binary responses are now used. It directly compares the efficiencies and biases of classical and modern test analyses and finds an improvement in the root mean squared error of ability estimates of about 5% for two designed multiple-choice tests and…

Descriptors: Scoring, Test Theory, Computation, Maximum Likelihood Statistics

Posterior Predictive Checks for Conditional Independence between Response Time and Accuracy

Peer reviewed

Direct link

Bolsinova, Maria; Tijmstra, Jesper – Journal of Educational and Behavioral Statistics, 2016

Conditional independence (CI) between response time and response accuracy is a fundamental assumption of many joint models for time and accuracy used in educational measurement. In this study, posterior predictive checks (PPCs) are proposed for testing this assumption. These PPCs are based on three discrepancy measures reflecting different…

Descriptors: Reaction Time, Accuracy, Statistical Analysis, Robustness (Statistics)

Grade of Membership Response Time Model for Detecting Guessing Behaviors

Peer reviewed

Direct link

Pokropek, Artur – Journal of Educational and Behavioral Statistics, 2016

A response model that is able to detect guessing behaviors and produce unbiased estimates in low-stake conditions using timing information is proposed. The model is a special case of the grade of membership model in which responses are modeled as partial members of a class that is affected by motivation and a class that responds only according to…

Descriptors: Reaction Time, Models, Guessing (Tests), Computation

Testing for Aberrant Behavior in Response Time Modeling

Peer reviewed

Direct link

Marianti, Sukaesi; Fox, Jean-Paul; Avetisyan, Marianna; Veldkamp, Bernard P.; Tijmstra, Jesper – Journal of Educational and Behavioral Statistics, 2014

Many standardized tests are now administered via computer rather than paper-and-pencil format. In a computer-based testing environment, it is possible to record not only the test taker's response to each question (item) but also the amount of time spent by the test taker in considering and answering each item. Response times (RTs) provide…

Descriptors: Reaction Time, Response Style (Tests), Computer Assisted Testing, Bayesian Statistics

Using Data-Dependent Priors to Mitigate Small Sample Bias in Latent Growth Models: A Discussion and Illustration Using M"plus"

Peer reviewed

Direct link

McNeish, Daniel M. – Journal of Educational and Behavioral Statistics, 2016

Mixed-effects models (MEMs) and latent growth models (LGMs) are often considered interchangeable save the discipline-specific nomenclature. Software implementations of these models, however, are not interchangeable, particularly with small sample sizes. Restricted maximum likelihood estimation that mitigates small sample bias in MEMs has not been…

Descriptors: Models, Statistical Analysis, Hierarchical Linear Modeling, Sample Size

Covariate Adjustment Strategy Increases Power in the Randomized Controlled Trial With Discrete-Time Survival Endpoints

Peer reviewed

Direct link

Safarkhani, Maryam; Moerbeek, Mirjam – Journal of Educational and Behavioral Statistics, 2013

In a randomized controlled trial, a decision needs to be made about the total number of subjects for adequate statistical power. One way to increase the power of a trial is by including a predictive covariate in the model. In this article, the effects of various covariate adjustment strategies on increasing the power is studied for discrete-time…

Descriptors: Statistical Analysis, Scientific Methodology, Research Design, Sample Size

Student, School, and Country Differences in Sustained Test-Taking Effort in the 2009 PISA Reading Assessment

Peer reviewed

Direct link

Debeer, Dries; Buchholz, Janine; Hartig, Johannes; Janssen, Rianne – Journal of Educational and Behavioral Statistics, 2014

In this article, the change in examinee effort during an assessment, which we will refer to as persistence, is modeled as an effect of item position. A multilevel extension is proposed to analyze hierarchically structured data and decompose the individual differences in persistence. Data from the 2009 Program of International Student Achievement…

Descriptors: Reading Tests, International Programs, Testing Programs, Individual Differences

Alternatives for Mixed-Effects Meta-Regression Models in the Reliability Generalization Approach: A Simulation Study

Peer reviewed

Direct link

López-López, José Antonio; Botella, Juan; Sánchez-Meca, Julio; Marín-Martínez, Fulgencio – Journal of Educational and Behavioral Statistics, 2013

Since heterogeneity between reliability coefficients is usually found in reliability generalization studies, moderator analyses constitute a crucial step for that meta-analytic approach. In this study, different procedures for conducting mixed-effects meta-regression analyses were compared. Specifically, four transformation methods for the…

Descriptors: Reliability, Generalization, Meta Analysis, Regression (Statistics)

Previous Page | Next Page »

Pages: 1 | 2

Comparative Analysis	23
Models	23
Computation	8
Item Response Theory	8
Simulation	8
Evaluation Methods	6
Statistical Analysis	6
Reaction Time	5
Bayesian Statistics	4
Computer Assisted Testing	4
Computer Software	4
Foreign Countries	4
Maximum Likelihood Statistics	4
Monte Carlo Methods	4
Regression (Statistics)	4
Scoring	4
Test Items	4
Bias	3
Correlation	3
Equations (Mathematics)	3
Item Analysis	3
Longitudinal Studies	3
Sample Size	3
Scores	3
Statistical Bias	3
More ▼

Tijmstra, Jesper	2
Veldkamp, Bernard P.	2
Ariel, Adelaide	1
Aseltine, Robert H., Jr.	1
Avetisyan, Marianna	1
Berger, Martijn P. F.	1
Bolsinova, Maria	1
Botella, Juan	1
Buchholz, Janine	1
Chen, Haiwen	1
Cho, Sun-Joo	1
Cohen, Allan S.	1
Dean, Nema	1
Debeer, Dries	1
Flynt, Abby	1
Fox, Jean-Paul	1
Haberman, Shelby J.	1
Harel, Ofer	1
Hartig, Johannes	1
He, Qiwei	1
Ho, Andrew Dean	1
James O. Ramsay	1
Jansen, Margo G. H.	1
Janssen, Rianne	1
Jo, Booil	1
More ▼