ERIC - Search Results

Publication Date

In 2025	1
Since 2024	5
Since 2021 (last 5 years)	14
Since 2016 (last 10 years)	23
Since 2006 (last 20 years)	24

Descriptor

International Assessment	24
Foreign Countries	19
Achievement Tests	18
Secondary School Students	11
Item Response Theory	10
Models	8
Bayesian Statistics	7
Mathematics Achievement	7
Mathematics Tests	7
Monte Carlo Methods	7
Science Achievement	7
Science Tests	7
Test Items	7
Comparative Analysis	6
Elementary Secondary Education	6
Statistical Analysis	6
Statistical Inference	6
Computation	5
Simulation	5
Classification	4
Markov Processes	4
Questionnaires	4
Sampling	4
Computer Assisted Testing	3
Error of Measurement	3
More ▼

Source

Journal of Educational and…

Publication Type

Journal Articles	24
Reports - Research	21
Reports - Descriptive	3

Education Level

Secondary Education	14
Elementary Secondary Education	6
Elementary Education	2
Grade 8	2
Junior High Schools	2
Middle Schools	2
Grade 12	1
High Schools	1
Higher Education	1
Postsecondary Education	1

Audience

Location

Canada	2
South Korea	2
United States	2
Australia	1
Austria	1
Belgium	1
China (Shanghai)	1
Cyprus	1
Czech Republic	1
Denmark	1
Estonia	1
Finland	1
France	1
Germany	1
Ireland	1
Italy	1
Japan	1
Netherlands	1
North Carolina	1
Poland	1
Slovakia	1
Spain	1
Sweden	1
United Kingdom (England)	1
United Kingdom (Northern…	1
More ▼

Laws, Policies, & Programs

Assessments and Surveys

Program for International…	15
Trends in International…	8
National Assessment of…	2

What Works Clearinghouse Rating

Showing 1 to 15 of 24 results Save | Export

Generalizing beyond the Test: Permutation-Based Profile Analysis for Explaining DIF Using Item Features

Peer reviewed

Direct link

Maria Bolsinova; Jesper Tijmstra; Leslie Rutkowski; David Rutkowski – Journal of Educational and Behavioral Statistics, 2024

Profile analysis is one of the main tools for studying whether differential item functioning can be related to specific features of test items. While relevant, profile analysis in its current form has two restrictions that limit its usefulness in practice: It assumes that all test items have equal discrimination parameters, and it does not test…

Descriptors: Test Items, Item Analysis, Generalizability Theory, Achievement Tests

Alternatives to Weighted Item Fit Statistics for Establishing Measurement Invariance in Many Groups

Peer reviewed

Direct link

Sean Joo; Montserrat Valdivia; Dubravka Svetina Valdivia; Leslie Rutkowski – Journal of Educational and Behavioral Statistics, 2024

Evaluating scale comparability in international large-scale assessments depends on measurement invariance (MI). The root mean square deviation (RMSD) is a standard method for establishing MI in several programs, such as the Programme for International Student Assessment and the Programme for the International Assessment of Adult Competencies.…

Descriptors: International Assessment, Monte Carlo Methods, Statistical Studies, Error of Measurement

Predictive Performance of Bayesian Stacking in Multilevel Education Data

Peer reviewed

Direct link

Mingya Huang; David Kaplan – Journal of Educational and Behavioral Statistics, 2025

The issue of model uncertainty has been gaining interest in education and the social sciences community over the years, and the dominant methods for handling model uncertainty are based on Bayesian inference, particularly, Bayesian model averaging. However, Bayesian model averaging assumes that the true data-generating model is within the…

Descriptors: Bayesian Statistics, Hierarchical Linear Modeling, Statistical Inference, Predictor Variables

Mean Comparisons of Many Groups in the Presence of DIF: An Evaluation of Linking and Concurrent Scaling Approaches

Peer reviewed

Direct link

Robitzsch, Alexander; Lüdtke, Oliver – Journal of Educational and Behavioral Statistics, 2022

One of the primary goals of international large-scale assessments in education is the comparison of country means in student achievement. This article introduces a framework for discussing differential item functioning (DIF) for such mean comparisons. We compare three different linking methods: concurrent scaling based on full invariance,…

Descriptors: Test Bias, International Assessment, Scaling, Comparative Analysis

Using Response Times for Joint Modeling of Careless Responding and Attentive Response Styles

Peer reviewed

Direct link

Esther Ulitzsch; Steffi Pohl; Lale Khorramdel; Ulf Kroehne; Matthias von Davier – Journal of Educational and Behavioral Statistics, 2024

Questionnaires are by far the most common tool for measuring noncognitive constructs in psychology and educational sciences. Response bias may pose an additional source of variation between respondents that threatens validity of conclusions drawn from questionnaire data. We present a mixture modeling approach that leverages response time data from…

Descriptors: Item Response Theory, Response Style (Tests), Questionnaires, Secondary School Students

Detecting Noneffortful Responses Based on a Residual Method Using an Iterative Purification Process

Peer reviewed

Direct link

Liu, Yue; Liu, Hongyun – Journal of Educational and Behavioral Statistics, 2021

The prevalence and serious consequences of noneffortful responses from unmotivated examinees are well-known in educational measurement. In this study, we propose to apply an iterative purification process based on a response time residual method with fixed item parameter estimates to detect noneffortful responses. The proposed method is compared…

Descriptors: Response Style (Tests), Reaction Time, Test Items, Accuracy

Expertise on Offer: Why Isn't Anyone Buying?

Peer reviewed

Direct link

Braun, Henry – Journal of Educational and Behavioral Statistics, 2023

It is a much-lamented fact that research with the potential to inform or influence education policy instead remains policy inert. There are many reasons for this frustrating state of affairs, including a lack of strategic thinking on the part of researchers on how to successfully accomplish outreach--as opposed to communication with peers…

Descriptors: Educational Policy, Educational Research, Educational Researchers, Persuasive Discourse

Estimating Heterogeneous Treatment Effects within Latent Class Multilevel Models: A Bayesian Approach

Peer reviewed

Direct link

Lyu, Weicong; Kim, Jee-Seon; Suk, Youmi – Journal of Educational and Behavioral Statistics, 2023

This article presents a latent class model for multilevel data to identify latent subgroups and estimate heterogeneous treatment effects. Unlike sequential approaches that partition data first and then estimate average treatment effects (ATEs) within classes, we employ a Bayesian procedure to jointly estimate mixing probability, selection, and…

Descriptors: Hierarchical Linear Modeling, Bayesian Statistics, Causal Models, Statistical Inference

Chance-Constrained Automated Test Assembly

Peer reviewed

Direct link

Giada Spaccapanico Proietti; Mariagiulia Matteucci; Stefania Mignani; Bernard P. Veldkamp – Journal of Educational and Behavioral Statistics, 2024

Classical automated test assembly (ATA) methods assume fixed and known coefficients for the constraints and the objective function. This hypothesis is not true for the estimates of item response theory parameters, which are crucial elements in test assembly classical models. To account for uncertainty in ATA, we propose a chance-constrained…

Descriptors: Automation, Computer Assisted Testing, Ambiguity (Context), Item Response Theory

Bayesian Analysis Methods for Two-Level Diagnosis Classification Models

Peer reviewed

Direct link

Yamaguchi, Kazuhiro – Journal of Educational and Behavioral Statistics, 2023

Understanding whether or not different types of students master various attributes can aid future learning remediation. In this study, two-level diagnostic classification models (DCMs) were developed to represent the probabilistic relationship between external latent classes and attribute mastery patterns. Furthermore, variational Bayesian (VB)…

Descriptors: Bayesian Statistics, Classification, Statistical Inference, Sampling

A Bayesian Item Response Model for Examining Item Position Effects in Complex Survey Data

Peer reviewed

Direct link

Trendtel, Matthias; Robitzsch, Alexander – Journal of Educational and Behavioral Statistics, 2021

A multidimensional Bayesian item response model is proposed for modeling item position effects. The first dimension corresponds to the ability that is to be measured; the second dimension represents a factor that allows for individual differences in item position effects called persistence. This model allows for nonlinear item position effects on…

Descriptors: Bayesian Statistics, Item Response Theory, Test Items, Test Format

Hybridizing Machine Learning Methods and Finite Mixture Models for Estimating Heterogeneous Treatment Effects in Latent Classes

Peer reviewed

Direct link

Suk, Youmi; Kim, Jee-Seon; Kang, Hyunseung – Journal of Educational and Behavioral Statistics, 2021

There has been increasing interest in exploring heterogeneous treatment effects using machine learning (ML) methods such as causal forests, Bayesian additive regression trees, and targeted maximum likelihood estimation. However, there is little work on applying these methods to estimate treatment effects in latent classes defined by…

Descriptors: Artificial Intelligence, Statistical Analysis, Statistical Inference, Classification

Category-Level Model Selection for the Sequential G-DINA Model

Peer reviewed

Direct link

Ma, Wenchao; de la Torre, Jimmy – Journal of Educational and Behavioral Statistics, 2019

Solving a constructed-response item usually requires successfully performing a sequence of tasks. Each task could involve different attributes, and those required attributes may be "condensed" in various ways to produce the responses. The sequential generalized deterministic input noisy "and" gate model is a general cognitive…

Descriptors: Test Items, Cognitive Measurement, Models, Hypothesis Testing

Developments in Psychometric Population Models for Technology-Based Large-Scale Assessments: An Overview of Challenges and Opportunities

Peer reviewed

Direct link

von Davier, Matthias; Khorramdel, Lale; He, Qiwei; Shin, Hyo Jeong; Chen, Haiwen – Journal of Educational and Behavioral Statistics, 2019

International large-scale assessments (ILSAs) transitioned from paper-based assessments to computer-based assessments (CBAs) facilitating the use of new item types and more effective data collection tools. This allows implementation of more complex test designs and to collect process and response time (RT) data. These new data types can be used to…

Descriptors: International Assessment, Computer Assisted Testing, Psychometrics, Item Response Theory

Statistical Equivalence Testing Approaches for Mantel-Haenszel DIF Analysis

Peer reviewed

Direct link

Casabianca, Jodi M.; Lewis, Charles – Journal of Educational and Behavioral Statistics, 2018

The null hypothesis test used in differential item functioning (DIF) detection tests for a subgroup difference in item-level performance--if the null hypothesis of "no DIF" is rejected, the item is flagged for DIF. Conversely, an item is kept in the test form if there is insufficient evidence of DIF. We present frequentist and empirical…

Descriptors: Test Bias, Hypothesis Testing, Bayesian Statistics, Statistical Analysis

Previous Page | Next Page »

Pages: 1 | 2

Robitzsch, Alexander	4
Lüdtke, Oliver	3
Casabianca, Jodi M.	2
Grund, Simon	2
Kim, Jee-Seon	2
Leslie Rutkowski	2
Lewis, Charles	2
Suk, Youmi	2
Bernard P. Veldkamp	1
Braun, Henry	1
Chen, Haiwen	1
Clifton, James P.	1
Cobb, Patrice R.	1
David Kaplan	1
David Rutkowski	1
Depaoli, Sarah	1
Dubravka Svetina Valdivia	1
Esther Ulitzsch	1
Giada Spaccapanico Proietti	1
He, Qiwei	1
Jesper Tijmstra	1
Kang, Hyunseung	1
Kaplan, David	1
Khorramdel, Lale	1
Lale Khorramdel	1
More ▼