ERIC - Search Results

Publication Date

In 2025	0
Since 2024	1
Since 2021 (last 5 years)	4
Since 2016 (last 10 years)	7
Since 2006 (last 20 years)	13

Descriptor

Item Response Theory	16
Probability	16
Models	11
Simulation	8
Computation	6
Monte Carlo Methods	5
Test Items	5
Markov Processes	4
Maximum Likelihood Statistics	4
Bayesian Statistics	3
Equations (Mathematics)	3
Error of Measurement	3
Statistical Distributions	3
Adaptive Testing	2
Computer Assisted Testing	2
Correlation	2
Decision Making	2
Evaluation Research	2
Foreign Countries	2
Inferences	2
Item Analysis	2
Measurement Techniques	2
Prediction	2
Preferences	2
Rating Scales	2
More ▼

Source

Journal of Educational and…

Publication Type

Journal Articles	16
Reports - Research	11
Reports - Descriptive	3
Reports - Evaluative	2

Education Level

Grade 8	1
Higher Education	1

Audience

Location

Belgium	1
Netherlands	1

Laws, Policies, & Programs

Assessments and Surveys

National Assessment of…

What Works Clearinghouse Rating

Showing 1 to 15 of 16 results Save | Export

Model Misspecification and Robustness of Observed-Score Test Equating Using Propensity Scores

Peer reviewed

Direct link

Wallin, Gabriel; Wiberg, Marie – Journal of Educational and Behavioral Statistics, 2023

This study explores the usefulness of covariates on equating test scores from nonequivalent test groups. The covariates are captured by an estimated propensity score, which is used as a proxy for latent ability to balance the test groups. The objective is to assess the sensitivity of the equated scores to various misspecifications in the…

Descriptors: Models, Error of Measurement, Robustness (Statistics), Equated Scores

An Improved Inferential Procedure to Evaluate Item Discriminations in a Conditional Maximum Likelihood Framework

Peer reviewed

Direct link

Clemens Draxler; Andreas Kurz; Can Gürer; Jan Philipp Nolte – Journal of Educational and Behavioral Statistics, 2024

A modified and improved inductive inferential approach to evaluate item discriminations in a conditional maximum likelihood and Rasch modeling framework is suggested. The new approach involves the derivation of four hypothesis tests. It implies a linear restriction of the assumed set of probability distributions in the classical approach that…

Descriptors: Inferences, Test Items, Item Analysis, Maximum Likelihood Statistics

A Scaled Threshold Model for Measuring Extreme Response Style

Peer reviewed

Direct link

Lubbe, Dirk; Schuster, Christof – Journal of Educational and Behavioral Statistics, 2020

Extreme response style is the tendency of individuals to prefer the extreme categories of a rating scale irrespective of item content. It has been shown repeatedly that individual response style differences affect the reliability and validity of item responses and should, therefore, be considered carefully. To account for extreme response style…

Descriptors: Response Style (Tests), Rating Scales, Item Response Theory, Models

Forced-Choice Ranking Models for Raters' Ranking Data

Peer reviewed

Direct link

Hung, Su-Pin; Huang, Hung-Yu – Journal of Educational and Behavioral Statistics, 2022

To address response style or bias in rating scales, forced-choice items are often used to request that respondents rank their attitudes or preferences among a limited set of options. The rating scales used by raters to render judgments on ratees' performance also contribute to rater bias or errors; consequently, forced-choice items have recently…

Descriptors: Evaluation Methods, Rating Scales, Item Analysis, Preferences

Deep Reinforcement Learning for Adaptive Learning Systems

Peer reviewed

Direct link

Li, Xiao; Xu, Hanchen; Zhang, Jinming; Chang, Hua-hua – Journal of Educational and Behavioral Statistics, 2023

The adaptive learning problem concerns how to create an individualized learning plan (also referred to as a learning policy) that chooses the most appropriate learning materials based on a learner's latent traits. In this article, we study an important yet less-addressed adaptive learning problem--one that assumes continuous latent traits.…

Descriptors: Learning Processes, Models, Algorithms, Individualized Instruction

Estimation of Expected Fisher Information for IRT Models

Peer reviewed

Direct link

Monroe, Scott – Journal of Educational and Behavioral Statistics, 2019

In item response theory (IRT) modeling, the Fisher information matrix is used for numerous inferential procedures such as estimating parameter standard errors, constructing test statistics, and facilitating test scoring. In principal, these procedures may be carried out using either the expected information or the observed information. However, in…

Descriptors: Item Response Theory, Error of Measurement, Scoring, Inferences

The Prevalence and Implications of Slipping on Low-Stakes, Large-Scale Assessments

Peer reviewed

Direct link

Culpepper, Steven Andrew – Journal of Educational and Behavioral Statistics, 2017

In the absence of clear incentives, achievement tests may be subject to the effect of slipping where item response functions have upper asymptotes below one. Slipping reduces score precision for higher latent scores and distorts test developers' understandings of item and test information. A multidimensional four-parameter normal ogive model was…

Descriptors: Measurement, Achievement Tests, Item Response Theory, National Competency Tests

The Sequential Probability Ratio Test and Binary Item Response Models

Peer reviewed

Direct link

Nydick, Steven W. – Journal of Educational and Behavioral Statistics, 2014

The sequential probability ratio test (SPRT) is a common method for terminating item response theory (IRT)-based adaptive classification tests. To decide whether a classification test should stop, the SPRT compares a simple log-likelihood ratio, based on the classification bound separating two categories, to prespecified critical values. As has…

Descriptors: Probability, Item Response Theory, Models, Classification

Bayesian Estimation of the DINA Model with Gibbs Sampling

Peer reviewed

Direct link

Culpepper, Steven Andrew – Journal of Educational and Behavioral Statistics, 2015

A Bayesian model formulation of the deterministic inputs, noisy "and" gate (DINA) model is presented. Gibbs sampling is employed to simulate from the joint posterior distribution of item guessing and slipping parameters, subject attribute parameters, and latent class probabilities. The procedure extends concepts in Béguin and Glas,…

Descriptors: Bayesian Statistics, Models, Sampling, Computation

Assessment of Person Fit for Mixed-Format Tests

Peer reviewed

Direct link

Sinharay, Sandip – Journal of Educational and Behavioral Statistics, 2015

Person-fit assessment may help the researcher to obtain additional information regarding the answering behavior of persons. Although several researchers examined person fit, there is a lack of research on person-fit assessment for mixed-format tests. In this article, the lz statistic and the ?2 statistic, both of which have been used for tests…

Descriptors: Test Format, Goodness of Fit, Item Response Theory, Bayesian Statistics

Robust Estimation of Latent Ability in Item Response Models

Peer reviewed

Direct link

Schuster, Christof; Yuan, Ke-Hai – Journal of Educational and Behavioral Statistics, 2011

Because of response disturbances such as guessing, cheating, or carelessness, item response models often can only approximate the "true" individual response probabilities. As a consequence, maximum-likelihood estimates of ability will be biased. Typically, the nature and extent to which response disturbances are present is unknown, and, therefore,…

Descriptors: Computation, Item Response Theory, Probability, Maximum Likelihood Statistics

Beta Regression Finite Mixture Models of Polarization and Priming

Peer reviewed

Direct link

Smithson, Michael; Merkle, Edgar C.; Verkuilen, Jay – Journal of Educational and Behavioral Statistics, 2011

This paper describes the application of finite-mixture general linear models based on the beta distribution to modeling response styles, polarization, anchoring, and priming effects in probability judgments. These models, in turn, enhance our capacity for explicitly testing models and theories regarding the aforementioned phenomena. The mixture…

Descriptors: Priming, Research Methodology, Probability, Item Response Theory

On Using Stochastic Curtailment to Shorten the SPRT in Sequential Mastery Testing

Peer reviewed

Direct link

Finkelman, Matthew – Journal of Educational and Behavioral Statistics, 2008

Sequential mastery testing (SMT) has been researched as an efficient alternative to paper-and-pencil testing for pass/fail examinations. One popular method for determining when to cease examination in SMT is the truncated sequential probability ratio test (TSPRT). This article introduces the application of stochastic curtailment in SMT to shorten…

Descriptors: Mastery Tests, Sequential Approach, Computer Assisted Testing, Adaptive Testing

Randomized Item Response Theory Models

Peer reviewed

Direct link

Fox, Jean-Paul – Journal of Educational and Behavioral Statistics, 2005

The randomized response (RR) technique is often used to obtain answers on sensitive questions. A new method is developed to measure latent variables using the RR technique because direct questioning leads to biased results. Within the RR technique is the probability of the true response modeled by an item response theory (IRT) model. The RR…

Descriptors: Item Response Theory, Models, Probability, Markov Processes

Assessing and Explaining Differential Item Functioning Using Logistic Mixed Models

Peer reviewed

Direct link

Van den Noortgate, Wim; De Boeck, Paul – Journal of Educational and Behavioral Statistics, 2005

Although differential item functioning (DIF) theory traditionally focuses on the behavior of individual items in two (or a few) specific groups, in educational measurement contexts, it is often plausible to regard the set of items as a random sample from a broader category. This article presents logistic mixed models that can be used to model…

Descriptors: Test Bias, Item Response Theory, Educational Assessment, Mathematical Models

Previous Page | Next Page »

Pages: 1 | 2

Culpepper, Steven Andrew	2
Schuster, Christof	2
Andreas Kurz	1
Can Gürer	1
Chang, Hua-hua	1
Clemens Draxler	1
De Boeck, Paul	1
Finkelman, Matthew	1
Fox, Jean-Paul	1
Huang, Hung-Yu	1
Hung, Su-Pin	1
Jan Philipp Nolte	1
Johnson, Matthew S.	1
Junker, Brian W.	1
Li, Xiao	1
Lubbe, Dirk	1
Merkle, Edgar C.	1
Monroe, Scott	1
Nydick, Steven W.	1
Sinharay, Sandip	1
Smithson, Michael	1
Van den Noortgate, Wim	1
Verkuilen, Jay	1
Wallin, Gabriel	1
Wiberg, Marie	1
More ▼