ERIC - Search Results

Publication Date

In 2025	0
Since 2024	0
Since 2021 (last 5 years)	1
Since 2016 (last 10 years)	3
Since 2006 (last 20 years)	12

Source

Journal of Educational and…

Publication Type

Journal Articles	13
Reports - Research	9
Reports - Descriptive	2
Reports - Evaluative	2

Education Level

Elementary Secondary Education

Audience

Location

California	1
Indiana	1

Laws, Policies, & Programs

Assessments and Surveys

What Works Clearinghouse Rating

Showing all 13 results Save | Export

Forced-Choice Ranking Models for Raters' Ranking Data

Peer reviewed

Direct link

Hung, Su-Pin; Huang, Hung-Yu – Journal of Educational and Behavioral Statistics, 2022

To address response style or bias in rating scales, forced-choice items are often used to request that respondents rank their attitudes or preferences among a limited set of options. The rating scales used by raters to render judgments on ratees' performance also contribute to rater bias or errors; consequently, forced-choice items have recently…

Descriptors: Evaluation Methods, Rating Scales, Item Analysis, Preferences

Estimation of Expected Fisher Information for IRT Models

Peer reviewed

Direct link

Monroe, Scott – Journal of Educational and Behavioral Statistics, 2019

In item response theory (IRT) modeling, the Fisher information matrix is used for numerous inferential procedures such as estimating parameter standard errors, constructing test statistics, and facilitating test scoring. In principal, these procedures may be carried out using either the expected information or the observed information. However, in…

Descriptors: Item Response Theory, Error of Measurement, Scoring, Inferences

Applications of Small Area Estimation to Generalization with Subclassification by Propensity Scores

Peer reviewed

Direct link

Chan, Wendy – Journal of Educational and Behavioral Statistics, 2018

Policymakers have grown increasingly interested in how experimental results may generalize to a larger population. However, recently developed propensity score-based methods are limited by small sample sizes, where the experimental study is generalized to a population that is at least 20 times larger. This is particularly problematic for methods…

Descriptors: Computation, Generalization, Probability, Sample Size

Incorporating Covariates into Stochastic Blockmodels

Peer reviewed
PDF on ERIC

Download full text

Direct link

Sweet, Tracy M. – Journal of Educational and Behavioral Statistics, 2015

Social networks in education commonly involve some form of grouping, such as friendship cliques or teacher departments, and blockmodels are a type of statistical social network model that accommodate these grouping or blocks by assuming different within-group tie probabilities than between-group tie probabilities. We describe a class of models,…

Descriptors: Social Networks, Statistical Analysis, Probability, Models

Ratio-of-Mediator-Probability Weighting for Causal Mediation Analysis in the Presence of Treatment-by-Mediator Interaction

Peer reviewed

Direct link

Guanglei Hong; Jonah Deutsch; Heather D. Hill – Journal of Educational and Behavioral Statistics, 2015

Conventional methods for mediation analysis generate biased results when the mediator--outcome relationship depends on the treatment condition. This article shows how the ratio-of-mediator-probability weighting (RMPW) method can be used to decompose total effects into natural direct and indirect effects in the presence of treatment-by-mediator…

Descriptors: Weighted Scores, Probability, Statistical Analysis, Interaction

The Sequential Probability Ratio Test and Binary Item Response Models

Peer reviewed

Direct link

Nydick, Steven W. – Journal of Educational and Behavioral Statistics, 2014

The sequential probability ratio test (SPRT) is a common method for terminating item response theory (IRT)-based adaptive classification tests. To decide whether a classification test should stop, the SPRT compares a simple log-likelihood ratio, based on the classification bound separating two categories, to prespecified critical values. As has…

Descriptors: Probability, Item Response Theory, Models, Classification

Bayesian Estimation of the DINA Model with Gibbs Sampling

Peer reviewed

Direct link

Culpepper, Steven Andrew – Journal of Educational and Behavioral Statistics, 2015

A Bayesian model formulation of the deterministic inputs, noisy "and" gate (DINA) model is presented. Gibbs sampling is employed to simulate from the joint posterior distribution of item guessing and slipping parameters, subject attribute parameters, and latent class probabilities. The procedure extends concepts in Béguin and Glas,…

Descriptors: Bayesian Statistics, Models, Sampling, Computation

Assessment of Person Fit for Mixed-Format Tests

Peer reviewed

Direct link

Sinharay, Sandip – Journal of Educational and Behavioral Statistics, 2015

Person-fit assessment may help the researcher to obtain additional information regarding the answering behavior of persons. Although several researchers examined person fit, there is a lack of research on person-fit assessment for mixed-format tests. In this article, the lz statistic and the ?2 statistic, both of which have been used for tests…

Descriptors: Test Format, Goodness of Fit, Item Response Theory, Bayesian Statistics

Robust Estimation of Latent Ability in Item Response Models

Peer reviewed

Direct link

Schuster, Christof; Yuan, Ke-Hai – Journal of Educational and Behavioral Statistics, 2011

Because of response disturbances such as guessing, cheating, or carelessness, item response models often can only approximate the "true" individual response probabilities. As a consequence, maximum-likelihood estimates of ability will be biased. Typically, the nature and extent to which response disturbances are present is unknown, and, therefore,…

Descriptors: Computation, Item Response Theory, Probability, Maximum Likelihood Statistics

Mixed and Mixture Regression Models for Continuous Bounded Responses Using the Beta Distribution

Peer reviewed

Direct link

Verkuilen, Jay; Smithson, Michael – Journal of Educational and Behavioral Statistics, 2012

Doubly bounded continuous data are common in the social and behavioral sciences. Examples include judged probabilities, confidence ratings, derived proportions such as percent time on task, and bounded scale scores. Dependent variables of this kind are often difficult to analyze using normal theory models because their distributions may be quite…

Descriptors: Responses, Regression (Statistics), Statistical Analysis, Models

Using the Kernel Method of Test Equating for Estimating the Standard Errors of Population Invariance Measures

Peer reviewed

Direct link

Moses, Tim – Journal of Educational and Behavioral Statistics, 2008

Equating functions are supposed to be population invariant, meaning that the choice of subpopulation used to compute the equating function should not matter. The extent to which equating functions are population invariant is typically assessed in terms of practical difference criteria that do not account for equating functions' sampling…

Descriptors: Equated Scores, Error of Measurement, Sampling, Evaluation Methods

On Using Stochastic Curtailment to Shorten the SPRT in Sequential Mastery Testing

Peer reviewed

Direct link

Finkelman, Matthew – Journal of Educational and Behavioral Statistics, 2008

Sequential mastery testing (SMT) has been researched as an efficient alternative to paper-and-pencil testing for pass/fail examinations. One popular method for determining when to cease examination in SMT is the truncated sequential probability ratio test (TSPRT). This article introduces the application of stochastic curtailment in SMT to shorten…

Descriptors: Mastery Tests, Sequential Approach, Computer Assisted Testing, Adaptive Testing

Using Data Augmentation and Markov Chain Monte Carlo for the Estimation of Unfolding Response Models

Peer reviewed

Direct link

Johnson, Matthew S.; Junker, Brian W. – Journal of Educational and Behavioral Statistics, 2003

Unfolding response models, a class of item response theory (IRT) models that assume a unimodal item response function (IRF), are often used for the measurement of attitudes. Verhelst and Verstralen (1993)and Andrich and Luo (1993) independently developed unfolding response models by relating the observed responses to a more common monotone IRT…

Descriptors: Markov Processes, Item Response Theory, Computation, Data Analysis

Probability	13
Simulation	13
Computation	8
Item Response Theory	8
Models	8
Monte Carlo Methods	5
Bayesian Statistics	4
Markov Processes	4
Statistical Analysis	4
Equations (Mathematics)	3
Error of Measurement	3
Maximum Likelihood Statistics	3
Sampling	3
Adaptive Testing	2
Classification	2
Computer Assisted Testing	2
Correlation	2
Data Analysis	2
Evaluation Methods	2
Goodness of Fit	2
Regression (Statistics)	2
Responses	2
Sample Size	2
Scoring	2
Statistical Distributions	2
More ▼

Chan, Wendy	1
Culpepper, Steven Andrew	1
Finkelman, Matthew	1
Guanglei Hong	1
Heather D. Hill	1
Huang, Hung-Yu	1
Hung, Su-Pin	1
Johnson, Matthew S.	1
Jonah Deutsch	1
Junker, Brian W.	1
Monroe, Scott	1
Moses, Tim	1
Nydick, Steven W.	1
Schuster, Christof	1
Sinharay, Sandip	1
Smithson, Michael	1
Sweet, Tracy M.	1
Verkuilen, Jay	1
Yuan, Ke-Hai	1
More ▼