ERIC - Search Results

Publication Date

In 2025	0
Since 2024	0
Since 2021 (last 5 years)	6
Since 2016 (last 10 years)	10
Since 2006 (last 20 years)	19

Source

Journal of Educational and…

Publication Type

Journal Articles	22
Reports - Research	14
Reports - Descriptive	4
Reports - Evaluative	4

Education Level

Higher Education	2
Elementary Education	1
Elementary Secondary Education	1
Grade 1	1
Grade 4	1
Postsecondary Education	1

Audience

Location

Netherlands	1
Netherlands (Amsterdam)	1

Laws, Policies, & Programs

Assessments and Surveys

National Assessment of…

What Works Clearinghouse Rating

Showing 1 to 15 of 22 results Save | Export

A Critical View on the NEAT Equating Design: Statistical Modeling and Identifiability Problems

Peer reviewed

Direct link

San Martín, Ernesto; González, Jorge – Journal of Educational and Behavioral Statistics, 2022

The nonequivalent groups with anchor test (NEAT) design is widely used in test equating. Under this design, two groups of examinees are administered different test forms with each test form containing a subset of common items. Because test takers from different groups are assigned only one test form, missing score data emerge by design rendering…

Descriptors: Tests, Scores, Statistical Analysis, Models

Latent Transition Cognitive Diagnosis Model with Covariates: A Three-Step Approach

Peer reviewed

Direct link

Liang, Qianru; de la Torre, Jimmy; Law, Nancy – Journal of Educational and Behavioral Statistics, 2023

To expand the use of cognitive diagnosis models (CDMs) to longitudinal assessments, this study proposes a bias-corrected three-step estimation approach for latent transition CDMs with covariates by integrating a general CDM and a latent transition model. The proposed method can be used to assess changes in attribute mastery status and attribute…

Descriptors: Cognitive Measurement, Models, Statistical Bias, Computation

Model Misspecification and Robustness of Observed-Score Test Equating Using Propensity Scores

Peer reviewed

Direct link

Wallin, Gabriel; Wiberg, Marie – Journal of Educational and Behavioral Statistics, 2023

This study explores the usefulness of covariates on equating test scores from nonequivalent test groups. The covariates are captured by an estimated propensity score, which is used as a proxy for latent ability to balance the test groups. The objective is to assess the sensitivity of the equated scores to various misspecifications in the…

Descriptors: Models, Error of Measurement, Robustness (Statistics), Equated Scores

The Reliability of the Posterior Probability of Skill Attainment in Diagnostic Classification Models

Peer reviewed

Direct link

Johnson, Matthew S.; Sinharay, Sandip – Journal of Educational and Behavioral Statistics, 2020

One common score reported from diagnostic classification assessments is the vector of posterior means of the skill mastery indicators. As with any assessment, it is important to derive and report estimates of the reliability of the reported scores. After reviewing a reliability measure suggested by Templin and Bradshaw, this article suggests three…

Descriptors: Reliability, Probability, Skill Development, Classification

Testing the Within-State Distribution in Mixture Models for Responses and Response Times

Peer reviewed

Direct link

Kuijpers, Renske E.; Visser, Ingmar; Molenaar, Dylan – Journal of Educational and Behavioral Statistics, 2021

Mixture models have been developed to enable detection of within-subject differences in responses and response times to psychometric test items. To enable mixture modeling of both responses and response times, a distributional assumption is needed for the within-state response time distribution. Since violations of the assumed response time…

Descriptors: Test Items, Responses, Reaction Time, Models

A Scaled Threshold Model for Measuring Extreme Response Style

Peer reviewed

Direct link

Lubbe, Dirk; Schuster, Christof – Journal of Educational and Behavioral Statistics, 2020

Extreme response style is the tendency of individuals to prefer the extreme categories of a rating scale irrespective of item content. It has been shown repeatedly that individual response style differences affect the reliability and validity of item responses and should, therefore, be considered carefully. To account for extreme response style…

Descriptors: Response Style (Tests), Rating Scales, Item Response Theory, Models

Forced-Choice Ranking Models for Raters' Ranking Data

Peer reviewed

Direct link

Hung, Su-Pin; Huang, Hung-Yu – Journal of Educational and Behavioral Statistics, 2022

To address response style or bias in rating scales, forced-choice items are often used to request that respondents rank their attitudes or preferences among a limited set of options. The rating scales used by raters to render judgments on ratees' performance also contribute to rater bias or errors; consequently, forced-choice items have recently…

Descriptors: Evaluation Methods, Rating Scales, Item Analysis, Preferences

Deep Learning with TensorFlow: A Review

Peer reviewed

Direct link

Pang, Bo; Nijkamp, Erik; Wu, Ying Nian – Journal of Educational and Behavioral Statistics, 2020

This review covers the core concepts and design decisions of TensorFlow. TensorFlow, originally created by researchers at Google, is the most popular one among the plethora of deep learning libraries. In the field of deep learning, neural networks have achieved tremendous success and gained wide popularity in various areas. This family of models…

Descriptors: Artificial Intelligence, Regression (Statistics), Models, Classification

Deep Reinforcement Learning for Adaptive Learning Systems

Peer reviewed

Direct link

Li, Xiao; Xu, Hanchen; Zhang, Jinming; Chang, Hua-hua – Journal of Educational and Behavioral Statistics, 2023

The adaptive learning problem concerns how to create an individualized learning plan (also referred to as a learning policy) that chooses the most appropriate learning materials based on a learner's latent traits. In this article, we study an important yet less-addressed adaptive learning problem--one that assumes continuous latent traits.…

Descriptors: Learning Processes, Models, Algorithms, Individualized Instruction

Estimation of Expected Fisher Information for IRT Models

Peer reviewed

Direct link

Monroe, Scott – Journal of Educational and Behavioral Statistics, 2019

In item response theory (IRT) modeling, the Fisher information matrix is used for numerous inferential procedures such as estimating parameter standard errors, constructing test statistics, and facilitating test scoring. In principal, these procedures may be carried out using either the expected information or the observed information. However, in…

Descriptors: Item Response Theory, Error of Measurement, Scoring, Inferences

Incorporating Covariates into Stochastic Blockmodels

Peer reviewed
PDF on ERIC

Download full text

Direct link

Sweet, Tracy M. – Journal of Educational and Behavioral Statistics, 2015

Social networks in education commonly involve some form of grouping, such as friendship cliques or teacher departments, and blockmodels are a type of statistical social network model that accommodate these grouping or blocks by assuming different within-group tie probabilities than between-group tie probabilities. We describe a class of models,…

Descriptors: Social Networks, Statistical Analysis, Probability, Models

The Sequential Probability Ratio Test and Binary Item Response Models

Peer reviewed

Direct link

Nydick, Steven W. – Journal of Educational and Behavioral Statistics, 2014

The sequential probability ratio test (SPRT) is a common method for terminating item response theory (IRT)-based adaptive classification tests. To decide whether a classification test should stop, the SPRT compares a simple log-likelihood ratio, based on the classification bound separating two categories, to prespecified critical values. As has…

Descriptors: Probability, Item Response Theory, Models, Classification

Bayesian Estimation of the DINA Model with Gibbs Sampling

Peer reviewed

Direct link

Culpepper, Steven Andrew – Journal of Educational and Behavioral Statistics, 2015

A Bayesian model formulation of the deterministic inputs, noisy "and" gate (DINA) model is presented. Gibbs sampling is employed to simulate from the joint posterior distribution of item guessing and slipping parameters, subject attribute parameters, and latent class probabilities. The procedure extends concepts in Béguin and Glas,…

Descriptors: Bayesian Statistics, Models, Sampling, Computation

Assessment of Person Fit for Mixed-Format Tests

Peer reviewed

Direct link

Sinharay, Sandip – Journal of Educational and Behavioral Statistics, 2015

Person-fit assessment may help the researcher to obtain additional information regarding the answering behavior of persons. Although several researchers examined person fit, there is a lack of research on person-fit assessment for mixed-format tests. In this article, the lz statistic and the ?2 statistic, both of which have been used for tests…

Descriptors: Test Format, Goodness of Fit, Item Response Theory, Bayesian Statistics

Modeling Answer Changes on Test Items

Peer reviewed

Direct link

van der Linden, Wim J.; Jeon, Minjeong – Journal of Educational and Behavioral Statistics, 2012

The probability of test takers changing answers upon review of their initial choices is modeled. The primary purpose of the model is to check erasures on answer sheets recorded by an optical scanner for numbers and patterns that may be indicative of irregular behavior, such as teachers or school administrators changing answer sheets after their…

Descriptors: Probability, Models, Test Items, Educational Testing

Previous Page | Next Page »

Pages: 1 | 2

Models	22
Probability	22
Computation	11
Item Response Theory	11
Simulation	8
Monte Carlo Methods	7
Statistical Analysis	7
Markov Processes	6
Regression (Statistics)	5
Bayesian Statistics	4
Correlation	4
Responses	4
Statistical Distributions	4
Test Items	4
Classification	3
Equations (Mathematics)	3
Error of Measurement	3
Response Style (Tests)	3
Artificial Intelligence	2
Cheating	2
Cognitive Measurement	2
Comparative Analysis	2
Data Analysis	2
Decision Making	2
Foreign Countries	2
More ▼

Johnson, Matthew S.	2
Sinharay, Sandip	2
Smithson, Michael	2
Verkuilen, Jay	2
Chang, Hua-hua	1
Culpepper, Steven Andrew	1
Fox, Jean-Paul	1
González, Jorge	1
Harris, Ian	1
Ho, Andrew Dean	1
Huang, Hung-Yu	1
Hung, Su-Pin	1
Jeon, Minjeong	1
Jia, Yue	1
Junker, Brian W.	1
Kaplan, David	1
Kuijpers, Renske E.	1
Law, Nancy	1
Li, Xiao	1
Liang, Qianru	1
Lubbe, Dirk	1
Merkle, Edgar C.	1
Molenaar, Dylan	1
Monroe, Scott	1
Nijkamp, Erik	1
More ▼