ERIC - Search Results

Publication Date

In 2025	0
Since 2024	0
Since 2021 (last 5 years)	1
Since 2016 (last 10 years)	10
Since 2006 (last 20 years)	17

Source

Journal of Educational and…

Publication Type

Journal Articles	24
Reports - Evaluative	24
Book/Product Reviews	1
Speeches/Meeting Papers	1

Education Level

Elementary Education	1
Grade 4	1
Higher Education	1
Intermediate Grades	1

Audience

Location

United Kingdom (Scotland)

Laws, Policies, & Programs

Assessments and Surveys

National Assessment of…	2
SAT (College Admission Test)	1

What Works Clearinghouse Rating

Showing 1 to 15 of 24 results Save | Export

Model Misspecification and Robustness of Observed-Score Test Equating Using Propensity Scores

Peer reviewed

Direct link

Wallin, Gabriel; Wiberg, Marie – Journal of Educational and Behavioral Statistics, 2023

This study explores the usefulness of covariates on equating test scores from nonequivalent test groups. The covariates are captured by an estimated propensity score, which is used as a proxy for latent ability to balance the test groups. The objective is to assess the sensitivity of the equated scores to various misspecifications in the…

Descriptors: Models, Error of Measurement, Robustness (Statistics), Equated Scores

Detecting Compromised Items Using Information from Secure Items

Peer reviewed

Direct link

Wang, Xi; Liu, Yang – Journal of Educational and Behavioral Statistics, 2020

In continuous testing programs, some items are repeatedly used across test administrations, and statistical methods are often used to evaluate whether items become compromised due to examinees' preknowledge. In this study, we proposed a residual method to detect compromised items when a test can be partitioned into two subsets of items: secure…

Descriptors: Test Items, Information Security, Error of Measurement, Cheating

A Fast and Simple Algorithm for Bayesian Adaptive Testing

Peer reviewed

Direct link

van der Linden, Wim J.; Ren, Hao – Journal of Educational and Behavioral Statistics, 2020

The Bayesian way of accounting for the effects of error in the ability and item parameters in adaptive testing is through the joint posterior distribution of all parameters. An optimized Markov chain Monte Carlo algorithm for adaptive testing is presented, which samples this distribution in real time to score the examinee's ability and optimally…

Descriptors: Bayesian Statistics, Adaptive Testing, Error of Measurement, Markov Processes

Correcting Fixed Effect Standard Errors When a Crossed Random Effect Was Ignored for Balanced and Unbalanced Designs

Peer reviewed

Direct link

Lai, Mark H. C. – Journal of Educational and Behavioral Statistics, 2019

Previous studies have detailed the consequence of ignoring a level of clustering in multilevel models with straightly hierarchical structures and have proposed methods to adjust for the fixed effect standard errors (SEs). However, in behavioral and social science research, there are usually two or more crossed clustering levels, such as when…

Descriptors: Error of Measurement, Hierarchical Linear Modeling, Least Squares Statistics, Statistical Bias

Flexible, Free Software for Multilevel Multiple Imputation: A Review of Blimp and jomo

Peer reviewed

Direct link

Hayes, Timothy – Journal of Educational and Behavioral Statistics, 2019

Multiple imputation is a popular method for addressing data that are presumed to be missing at random. To obtain accurate results, one's imputation model must be congenial to (appropriate for) one's intended analysis model. This article reviews and demonstrates two recent software packages, Blimp and jomo, to multiply impute data in a manner…

Descriptors: Computer Software Evaluation, Computer Software Reviews, Hierarchical Linear Modeling, Data Analysis

Estimating the Entropy Rate of Finite Markov Chains with Application to Behavior Studies

Peer reviewed

Direct link

Vegetabile, Brian G.; Stout-Oswald, Stephanie A.; Davis, Elysia Poggi; Baram, Tallie Z.; Stern, Hal S. – Journal of Educational and Behavioral Statistics, 2019

Predictability of behavior is an important characteristic in many fields including biology, medicine, marketing, and education. When a sequence of actions performed by an individual can be modeled as a stationary time-homogeneous Markov chain the predictability of the individual's behavior can be quantified by the entropy rate of the process. This…

Descriptors: Markov Processes, Prediction, Behavior, Computation

Estimation of Expected Fisher Information for IRT Models

Peer reviewed

Direct link

Monroe, Scott – Journal of Educational and Behavioral Statistics, 2019

In item response theory (IRT) modeling, the Fisher information matrix is used for numerous inferential procedures such as estimating parameter standard errors, constructing test statistics, and facilitating test scoring. In principal, these procedures may be carried out using either the expected information or the observed information. However, in…

Descriptors: Item Response Theory, Error of Measurement, Scoring, Inferences

Does the Package Matter? A Comparison of Five Common Multilevel Modeling Software Packages

Peer reviewed

Direct link

McCoach, D. Betsy; Rifenbark, Graham G.; Newton, Sarah D.; Li, Xiaoran; Kooken, Janice; Yomtov, Dani; Gambino, Anthony J.; Bellara, Aarti – Journal of Educational and Behavioral Statistics, 2018

This study compared five common multilevel software packages via Monte Carlo simulation: HLM 7, M"plus" 7.4, R (lme4 V1.1-12), Stata 14.1, and SAS 9.4 to determine how the programs differ in estimation accuracy and speed, as well as convergence, when modeling multiple randomly varying slopes of different magnitudes. Simulated data…

Descriptors: Hierarchical Linear Modeling, Computer Software, Comparative Analysis, Monte Carlo Methods

Power to Detect Intervention Effects on Ensembles of Social Networks

Peer reviewed

Direct link

Sweet, Tracy M.; Junker, Brian W. – Journal of Educational and Behavioral Statistics, 2016

The hierarchical network model (HNM) is a framework introduced by Sweet, Thomas, and Junker for modeling interventions and other covariate effects on ensembles of social networks, such as what would be found in randomized controlled trials in education research. In this article, we develop calculations for the power to detect an intervention…

Descriptors: Intervention, Social Networks, Statistical Analysis, Computation

On the Hedges Correction for a "t"-Test

Peer reviewed

Direct link

VanHoudnos, Nathan M.; Greenhouse, Joel B. – Journal of Educational and Behavioral Statistics, 2016

When cluster randomized experiments are analyzed as if units were independent, test statistics for treatment effects can be anticonservative. Hedges proposed a correction for such tests by scaling them to control their Type I error rate. This article generalizes the Hedges correction from a posttest-only experimental design to more common designs…

Descriptors: Statistical Analysis, Randomized Controlled Trials, Error of Measurement, Scaling

Incorporating Quality Scores in Meta-Analysis

Peer reviewed

Direct link

Ahn, Soyeon; Becker, Betsy Jane – Journal of Educational and Behavioral Statistics, 2011

This paper examines the impact of quality-score weights in meta-analysis. A simulation examines the roles of study characteristics such as population effect size (ES) and its variance on the bias and mean square errors (MSEs) of the estimators for several patterns of relationship between quality and ES, and for specific patterns of systematic…

Descriptors: Meta Analysis, Scores, Effect Size, Statistical Bias

Robust Means Modeling: An Alternative for Hypothesis Testing of Independent Means under Variance Heterogeneity and Nonnormality

Peer reviewed

Direct link

Fan, Weihua; Hancock, Gregory R. – Journal of Educational and Behavioral Statistics, 2012

This study proposes robust means modeling (RMM) approaches for hypothesis testing of mean differences for between-subjects designs in order to control the biasing effects of nonnormality and variance inequality. Drawing from structural equation modeling (SEM), the RMM approaches make no assumption of variance homogeneity and employ robust…

Descriptors: Robustness (Statistics), Hypothesis Testing, Monte Carlo Methods, Simulation

The Impact of Variability of Item Parameter Estimators on Test Information Function

Peer reviewed

Direct link

Zhang, Jinming – Journal of Educational and Behavioral Statistics, 2012

The impact of uncertainty about item parameters on test information functions is investigated. The information function of a test is one of the most important tools in item response theory (IRT). Inaccuracy in the estimation of test information can have substantial consequences on data analyses based on IRT. In this article, the major part (called…

Descriptors: Item Response Theory, Tests, Accuracy, Data Analysis

Nonparametric Item Response Curve Estimation with Correction for Measurement Error

Peer reviewed

Direct link

Guo, Hongwen; Sinharay, Sandip – Journal of Educational and Behavioral Statistics, 2011

Nonparametric or kernel regression estimation of item response curves (IRCs) is often used in item analysis in testing programs. These estimates are biased when the observed scores are used as the regressor because the observed scores are contaminated by measurement error. Accuracy of this estimation is a concern theoretically and operationally.…

Descriptors: Testing Programs, Measurement, Item Analysis, Error of Measurement

When Can Subscores Have Value?

Peer reviewed

Direct link

Haberman, Shelby J. – Journal of Educational and Behavioral Statistics, 2008

In educational tests, subscores are often generated from a portion of the items in a larger test. Guidelines based on mean squared error are proposed to indicate whether subscores are worth reporting. Alternatives considered are direct reports of subscores, estimates of subscores based on total score, combined estimates based on subscores and…

Descriptors: Testing Programs, Regression (Statistics), Scores, Student Evaluation

Previous Page | Next Page »

Pages: 1 | 2

Error of Measurement	24
Computation	10
Item Response Theory	6
Sample Size	6
Statistical Analysis	6
Hierarchical Linear Modeling	5
Monte Carlo Methods	5
Probability	5
Regression (Statistics)	5
Scores	5
Simulation	4
Accuracy	3
Comparative Analysis	3
Effect Size	3
Equated Scores	3
Sampling	3
Statistical Distributions	3
Statistical Inference	3
Test Items	3
College Students	2
Computer Simulation	2
Computer Software	2
Correlation	2
Data Analysis	2
Equations (Mathematics)	2
More ▼

Zwick, Rebecca	2
Adams, Raymond J.	1
Ahn, Soyeon	1
Algina, James	1
Baram, Tallie Z.	1
Becker, Betsy Jane	1
Bellara, Aarti	1
Brennan, Robert L.	1
Cheng, Philip E.	1
Cope, Ronald T.	1
Davis, Elysia Poggi	1
Fan, Weihua	1
Gambino, Anthony J.	1
Greenhouse, Joel B.	1
Guo, Hongwen	1
Haberman, Shelby J.	1
Hancock, Gregory R.	1
Hayes, Timothy	1
Huitema, Bradley E.	1
Junker, Brian W.	1
Kolen, Michael J.	1
Kooken, Janice	1
Lai, Mark H. C.	1
Lee, Won-Chan	1
More ▼