ERIC - Search Results

Publication Date

In 2025	0
Since 2024	0
Since 2021 (last 5 years)	1
Since 2016 (last 10 years)	4
Since 2006 (last 20 years)	8

Descriptor

Comparative Analysis	9
Error of Measurement	9
Accuracy	3
Item Response Theory	3
Monte Carlo Methods	3
Sample Size	3
Computation	2
Hierarchical Linear Modeling	2
Mathematics	2
Maximum Likelihood Statistics	2
Measurement	2
Measurement Techniques	2
Regression (Statistics)	2
Statistical Analysis	2
Testing Programs	2
True Scores	2
Academic Achievement	1
Computer Software	1
Computer Software Selection	1
Context Effect	1
Correlation	1
Differences	1
Educational Testing	1
Effect Size	1
Efficiency	1
More ▼

Source

Journal of Educational and…

Publication Type

Journal Articles	9
Reports - Research	4
Reports - Evaluative	3
Reports - Descriptive	2

Education Level

Audience

Location

Laws, Policies, & Programs

Assessments and Surveys

Iowa Tests of Basic Skills	1
Program for International…	1

What Works Clearinghouse Rating

Showing all 9 results Save | Export

Adaptive Pairwise Comparison for Educational Measurement

Peer reviewed

Direct link

Crompvoets, Elise A. V.; Béguin, Anton A.; Sijtsma, Klaas – Journal of Educational and Behavioral Statistics, 2020

Pairwise comparison is becoming increasingly popular as a holistic measurement method in education. Unfortunately, many comparisons are required for reliable measurement. To reduce the number of required comparisons, we developed an adaptive selection algorithm (ASA) that selects the most informative comparisons while taking the uncertainty of the…

Descriptors: Comparative Analysis, Statistical Analysis, Mathematics, Measurement

Statistical Power for Estimating Treatment Effects Using Difference-in-Differences and Comparative Interrupted Time Series Estimators with Variation in Treatment Timing

Peer reviewed

Direct link

Schochet, Peter Z. – Journal of Educational and Behavioral Statistics, 2022

This article develops new closed-form variance expressions for power analyses for commonly used difference-in-differences (DID) and comparative interrupted time series (CITS) panel data estimators. The main contribution is to incorporate variation in treatment timing into the analysis. The power formulas also account for other key design features…

Descriptors: Comparative Analysis, Statistical Analysis, Sample Size, Measurement Techniques

Estimation of Expected Fisher Information for IRT Models

Peer reviewed

Direct link

Monroe, Scott – Journal of Educational and Behavioral Statistics, 2019

In item response theory (IRT) modeling, the Fisher information matrix is used for numerous inferential procedures such as estimating parameter standard errors, constructing test statistics, and facilitating test scoring. In principal, these procedures may be carried out using either the expected information or the observed information. However, in…

Descriptors: Item Response Theory, Error of Measurement, Scoring, Inferences

Does the Package Matter? A Comparison of Five Common Multilevel Modeling Software Packages

Peer reviewed

Direct link

McCoach, D. Betsy; Rifenbark, Graham G.; Newton, Sarah D.; Li, Xiaoran; Kooken, Janice; Yomtov, Dani; Gambino, Anthony J.; Bellara, Aarti – Journal of Educational and Behavioral Statistics, 2018

This study compared five common multilevel software packages via Monte Carlo simulation: HLM 7, M"plus" 7.4, R (lme4 V1.1-12), Stata 14.1, and SAS 9.4 to determine how the programs differ in estimation accuracy and speed, as well as convergence, when modeling multiple randomly varying slopes of different magnitudes. Simulated data…

Descriptors: Hierarchical Linear Modeling, Computer Software, Comparative Analysis, Monte Carlo Methods

Estimation of Contextual Effects through Nonlinear Multilevel Latent Variable Modeling with a Metropolis-Hastings Robbins-Monro Algorithm

Peer reviewed

Direct link

Yang, Ji Seung; Cai, Li – Journal of Educational and Behavioral Statistics, 2014

The main purpose of this study is to improve estimation efficiency in obtaining maximum marginal likelihood estimates of contextual effects in the framework of nonlinear multilevel latent variable model by adopting the Metropolis-Hastings Robbins-Monro algorithm (MH-RM). Results indicate that the MH-RM algorithm can produce estimates and standard…

Descriptors: Computation, Hierarchical Linear Modeling, Mathematics, Context Effect

Nonparametric Item Response Curve Estimation with Correction for Measurement Error

Peer reviewed

Direct link

Guo, Hongwen; Sinharay, Sandip – Journal of Educational and Behavioral Statistics, 2011

Nonparametric or kernel regression estimation of item response curves (IRCs) is often used in item analysis in testing programs. These estimates are biased when the observed scores are used as the regressor because the observed scores are contaminated by measurement error. Accuracy of this estimation is a concern theoretically and operationally.…

Descriptors: Testing Programs, Measurement, Item Analysis, Error of Measurement

Modification of the Mantel-Haenszel and Logistic Regression DIF Procedures to Incorporate the SIBTEST Regression Correction

Peer reviewed

Direct link

DeMars, Christine E. – Journal of Educational and Behavioral Statistics, 2009

The Mantel-Haenszel (MH) and logistic regression (LR) differential item functioning (DIF) procedures have inflated Type I error rates when there are large mean group differences, short tests, and large sample sizes.When there are large group differences in mean score, groups matched on the observed number-correct score differ on true score,…

Descriptors: Regression (Statistics), Test Bias, Error of Measurement, True Scores

Standard Error Estimation of 3PL IRT True Score Equating with an MCMC Method

Peer reviewed

Direct link

Liu, Yuming; Schulz, E. Matthew; Yu, Lei – Journal of Educational and Behavioral Statistics, 2008

A Markov chain Monte Carlo (MCMC) method and a bootstrap method were compared in the estimation of standard errors of item response theory (IRT) true score equating. Three test form relationships were examined: parallel, tau-equivalent, and congeneric. Data were simulated based on Reading Comprehension and Vocabulary tests of the Iowa Tests of…

Descriptors: Reading Comprehension, Test Format, Markov Processes, Educational Testing

Controlling Error in Multiple Comparisons, with Examples from State-to-State Differences in Educational Achievement.

Peer reviewed

Williams, Valerie S. L.; Jones, Lyle V.; Tukey, John W. – Journal of Educational and Behavioral Statistics, 1999

Illustrates and compares three alternative procedures to adjust significance levels for multiplicity: (1) the traditional Bonferroni technique; (2) a sequential Bonferroni technique; and (3) a sequential approach to control the false discovery rate proposed by Y. Benjamini and Y. Hochberg (1995). Explains advantages of the Benjamini and Hochberg…

Descriptors: Academic Achievement, Comparative Analysis, Error of Measurement, Statistical Significance

Bellara, Aarti	1
Béguin, Anton A.	1
Cai, Li	1
Crompvoets, Elise A. V.	1
DeMars, Christine E.	1
Gambino, Anthony J.	1
Guo, Hongwen	1
Jones, Lyle V.	1
Kooken, Janice	1
Li, Xiaoran	1
Liu, Yuming	1
McCoach, D. Betsy	1
Monroe, Scott	1
Newton, Sarah D.	1
Rifenbark, Graham G.	1
Schochet, Peter Z.	1
Schulz, E. Matthew	1
Sijtsma, Klaas	1
Sinharay, Sandip	1
Tukey, John W.	1
Williams, Valerie S. L.	1
Yang, Ji Seung	1
Yomtov, Dani	1
Yu, Lei	1
More ▼