NotesFAQContact Us
Collection
Advanced
Search Tips
Showing all 8 results Save | Export
Peer reviewed Peer reviewed
Direct linkDirect link
Giada Spaccapanico Proietti; Mariagiulia Matteucci; Stefania Mignani; Bernard P. Veldkamp – Journal of Educational and Behavioral Statistics, 2024
Classical automated test assembly (ATA) methods assume fixed and known coefficients for the constraints and the objective function. This hypothesis is not true for the estimates of item response theory parameters, which are crucial elements in test assembly classical models. To account for uncertainty in ATA, we propose a chance-constrained…
Descriptors: Automation, Computer Assisted Testing, Ambiguity (Context), Item Response Theory
Sinharay, Sandip – Journal of Educational and Behavioral Statistics, 2018
Wollack, Cohen, and Eckerly suggested the "erasure detection index" (EDI) to detect fraudulent erasures for individual examinees. Wollack and Eckerly extended the EDI to detect fraudulent erasures at the group level. The EDI at the group level was found to be slightly conservative. This article suggests two modifications of the EDI for…
Descriptors: Deception, Identification, Testing Problems, Cheating
Peer reviewed Peer reviewed
Direct linkDirect link
Si, Yajuan; Reiter, Jerome P. – Journal of Educational and Behavioral Statistics, 2013
In many surveys, the data comprise a large number of categorical variables that suffer from item nonresponse. Standard methods for multiple imputation, like log-linear models or sequential regression imputation, can fail to capture complex dependencies and can be difficult to implement effectively in high dimensions. We present a fully Bayesian,…
Descriptors: Nonparametric Statistics, Bayesian Statistics, Measurement, Evaluation Methods
Peer reviewed Peer reviewed
Direct linkDirect link
Sinharay, Sandip – Journal of Educational and Behavioral Statistics, 2015
Person-fit assessment may help the researcher to obtain additional information regarding the answering behavior of persons. Although several researchers examined person fit, there is a lack of research on person-fit assessment for mixed-format tests. In this article, the lz statistic and the ?2 statistic, both of which have been used for tests…
Descriptors: Test Format, Goodness of Fit, Item Response Theory, Bayesian Statistics
Peer reviewed Peer reviewed
Direct linkDirect link
Verkuilen, Jay; Smithson, Michael – Journal of Educational and Behavioral Statistics, 2012
Doubly bounded continuous data are common in the social and behavioral sciences. Examples include judged probabilities, confidence ratings, derived proportions such as percent time on task, and bounded scale scores. Dependent variables of this kind are often difficult to analyze using normal theory models because their distributions may be quite…
Descriptors: Responses, Regression (Statistics), Statistical Analysis, Models
Peer reviewed Peer reviewed
Berkhof, Johannes; Snijders, Tom A. B. – Journal of Educational and Behavioral Statistics, 2001
Describes available variance component tests and presents three new score tests. One test uses the asymptotic normal distribution of the test statistic as a reference distribution; the others use a Satterthwaite approximation for the null distribution of the test statistic. Evaluates the performance of these tests through Monte Carlo simulation.…
Descriptors: Models, Monte Carlo Methods, Simulation, Statistical Distributions
Peer reviewed Peer reviewed
Toothaker, Larry E.; Newman, De – Journal of Educational and Behavioral Statistics, 1994
Compared the analysis of variance (ANOVA) "F" and several nonparametric competitors for two-way designs for empirical alpha and power through simulation. Results suggest the ANOVA "F" suffers from conservative alpha and power for the mixed normal distribution, but is generally recommended. (Author/SLD)
Descriptors: Analysis of Variance, Nonparametric Statistics, Simulation, Statistical Distributions
Peer reviewed Peer reviewed
Bloxom, Bruce; And Others – Journal of Educational and Behavioral Statistics, 1995
Develops and evaluates the linkage of the Armed Services Vocational Aptitude Battery to the mathematics scale of the National Assessment of Educational Progress. The accuracy of the proficiency distribution estimated from the projection was close to the accuracy of the distribution estimated from the large scale assessment. (SLD)
Descriptors: Educational Assessment, Estimation (Mathematics), Evaluation Methods, Mathematics Tests