Publication Date
In 2025 | 0 |
Since 2024 | 0 |
Since 2021 (last 5 years) | 0 |
Since 2016 (last 10 years) | 3 |
Since 2006 (last 20 years) | 7 |
Descriptor
Comparative Analysis | 8 |
Probability | 8 |
Statistical Analysis | 4 |
Ability | 2 |
Computation | 2 |
Effect Size | 2 |
Models | 2 |
Monte Carlo Methods | 2 |
Scores | 2 |
Accuracy | 1 |
Bayesian Statistics | 1 |
More ▼ |
Source
Journal of Educational and… | 8 |
Author
Feinberg, Richard A. | 1 |
Garcia-Perez, Miguel A. | 1 |
Ho, Andrew Dean | 1 |
Hong, Guanglei | 1 |
Jan, Show-Li | 1 |
Monroe, Scott | 1 |
Qin, Xu | 1 |
Reckase, Mark D. | 1 |
Shieh, Gwowen | 1 |
Spray, Judith A. | 1 |
Tipton, Elizabeth | 1 |
More ▼ |
Publication Type
Journal Articles | 8 |
Reports - Evaluative | 4 |
Reports - Research | 4 |
Education Level
Elementary Secondary Education | 1 |
Audience
Location
California (Riverside) | 1 |
Laws, Policies, & Programs
Assessments and Surveys
National Assessment of… | 2 |
What Works Clearinghouse Rating
Feinberg, Richard A.; von Davier, Matthias – Journal of Educational and Behavioral Statistics, 2020
The literature showing that subscores fail to add value is vast; yet despite their typical redundancy and the frequent presence of substantial statistical errors, many stakeholders remain convinced of their necessity. This article describes a method for identifying and reporting unexpectedly high or low subscores by comparing each examinee's…
Descriptors: Scores, Probability, Statistical Distributions, Ability
Hong, Guanglei; Qin, Xu; Yang, Fan – Journal of Educational and Behavioral Statistics, 2018
Through a sensitivity analysis, the analyst attempts to determine whether a conclusion of causal inference could be easily reversed by a plausible violation of an identification assumption. Analytic conclusions that are harder to alter by such a violation are expected to add a higher value to scientific knowledge about causality. This article…
Descriptors: Statistical Inference, Probability, Statistical Bias, Statistical Analysis
Monroe, Scott – Journal of Educational and Behavioral Statistics, 2019
In item response theory (IRT) modeling, the Fisher information matrix is used for numerous inferential procedures such as estimating parameter standard errors, constructing test statistics, and facilitating test scoring. In principal, these procedures may be carried out using either the expected information or the observed information. However, in…
Descriptors: Item Response Theory, Error of Measurement, Scoring, Inferences
Tipton, Elizabeth – Journal of Educational and Behavioral Statistics, 2014
Although a large-scale experiment can provide an estimate of the average causal impact for a program, the sample of sites included in the experiment is often not drawn randomly from the inference population of interest. In this article, we provide a generalizability index that can be used to assess the degree of similarity between the sample of…
Descriptors: Experiments, Comparative Analysis, Experimental Groups, Generalization
Jan, Show-Li; Shieh, Gwowen – Journal of Educational and Behavioral Statistics, 2014
The analysis of variance (ANOVA) is one of the most frequently used statistical analyses in practical applications. Accordingly, the single and multiple comparison procedures are frequently applied to assess the differences among mean effects. However, the underlying assumption of homogeneous variances may not always be tenable. This study…
Descriptors: Sample Size, Statistical Analysis, Computation, Probability
Garcia-Perez, Miguel A. – Journal of Educational and Behavioral Statistics, 2010
A recent comparative analysis of alternative interval estimation approaches and procedures has shown that confidence intervals (CIs) for true raw scores determined with the Score method--which uses the normal approximation to the binomial distribution--have actual coverage probabilities that are closest to their nominal level. It has also recently…
Descriptors: Computation, Statistical Analysis, True Scores, Raw Scores
Ho, Andrew Dean – Journal of Educational and Behavioral Statistics, 2009
Problems of scale typically arise when comparing test score trends, gaps, and gap trends across different tests. To overcome some of these difficulties, test score distributions on the same score scale can be represented by nonparametric graphs or statistics that are invariant under monotone scale transformations. This article motivates and then…
Descriptors: Nonparametric Statistics, Comparative Analysis, Trend Analysis, Scores

Spray, Judith A.; Reckase, Mark D. – Journal of Educational and Behavioral Statistics, 1996
Two procedures for classifying examinees into categories, one based on the sequential probability ratio test (SPRT) and the other on sequential Bayes methodology, were compared to determine which required fewer items for classification. Results showed that the SPRT procedure requires fewer items to achieve the same accuracy level. (SLD)
Descriptors: Ability, Bayesian Statistics, Classification, Comparative Analysis