Publication Date
In 2025 | 0 |
Since 2024 | 0 |
Since 2021 (last 5 years) | 0 |
Since 2016 (last 10 years) | 1 |
Since 2006 (last 20 years) | 2 |
Descriptor
Source
Educational and Psychological… | 7 |
Author
Berry, Kenneth J. | 3 |
Mielke, Paul W., Jr. | 3 |
Aiken, Lewis R. | 1 |
Chang, Chi | 1 |
Edgington, Eugene S. | 1 |
Haller, Otto | 1 |
Jiao, Hong | 1 |
Lee, Chun-Lung | 1 |
Luo, Yong | 1 |
Marcoulides, George A. | 1 |
Raykov, Tenko | 1 |
More ▼ |
Publication Type
Journal Articles | 7 |
Reports - Research | 4 |
Reports - Evaluative | 2 |
Book/Product Reviews | 1 |
Numerical/Quantitative Data | 1 |
Education Level
Audience
Location
Laws, Policies, & Programs
Assessments and Surveys
What Works Clearinghouse Rating
Luo, Yong; Jiao, Hong – Educational and Psychological Measurement, 2018
Stan is a new Bayesian statistical software program that implements the powerful and efficient Hamiltonian Monte Carlo (HMC) algorithm. To date there is not a source that systematically provides Stan code for various item response theory (IRT) models. This article provides Stan code for three representative IRT models, including the…
Descriptors: Bayesian Statistics, Item Response Theory, Probability, Computer Software
Raykov, Tenko; Marcoulides, George A.; Lee, Chun-Lung; Chang, Chi – Educational and Psychological Measurement, 2013
This note is concerned with a latent variable modeling approach for the study of differential item functioning in a multigroup setting. A multiple-testing procedure that can be used to evaluate group differences in response probabilities on individual items is discussed. The method is readily employed when the aim is also to locate possible…
Descriptors: Test Bias, Statistical Analysis, Models, Hypothesis Testing

Berry, Kenneth J.; Mielke, Paul W., Jr. – Educational and Psychological Measurement, 1997
A FORTRAN subroutine is presented to calculate a generalized measure of agreement between multiple raters and a set of correct responses at any level of measurement and among multiple responses, along with the associated probability value, under the null hypothesis. (Author)
Descriptors: Computer Software, Interrater Reliability, Measurement Techniques, Probability

Berry, Kenneth J.; Mielke, Paul W., Jr. – Educational and Psychological Measurement, 1987
Subroutines to calculate exact chi square and Fisher's exact probability tests are presented for 3 by 2 cross-classification tables. A nondirectional probability value for each test is computed recursively. (Author/GDC)
Descriptors: Computer Software, Probability, Research Design, Statistical Significance

Berry, Kenneth J.; Mielke, Paul W., Jr. – Educational and Psychological Measurement, 1997
Describes a FORTRAN software program that calculates the probability of an observed difference between agreement measures obtained from two independent sets of raters. An example illustrates the use of the DIFFER program in evaluating undergraduate essays. (Author/SLD)
Descriptors: Comparative Analysis, Computer Software, Evaluation Methods, Higher Education

Edgington, Eugene S.; Haller, Otto – Educational and Psychological Measurement, 1984
This paper explains how to combine probabilities from discrete distributions, such as probability distributions for nonparametric tests. (Author/BW)
Descriptors: Computer Software, Data Analysis, Hypothesis Testing, Mathematical Formulas

Aiken, Lewis R. – Educational and Psychological Measurement, 1985
Three numerical coefficients for analyzing the validity and reliability of ratings are described. Each coefficient is computed as the ratio of an obtained to a maximum sum of differences in ratings. The coefficients are also applicable to the item analysis, agreement analysis, and cluster or factor analysis of rating-scale data. (Author/BW)
Descriptors: Computer Software, Data Analysis, Factor Analysis, Item Analysis