ERIC - Search Results

Publication Date

In 2025	0
Since 2024	0
Since 2021 (last 5 years)	0
Since 2016 (last 10 years)	1
Since 2006 (last 20 years)	2

Source

Educational and Psychological…

Author

Berry, Kenneth J.	3
Mielke, Paul W., Jr.	3
Aiken, Lewis R.	1
Chang, Chi	1
Edgington, Eugene S.	1
Haller, Otto	1
Jiao, Hong	1
Lee, Chun-Lung	1
Luo, Yong	1
Marcoulides, George A.	1
Raykov, Tenko	1
More ▼

Publication Type

Journal Articles	7
Reports - Research	4
Reports - Evaluative	2
Book/Product Reviews	1
Numerical/Quantitative Data	1

Education Level

Audience

Location

Laws, Policies, & Programs

Assessments and Surveys

What Works Clearinghouse Rating

Showing all 7 results Save | Export

Using the Stan Program for Bayesian Item Response Theory

Peer reviewed

Direct link

Luo, Yong; Jiao, Hong – Educational and Psychological Measurement, 2018

Stan is a new Bayesian statistical software program that implements the powerful and efficient Hamiltonian Monte Carlo (HMC) algorithm. To date there is not a source that systematically provides Stan code for various item response theory (IRT) models. This article provides Stan code for three representative IRT models, including the…

Descriptors: Bayesian Statistics, Item Response Theory, Probability, Computer Software

Studying Differential Item Functioning via Latent Variable Modeling: A Note on a Multiple-Testing Procedure

Peer reviewed

Direct link

Raykov, Tenko; Marcoulides, George A.; Lee, Chun-Lung; Chang, Chi – Educational and Psychological Measurement, 2013

This note is concerned with a latent variable modeling approach for the study of differential item functioning in a multigroup setting. A multiple-testing procedure that can be used to evaluate group differences in response probabilities on individual items is discussed. The method is readily employed when the aim is also to locate possible…

Descriptors: Test Bias, Statistical Analysis, Models, Hypothesis Testing

Measuring the Joint Agreement between Multiple Raters and a Standard.

Peer reviewed

Berry, Kenneth J.; Mielke, Paul W., Jr. – Educational and Psychological Measurement, 1997

A FORTRAN subroutine is presented to calculate a generalized measure of agreement between multiple raters and a set of correct responses at any level of measurement and among multiple responses, along with the associated probability value, under the null hypothesis. (Author)

Descriptors: Computer Software, Interrater Reliability, Measurement Techniques, Probability

Exact Chi-Square and Fisher's Exact Probability Test for 3 by 2 Cross-Classification Tables.

Peer reviewed

Berry, Kenneth J.; Mielke, Paul W., Jr. – Educational and Psychological Measurement, 1987

Subroutines to calculate exact chi square and Fisher's exact probability tests are presented for 3 by 2 cross-classification tables. A nondirectional probability value for each test is computed recursively. (Author/GDC)

Descriptors: Computer Software, Probability, Research Design, Statistical Significance

Agreement Measure Comparisons between Two Independent Sets of Raters.

Peer reviewed

Berry, Kenneth J.; Mielke, Paul W., Jr. – Educational and Psychological Measurement, 1997

Describes a FORTRAN software program that calculates the probability of an observed difference between agreement measures obtained from two independent sets of raters. An example illustrates the use of the DIFFER program in evaluating undergraduate essays. (Author/SLD)

Descriptors: Comparative Analysis, Computer Software, Evaluation Methods, Higher Education

Combining Probabilities from Discrete Probability Distributions.

Peer reviewed

Edgington, Eugene S.; Haller, Otto – Educational and Psychological Measurement, 1984

This paper explains how to combine probabilities from discrete distributions, such as probability distributions for nonparametric tests. (Author/BW)

Descriptors: Computer Software, Data Analysis, Hypothesis Testing, Mathematical Formulas

Three Coefficients for Analyzing the Reliability and Validity of Ratings.

Peer reviewed

Aiken, Lewis R. – Educational and Psychological Measurement, 1985

Three numerical coefficients for analyzing the validity and reliability of ratings are described. Each coefficient is computed as the ratio of an obtained to a maximum sum of differences in ratings. The coefficients are also applicable to the item analysis, agreement analysis, and cluster or factor analysis of rating-scale data. (Author/BW)

Descriptors: Computer Software, Data Analysis, Factor Analysis, Item Analysis

Computer Software	7
Probability	7
Comparative Analysis	2
Data Analysis	2
Hypothesis Testing	2
Interrater Reliability	2
Statistical Analysis	2
Statistical Significance	2
Bayesian Statistics	1
Differences	1
Equations (Mathematics)	1
Evaluation Methods	1
Factor Analysis	1
Groups	1
Higher Education	1
Item Analysis	1
Item Response Theory	1
Markov Processes	1
Mathematical Formulas	1
Mathematical Models	1
Mathematics	1
Measurement Techniques	1
Models	1
Monte Carlo Methods	1
Nonparametric Statistics	1
More ▼