Publication Date
In 2025 | 0 |
Since 2024 | 0 |
Since 2021 (last 5 years) | 0 |
Since 2016 (last 10 years) | 0 |
Since 2006 (last 20 years) | 1 |
Descriptor
Statistical Distributions | 35 |
Test Items | 35 |
Item Response Theory | 15 |
Ability | 13 |
Item Bias | 11 |
Estimation (Mathematics) | 10 |
Simulation | 10 |
Computer Simulation | 8 |
Equations (Mathematics) | 8 |
Mathematical Models | 8 |
Scores | 7 |
More ▼ |
Source
Applied Psychological… | 6 |
Educational and Psychological… | 4 |
Journal of Educational… | 4 |
Psychometrika | 3 |
Applied Measurement in… | 2 |
Journal of Outcome Measurement | 2 |
ACT, Inc. | 1 |
Evaluation & Research in… | 1 |
Author
Lewis, Charles | 2 |
Smith, Richard M. | 2 |
van der Linden, Wim J. | 2 |
Baker, Frank B. | 1 |
Bandalos, Deborah L. | 1 |
Bedrick, Edward J. | 1 |
Bramley, Tom | 1 |
Camilli, Gregory | 1 |
Chang, Shun-Wen | 1 |
Clingman, Joy M. | 1 |
Davey, T. C. | 1 |
More ▼ |
Publication Type
Reports - Evaluative | 35 |
Journal Articles | 22 |
Speeches/Meeting Papers | 13 |
Numerical/Quantitative Data | 1 |
Reports - Research | 1 |
Education Level
Audience
Location
United Kingdom | 1 |
Laws, Policies, & Programs
Assessments and Surveys
What Works Clearinghouse Rating
Kang, Taehoon; Petersen, Nancy S. – ACT, Inc., 2009
This paper compares three methods of item calibration--concurrent calibration, separate calibration with linking, and fixed item parameter calibration--that are frequently used for linking item parameters to a base scale. Concurrent and separate calibrations were implemented using BILOG-MG. The Stocking and Lord (1983) characteristic curve method…
Descriptors: Standards, Testing Programs, Test Items, Statistical Distributions

Zwick, Rebecca; Thayer, Dorothy T.; Lewis, Charles – Journal of Educational Measurement, 1999
Developed an empirical Bayes enhancement to Mantel-Haenszel (MH) analysis of differential item functioning (DIF) in which it is assumed that the MH statistics are normally distributed and that the prior distribution of underlying DIF parameters is also normal. (Author/SLD)
Descriptors: Bayesian Statistics, Item Bias, Statistical Distributions, Test Items

van der Linden, Wim J. – Psychometrika, 1998
Dichotomous item response theory (IRT) models can be viewed as families of stochastically ordered distributions of responses to test items. This paper explores several properties of such distributions, especially those related to transfer to other distributions. Results are formulated as a series of theorems and corollaries that apply to…
Descriptors: Item Response Theory, Responses, Statistical Distributions, Test Items

Zeng, Lingjia – Applied Psychological Measurement, 1997
Proposes a marginal Bayesian estimation procedure to improve item parameter estimates for the three parameter logistic model. Computer simulation suggests that implementing the marginal Bayesian estimation algorithm with four-parameter beta prior distributions and then updating the priors with empirical means of updated intermediate estimates can…
Descriptors: Algorithms, Bayesian Statistics, Estimation (Mathematics), Statistical Distributions
Monahan, Patrick – 2000
Previous studies that investigated the effect of unequal ability distributions on the Type I error (TIE) of the Mantel-Haenszel chi-square test for detecting differential item functioning (DIF) simulated ability distributions that differed only in means. This simulation study suggests that the magnitude of TIE inflation is increased, and the type…
Descriptors: Ability, Chi Square, Item Bias, Simulation

Enders, Craig K.; Bandalos, Deborah L. – Applied Measurement in Education, 1999
Examined the degree to which coefficient alpha is affected by including items with different distribution shapes within a unidimensional scale. Computer simulation results indicate that reliability does not increase dramatically as a result of using differentially shaped items within a scale. Discusses implications for test construction. (SLD)
Descriptors: Computer Simulation, Reliability, Scaling, Statistical Distributions
Oshima, T. C.; Davey, T. C. – 1994
This paper evaluated multidimensional linking procedures with which multidimensional test data from two separate calibrations were put on a common scale. Data were simulated with known ability distributions varying on two factors which made linking necessary: mean vector differences and variance-covariance (v-c) matrix differences. After the…
Descriptors: Ability, Estimation (Mathematics), Evaluation Methods, Matrices
Chang, Shun-Wen; Twu, Bor-Yaun – 2001
To satisfy the security requirements of computerized adaptive tests (CATs), efforts have been made to control the exposure rates of optimal items directly by incorporating statistical methods into the item selection procedure. Since differences are likely to occur between the exposure control parameter derivation stage and the operational CAT…
Descriptors: Adaptive Testing, Computer Assisted Testing, Selection, Simulation

Baker, Frank B. – Applied Psychological Measurement, 1996
Using the characteristic curve method for dichotomously scored test items, the sampling distributions of equating coefficients were examined. Simulations indicate that for the equating conditions studied, the sampling distributions of the equating coefficients appear to have acceptable characteristics, suggesting confidence in the values obtained…
Descriptors: Equated Scores, Item Response Theory, Sampling, Statistical Distributions

Bedrick, Edward J. – Psychometrika, 1997
A simple approximation to the conditional distribution of goodness-of-fit statistics for the Rasch model is presented that is used when item difficulties are known. The approximation, which is easily programmed, gives relatively accurate assessments of conditional p-values for tests of 10 or more items. (Author/SLD)
Descriptors: Difficulty Level, Goodness of Fit, Item Response Theory, Statistical Distributions

Seol, Hyunsoo – Journal of Outcome Measurement, 1999
Examined five Rasch-model-based item-fit indices in terms of their distributional properties and the power of detecting item bias or differential item functioning. Results indicate that, although these five standardized item-fit indices did not depart significantly from a normal distribution, the Type I error rates were not reasonable. (Author/SLD)
Descriptors: Goodness of Fit, Item Bias, Item Response Theory, Statistical Distributions

Monaco, Malina – 1997
The effects of skewed theta distributions on indices of differential item functioning (DIF) were studied, comparing Mantel Haenszel (N. Mantel and W. Haenszel, 1959) and DFIT (N. S. Raju, W. J. van der Linden, and P. F. Fleer) (noncompensatory DIF). The significance of the study is that in educational and psychological data, the distributions one…
Descriptors: Ability, Estimation (Mathematics), Item Bias, Monte Carlo Methods

Livingston, Samuel A.; Lewis, Charles – Journal of Educational Measurement, 1995
A method is presented for estimating the accuracy and consistency of classifications based on test scores. The reliability of the score is used to estimate effective test length in terms of discrete items. The true-score distribution is estimated by fitting a four-parameter beta model. (SLD)
Descriptors: Classification, Estimation (Mathematics), Scores, Statistical Distributions

Hanson, Bradley A.; And Others – Applied Psychological Measurement, 1993
The delta method was used to derive standard errors (SES) of the Levine observed score and Levine true score linear test equating methods using data from two test forms. SES derived without the normality assumption and bootstrap SES were very close. The situation with skewed score distributions is also discussed. (SLD)
Descriptors: Equated Scores, Equations (Mathematics), Error of Measurement, Sampling
Mazor, Kathleen M.; And Others – 1993
The Mantel-Haenszel (MH) procedure has become one of the most popular procedures for detecting differential item functioning (DIF). One of the most troublesome criticisms of this procedure is that while detection rates for uniform DIF are very good, the procedure is not sensitive to non-uniform DIF. In this study, examinee responses were generated…
Descriptors: Comparative Testing, Computer Simulation, Item Bias, Item Response Theory