Publication Date
| In 2026 | 0 |
| Since 2025 | 12 |
| Since 2022 (last 5 years) | 56 |
| Since 2017 (last 10 years) | 108 |
| Since 2007 (last 20 years) | 255 |
Descriptor
| Sample Size | 404 |
| Simulation | 404 |
| Item Response Theory | 113 |
| Statistical Analysis | 92 |
| Error of Measurement | 86 |
| Models | 84 |
| Test Items | 80 |
| Comparative Analysis | 77 |
| Monte Carlo Methods | 75 |
| Correlation | 67 |
| Evaluation Methods | 64 |
| More ▼ | |
Source
Author
| Fan, Xitao | 7 |
| Beretvas, S. Natasha | 5 |
| Algina, James | 4 |
| Chan, Wai | 4 |
| Cohen, Allan S. | 4 |
| De Champlain, Andre | 4 |
| Finch, W. Holmes | 4 |
| French, Brian F. | 4 |
| Kim, Seock-Ho | 4 |
| Kromrey, Jeffrey D. | 4 |
| Paek, Insu | 4 |
| More ▼ | |
Publication Type
Education Level
Audience
| Teachers | 4 |
| Researchers | 3 |
Location
| North Carolina | 2 |
| Armenia | 1 |
| Austria | 1 |
| Canada | 1 |
| Florida (Miami) | 1 |
| Hong Kong | 1 |
| Indiana | 1 |
| Iran | 1 |
| Montana | 1 |
| New York (New York) | 1 |
| Norway | 1 |
| More ▼ | |
Laws, Policies, & Programs
| No Child Left Behind Act 2001 | 2 |
Assessments and Surveys
What Works Clearinghouse Rating
Finch, W. Holmes; French, Brian F. – Educational and Psychological Measurement, 2008
A number of statistical methods exist for the detection of differential item functioning (DIF). The performance of DIF methods has been widely studied and generally found to be effective in the detection of both uniform and nonuniform DIF. Anecdotal reports suggest that these techniques may too often incorrectly detect the presence of one type of…
Descriptors: Test Bias, Simulation, Statistical Analysis, Probability
Jamshidian, Mortaza; Mata, Matthew – Multivariate Behavioral Research, 2008
Incomplete or missing data is a common problem in almost all areas of empirical research. It is well known that simple and ad hoc methods such as complete case analysis or mean imputation can lead to biased and/or inefficient estimates. The method of maximum likelihood works well; however, when the missing data mechanism is not one of missing…
Descriptors: Structural Equation Models, Simulation, Factor Analysis, Research Methodology
Woods, Carol M. – Multivariate Behavioral Research, 2009
Differential item functioning (DIF) occurs when an item on a test or questionnaire has different measurement properties for 1 group of people versus another, irrespective of mean differences on the construct. This study focuses on the use of multiple-indicator multiple-cause (MIMIC) structural equation models for DIF testing, parameterized as item…
Descriptors: Test Bias, Structural Equation Models, Item Response Theory, Testing
Murphy, Daniel L.; Pituch, Keenan A. – Journal of Experimental Education, 2009
The authors examined the robustness of multilevel linear growth curve modeling to misspecification of an autoregressive moving average process. As previous research has shown (J. Ferron, R. Dailey, & Q. Yi, 2002; O. Kwok, S. G. West, & S. B. Green, 2007; S. Sivo, X. Fan, & L. Witta, 2005), estimates of the fixed effects were unbiased, and Type I…
Descriptors: Sample Size, Computation, Evaluation Methods, Longitudinal Studies
de Winter, J. C. F.; Dodou, D.; Wieringa, P. A. – Multivariate Behavioral Research, 2009
Exploratory factor analysis (EFA) is generally regarded as a technique for large sample sizes ("N"), with N = 50 as a reasonable absolute minimum. This study offers a comprehensive overview of the conditions in which EFA can yield good quality results for "N" below 50. Simulations were carried out to estimate the minimum required "N" for different…
Descriptors: Sample Size, Factor Analysis, Enrollment, Evaluation Methods
Wyse, Adam E.; Mapuranga, Raymond – International Journal of Testing, 2009
Differential item functioning (DIF) analysis is a statistical technique used for ensuring the equity and fairness of educational assessments. This study formulates a new DIF analysis method using the information similarity index (ISI). ISI compares item information functions when data fits the Rasch model. Through simulations and an international…
Descriptors: Test Bias, Evaluation Methods, Test Items, Educational Assessment
Maydeu-Olivares, Alberto; Coffman, Donna L.; Hartmann, Wolfgang M. – Psychological Methods, 2007
The point estimate of sample coefficient alpha may provide a misleading impression of the reliability of the test score. Because sample coefficient alpha is consistently biased downward, it is more likely to yield a misleading impression of poor reliability. The magnitude of the bias is greatest precisely when the variability of sample alpha is…
Descriptors: Intervals, Scores, Sample Size, Simulation
Martineau, Joseph A. – Applied Psychological Measurement, 2007
Rudner (2001, 2005) described an expected classification accuracy index for determining the asymptotic expectation of accuracy of classifications of examinees into score categories. This article expands on that exposition by evaluating the index as it is likely to be used in practice (as a point estimate of classification accuracy), provides a…
Descriptors: Classification, Error of Measurement, Sample Size, Measurement
Tong, Ye; Brennan, Robert L. – Educational and Psychological Measurement, 2007
Estimating standard errors of estimated variance components has long been a challenging task in generalizability theory. Researchers have speculated about the potential applicability of the bootstrap for obtaining such estimates, but they have identified problems (especially bias) in using the bootstrap. Using Brennan's bias-correcting procedures…
Descriptors: Error of Measurement, Generalizability Theory, Computation, Simulation
Hertzog, Christopher; von Oertzen, Timo; Ghisletta, Paolo; Lindenberger, Ulman – Structural Equation Modeling: A Multidisciplinary Journal, 2008
We evaluated the statistical power of single-indicator latent growth curve models to detect individual differences in change (variances of latent slopes) as a function of sample size, number of longitudinal measurement occasions, and growth curve reliability. We recommend the 2 degree-of-freedom generalized test assessing loss of fit when both…
Descriptors: Sample Size, Error of Measurement, Individual Differences, Statistical Analysis
Jamshidian, M.; Khatoonabadi, M. – International Journal of Mathematical Education in Science and Technology, 2007
Almost all introductory and intermediate level statistics textbooks include the topic of confidence interval for the population mean. Almost all these texts introduce the median as a robust measure of central tendency. Only a few of these books, however, cover inference on the population median and in particular confidence interval for the median.…
Descriptors: Intervals, Simulation, Computation, Error of Measurement
Anderson, Richard B.; Doherty, Michael E.; Friedrich, Jeff C. – Journal of Experimental Psychology: Learning, Memory, and Cognition, 2008
In 4 studies, the authors examined the hypothesis that the structure of the informational environment makes small samples more informative than large ones for drawing inferences about population correlations. The specific purpose of the studies was to test predictions arising from the signal detection simulations of R. B. Anderson, M. E. Doherty,…
Descriptors: Simulation, Statistical Analysis, Inferences, Population Trends
van Abswoude, Alexandra A. H.; Vermunt, Jeroen K.; Hemker, Bas T. – Applied Psychological Measurement, 2007
Mokken scale analysis can be used for scaling under nonparametric item response theory models. The results may, however, not reflect the underlying dimensionality of data. Various features of Mokken scale analysis--the H coefficient, Mokken scale conditions, and algorithms--may explain this result. In this article, three new H-based objective…
Descriptors: Measures (Individuals), Probability, Simulation, Item Response Theory
Mulekar, Madhuri S.; Siegel, Murray H. – Mathematics Teacher, 2009
If students are to understand inferential statistics successfully, they must have a profound understanding of the nature of the sampling distribution. Specifically, they must comprehend the determination of the expected value and standard error of a sampling distribution as well as the meaning of the central limit theorem. Many students in a high…
Descriptors: Statistical Inference, Statistics, Sample Size, Error of Measurement
Peer reviewedTuerlinckx, Francis; De Boeck, Paul – Applied Psychological Measurement, 1999
Conducted a simulation study to determine how well two models for local item dependency, the constant order interaction and the dimension-dependent interaction models, could be distinguished. Results suggest that when there is good reason to suspect a dimension-dependent interaction, it is possible to detect it if the sample size is at least 500.…
Descriptors: Interaction, Models, Sample Size, Simulation

Direct link
