NotesFAQContact Us
Collection
Advanced
Search Tips
Showing all 15 results Save | Export
Peer reviewed Peer reviewed
Direct linkDirect link
Yan Xia; Xinchang Zhou – Educational and Psychological Measurement, 2025
Parallel analysis has been considered one of the most accurate methods for determining the number of factors in factor analysis. One major advantage of parallel analysis over traditional factor retention methods (e.g., Kaiser's rule) is that it addresses the sampling variability of eigenvalues obtained from the identity matrix, representing the…
Descriptors: Factor Analysis, Statistical Analysis, Evaluation Methods, Sampling
Peer reviewed Peer reviewed
Direct linkDirect link
Pere J. Ferrando; David Navarro-González; Fabia Morales-Vives – Educational and Psychological Measurement, 2025
The problem of local item dependencies (LIDs) is very common in personality and attitude measures, particularly in those that measure narrow-bandwidth dimensions. At the structural level, these dependencies can be modeled by using extended factor analytic (FA) solutions that include correlated residuals. However, the effects that LIDs have on the…
Descriptors: Scores, Accuracy, Evaluation Methods, Factor Analysis
Peer reviewed Peer reviewed
Direct linkDirect link
McNeish, Daniel; Harring, Jeffrey R. – Educational and Psychological Measurement, 2017
To date, small sample problems with latent growth models (LGMs) have not received the amount of attention in the literature as related mixed-effect models (MEMs). Although many models can be interchangeably framed as a LGM or a MEM, LGMs uniquely provide criteria to assess global data-model fit. However, previous studies have demonstrated poor…
Descriptors: Growth Models, Goodness of Fit, Error Correction, Sampling
Peer reviewed Peer reviewed
Direct linkDirect link
Raykov, Tenko; Marcoulides, George A. – Educational and Psychological Measurement, 2015
A latent variable modeling approach for scale reliability evaluation in heterogeneous populations is discussed. The method can be used for point and interval estimation of reliability of multicomponent measuring instruments in populations representing mixtures of an unknown number of latent classes or subpopulations. The procedure is helpful also…
Descriptors: Test Reliability, Evaluation Methods, Measurement Techniques, Computation
Peer reviewed Peer reviewed
Direct linkDirect link
Peng, Chao-Ying Joanne; Zhu, Jin – Educational and Psychological Measurement, 2008
For the past 25 years, methodological advances have been made in missing data treatment. Most published work has focused on missing data in dependent variables under various conditions. The present study seeks to fill the void by comparing two approaches for handling missing data in categorical covariates in logistic regression: the…
Descriptors: Regression (Statistics), Comparative Analysis, Evaluation Methods, Equations (Mathematics)
Peer reviewed Peer reviewed
Direct linkDirect link
Zhang, Bo; Stone, Clement A. – Educational and Psychological Measurement, 2008
This research examines the utility of the s-x[superscript 2] statistic proposed by Orlando and Thissen (2000) in evaluating item fit for multidimensional item response models. Monte Carlo simulation was conducted to investigate both the Type I error and statistical power of this fit statistic in analyzing two kinds of multidimensional test…
Descriptors: Monte Carlo Methods, Sampling, Goodness of Fit, Evaluation Methods
Peer reviewed Peer reviewed
Rothrock, Julia E.; Michael, William B. – Educational and Psychological Measurement, 1976
Tests constructed through item-sampling (where several tests are constructed and randomly assigned to testees) were compared to a traditional constant-item procedure for evaluating student and class progress in a community college freshman psychology class. The item-sampling approach did not appear to be superior. (JKS)
Descriptors: Evaluation Methods, Item Sampling, Measurement Techniques, Test Validity
Peer reviewed Peer reviewed
Forester, Donald Lee; Michael, William B. – Educational and Psychological Measurement, 1976
Tests constructed through item-sampling (where several tests are constructed and randomly assigned to testees) were compared to a traditional constant-item procedure for evaluating class progress in a community college intermediate algebra class. The item-sampling format appeared to be slightly less valid than the traditional format. (JKS)
Descriptors: Evaluation Methods, Item Sampling, Measurement Techniques, Test Validity
Peer reviewed Peer reviewed
Direct linkDirect link
Wilcox, Rand R. – Educational and Psychological Measurement, 2006
For two random variables, X and Y, let D = X - Y, and let theta[subscript x], theta[subscript y], and theta[subscript d] be the corresponding medians. It is known that the Wilcoxon-Mann-Whitney test and its modern extensions do not test H[subscript o] : theta[subscript x] = theta[subscript y], but rather, they test H[subscript o] : theta[subscript…
Descriptors: Scores, Inferences, Comparative Analysis, Statistical Analysis
Peer reviewed Peer reviewed
Poggio, John P.; Glasnapp, Douglas R. – Educational and Psychological Measurement, 1973
Descriptors: Academic Achievement, Evaluation Methods, Formative Evaluation, Item Sampling
Peer reviewed Peer reviewed
Thompson, Bruce – Educational and Psychological Measurement, 1995
Use of the bootstrap method in a canonical correlation analysis to evaluate the replicability of a study's results is illustrated. More confidence may be vested in research results that replicate. (SLD)
Descriptors: Analysis of Covariance, Correlation, Effect Size, Evaluation Methods
Peer reviewed Peer reviewed
Direct linkDirect link
De Corte, Wilfried – Educational and Psychological Measurement, 2004
The article describes a Windows program to estimate the expected value and sampling distribution function of the adverse impact ratio for general multistage selections. The results of the program can also be used to predict the risk that a future selection decision will result in an outcome that reflects the presence of adverse impact. The method…
Descriptors: Sampling, Measurement Techniques, Evaluation Methods, Computer Software
Peer reviewed Peer reviewed
Direct linkDirect link
Wang, Wen-Chung – Educational and Psychological Measurement, 2004
The Pearson correlation is used to depict effect sizes in the context of item response theory. Amultidimensional Rasch model is used to directly estimate the correlation between latent traits. Monte Carlo simulations were conducted to investigate whether the population correlation could be accurately estimated and whether the bootstrap method…
Descriptors: Test Length, Sampling, Effect Size, Correlation
Peer reviewed Peer reviewed
Direct linkDirect link
Kelley, Ken – Educational and Psychological Measurement, 2005
The standardized group mean difference, Cohen's "d", is among the most commonly used and intuitively appealing effect sizes for group comparisons. However, reporting this point estimate alone does not reflect the extent to which sampling error may have led to an obtained value. A confidence interval expresses the uncertainty that exists between…
Descriptors: Intervals, Sampling, Integrity, Effect Size
Peer reviewed Peer reviewed
Rogers, H. Jane; Hambleton, Ronald K. – Educational and Psychological Measurement, 1989
The validity of logistic test models and computer simulation methods for generating sampling distributions of item bias statistics was evaluated under the hypothesis of no item bias. Test data from 937 ninth-grade students were used to develop 7 steps for applying computer-simulated baseline statistics in test development. (SLD)
Descriptors: Computer Simulation, Educational Research, Evaluation Methods, Grade 9