ERIC - Search Results

Publication Date

In 2025	2
Since 2024	2
Since 2021 (last 5 years)	2
Since 2016 (last 10 years)	3
Since 2006 (last 20 years)	7

Descriptor

Evaluation Methods	15
Sampling	12
Computation	5
Measurement Techniques	5
Research Methodology	5
Simulation	5
Statistical Analysis	5
Correlation	4
Data Analysis	3
Effect Size	3
Goodness of Fit	3
Item Sampling	3
Monte Carlo Methods	3
Test Validity	3
Comparative Analysis	2
Computer Simulation	2
Equations (Mathematics)	2
Error of Measurement	2
Factor Analysis	2
High School Students	2
Item Response Theory	2
Models	2
Sample Size	2
Scores	2
Test Reliability	2
More ▼

Source

Educational and Psychological…

Publication Type

Journal Articles	12
Reports - Research	6
Reports - Evaluative	5
Speeches/Meeting Papers	2
Reports - Descriptive	1

Education Level

High Schools	1
Secondary Education	1

Audience

Location

Laws, Policies, & Programs

Assessments and Surveys

What Works Clearinghouse Rating

Showing all 15 results Save | Export

Improving the Use of Parallel Analysis by Accounting for Sampling Variability of the Observed Correlation Matrix

Peer reviewed

Direct link

Yan Xia; Xinchang Zhou – Educational and Psychological Measurement, 2025

Parallel analysis has been considered one of the most accurate methods for determining the number of factors in factor analysis. One major advantage of parallel analysis over traditional factor retention methods (e.g., Kaiser's rule) is that it addresses the sampling variability of eigenvalues obtained from the identity matrix, representing the…

Descriptors: Factor Analysis, Statistical Analysis, Evaluation Methods, Sampling

Linear and Nonlinear Indices of Score Accuracy and Item Effectiveness for Measures That Contain Locally Dependent Items

Peer reviewed

Direct link

Pere J. Ferrando; David Navarro-González; Fabia Morales-Vives – Educational and Psychological Measurement, 2025

The problem of local item dependencies (LIDs) is very common in personality and attitude measures, particularly in those that measure narrow-bandwidth dimensions. At the structural level, these dependencies can be modeled by using extended factor analytic (FA) solutions that include correlated residuals. However, the effects that LIDs have on the…

Descriptors: Scores, Accuracy, Evaluation Methods, Factor Analysis

Correcting Model Fit Criteria for Small Sample Latent Growth Models with Incomplete Data

Peer reviewed

Direct link

McNeish, Daniel; Harring, Jeffrey R. – Educational and Psychological Measurement, 2017

To date, small sample problems with latent growth models (LGMs) have not received the amount of attention in the literature as related mixed-effect models (MEMs). Although many models can be interchangeably framed as a LGM or a MEM, LGMs uniquely provide criteria to assess global data-model fit. However, previous studies have demonstrated poor…

Descriptors: Growth Models, Goodness of Fit, Error Correction, Sampling

Scale Reliability Evaluation with Heterogeneous Populations

Peer reviewed

Direct link

Raykov, Tenko; Marcoulides, George A. – Educational and Psychological Measurement, 2015

A latent variable modeling approach for scale reliability evaluation in heterogeneous populations is discussed. The method can be used for point and interval estimation of reliability of multicomponent measuring instruments in populations representing mixtures of an unknown number of latent classes or subpopulations. The procedure is helpful also…

Descriptors: Test Reliability, Evaluation Methods, Measurement Techniques, Computation

Comparison of Two Approaches for Handling Missing Covariates in Logistic Regression

Peer reviewed

Direct link

Peng, Chao-Ying Joanne; Zhu, Jin – Educational and Psychological Measurement, 2008

For the past 25 years, methodological advances have been made in missing data treatment. Most published work has focused on missing data in dependent variables under various conditions. The present study seeks to fill the void by comparing two approaches for handling missing data in categorical covariates in logistic regression: the…

Descriptors: Regression (Statistics), Comparative Analysis, Evaluation Methods, Equations (Mathematics)

Evaluating Item Fit for Multidimensional Item Response Models

Peer reviewed

Direct link

Zhang, Bo; Stone, Clement A. – Educational and Psychological Measurement, 2008

This research examines the utility of the s-x[superscript 2] statistic proposed by Orlando and Thissen (2000) in evaluating item fit for multidimensional item response models. Monte Carlo simulation was conducted to investigate both the Type I error and statistical power of this fit statistic in analyzing two kinds of multidimensional test…

Descriptors: Monte Carlo Methods, Sampling, Goodness of Fit, Evaluation Methods

The Comparative Validities of Traditional Item-Selection and Item-Sampling Procedures in Evaluation of Student and Class Progress Toward Course Objectives in Freshman Psychology

Peer reviewed

Rothrock, Julia E.; Michael, William B. – Educational and Psychological Measurement, 1976

Tests constructed through item-sampling (where several tests are constructed and randomly assigned to testees) were compared to a traditional constant-item procedure for evaluating student and class progress in a community college freshman psychology class. The item-sampling approach did not appear to be superior. (JKS)

Descriptors: Evaluation Methods, Item Sampling, Measurement Techniques, Test Validity

A Comparison of the Validities of a Traditional Item-Allocation Method and of an Item-Sampling Procedure in the Construction of In-Class Examinations in Intermediate Algebra

Peer reviewed

Forester, Donald Lee; Michael, William B. – Educational and Psychological Measurement, 1976

Tests constructed through item-sampling (where several tests are constructed and randomly assigned to testees) were compared to a traditional constant-item procedure for evaluating class progress in a community college intermediate algebra class. The item-sampling format appeared to be slightly less valid than the traditional format. (JKS)

Descriptors: Evaluation Methods, Item Sampling, Measurement Techniques, Test Validity

A Note on Inferences about the Median of the Distribution of Difference Scores

Peer reviewed

Direct link

Wilcox, Rand R. – Educational and Psychological Measurement, 2006

For two random variables, X and Y, let D = X - Y, and let theta[subscript x], theta[subscript y], and theta[subscript d] be the corresponding medians. It is known that the Wilcoxon-Mann-Whitney test and its modern extensions do not test H[subscript o] : theta[subscript x] = theta[subscript y], but rather, they test H[subscript o] : theta[subscript…

Descriptors: Scores, Inferences, Comparative Analysis, Statistical Analysis

Content-Sampling as an Evaluation and Research Technique

Peer reviewed

Poggio, John P.; Glasnapp, Douglas R. – Educational and Psychological Measurement, 1973

Descriptors: Academic Achievement, Evaluation Methods, Formative Evaluation, Item Sampling

Exploring the Replicability of a Study's Results: Bootstrap Statistics for the Multivariate Case.

Peer reviewed

Thompson, Bruce – Educational and Psychological Measurement, 1995

Use of the bootstrap method in a canonical correlation analysis to evaluate the replicability of a study's results is illustrated. More confidence may be vested in research results that replicate. (SLD)

Descriptors: Analysis of Covariance, Correlation, Effect Size, Evaluation Methods

SSAIS: A Program to Assess Adverse Impact in Multistage Selection Decisions

Peer reviewed

Direct link

De Corte, Wilfried – Educational and Psychological Measurement, 2004

The article describes a Windows program to estimate the expected value and sampling distribution function of the adverse impact ratio for general multistage selections. The results of the program can also be used to predict the risk that a future selection decision will result in an outcome that reflects the presence of adverse impact. The method…

Descriptors: Sampling, Measurement Techniques, Evaluation Methods, Computer Software

Direct Estimation of Correlation as a Measure of Association Strength Using Multidimensional Item Response Models

Peer reviewed

Direct link

Wang, Wen-Chung – Educational and Psychological Measurement, 2004

The Pearson correlation is used to depict effect sizes in the context of item response theory. Amultidimensional Rasch model is used to directly estimate the correlation between latent traits. Monte Carlo simulations were conducted to investigate whether the population correlation could be accurately estimated and whether the bootstrap method…

Descriptors: Test Length, Sampling, Effect Size, Correlation

The Effects of Nonnormal Distributions on Confidence Intervals around the Standardized Mean Difference: Bootstrap and Parametric Confidence Intervals

Peer reviewed

Direct link

Kelley, Ken – Educational and Psychological Measurement, 2005

The standardized group mean difference, Cohen's "d", is among the most commonly used and intuitively appealing effect sizes for group comparisons. However, reporting this point estimate alone does not reflect the extent to which sampling error may have led to an obtained value. A confidence interval expresses the uncertainty that exists between…

Descriptors: Intervals, Sampling, Integrity, Effect Size

Evaluation of Computer Simulated Baseline Statistics for Use in Item Bias Studies.

Peer reviewed

Rogers, H. Jane; Hambleton, Ronald K. – Educational and Psychological Measurement, 1989

The validity of logistic test models and computer simulation methods for generating sampling distributions of item bias statistics was evaluated under the hypothesis of no item bias. Test data from 937 ninth-grade students were used to develop 7 steps for applying computer-simulated baseline statistics in test development. (SLD)

Descriptors: Computer Simulation, Educational Research, Evaluation Methods, Grade 9

Michael, William B.	2
David Navarro-González	1
De Corte, Wilfried	1
Fabia Morales-Vives	1
Forester, Donald Lee	1
Glasnapp, Douglas R.	1
Hambleton, Ronald K.	1
Harring, Jeffrey R.	1
Kelley, Ken	1
Marcoulides, George A.	1
McNeish, Daniel	1
Peng, Chao-Ying Joanne	1
Pere J. Ferrando	1
Poggio, John P.	1
Raykov, Tenko	1
Rogers, H. Jane	1
Rothrock, Julia E.	1
Stone, Clement A.	1
Thompson, Bruce	1
Wang, Wen-Chung	1
Wilcox, Rand R.	1
Xinchang Zhou	1
Yan Xia	1
Zhang, Bo	1
Zhu, Jin	1
More ▼