NotesFAQContact Us
Collection
Advanced
Search Tips
What Works Clearinghouse Rating
Does not meet standards1
Showing 1,216 to 1,230 of 3,295 results Save | Export
Peer reviewed Peer reviewed
Direct linkDirect link
Wang, Wen-Chung; Shih, Ching-Lin; Sun, Guo-Wei – Educational and Psychological Measurement, 2012
The DIF-free-then-DIF (DFTD) strategy consists of two steps: (a) select a set of items that are the most likely to be DIF-free and (b) assess the other items for DIF (differential item functioning) using the designated items as anchors. The rank-based method together with the computer software IRTLRDIF can select a set of DIF-free polytomous items…
Descriptors: Test Bias, Test Items, Item Response Theory, Evaluation Methods
Peer reviewed Peer reviewed
Direct linkDirect link
Barchard, Kimberly A. – Psychological Methods, 2012
This article introduces new statistics for evaluating score consistency. Psychologists usually use correlations to measure the degree of linear relationship between 2 sets of scores, ignoring differences in means and standard deviations. In medicine, biology, chemistry, and physics, a more stringent criterion is often used: the extent to which…
Descriptors: Psychologists, Error of Measurement, Correlation, Reliability
Peer reviewed Peer reviewed
Direct linkDirect link
Lee, Taehun; Cai, Li – Journal of Educational and Behavioral Statistics, 2012
Model-based multiple imputation has become an indispensable method in the educational and behavioral sciences. Mean and covariance structure models are often fitted to multiply imputed data sets. However, the presence of multiple random imputations complicates model fit testing, which is an important aspect of mean and covariance structure…
Descriptors: Statistical Inference, Structural Equation Models, Goodness of Fit, Statistical Analysis
Peer reviewed Peer reviewed
Direct linkDirect link
Watkins, Ann E.; Bargagliotti, Anna; Franklin, Christine – Journal of Statistics Education, 2014
Although the use of simulation to teach the sampling distribution of the mean is meant to provide students with sound conceptual understanding, it may lead them astray. We discuss a misunderstanding that can be introduced or reinforced when students who intuitively understand that "bigger samples are better" conduct a simulation to…
Descriptors: Simulation, Sampling, Sample Size, Misconceptions
Peer reviewed Peer reviewed
Direct linkDirect link
De Laet, Steven; Colpin, Hilde; Goossens, Luc; Van Leeuwen, Karla; Verschueren, Karine – Journal of Psychoeducational Assessment, 2014
Through an examination of measurement invariance, this study investigated whether attachment-related dimensions (i.e., secure base, safe haven, and negative interactions as measured with the Network of Relationships Inventory-Behavioral Systems Version) have the same psychological meaning for early adolescents in their relationships with parents…
Descriptors: Parent Child Relationship, Attachment Behavior, Error of Measurement, Teacher Student Relationship
Peer reviewed Peer reviewed
Direct linkDirect link
Heinicke, Susanne – Interchange: A Quarterly Review of Education, 2014
Every measurement in science, every experimental decision, result and information drawn from it has to cope with something that has long been named by the term "error". In fact, errors describe our limitations when it comes to experimental science and science looks back on a long tradition to cope with them. The widely known way to cope…
Descriptors: Coping, Teaching Methods, Motivation Techniques, Science Education History
Peer reviewed Peer reviewed
Direct linkDirect link
Scott-Clayton, Judith; Crosta, Peter M.; Belfield, Clive R. – Educational Evaluation and Policy Analysis, 2014
Remediation is one of the largest single interventions intended to improve outcomes for underprepared college students, yet little is known about the remedial screening process. Using administrative data and a rich predictive model, we find that severe mis-assignments are common using current test-score-cutoff-based policies, with…
Descriptors: Remedial Instruction, Remedial Programs, College Students, Screening Tests
Peer reviewed Peer reviewed
Direct linkDirect link
Chiang, Yu-Tzu; Lin, Sunny S. J. – Scandinavian Journal of Educational Research, 2014
This study examined the measurement structure, cross-year stability of achievement goals, and mediating effects of achievement goals between self-efficacy and math grades in a national sample of Taiwan middle school students. The measurement model with factorial structure showed good fit to the data. In the panel data (N?=?343), four achievement…
Descriptors: Middle School Students, Mathematics Achievement, Goal Orientation, Self Efficacy
Peer reviewed Peer reviewed
Direct linkDirect link
Hughes, Sean; Lyddy, Fiona; Lambe, Sinead – Psychology Learning and Teaching, 2013
This article provides an overview of the available evidence on psychological misconceptions, including key findings, current directions and emerging issues for investigation. We begin by defining misconceptions and then examine their prevalence and persistence, discuss their implications for student learning and highlight potential strategies to…
Descriptors: Misconceptions, Psychological Studies, Evidence, Definitions
Topczewski, Anna Marie – ProQuest LLC, 2013
Developmental score scales represent the performance of students along a continuum, where as students learn more they move higher along that continuum. Unidimensional item response theory (UIRT) vertical scaling has become a commonly used method to create developmental score scales. Research has shown that UIRT vertical scaling methods can be…
Descriptors: Item Response Theory, Scaling, Scores, Student Development
Peer reviewed Peer reviewed
PDF on ERIC Download full text
Citkowicz, Martyna; Hedges, Larry V. – Society for Research on Educational Effectiveness, 2013
In some instances, intentionally or not, study designs are such that there is clustering in one group but not in the other. This paper describes methods for computing effect size estimates and their variances when there is clustering in only one group and the analysis has not taken that clustering into account. The authors provide the effect size…
Descriptors: Multivariate Analysis, Effect Size, Sampling, Sample Size
Moses, Tim; Liu, Jinghua – Educational Testing Service, 2011
In equating research and practice, equating functions that are smooth are typically assumed to be more accurate than equating functions with irregularities. This assumption presumes that population test score distributions are relatively smooth. In this study, two examples were used to reconsider common beliefs about smoothing and equating. The…
Descriptors: Equated Scores, Data Analysis, Scores, Methods
Guo, Hongwen; Sinharay, Sandip – Educational Testing Service, 2011
Nonparametric, or kernel, estimation of item response curve (IRC) is a concern theoretically and operationally. Accuracy of this estimation, often used in item analysis in testing programs, is biased when the observed scores are used as the regressor because the observed scores are contaminated by measurement error. In this study, we investigate…
Descriptors: Error of Measurement, Nonparametric Statistics, Item Response Theory, Computation
Peer reviewed Peer reviewed
Direct linkDirect link
Culpepper, Steven Andrew; Aguinis, Herman – Psychological Methods, 2011
Analysis of covariance (ANCOVA) is used widely in psychological research implementing nonexperimental designs. However, when covariates are fallible (i.e., measured with error), which is the norm, researchers must choose from among 3 inadequate courses of action: (a) know that the assumption that covariates are perfectly reliable is violated but…
Descriptors: Statistical Analysis, Error of Measurement, Monte Carlo Methods, Structural Equation Models
Peer reviewed Peer reviewed
Direct linkDirect link
Fayers, Peter – Advances in Health Sciences Education, 2011
Although many parametric statistical tests are considered to be robust, as recently shown in Methodologist's Corner, it still pays to be circumspect about the assumptions underlying statistical tests. In this paper I show that robustness mainly refers to "[alpha]", the type-I error. If the underlying distribution of data is ignored there…
Descriptors: Statistical Analysis, Tests, Robustness (Statistics), Statistical Distributions
Pages: 1  |  ...  |  78  |  79  |  80  |  81  |  82  |  83  |  84  |  85  |  86  |  ...  |  220