NotesFAQContact Us
Collection
Advanced
Search Tips
Audience
Laws, Policies, & Programs
No Child Left Behind Act 20011
What Works Clearinghouse Rating
Showing 1 to 15 of 34 results Save | Export
Peer reviewed Peer reviewed
Direct linkDirect link
Tenko Raykov – Educational and Psychological Measurement, 2024
This note is concerned with the benefits that can result from the use of the maximal reliability and optimal linear combination concepts in educational and psychological research. Within the widely used framework of unidimensional multi-component measuring instruments, it is demonstrated that the linear combination of their components that…
Descriptors: Educational Research, Behavioral Science Research, Reliability, Error of Measurement
Peer reviewed Peer reviewed
Direct linkDirect link
Raykov, Tenko; Marcoulides, George A.; Li, Tenglong – Educational and Psychological Measurement, 2017
The measurement error in principal components extracted from a set of fallible measures is discussed and evaluated. It is shown that as long as one or more measures in a given set of observed variables contains error of measurement, so also does any principal component obtained from the set. The error variance in any principal component is shown…
Descriptors: Error of Measurement, Factor Analysis, Research Methodology, Psychometrics
Peer reviewed Peer reviewed
Direct linkDirect link
Flanagan, Dawn P.; Schneider, W. Joel – International Journal of School & Educational Psychology, 2016
When education works, it creates productive, innovative citizens eager to contribute to a well-functioning democracy. In contrast, educational failure has lifelong consequences, with some individuals experiencing decades of preventable hardship. Dawn Flanagan and Joel Schneider write in this response that, like Kranzler, Floyd, Benson, Zabowski,…
Descriptors: Learning Disabilities, Identification, Diagnostic Tests, Criticism
Gehlbach, Hunter; Hough, Heather J. – Policy Analysis for California Education, PACE, 2018
As educational practitioners and policymakers expand the range of student outcomes they assess, student perception surveys--particularly those targeting social-emotional learning--have grown in popularity. Despite excitement around the potential for measuring a wider array of important student outcomes, concerns about the validity of the…
Descriptors: Social Development, Emotional Development, Validity, School Districts
Peer reviewed Peer reviewed
Direct linkDirect link
Petscher, Yaacov; Cummings, Kelli Dawn; Biancarosa, Gina; Fien, Hank – Assessment for Effective Intervention, 2013
The purpose of this article is to provide a commentary on the current state of several measurement issues pertaining to curriculum-based measures of reading (R-CBM). We begin by providing an overview of the utility of R-CBM, followed by a presentation of five specific measurements considerations: (a) the reliability of R-CBM oral reading fluency…
Descriptors: Measurement, Reading Fluency, Curriculum Based Assessment, Error of Measurement
Peer reviewed Peer reviewed
Direct linkDirect link
Raymond, Mark R.; Swygert, Kimberly A.; Kahraman, Nilufer – Journal of Educational Measurement, 2012
Although a few studies report sizable score gains for examinees who repeat performance-based assessments, research has not yet addressed the reliability and validity of inferences based on ratings of repeat examinees on such tests. This study analyzed scores for 8,457 single-take examinees and 4,030 repeat examinees who completed a 6-hour clinical…
Descriptors: Physicians, Licensing Examinations (Professions), Performance Based Assessment, Repetition
Haertel, Edward H. – Educational Testing Service, 2013
Policymakers and school administrators have embraced value-added models of teacher effectiveness as tools for educational improvement. Teacher value-added estimates may be viewed as complicated scores of a certain kind. This suggests using a test validation model to examine their reliability and validity. Validation begins with an interpretive…
Descriptors: Reliability, Validity, Inferences, Teacher Effectiveness
Peer reviewed Peer reviewed
Direct linkDirect link
Vasconcelos-Raposo, Jose; Fernandes, Helder Miguel; Teixeira, Carla M.; Bertelli, Rosangela – Social Indicators Research, 2012
The purpose of the present study was to examine the reliability, factorial validity and measurement invariance (across gender, age and physical activity participation) of a Portuguese version of the Rosenberg Self-Esteem Scale (RSES). The sample consisted of 1,763 Portuguese youngsters (731 male and 1,032 female) with ages between 15 and 20 years.…
Descriptors: Validity, Factor Structure, Measures (Individuals), Factor Analysis
Peer reviewed Peer reviewed
Direct linkDirect link
Lee, C. Matthew; Gorelick, Mark – Measurement in Physical Education and Exercise Science, 2011
The purpose of this study was to examine the validity of the Smarthealth watch (Salutron, Inc., Fremont, California, USA), a heart rate monitor that includes a wristwatch without an accompanying chest strap. Twenty-five individuals participated in 3-min periods of standing, 2.0 mph walking, 3.5 mph walking, 4.5 mph jogging, and 6.0 mph running.…
Descriptors: Metabolism, Intervals, Physical Activities, Validity
Peer reviewed Peer reviewed
Direct linkDirect link
Wu, Pei-Chen – Journal of Psychoeducational Assessment, 2010
This study examined measurement invariance (i.e., configural invariance, metric invariance, scalar invariance) of the Chinese version of Beck Depression Inventory II (BDI-II-C) across college males and females and compared gender differences on depression at the latent factor mean level. Two samples composed of 402 male college students and 595…
Descriptors: College Students, Females, Negative Attitudes, Construct Validity
Peer reviewed Peer reviewed
Direct linkDirect link
Dory, Valerie; Gagnon, Robert; Charlin, Bernard – Advances in Health Sciences Education, 2010
Case-specificity, i.e., variability of a subject's performance across cases, has been a consistent finding in medical education. It has important implications for assessment validity and reliability. Its root causes remain a matter of discussion. One hypothesis, content-specificity, links variability of performance to variable levels of relevant…
Descriptors: Medical Education, Trainees, English (Second Language), Error of Measurement
Peer reviewed Peer reviewed
Direct linkDirect link
Schmitt, T. A.; Sass, D. A.; Sullivan, J. R.; Walker, C. M. – International Journal of Testing, 2010
Imposed time limits on computer adaptive tests (CATs) can result in examinees having difficulty completing all items, thus compromising the validity and reliability of ability estimates. In this study, the effects of speededness were explored in a simulated CAT environment by varying examinee response patterns to end-of-test items. Expectedly,…
Descriptors: Monte Carlo Methods, Simulation, Computer Assisted Testing, Adaptive Testing
Peer reviewed Peer reviewed
Direct linkDirect link
Pullin, Andrew S.; Knight, Teri M. – New Directions for Evaluation, 2009
To use environmental program evaluation to increase effectiveness, predictive power, and resource allocation efficiency, evaluators need good data. Data require sufficient credibility in terms of fitness for purpose and quality to develop the necessary evidence base. The authors examine elements of data credibility using experience from critical…
Descriptors: Data, Credibility, Conservation (Environment), Program Evaluation
Peer reviewed Peer reviewed
Direct linkDirect link
Geiser, Christian; Eid, Michael; Nussbeck, Fridtjof W.; Courvoisier, Delphine S.; Cole, David A. – Developmental Psychology, 2010
The authors show how structural equation modeling can be applied to analyze change in longitudinal multitrait-multimethod (MTMM) studies. For this purpose, an extension of latent difference models (McArdle, 1988; Steyer, Eid, & Schwenkmezger, 1997) to multiple constructs and multiple methods is presented. The model allows investigators to separate…
Descriptors: Structural Equation Models, Multitrait Multimethod Techniques, Validity, Measurement
Peer reviewed Peer reviewed
Direct linkDirect link
Monbaliu, E.; Ortibus, E.; Roelens, F.; Desloovere, K.; Deklerck, J.; Prinzie, P.; De Cock, P.; Feys, H. – Developmental Medicine & Child Neurology, 2010
Aim: This study investigated the reliability and validity of the Barry-Albright Dystonia Scale (BADS), the Burke-Fahn-Marsden Movement Scale (BFMMS), and the Unified Dystonia Rating Scale (UDRS) in patients with bilateral dystonic cerebral palsy (CP). Method: Three raters independently scored videotapes of 10 patients (five males, five females;…
Descriptors: Content Validity, Cerebral Palsy, Validity, Interrater Reliability
Previous Page | Next Page ยป
Pages: 1  |  2  |  3