Publication Date
In 2025 | 0 |
Since 2024 | 2 |
Since 2021 (last 5 years) | 3 |
Since 2016 (last 10 years) | 7 |
Since 2006 (last 20 years) | 44 |
Descriptor
Error of Measurement | 84 |
Reliability | 84 |
Scores | 24 |
Validity | 19 |
Correlation | 14 |
Estimation (Mathematics) | 12 |
Psychometrics | 12 |
Simulation | 12 |
Computation | 11 |
Evaluation Methods | 11 |
Structural Equation Models | 11 |
More ▼ |
Source
Author
Raykov, Tenko | 3 |
Fan, Xitao | 2 |
Henson, Robin K. | 2 |
Kolen, Michael J. | 2 |
Lee, Guemin | 2 |
Sijtsma, Klaas | 2 |
Vacha-Haase, Tammi | 2 |
Wang, Tianyou | 2 |
Williams, Richard H. | 2 |
Yin, Ping | 2 |
Zimmerman, Donald W. | 2 |
More ▼ |
Publication Type
Reports - Evaluative | 84 |
Journal Articles | 66 |
Speeches/Meeting Papers | 11 |
Book/Product Reviews | 2 |
Reports - Descriptive | 2 |
Tests/Questionnaires | 2 |
Opinion Papers | 1 |
Education Level
Higher Education | 7 |
Elementary Secondary Education | 4 |
Elementary Education | 3 |
Postsecondary Education | 2 |
Adult Education | 1 |
Grade 3 | 1 |
Grade 4 | 1 |
High Schools | 1 |
Middle Schools | 1 |
Secondary Education | 1 |
Audience
Researchers | 1 |
Location
Portugal | 2 |
Australia | 1 |
California | 1 |
Canada | 1 |
Germany | 1 |
Iowa | 1 |
New York | 1 |
North Carolina | 1 |
Pennsylvania | 1 |
Spain | 1 |
Texas | 1 |
More ▼ |
Laws, Policies, & Programs
No Child Left Behind Act 2001 | 2 |
Assessments and Surveys
What Works Clearinghouse Rating
Tenko Raykov – Educational and Psychological Measurement, 2024
This note is concerned with the benefits that can result from the use of the maximal reliability and optimal linear combination concepts in educational and psychological research. Within the widely used framework of unidimensional multi-component measuring instruments, it is demonstrated that the linear combination of their components that…
Descriptors: Educational Research, Behavioral Science Research, Reliability, Error of Measurement
John R. Donoghue; Carol Eckerly – Applied Measurement in Education, 2024
Trend scoring constructed response items (i.e. rescoring Time A responses at Time B) gives rise to two-way data that follow a product multinomial distribution rather than the multinomial distribution that is usually assumed. Recent work has shown that the difference in sampling model can have profound negative effects on statistics usually used to…
Descriptors: Scoring, Error of Measurement, Reliability, Scoring Rubrics
Martí, Mónica; Ródenas, Carmen – International Journal of Social Research Methodology, 2021
This paper analyses the reliability and accuracy of the relationships between migration and employment status when estimated using a linked data set. The analysis will be carried out using a new source, the "Labour and Geographical Mobility Statistics," which is provided by the Spanish Statistical Office. This statistic is constructed by…
Descriptors: Foreign Countries, Error of Measurement, Occupational Mobility, Migration
Marcoulides, Katerina M. – Measurement: Interdisciplinary Research and Perspectives, 2019
Longitudinal data analysis has received widespread interest throughout educational, behavioral, and social science research, with latent growth curve modeling currently being one of the most popular methods of analysis. Despite the popularity of latent growth curve modeling, limited attention has been directed toward understanding the issues of…
Descriptors: Reliability, Longitudinal Studies, Growth Models, Structural Equation Models
Lee, Won-Chan; Kim, Stella Y.; Choi, Jiwon; Kang, Yujin – Journal of Educational Measurement, 2020
This article considers psychometric properties of composite raw scores and transformed scale scores on mixed-format tests that consist of a mixture of multiple-choice and free-response items. Test scores on several mixed-format tests are evaluated with respect to conditional and overall standard errors of measurement, score reliability, and…
Descriptors: Raw Scores, Item Response Theory, Test Format, Multiple Choice Tests
Raykov, Tenko; Marcoulides, George A.; Li, Tenglong – Educational and Psychological Measurement, 2017
The measurement error in principal components extracted from a set of fallible measures is discussed and evaluated. It is shown that as long as one or more measures in a given set of observed variables contains error of measurement, so also does any principal component obtained from the set. The error variance in any principal component is shown…
Descriptors: Error of Measurement, Factor Analysis, Research Methodology, Psychometrics
Gehlbach, Hunter; Hough, Heather J. – Policy Analysis for California Education, PACE, 2018
As educational practitioners and policymakers expand the range of student outcomes they assess, student perception surveys--particularly those targeting social-emotional learning--have grown in popularity. Despite excitement around the potential for measuring a wider array of important student outcomes, concerns about the validity of the…
Descriptors: Social Development, Emotional Development, Validity, School Districts
Petscher, Yaacov; Cummings, Kelli Dawn; Biancarosa, Gina; Fien, Hank – Assessment for Effective Intervention, 2013
The purpose of this article is to provide a commentary on the current state of several measurement issues pertaining to curriculum-based measures of reading (R-CBM). We begin by providing an overview of the utility of R-CBM, followed by a presentation of five specific measurements considerations: (a) the reliability of R-CBM oral reading fluency…
Descriptors: Measurement, Reading Fluency, Curriculum Based Assessment, Error of Measurement
Raymond, Mark R.; Swygert, Kimberly A.; Kahraman, Nilufer – Journal of Educational Measurement, 2012
Although a few studies report sizable score gains for examinees who repeat performance-based assessments, research has not yet addressed the reliability and validity of inferences based on ratings of repeat examinees on such tests. This study analyzed scores for 8,457 single-take examinees and 4,030 repeat examinees who completed a 6-hour clinical…
Descriptors: Physicians, Licensing Examinations (Professions), Performance Based Assessment, Repetition
Haertel, Edward H. – Educational Testing Service, 2013
Policymakers and school administrators have embraced value-added models of teacher effectiveness as tools for educational improvement. Teacher value-added estimates may be viewed as complicated scores of a certain kind. This suggests using a test validation model to examine their reliability and validity. Validation begins with an interpretive…
Descriptors: Reliability, Validity, Inferences, Teacher Effectiveness
Moses, Tim – Journal of Educational Measurement, 2012
The focus of this paper is assessing the impact of measurement errors on the prediction error of an observed-score regression. Measures are presented and described for decomposing the linear regression's prediction error variance into parts attributable to the true score variance and the error variances of the dependent variable and the predictor…
Descriptors: Error of Measurement, Prediction, Regression (Statistics), True Scores
Beauducel, Andre – Applied Psychological Measurement, 2013
The problem of factor score indeterminacy implies that the factor and the error scores cannot be completely disentangled in the factor model. It is therefore proposed to compute Harman's factor score predictor that contains an additive combination of factor and error variance. This additive combination is discussed in the framework of classical…
Descriptors: Factor Analysis, Predictor Variables, Reliability, Error of Measurement
The Strengths Assessment Inventory: Reliability of a New Measure of Psychosocial Strengths for Youth
Brazeau, James N.; Teatero, Missy L.; Rawana, Edward P.; Brownlee, Keith; Blanchette, Loretta R. – Journal of Child and Family Studies, 2012
A new measure, the Strengths Assessment Inventory-Youth self-report (SAI-Y), was recently developed to assess the strengths of children and adolescents between the ages of 10 and 18 years. The SAI-Y differs from similar measures in that it provides a comprehensive assessment of strengths that are intrinsic to the individual as well as strengths…
Descriptors: Error of Measurement, Psychometrics, Secondary School Students, Adolescents
McGill, D. A.; van der Vleuten, C. P. M.; Clarke, M. J. – Advances in Health Sciences Education, 2011
Even though rater-based judgements of clinical competence are widely used, they are context sensitive and vary between individuals and institutions. To deal adequately with rater-judgement unreliability, evaluating the reliability of workplace rater-based assessments in the local context is essential. Using such an approach, the primary intention…
Descriptors: Error of Measurement, Certification, Communication Skills, Trainees
Vasconcelos-Raposo, Jose; Fernandes, Helder Miguel; Teixeira, Carla M.; Bertelli, Rosangela – Social Indicators Research, 2012
The purpose of the present study was to examine the reliability, factorial validity and measurement invariance (across gender, age and physical activity participation) of a Portuguese version of the Rosenberg Self-Esteem Scale (RSES). The sample consisted of 1,763 Portuguese youngsters (731 male and 1,032 female) with ages between 15 and 20 years.…
Descriptors: Validity, Factor Structure, Measures (Individuals), Factor Analysis