Publication Date
In 2025 | 2 |
Since 2024 | 4 |
Since 2021 (last 5 years) | 7 |
Since 2016 (last 10 years) | 11 |
Since 2006 (last 20 years) | 33 |
Descriptor
Error of Measurement | 36 |
Scores | 36 |
Validity | 36 |
Reliability | 16 |
Measures (Individuals) | 12 |
Foreign Countries | 11 |
Psychometrics | 10 |
Factor Analysis | 9 |
Correlation | 6 |
Elementary School Students | 6 |
Evaluation Methods | 6 |
More ▼ |
Source
Author
Biancarosa, Gina | 2 |
Fien, Hank | 2 |
Abu-Hamour, Bashir | 1 |
Aman, Michael G. | 1 |
Aryadoust, Vahid | 1 |
Asil, Mustafa | 1 |
Bahar, Mustafa | 1 |
Birenbaum, Menucha | 1 |
Bleses, Dorthe | 1 |
Cummings, Kelli Dawn | 1 |
De Cock, P. | 1 |
More ▼ |
Publication Type
Education Level
Higher Education | 5 |
Secondary Education | 5 |
Elementary Education | 4 |
Middle Schools | 4 |
Postsecondary Education | 4 |
High Schools | 3 |
Junior High Schools | 3 |
Elementary Secondary Education | 2 |
Grade 2 | 1 |
Grade 4 | 1 |
Grade 8 | 1 |
More ▼ |
Audience
Location
United States | 2 |
Arkansas | 1 |
Australia | 1 |
Canada | 1 |
China (Beijing) | 1 |
Denmark | 1 |
Indonesia | 1 |
Israel | 1 |
Jordan | 1 |
Michigan | 1 |
Netherlands | 1 |
More ▼ |
Laws, Policies, & Programs
No Child Left Behind Act 2001 | 2 |
Assessments and Surveys
What Works Clearinghouse Rating
Tenko Raykov – Educational and Psychological Measurement, 2024
This note is concerned with the benefits that can result from the use of the maximal reliability and optimal linear combination concepts in educational and psychological research. Within the widely used framework of unidimensional multi-component measuring instruments, it is demonstrated that the linear combination of their components that…
Descriptors: Educational Research, Behavioral Science Research, Reliability, Error of Measurement
Manuel T. Rein; Jeroen K. Vermunt; Kim De Roover; Leonie V. D. E. Vogelsmeier – Structural Equation Modeling: A Multidisciplinary Journal, 2025
Researchers often study dynamic processes of latent variables in everyday life, such as the interplay of positive and negative affect over time. An intuitive approach is to first estimate the measurement model of the latent variables, then compute factor scores, and finally use these factor scores as observed scores in vector autoregressive…
Descriptors: Measurement Techniques, Factor Analysis, Scores, Validity
Wu, Tong – ProQuest LLC, 2023
This three-article dissertation aims to address three methodological challenges to ensure comparability in educational research, including scale linking, test equating, and propensity score (PS) weighting. The first study intends to improve test scale comparability by evaluating the effect of six missing data handling approaches, including…
Descriptors: Educational Research, Comparative Analysis, Equated Scores, Weighted Scores
Yeon Ha Kim – Journal of Early Adolescence, 2025
This study aimed to introduce an ego-resiliency questionnaire for preadolescents (the ER-P) by restructuring the ER89 using the data of 1398 preadolescents from the Panel Study on Korean Children. The ER-P was proposed as a 10-item second-order instrument with two factors (Optimal Regulation and Openness to Life Experiences). The ER-P achieved…
Descriptors: Self Concept, Resilience (Psychology), Preadolescents, Asians
Kate E. Walton – ACT, Inc., 2024
There is a tradeoff between scale length and psychometric concerns. The two are, in fact, directly linked. Generally, when scales are shortened, reliability is reduced, and when scales are lengthened, reliability is improved, provided the items added to the scale are comparable psychometrically (AERA et al., 2014). Scale reliability, in turn,…
Descriptors: Psychometrics, Error of Measurement, Rating Scales, Reliability
Martín-Puga, M. Eva; Pelegrina, Santiago; Gómez-Pérez, M. Mar; Justicia-Galiano, M. José – Journal of Psychoeducational Assessment, 2022
The objectives were to examine the factorial structure of the Academic Procrastination Scale-Short Form (APS-S) and the measurement invariance across gender and educational levels, to determine possible differences in procrastination across gender, educational levels, and grades. The sample was formed of 1486 Spanish primary and secondary school…
Descriptors: Psychometrics, Measures (Individuals), Study Habits, Scores
Lowe, Patricia A. – Journal of Psychoeducational Assessment, 2019
Existing measures of test anxiety used with the college student population are old with old norms and old items, and they do not capture the multiple dimensions of the test anxiety construct or assess facilitating anxiety. In the present study, the validity of the scores of a new, multidimensional measure of test anxiety with a facilitating…
Descriptors: Cross Cultural Studies, Gender Differences, Test Anxiety, Foreign Countries
Martínez, José Felipe; Kloser, Matt; Srinivasan, Jayashri; Stecher, Brian; Edelman, Amanda – Educational and Psychological Measurement, 2022
Adoption of new instructional standards in science demands high-quality information about classroom practice. Teacher portfolios can be used to assess instructional practice and support teacher self-reflection anchored in authentic evidence from classrooms. This study investigated a new type of electronic portfolio tool that allows efficient…
Descriptors: Science Instruction, Academic Standards, Instructional Innovation, Electronic Publishing
Bahar, Mustafa; Asil, Mustafa; Rubie-Davies, Christine M. – European Journal of Educational Research, 2018
Among school psycho-social factors with considerable effect on student outcomes are both school and classroom climate. Because how students perceive the classroom climate strongly predicts achievement, measuring classroom climate gains importance and the need for testing the existing results across cultures persists. In this study, we assessed the…
Descriptors: Error of Measurement, Climate, Change, Factor Analysis
McCaffrey, Daniel F.; Yuan, Kun; Savitsky, Terrance D.; Lockwood, J. R.; Edelen, Maria O. – Educational Measurement: Issues and Practice, 2015
We examine the factor structure of scores from the CLASS-S protocol obtained from observations of middle school classroom teaching. Factor analysis has been used to support both interpretations of scores from classroom observation protocols, like CLASS-S, and the theories about teaching that underlie them. However, classroom observations contain…
Descriptors: Factor Structure, Multivariate Analysis, Scores, Factor Analysis
Kaat, Aaron J.; Lecavalier, Luc; Aman, Michael G. – Journal of Autism and Developmental Disorders, 2014
The Aberrant Behavior Checklist (ABC) is a widely used measure in autism spectrum disorder (ASD) treatment studies. We conducted confirmatory and exploratory factor analyses of the ABC in 1,893 children evaluated as part of the Autism Treatment Network. The root mean square error of approximation was .086 for the standard item assignment, and in…
Descriptors: Validity, Children, Autism, Pervasive Developmental Disorders
Petscher, Yaacov; Cummings, Kelli Dawn; Biancarosa, Gina; Fien, Hank – Assessment for Effective Intervention, 2013
The purpose of this article is to provide a commentary on the current state of several measurement issues pertaining to curriculum-based measures of reading (R-CBM). We begin by providing an overview of the utility of R-CBM, followed by a presentation of five specific measurements considerations: (a) the reliability of R-CBM oral reading fluency…
Descriptors: Measurement, Reading Fluency, Curriculum Based Assessment, Error of Measurement
Han, Chao – Language Assessment Quarterly, 2016
As a property of test scores, reliability/dependability constitutes an important psychometric consideration, and it underpins the validity of measurement results. A review of interpreter certification performance tests (ICPTs) reveals that (a) although reliability/dependability checking has been recognized as an important concern, its theoretical…
Descriptors: Foreign Countries, Scores, English, Chinese
Raymond, Mark R.; Swygert, Kimberly A.; Kahraman, Nilufer – Journal of Educational Measurement, 2012
Although a few studies report sizable score gains for examinees who repeat performance-based assessments, research has not yet addressed the reliability and validity of inferences based on ratings of repeat examinees on such tests. This study analyzed scores for 8,457 single-take examinees and 4,030 repeat examinees who completed a 6-hour clinical…
Descriptors: Physicians, Licensing Examinations (Professions), Performance Based Assessment, Repetition
Haertel, Edward H. – Educational Testing Service, 2013
Policymakers and school administrators have embraced value-added models of teacher effectiveness as tools for educational improvement. Teacher value-added estimates may be viewed as complicated scores of a certain kind. This suggests using a test validation model to examine their reliability and validity. Validation begins with an interpretive…
Descriptors: Reliability, Validity, Inferences, Teacher Effectiveness