Publication Date
| In 2026 | 0 |
| Since 2025 | 1 |
| Since 2022 (last 5 years) | 1 |
| Since 2017 (last 10 years) | 1 |
| Since 2007 (last 20 years) | 2 |
Descriptor
| Error of Measurement | 7 |
| Individual Differences | 7 |
| Test Reliability | 7 |
| Statistical Significance | 3 |
| Factor Analysis | 2 |
| Mathematical Models | 2 |
| Power (Statistics) | 2 |
| Research Methodology | 2 |
| Test Validity | 2 |
| Testing Problems | 2 |
| Academic Ability | 1 |
| More ▼ | |
Author
| Brendan A. Schuetze | 1 |
| Forbes, Dean W. | 1 |
| Humphreys, Lloyd G. | 1 |
| Marston, Paul T., Borich,… | 1 |
| Paul T. von Hippel | 1 |
| Vlachopoulos, Symeon P. | 1 |
| Whitely, Susan E. | 1 |
| Zimmerman, Donald W. | 1 |
Publication Type
| Reports - Research | 5 |
| Journal Articles | 4 |
| Collected Works - General | 1 |
| Opinion Papers | 1 |
| Reports - Evaluative | 1 |
Education Level
Audience
Location
Laws, Policies, & Programs
Assessments and Surveys
What Works Clearinghouse Rating
Paul T. von Hippel; Brendan A. Schuetze – Annenberg Institute for School Reform at Brown University, 2025
Researchers across many fields have called for greater attention to heterogeneity of treatment effects--shifting focus from the average effect to variation in effects between different treatments, studies, or subgroups. True heterogeneity is important, but many reports of heterogeneity have proved to be false, non-replicable, or exaggerated. In…
Descriptors: Educational Research, Replication (Evaluation), Generalizability Theory, Inferences
Vlachopoulos, Symeon P. – Structural Equation Modeling: A Multidisciplinary Journal, 2008
This study examined the extent of measurement invariance of the Basic Psychological Needs in Exercise Scale responses (BPNES; Vlachopoulos & Michailidou, 2006) across male (n = 716) and female (n = 1,147) exercise participants. BPNES responses from exercise participants attending private fitness centers (n = 1,012) and community exercise programs…
Descriptors: Psychological Patterns, Factor Structure, Measures (Individuals), Measurement
Peer reviewedZimmerman, Donald W.; And Others – Applied Psychological Measurement, 1993
Some of the methods originally used to find relationships between reliability and power associated with a single measurement are extended to difference scores. Results, based on explicit power calculations, show that augmenting the reliability of measurement by reducing error score variance can make significance tests of difference more powerful.…
Descriptors: Equations (Mathematics), Error of Measurement, Individual Differences, Mathematical Models
Peer reviewedHumphreys, Lloyd G.; And Others – Applied Psychological Measurement, 1993
Two articles discuss the controversy about the relationship between reliability and the power of significance tests in response to the discussion of Donald W. Zimmerman, Richard H. Williams, and Bruno D. Zumbo. Lloyd G. Humphreys emphasizes the differences between what statisticians can do and constraints on researchers. Zimmerman, Williams, and…
Descriptors: Error of Measurement, Individual Differences, Power (Statistics), Research Methodology
Peer reviewedWhitely, Susan E. – Applied Psychological Measurement, 1979
Two sources of inconsistency were separated by reanalyzing data from a major study on short-term consistency. Little evidence was found for generalizability or behavioral predictability. Results supported the assumption that measurement error from short-term fluctuations is not due to systematic individual differences in response consistency.…
Descriptors: Behavior Change, Cognitive Processes, College Freshmen, Error of Measurement
Marston, Paul T., Borich, Gary D. – 1977
The four main approaches to measuring treatment effects in schools; raw gain, residual gain, covariance, and true scores; were compared. A simulation study showed true score analysis produced a large number of Type-I errors. When corrected for this error, this method showed the least power of the four. This outcome was clearly the result of the…
Descriptors: Achievement Gains, Analysis of Covariance, Comparative Analysis, Error of Measurement
Forbes, Dean W. – 1976
Rasch calibration permitted the development of short achievement tests that were economical in testing time, and could be developed in a series of difficulty levels to suit student individual differences. Furthermore, these tests were of adequate reliability for practical educational measurement when individual students were assigned to tests of…
Descriptors: Academic Ability, Achievement Tests, Classification, Elementary Education

Direct link
