Publication Date
In 2025 | 39 |
Since 2024 | 192 |
Since 2021 (last 5 years) | 495 |
Since 2016 (last 10 years) | 996 |
Since 2006 (last 20 years) | 2028 |
Descriptor
Source
Author
Publication Type
Education Level
Audience
Researchers | 93 |
Practitioners | 23 |
Teachers | 22 |
Policymakers | 10 |
Administrators | 5 |
Students | 4 |
Counselors | 2 |
Parents | 2 |
Community | 1 |
Location
United States | 47 |
Germany | 42 |
Australia | 34 |
Canada | 27 |
Turkey | 27 |
California | 22 |
United Kingdom (England) | 20 |
Netherlands | 18 |
China | 16 |
New York | 15 |
United Kingdom | 15 |
More ▼ |
Laws, Policies, & Programs
Assessments and Surveys
What Works Clearinghouse Rating
Does not meet standards | 1 |

Wilson, Mark; Hoskens, Machteld – Journal of Educational and Behavioral Statistics, 2001
Introduces the Rater Bundle Model, an item response model for repeated ratings of student work. Applies the model to real and simulated data to illustrate the approach, which was motivated by the observation that when repeated ratings occur, the assumption of conditional independence is violated, and current item response models can then…
Descriptors: Error of Measurement, Evaluators, Item Response Theory, Models

Ogasawara, Haruhiko – Psychometrika, 2002
Derived formulas for the asymptotic standard errors of component loading estimates to cover the cases of principal component analysis for unstandardized and standardized variables with orthogonal and oblique rotations. Used the formulas with a real correlation matrix of 355 subjects who took 12 psychological tests. (SLD)
Descriptors: Correlation, Error of Measurement, Factor Analysis, Matrices

Henson, Robin K.; Thompson, Bruce – Measurement and Evaluation in Counseling and Development, 2002
T. Vacha-Haase (1998) proposed her "reliability generalization" methodology to characterize (a) typical score reliability for a measure across studies, (b) the variability of score reliabilities, and (c) what measurement protocol features predict the variability in score reliabilities across administration. The present article provides…
Descriptors: Error of Measurement, Generalization, Psychometrics, Research Methodology

Magnano, Catherine L.; And Others – Child Development, 1989
Findings indicate that high cortisol levels and interfering substances in formula and breast milk could contaminate salivary cortisol measurements in young infants. To insure accurate results, appropriate controls should be taken for salivary cortisol measurements of young infants. (RH)
Descriptors: Error of Measurement, Guidelines, Infants, Measurement Techniques

Keselman, H. J.; Keselman, Joanne C. – Journal of Educational Statistics, 1988
Two Tukey multiple comparisons and Bonferroni and multivariate approaches are compared for their rates of Type I error and any-pairs power when multisample sphericity was not satisfied and the design was unbalanced. For tests of weighted means and for study conditions investigated, the Bonferroni procedure provides a workable solution. (TJH)
Descriptors: Error of Measurement, Multivariate Analysis, Power (Statistics), Weighted Scores

Zeng, Lingjia; And Others – Applied Psychological Measurement, 1994
A general delta method is described for computing the standard error (SE) of a chain of linear equations. The general delta method derives the SEs directly from the moments of the score distributions obtained in the equating chain. Computer simulations demonstrate the method. (SLD)
Descriptors: Computer Simulation, Equated Scores, Error of Measurement, Statistical Distributions

Rosenthal, Neal H. – Monthly Labor Review, 1992
Occupational employment projections for 1990 were conservative. Detailed comparison of projected and actual increases shows that too many occupations were projected to have average growth and more of those expected to have rapid growth were underprojected. (Author)
Descriptors: Demand Occupations, Employment Projections, Error of Measurement, Tables (Data)

Williams, Valerie S. L.; Jones, Lyle V.; Tukey, John W. – Journal of Educational and Behavioral Statistics, 1999
Illustrates and compares three alternative procedures to adjust significance levels for multiplicity: (1) the traditional Bonferroni technique; (2) a sequential Bonferroni technique; and (3) a sequential approach to control the false discovery rate proposed by Y. Benjamini and Y. Hochberg (1995). Explains advantages of the Benjamini and Hochberg…
Descriptors: Academic Achievement, Comparative Analysis, Error of Measurement, Statistical Significance

Brennan, Robert L. – Applied Psychological Measurement, 1998
Provides a comprehensive and integrated treatment of both conditional absolute standard errors of measurement (SEM) and conditional relative SEMs from the perspective of generalizability theory. Illustrates the approach with examples from commercial standardized tests. Examples support the conclusion that both types of conditional SEMs tend to be…
Descriptors: Error of Measurement, Generalizability Theory, Raw Scores, Standardized Tests

Doble, Susan E.; Fisk, John D.; Lewis, Norma; Rockwood, Kenneth – Occupational Therapy Journal of Research, 1999
The findings of a study of 55 elderly adults support the test-retest reliability of the Assessment of Motor and Process Skills, illustrate the utility of alternative methods for examining the reliability of individual subjects' measures, and indicate that not all test-retest differences represent measurement error. (Author/JOW)
Descriptors: Error of Measurement, Older Adults, Psychomotor Skills, Test Reliability

Feldt, Leonard S.; Qualls, Audrey L. – Applied Measurement in Education, 1998
Two relatively simple methods for estimating the condition standard error of measurement (SEM) for nonlinearly derived score scales are proposed. Applications indicate that these two procedures produce fairly consistent estimates that tend to peak near the high end of the scale and reach a minimum in the middle of the raw score scale. (SLD)
Descriptors: Error of Measurement, Estimation (Mathematics), Raw Scores, Reliability

Wang, Tianyou; Hanson, Bradley A.; Harris, Deborah J. – Applied Psychological Measurement, 2000
Studied whether circular equating could provide an adequate measure of various types of equating error when applied to different equating methods under different equating designs. Analyses and simluations show that circular equating is generally invalid as a criterion to evaluate the adequacy of equating. (SLD)
Descriptors: Criteria, Equated Scores, Error of Measurement, Evaluation Methods

Raykov, Tenko – Structural Equation Modeling, 2000
Shows that the conventional noncentrality parameter estimator of covariance structure models, currently implemented in popular structural modeling programs, possesses asymptotically potentially large bias, variance, and mean squared error (MSE). Presents a formal expression for its large-sample bias and quantifies large-sample bias and MSE. (SLD)
Descriptors: Error of Measurement, Estimation (Mathematics), Sample Size, Statistical Bias

Riniolo, Todd C. – Journal of Experimental Education, 1999
Presents an alternative statistical test, BOOT(subscript)med for the two-group situation when a small experimental group is being compared with a large control group. BOOTmed is a between-groups median test derived through bootstrapping techniques. Empirical validation indicates that BOOTmed maintains relatively robust error rates under a variety…
Descriptors: Comparative Analysis, Control Groups, Error of Measurement, Statistical Analysis

Markowski, Edward P.; Markowski, Carol A. – Journal of Education for Business, 1999
Proposes the use of statistical power subsequent to the results of hypothesis testing in business research. Describes how posttest use of power might be integrated into business statistics courses. (SK)
Descriptors: Business Administration, Error of Measurement, Hypothesis Testing, Research