Publication Date
In 2025 | 39 |
Since 2024 | 192 |
Since 2021 (last 5 years) | 495 |
Since 2016 (last 10 years) | 996 |
Since 2006 (last 20 years) | 2028 |
Descriptor
Source
Author
Publication Type
Education Level
Audience
Researchers | 93 |
Practitioners | 23 |
Teachers | 22 |
Policymakers | 10 |
Administrators | 5 |
Students | 4 |
Counselors | 2 |
Parents | 2 |
Community | 1 |
Location
United States | 47 |
Germany | 42 |
Australia | 34 |
Canada | 27 |
Turkey | 27 |
California | 22 |
United Kingdom (England) | 20 |
Netherlands | 18 |
China | 16 |
New York | 15 |
United Kingdom | 15 |
More ▼ |
Laws, Policies, & Programs
Assessments and Surveys
What Works Clearinghouse Rating
Does not meet standards | 1 |
Wang, Lin; Fan, Xitao – 1997
Standard statistical methods are used to analyze data that is assumed to be collected using a simple random sampling scheme. These methods, however, tend to underestimate variance when the data is collected with a cluster design, which is often found in educational survey research. The purposes of this paper are to demonstrate how a cluster design…
Descriptors: Cluster Analysis, Educational Research, Error of Measurement, Estimation (Mathematics)
Chang, Te-Sheng; Brookshire, William – 1997
The question of least-squares weights versus equal weights has been a subject of great interest to researchers for over 60 years. Several researchers have compared the efficiency of equal weights and that of least-squares weights under different conditions. Recently, S. V. Paunonen and R. C. Gardner stressed that the necessary and sufficient…
Descriptors: Correlation, Error of Measurement, Least Squares Statistics, Predictor Variables
Tritchler, D. L.; Pedrini, D. T. – 1983
The N=1 analysis differs from a typical analysis of variance in that there is no within-cell error term. Thus interaction terms are used as estimates of error variance. If the interaction term in question represents a significant interaction, the F tests will be conservative. Tukey's test for nonadditivity will detect a common form of interaction.…
Descriptors: Analysis of Variance, Computer Programs, Data Analysis, Error of Measurement
Kolen, Michael J. – 1984
Large sample standard errors for the Tucker method of linear equating under the common item nonrandom groups design are derived under normality assumptions as well as under less restrictive assumptions. Standard errors of Tucker equating are estimated using the bootstrap method described by Efron. The results from different methods are compared…
Descriptors: Certification, Comparative Analysis, Equated Scores, Error of Measurement
Lord, Frederic M. – 1981
A formula is derived for the asymptotic standard error of a true-score equating by item response theory (IRT). The equating method is applicable when the two tests to be equated are administered to different groups along with an "anchor test." Numerical standard errors are shown for an actual equating 1) comparing the standard errors of…
Descriptors: Comparative Analysis, Equated Scores, Error of Measurement, Latent Trait Theory

Hunyh, Hunyh; Saunders, Joseph C. – 1979
Comparisons were made among various methods of estimating the reliability of pass-fail decisions based on mastery tests. The reliability indices that are considered are p, the proportion of agreements between two estimates, and kappa, the proportion of agreements corrected for chance. Estimates of these two indices were made on the basis of…
Descriptors: Cutting Scores, Error of Measurement, Mastery Tests, Reliability

Forsyth, Robert A. – Applied Psychological Measurement, 1978
This note shows that, under conditions specified by Levin and Subkoviak (TM 503 420), it is not necessary to specify the reliabilities of observed scores when comparing completely randomized designs with randomized block designs. Certain errors in their illustrative example are also discussed. (Author/CTM)
Descriptors: Analysis of Variance, Error of Measurement, Hypothesis Testing, Reliability

Levin, Joel R.; Subkoviak, Michael J. – Applied Psychological Measurement, 1978
Comments (TM 503 706) on an earlier article (TM 503 420) concerning the comparison of the completely randomized design and the randomized block design are acknowledged and appreciated. In addition, potentially misleading notions arising from these comments are addressed and clarified. (See also TM 503 708). (Author/CTM)
Descriptors: Analysis of Variance, Error of Measurement, Hypothesis Testing, Reliability

Forsyth, Robert A. – Applied Psychological Measurement, 1978
This note continues the discussion of earlier articles (TM 503 420, TM 503 706, and TM 503 707), comparing the completely randomized design with the randomized block design. (CTM)
Descriptors: Analysis of Variance, Error of Measurement, Hypothesis Testing, Reliability

Singer, Judith, D. – Journal of Experimental Education, 1987
A two-stage generalized least squares model is developed for estimating the linear regression of an individual outcome on a group characteristic in studies of multilevel data. Results of this model are compared to the results of analytic methods, and formulas are developed for assessing the accuracy of the traditional approaches. (Author/JAZ)
Descriptors: Error of Measurement, Least Squares Statistics, Mathematical Models, Regression (Statistics)

Borich, Gary; Klinzing, Garhard – Journal of Classroom Interaction, 1984
Problems in studying teacher effectiveness through the use of classroom observation are discussed. Four assumptions in the observation of classroom process are offered and ways in which these assumptions can be dealt with in designing an observation study are suggested. (DF)
Descriptors: Classroom Observation Techniques, Error of Measurement, Experimenter Characteristics, Interrater Reliability

Glutting, Joseph J.; And Others – Educational and Psychological Measurement, 1987
This paper discusses the basic theory underlying confidence limits and presents reasons why psychologists should incorporate confidence ranges in their psychodiagnostic reports. Four methods for establishing confidence limits are compared. Three of the methods involve estimated true scores, and the fourth is the standard error of measurement…
Descriptors: Error of Measurement, Mathematical Formulas, Psychological Evaluation, Scores

McPhee, Robert D.; Babrow, Austin – Communication Monographs, 1987
Presents a table of cases requiring causal modeling or an equivalent technique. Reviews nine years of published research in communication journals to assess the adequacy of analysis in these situations. Offers standards for the conduct and reporting of causal modeling along with a review of their use in published causal modeling. (NKA)
Descriptors: Communication Research, Error of Measurement, Models, Research Methodology

Rindskopf, David; Rose, Tedd – Multivariate Behavioral Research, 1988
Confirmatory factor analysis was applied to test second- and higher-order factor models in the areas of structure of abilities, allometry, and the separation of specific and error variance estimates. The estimation of validity and reliability, second-order models within factor analysis models, and the concept of discriminability were also studied.…
Descriptors: Discriminant Analysis, Error of Measurement, Estimation (Mathematics), Factor Analysis

Reddon, John R.; And Others – Journal of Educational Statistics, 1985
Computer sampling from a multivariate normal spherical population was used to evaluate the type one error rates for a test of sphericity based on the distribution of the determinant of the sample correlation matrix. (Author/LMO)
Descriptors: Computer Simulation, Correlation, Error of Measurement, Matrices