Publication Date
In 2025 | 39 |
Since 2024 | 192 |
Since 2021 (last 5 years) | 495 |
Since 2016 (last 10 years) | 996 |
Since 2006 (last 20 years) | 2028 |
Descriptor
Source
Author
Publication Type
Education Level
Audience
Researchers | 93 |
Practitioners | 23 |
Teachers | 22 |
Policymakers | 10 |
Administrators | 5 |
Students | 4 |
Counselors | 2 |
Parents | 2 |
Community | 1 |
Location
United States | 47 |
Germany | 42 |
Australia | 34 |
Canada | 27 |
Turkey | 27 |
California | 22 |
United Kingdom (England) | 20 |
Netherlands | 18 |
China | 16 |
New York | 15 |
United Kingdom | 15 |
More ▼ |
Laws, Policies, & Programs
Assessments and Surveys
What Works Clearinghouse Rating
Does not meet standards | 1 |
Wainer, Howard; Wright, Benjamin D. – 1980
The pure Rasch model was compared with four modifications of the model in a number of different simulations in order to ascertain the comparative efficiencies of the parameter estimations of these modifications. Because there is always noise in test score data, some individuals may have response patterns that do not fit the model and their…
Descriptors: Error of Measurement, Guessing (Tests), Item Analysis, Latent Trait Theory

Kaplan, David – Multivariate Behavioral Research, 1988
The impact of misspecification on the estimation, testing, and improvement of structural equation models was assessed via a population study in which a prototypical latent variable model was misspecified. Results provide insights into the maximum likelihood estimator versus a limited two-stage least squares estimator in LISREL. (TJH)
Descriptors: Computer Simulation, Computer Software, Demography, Error of Measurement

Sexton, Thomas R.; And Others – New Directions for Program Evaluation, 1986
Recent methodological advances are described that enable the analyst to extract additional information from the data envelopment analysis (DEA) methodology, including goal programming to develop cross-efficiencies, cluster analysis, analysis of variance, and pooled cross section time-series analysis. Some shortcomings of DEA are discussed. (LMO)
Descriptors: Efficiency, Error of Measurement, Evaluation Methods, Evaluation Problems

Westermann, Rainer; Hager, Willi – Journal of Educational Statistics, 1986
The well-known problem of cumulating error probabilities is reconsidered from a general epistemological perspective, namely, the concepts of severity and of fairness of tests. It is shown that not only Type 1 but also Type 2 errors can cumulate. A new adjustment strategy is proposed and applied. (Author/JAZ)
Descriptors: Educational Research, Error of Measurement, Hypothesis Testing, Measurement Techniques

Rasmussen, Jeffrey Lee – Evaluation Review, 1985
A recent study (Blair and Higgins, 1980) indicated a power advantage for the Wilcoxon W Test over student's t-test when calculated from a common mixed-normal sample. Results of the present study indicate that the t-test corrected for outliers shows a superior power curve to the Wilcoxon W.
Descriptors: Computer Simulation, Error of Measurement, Hypothesis Testing, Power (Statistics)

Smith, Brandon B. – Journal of Vocational Education Research, 1984
This article focuses on steps in conducting empirical-analytic research and the problems of controlling for or estimating three sources of error: the amount of measurement error, research design error, and the amount of statistical or sampling error. (Author/CT)
Descriptors: Analysis of Covariance, Analysis of Variance, Error of Measurement, Objectivity

Carlson, Raymond W. – Computers in Human Services, 1985
Clinically oriented computer systems require compatibility with the information processing that typifies clinical work. This paper summarizes some of the research that can be applied to such processing and uses the summary to suggest likely errors. Emphasis is placed on decision support systems that integrate human and computer information…
Descriptors: Case Records, Case Studies, Computers, Decision Support Systems

Baranowski, Tom – Journal of School Health, 1985
The most commonly used method of collecting outcome data in health education programs is self-report, which produces a variety of measurement errors. A model is proposed to systematically identify major influences for accuracy of self-reported health behavior. Methodologic studies are described, and eight steps to increase accuracy are proposed.…
Descriptors: Error of Measurement, Health Behavior, Health Education, Research Methodology

Basch, Charles E.; Gold, Robert S. – Journal of School Health, 1985
Reliability guides research design and is used as a standard for judging the credibility of findings and inferences. Using data gathered in a school health education curriculum evaluation as an example, possible errors in hypothesis testing are examined. Appropriateness of internal consistency as a measure of reliability is discussed and…
Descriptors: Cognitive Tests, Elementary Secondary Education, Error of Measurement, Health Education

Modjeski, Richard B.; Michael, William B. – Educational and Psychological Measurement, 1983
Two tests of critical thinking (the Cornell Critical Thinking Test and the Watson-Glaser Critical Thinking Appraisal) were evaluated by a panel of psychologists relative to the validity, reliability, and error of measurement standards stated in the "Standards for Educational and Psychological Tests," 1974. (PN)
Descriptors: Cognitive Tests, Critical Thinking, Error of Measurement, Evaluation Criteria

De Santi, Roger J.; Sullivan, Vicki Gallo – Journal of Research and Development in Education, 1985
Cloze-based evaluations of reading comprehension present room for a greater amount of subjectivity in rating reader response. A study was designed to ascertain the nature of potential subjectivity within a single-rater's ratings of cloze-based assessments of reading comprehension. (DF)
Descriptors: Cloze Procedure, Elementary Secondary Education, Error of Measurement, Interrater Reliability

Zimmerman, Donald W.; And Others – Journal of Experimental Education, 1984
Three types of test were compared: a completion test, a matching test, and a multiple-choice test. The completion test was more reliable than the matching test, and the matching test was more reliable than the multiple-choice test. (Author/BW)
Descriptors: Comparative Analysis, Error of Measurement, Higher Education, Mathematical Models
Miller, M. David – 2002
In 1994 the State Collaborative on Assessment and Student Standards of the Council of Chief State School Officers began a study to examine the generalizability of performance-based assessments (PBAs) for state-mandated assessment programs. The intent was to examine the major sources of error associated with PBAs and the generalizability and…
Descriptors: Elementary Secondary Education, Error of Measurement, Generalizability Theory, Performance Based Assessment
Zwick, Rebecca; Thayer, Dorothy T. – 1994
Several recent studies have investigated the application of statistical inference procedures to the analysis of differential item functioning (DIF) in test items that are scored on an ordinal scale. Mantel's extension of the Mantel-Haenszel test is a possible hypothesis-testing method for this purpose. The development of descriptive statistics for…
Descriptors: Error of Measurement, Evaluation Methods, Hypothesis Testing, Item Bias
DeMars, Christine E. – 2002
When students are nested within course sections, the assumption of independence of residuals is unlikely to be met, unless the course section is explicitly included in the model. Hierarchical linear modeling (HLM) allows for modeling the course section as a random effect, leading to more accurate standard errors. In this study, students chose one…
Descriptors: College Entrance Examinations, College Students, Course Organization, Error of Measurement