Publication Date
In 2025 | 1 |
Since 2024 | 4 |
Since 2021 (last 5 years) | 11 |
Since 2016 (last 10 years) | 30 |
Since 2006 (last 20 years) | 95 |
Descriptor
True Scores | 415 |
Error of Measurement | 121 |
Test Reliability | 110 |
Statistical Analysis | 107 |
Mathematical Models | 97 |
Item Response Theory | 87 |
Correlation | 76 |
Equated Scores | 76 |
Reliability | 64 |
Test Theory | 52 |
Test Items | 50 |
More ▼ |
Source
Author
Publication Type
Education Level
Audience
Researchers | 12 |
Practitioners | 2 |
Administrators | 1 |
Teachers | 1 |
Location
Australia | 1 |
Canada | 1 |
China | 1 |
Colorado | 1 |
Illinois | 1 |
Israel | 1 |
New York | 1 |
Oregon | 1 |
Taiwan | 1 |
Texas | 1 |
United Kingdom (England) | 1 |
More ▼ |
Laws, Policies, & Programs
Elementary and Secondary… | 1 |
Assessments and Surveys
What Works Clearinghouse Rating

Littlefield, John H.; And Others – 1977
Generalizability theory extends previous methods of estimating the reliability of rating instruments such that one can estimate the precision of a measurement system for differentiating among students, scales, or any other important dimension. In this study, generalizability theory is applied to faculty ratings of junior and senior dental students…
Descriptors: Analysis of Variance, Clinical Experience, College Faculty, Data Collection
Graham, Darol L.; Bergquist, Constance – 1975
Two models were identified for criterion-referenced tests, one based on the assumption of a continuous achievement variable and the other assuming a dichotomous or binary variable. Several test characteristics were examined and contrasted for the two models, including the distribution of scores, establishment of a cutting score, test length, item…
Descriptors: Academic Achievement, Achievement Tests, Criterion Referenced Tests, Cutting Scores

Deacon, Christopher G. – Physics Teacher, 1992
Describes two simple methods of error analysis: (1) combining errors in the measured quantities; and (2) calculating the error or uncertainty in the slope of a straight-line graph. Discusses significance of the error in the comparison of experimental results with some known value. (MDH)
Descriptors: Error of Measurement, Goodness of Fit, High Schools, Higher Education
Koeller, Olaf – 1994
Scholastic achievement tests and mental ability tests normally consist of a set of multiple choice items, all of which are assumed to measure school-relevant cognitive abilities. The presumption, in a given test situation, is that the answers/solutions to the given tasks represent cognitive capabilities on the part of the examinees. The purpose of…
Descriptors: Achievement Tests, Cognitive Ability, Difficulty Level, Grade 7
Harris, Dale B.; Roberts, Jean – 1972
Data on the intellectual maturity of children 6-11 years of age in the noninstitutionalized population of the U. S. is analyzed in relation to their demographic and socioeconomic background. This is the second report on the Goodenough-Harris Drawing Test, administered in the Health Examination Survey of 1963-65, and deals with the results in…
Descriptors: Black Students, Demography, Educational Research, Elementary Education
Perry, Dallis – 1971
Principles of test administration, test validity, and accuracy of measurement underlying interpretation of standardized test scores in educational administration, instruction, and guidance are presented. Types of norm-referenced score transformations, including percentiles, standard scores, and grade equivalents, and of criterion referenced…
Descriptors: Criterion Referenced Tests, Error of Measurement, Evaluation, Expectancy Tables

Harris, Deborah J. – Journal of Educational Measurement, 1991
Two data collection designs, counterbalanced and spiraling (Angoff's Design I and Angoff's Design II) were compared using item response theory and equipercentile equating methodology in the vertical equating of 2 mathematics achievement tests using 1,000 eleventh graders and 1,000 twelfth graders. The greater stability of Design II is discussed.…
Descriptors: Achievement Tests, College Entrance Examinations, Comparative Analysis, Data Collection
Eignor, Daniel R. – 1985
The feasibility of pre-equating, or establishing conversions from raw to scaled scores through the use of pretest data before operationally administering a test, was investigated for the Scholastic Aptitude Test (SAT). Item-response theory based equating methods were used to estimate item parameters on SAT pretest data, instead of using final form…
Descriptors: College Entrance Examinations, Equated Scores, Estimation (Mathematics), Feasibility Studies
Werts, Charles E.; Linn, Robert L. – 1975
Forming a sequence covering the various aspects of the simplex model, four articles are presented here under the following titles: "A Simplex Model for Analyzing Academic Growth", "Analyzing Ratings With Correlated Intrajudge Measurement Errors", "The Correlation of States With Gain", and "The Reliability of…
Descriptors: Academic Achievement, Achievement Gains, Analysis of Covariance, College Students
Haladyna, Thomas – 1975
A central problem for the user of domain-referenced tests in instruction is deciding who has passed and who has failed. Two procedures were presented and discussed. The first, employing classical test theory, was found to be more useful for larger domains and where the passing standard is 70 percent or less. The sampling procedure suggested by…
Descriptors: Academic Achievement, Academic Standards, Criterion Referenced Tests, Decision Making Skills