Publication Date
In 2025 | 0 |
Since 2024 | 0 |
Since 2021 (last 5 years) | 0 |
Since 2016 (last 10 years) | 1 |
Since 2006 (last 20 years) | 2 |
Descriptor
Statistical Analysis | 41 |
Test Interpretation | 41 |
Testing | 41 |
Test Reliability | 14 |
Test Validity | 14 |
Achievement Tests | 11 |
Scores | 11 |
Test Results | 11 |
Measurement Techniques | 9 |
Test Construction | 9 |
Standardized Tests | 8 |
More ▼ |
Source
Author
Publication Type
Education Level
Higher Education | 2 |
Postsecondary Education | 1 |
Audience
Students | 1 |
Location
California (Stanford) | 1 |
Colorado (Denver) | 1 |
Hawaii | 1 |
Laws, Policies, & Programs
Individuals with Disabilities… | 1 |
Assessments and Surveys
Metropolitan Achievement Tests | 2 |
ACT Assessment | 1 |
Adjective Check List | 1 |
SAT (College Admission Test) | 1 |
Test of English as a Foreign… | 1 |
Torrance Tests of Creative… | 1 |
Woodcock Johnson Tests of… | 1 |
What Works Clearinghouse Rating
Powers, Sonya; Li, Dongmei; Suh, Hongwook; Harris, Deborah J. – ACT, Inc., 2016
ACT reporting categories and ACT Readiness Ranges are new features added to the ACT score reports starting in fall 2016. For each reporting category, the number correct score, the maximum points possible, the percent correct, and the ACT Readiness Range, along with an indicator of whether the reporting category score falls within the Readiness…
Descriptors: Scores, Classification, College Entrance Examinations, Error of Measurement
Brown, James Dean – Language Assessment Quarterly, 2008
In keeping with the theme of the International Language Testing Association/Language Testing Research Colloquium Conference in 2008, "Focusing on the Core: Justifying the Use of Language Assessments to Stakeholders," I define "stakeholder-friendly tests," "defensible testing," and "testing-context analysis."…
Descriptors: Language Usage, Curriculum Development, Testing, Language Tests
Lord, Frederic M. – 1973
A new formula is developed for the relative efficiency of two tests measuring the same trait. The formula expresses relative efficiency solely in terms of the standard errors of measurement and, surprisingly, the frequency distributions of true scores. Approximate methods for estimating relative efficiency may make this function routinely…
Descriptors: Error of Measurement, Research Reports, Statistical Analysis, Test Interpretation

Garvin, Alfred D. – Journal of Educational Measurement, 1976
A simple, usefully accurate approximation of the standard error of measurement is proposed for use by classroom teachers. An empirical comparison with Lord's approximation indicated that, though not as easy to calculate as Lord's, this approximation is more practical because it is useful at any point in the score distribution. (BW)
Descriptors: Error of Measurement, Estimation (Mathematics), Mathematical Formulas, Statistical Analysis
Klaas, Alan C. – 1975
Current usage and theory of standard error of measurement calls for one standard error of measurement figure to be used across all levels of scoring. The study revealed that scoring variance across scoring levels is not constant. As scoring ability increases scoring variance decreases. The assertion that low and high scoring subjects will…
Descriptors: Error of Measurement, Guessing (Tests), Scoring, Statistical Analysis

Spencer, Bruce D. – Journal of Educational Measurement, 1983
Because test scores are ordinal not cordinal attributes, the average test score often is a misleading way to summarize the scores of a group of individuals. Similarly, correlation coefficients may be misleading summary measures of association between test scores. Proper, readily interpretable, summary statistics are developed from a theory of…
Descriptors: Correlation, Measurement Techniques, Scores, Statistical Analysis

Shoemaker, David M. – Journal of Special Education, 1972
Considered is the improvement of criterion-referenced measurement as applied to individual and group assessment of handicapped and normal children. (DB)
Descriptors: Criterion Referenced Tests, Evaluation, Exceptional Child Education, Handicapped Children
Stanley, Julian C.; Wang, Marilyn D. – 1968
The literature on a priori and empirical weighting of test items and test-item options is reviewed. While multiple regression is the best known technique for deriving fixed empirical weights for component variables (such as tests and test items), other methods allow one to derive weights which equalize the effective weights of the component…
Descriptors: Correlation, Literature Reviews, Measurement Instruments, Measurement Techniques
Simpson, J. D. – Audio-Visual Language Journal, 1974
Some basic statistical concepts relevant to the teacher--mean scores, standard deviation, normal and skewed distributions, z scores, item analysis, standard error of measurement, reliability--and their use by the teacher are explained. (RM)
Descriptors: Error of Measurement, Evaluation Methods, Norm Referenced Tests, Scoring
Woodson, M. I. Charles E.
It has been argued that item variance and test variance are not necessary characteristics for criterion-referenced tests, although they are necessary for norm-referenced tests. This position is in error because it considers sample statistics as the criteria for evaluating items and tests. Within a particular sample, an item or test may have no…
Descriptors: Criterion Referenced Tests, Evaluation Criteria, Item Analysis, Item Sampling

Gorth, William P.; Hambleton, Ronald K. – Journal of Special Education, 1972
Descriptors: Criterion Referenced Tests, Evaluation, Exceptional Child Education, Handicapped Children
Warries, Egbert – 1974
The objective of the study is to convince educational researchers of the necessity for "standard mastery curves" for the graphical representation of scores on summative tests for a group of students. Attention is drawn to the study of theoretical and empirical skew curves in education and biology. Use of standard mastery curves and study…
Descriptors: Educational Research, Evaluation Needs, Graphs, Mastery Learning
Yeoh, Oon-Chye; Bridgham, Robert G. – 1971
This study tests the appropriateness of Torrance's assumptions of trait independence and the combinability of measures (Torrance Tests of Creative Thinking) with respect to the scoring of the tests for a younger population and estimates the homogeneity of the scores. The sample consisted of 128 elementary school children. Results indicate that…
Descriptors: Creativity, Creativity Research, Creativity Tests, Elementary School Students
Angoff, William H.; Stern, June – 1971
This study examines a common set of equating items for their appropriateness to both American and Canadian populations and, to determine the equations for converting scores from the verbal and mathematical scales of the 1971 form of the Canadian Scholastic Aptitude Test (the future standard continuing scales of the CSAT) to the corresponding…
Descriptors: Comparative Analysis, Correlation, Cross Cultural Studies, Equated Scores

McGrew, Kevin; Murphy, Suzanne – Journal of School Psychology, 1995
Investigates the general factor and uniqueness characteristics of the individual tests of the Woodcock-Johnson Test of Cognitive Ability-Revised (WJTCA-R). Only 2 of the 19 WJTCA-R tests examined had low general factor loadings, while 2 had low uniqueness. All other tests had medium or high uniqueness. Discusses implications for clinical…
Descriptors: Academic Ability, Cognitive Ability, Intelligence, Intelligence Tests