Publication Date
| In 2026 | 3 |
| Since 2025 | 666 |
| Since 2022 (last 5 years) | 3167 |
| Since 2017 (last 10 years) | 7408 |
| Since 2007 (last 20 years) | 15046 |
Descriptor
| Test Reliability | 15036 |
| Test Validity | 10272 |
| Reliability | 9759 |
| Foreign Countries | 7141 |
| Test Construction | 4823 |
| Validity | 4191 |
| Measures (Individuals) | 3877 |
| Factor Analysis | 3825 |
| Psychometrics | 3525 |
| Interrater Reliability | 3124 |
| Correlation | 3039 |
| More ▼ | |
Source
Author
Publication Type
Education Level
Audience
| Researchers | 709 |
| Practitioners | 451 |
| Teachers | 208 |
| Administrators | 122 |
| Policymakers | 66 |
| Counselors | 42 |
| Students | 38 |
| Parents | 11 |
| Community | 7 |
| Support Staff | 6 |
| Media Staff | 5 |
| More ▼ | |
Location
| Turkey | 1327 |
| Australia | 436 |
| Canada | 379 |
| China | 368 |
| United States | 271 |
| United Kingdom | 256 |
| Indonesia | 252 |
| Taiwan | 234 |
| Netherlands | 223 |
| Spain | 216 |
| California | 214 |
| More ▼ | |
Laws, Policies, & Programs
Assessments and Surveys
What Works Clearinghouse Rating
| Meets WWC Standards without Reservations | 8 |
| Meets WWC Standards with or without Reservations | 9 |
| Does not meet standards | 6 |
Tucker, Ledyard R.; And Others – 1970
Three topics in factor analysis are covered: a) a reliability coefficient for assessing the quality of a maximum likelihood factor analysis, b) an application of three-mode factor analysis to serial learning data, showing variations in learning curves over stages of learning and individuals, and c) the use of personal probability functions to…
Descriptors: Correlation, Factor Analysis, Hypothesis Testing, Individual Differences
Stanley, Julian C.; Livingston, Samuel A. – 1971
Besides the ubiquitous Pearson product-moment r, there are a number of other measures of relationship that are attenuated by errors of measurement and for which the relationship between true measures can be estimated. Among these are the correlation ratio (eta squared), Kelley's unbiased correlation ratio (epsilon squared), Hays' omega squared,…
Descriptors: Analysis of Variance, Cluster Grouping, Correlation, Data Analysis
Freeberg, Norman E.; Creech, F. Reid – 1971
Measurement properties of the written rules tests were examined to determine their suitability for assessing driver knowledge and skill. It was concluded that the tests, as measurement tools for granting or renewing a driver's license, were not adequate. Recommendations for test improvement and a sample test copy are included. (AG)
Descriptors: Driver Education, Factor Analysis, Item Analysis, Predictive Validity
Bochner, Arthur P. – 1972
The validity and reliability of small group research published between 1970 and 1971 is examined in this paper. In response to the small group research position which gives precedence to theory over method, the author counters that placing measurement in a secondary position increases the danger of accepting claims of experiments which contain…
Descriptors: Communication (Thought Transfer), Evaluation Criteria, Group Dynamics, Reliability
Rossiter, Charles M., Jr.; And Others – 1972
The authors question Burgoon's assumption that the acquisition of grades and the achievement of course goals are highly positively correlated. The major criticisms of Burgoon's study concern use of grades as measurements of success and the nature of the experimental population. The use of grades is criticized on the grounds that disparity existed…
Descriptors: Academic Achievement, Behavior, Communication (Thought Transfer), Evaluation Methods
PDF pending restorationMeredith, Keith E.; Sabers, Darrell L. – 1972
Data required for evaluating a Criterion Referenced Measurement (CRM) is described with a matrix. The information within the matrix consists of the "pass-fail" decisions of two CRMs. By differentially defining these two CRMs, different concepts of reliability and validity can be examined. Indices suggested for analyzing the matrix are listed with…
Descriptors: Criterion Referenced Tests, Factor Analysis, Item Analysis, Research Methodology
Carroll, John B. – 1971
The subjective magnitude estimation (SME) procedure was used to obtain estimates of relative word frequency from two adult groups (15 lexicographers, 13 other adults) for 60 words ranging widely in objective frequency. Lexicographers rendered more reliable estimates, and their averaged data correlated more highly with objective log frequency than…
Descriptors: Adults, Correlation, Discriminant Analysis, Individual Differences
Manpower Administration (DOL), Washington, DC. U.S. Training and Employment Service. – 1970
The United States Training and Employment Service General Aptitude Test Battery (GATB), first published in 1947, has been included in a continuing program of research to validate the tests against success in many different occupations. The GATB consists of 12 tests which measure nine aptitudes: General Learning Ability; Verbal Aptitude; Numerical…
Descriptors: Aptitude Tests, Career Guidance, Carpentry, Evaluation Criteria
Manpower Administration (DOL), Washington, DC. U.S. Training and Employment Service. – 1970
The United States Training and Employment Service General Aptitude Test Battery (GATB), first published in 1947, has been included in a continuing program of research to validate the tests against success in many different occupations. The GATB consists of 12 tests which measure nine aptitudes: General Learning Ability; Verbal Aptitude; Numerical…
Descriptors: Aptitude Tests, Career Guidance, Evaluation Criteria, Job Applicants
Manpower Administration (DOL), Washington, DC. U.S. Training and Employment Service. – 1955
The United States Training and Employment Service General Aptitude Test Battery (GATB), first published in 1947, has been included in a continuing program of research to validate the tests against success in many different occupations. The GATB consists of 12 tests which measure nine aptitudes: General Learning Ability; Verbal Aptitude; Numerical…
Descriptors: Aptitude Tests, Career Guidance, Ceramics, Evaluation Criteria
Lewis, Barbara; And Others – 1973
This paper describes and evaluates a new abstract form of the Purdue Elementary Problem-Solving Inventory. The new test parallels a shortened form of the original Inventory, but presents problems verbally rather than through slides. Both forms were given to advantaged and disadvantaged second- and fourth-graders. For the total sample, the slide…
Descriptors: Abstract Reasoning, Cognitive Tests, Comparative Analysis, Elementary Education
Marshall, Jon Clark – 1973
This manual describes the construction, administration and interpretation of the Course Evaluation Schedule, designed to assess students' perception of instruction. The inventory is divided into four parts; the first, designed to elicit information about the instructional modes used, is not included in the ratings. The remaining three parts…
Descriptors: Course Evaluation, Guides, Rating Scales, Student Attitudes
Blai, Boris, Jr. – 1971
Statistics are an essential tool for making proper judgement decisions. It is concerned with probability distribution models, testing of hypotheses, significance tests and other means of determining the correctness of deductions and the most likely outcome of decisions. Measures of central tendency include the mean, median and mode. A second…
Descriptors: Analysis of Variance, Correlation, Error of Measurement, Hypothesis Testing
Cillizza, Joseph Edward – 1970
The purpose of this study was to construct and validate a test of critical thinking ability. A preliminary form was checked for face validity by a panel of experts in reading. Item analysis of this form resulted in a final form consisting of four parts with three subscales each. This form, and tests of intelligence and general reading ability,…
Descriptors: Critical Thinking, Doctoral Dissertations, Intelligence, Junior High School Students
Tyler, Thomas A. – 1968
It was hypothesized that unstable subject-item interactions on personality scales would be associated with small psychological distances between subjects and items. It was further hypothesized that this relationship would be more demonstrable when the psychological test was more homogeneous. To clarify the rationale of the first hypothesis, an…
Descriptors: Data Analysis, Hypothesis Testing, Interaction Process Analysis, Personality Measures


