Publication Date
| In 2026 | 3 |
| Since 2025 | 675 |
| Since 2022 (last 5 years) | 3176 |
| Since 2017 (last 10 years) | 7417 |
| Since 2007 (last 20 years) | 15055 |
Descriptor
| Test Reliability | 15043 |
| Test Validity | 10279 |
| Reliability | 9761 |
| Foreign Countries | 7144 |
| Test Construction | 4825 |
| Validity | 4191 |
| Measures (Individuals) | 3877 |
| Factor Analysis | 3825 |
| Psychometrics | 3526 |
| Interrater Reliability | 3124 |
| Correlation | 3040 |
| More ▼ | |
Source
Author
Publication Type
Education Level
Audience
| Researchers | 709 |
| Practitioners | 451 |
| Teachers | 208 |
| Administrators | 122 |
| Policymakers | 66 |
| Counselors | 42 |
| Students | 38 |
| Parents | 11 |
| Community | 7 |
| Support Staff | 6 |
| Media Staff | 5 |
| More ▼ | |
Location
| Turkey | 1328 |
| Australia | 436 |
| Canada | 379 |
| China | 368 |
| United States | 271 |
| United Kingdom | 256 |
| Indonesia | 253 |
| Taiwan | 234 |
| Netherlands | 223 |
| Spain | 217 |
| California | 215 |
| More ▼ | |
Laws, Policies, & Programs
Assessments and Surveys
What Works Clearinghouse Rating
| Meets WWC Standards without Reservations | 8 |
| Meets WWC Standards with or without Reservations | 9 |
| Does not meet standards | 6 |
Oliver, Jo Ellen – 1977
A primary-grade-level instrument was developed to measure thinking prerequisites to reading comprehension. Levels of responses are identified as either perceptual or conceptual; the ability to synthesize concepts from several sources and make generalizations is also measured. Analyses of partial construct validity, of cross-form reliability, and…
Descriptors: Cognitive Measurement, Concept Formation, Grade 5, Intermediate Grades
PDF pending restorationKummerow, Jean M.; Hummel, Thomas J. – 1977
A study of 60 adults, ages 23-38, was done to assess the fit of life-stages (periods during which adults of similar age face common problems, events, pressures, situations) identified by writers in adult development for these ages. Focus was on (1) creating a structured interview schedule to obtain data which should be age-related and (2) creating…
Descriptors: Adult Development, Adults, Age Groups, Behavioral Science Research
Peer reviewedEvans, Williams R. – Psychology in the Schools, 1975
This research generated norms on the Peterson-Quay Behavior Problem Checklist for an inner city population (N=101), as well as providing test-retest reliability coefficients between two applications of the checklist. Data is provided on the conduct, personality, inadequacy-immaturity and socialized delinquency dimensions of the checklist and for…
Descriptors: Behavior Rating Scales, Behavioral Science Research, Disadvantaged Youth, Elementary Education
Peer reviewedTuma, June M.; McCraw, Ronald K. – Journal of Personality Assessment, 1975
Rorschach test protocols for a matched sample of male and female subjects, in the child and adolescent range, were scored for total responses. The data was analyzed for evidence of interactions between sex of experimenter and sex and age of subject. (Author/BJG)
Descriptors: Age, Children, Elementary Secondary Education, Examiners
Peer reviewedGolden, Charles J. – Journal of Personality Assessment, 1975
An attempt was made to develop a form of the Stroop Test which could be used in both group and individual settings and serve as a basic form for interested researchers. Group and individual measures differ only in that the group test does not require a spoken response. (Author/BJG)
Descriptors: Group Testing, Higher Education, Individual Testing, Personality Measures
McIntosh, Eranell I.; Warren, Sue Allen – Training Sch Bull, 1969
Descriptors: Behavior Rating Scales, Exceptional Child Research, Institutionalized Persons, Longitudinal Studies
Paul, Howard A.; Miller, Joel R. – Training Sch Bull, 1969
Descriptors: Exceptional Child Research, Item Analysis, Mental Retardation, Nonverbal Tests
Guess, Doug; And Others – 1981
The second of a three volume report on a University of Kansas approach to developing quantitative measures of motor and perceptual motor functioning in nonhandicapped and severely/multiply handicapped infants and young children presents interobserver reliability results from the measures described in volume 1. Some studies also include a limited…
Descriptors: Developmental Stages, Evaluation Methods, Infants, Motor Development
Deshler, Donald D.; And Others – 1980
A statewide random sampling of seven groups of professional educators (N=90) and 30 parents of learning disabled (LD) students were compared for their degree of agreement on the Modified Component Disability Instrument. Professionals included teachers of LD adolescents, remedial reading teachers, school psychologists, speech clinicians, school…
Descriptors: Adolescents, Behavior Patterns, Check Lists, Evaluation Criteria
Bauer, Barbara Ann – 1981
To compare the relative reliable uses and cost effectiveness of the analytic, the holistic, and the primary trait scoring methods, an inquiry was conducted in which a group of raters scored a large number of secondary school students' essays according to each of the scoring methods. Raters were nine graduate students in English who were trained in…
Descriptors: Comparative Analysis, Cost Effectiveness, Expository Writing, Holistic Evaluation
Cohen, Vicki L. Blum – 1982
In order to develop a systematic procedure for the evaluation and revision of educational software for microcomputers, a study was undertaken by the Educational Products Information Exchange (EPIE) Institute and the Microcomputer Resource Center at Columbia University to define the criteria that are needed to evaluate instructional microcomputer…
Descriptors: Computer Assisted Instruction, Computer Programs, Elementary Secondary Education, Evaluation Criteria
Singh, Balwant; And Others – 1980
This 46-question multiple choice test deals with the physical and chemical properties of matter, wave motion and types of energy, simple machines, equipment safety and measurement. The test is meant for administration to grade 8 students before and after instruction. Item analysis of the pre- post data are included, as are reliability estimates…
Descriptors: Grade 8, Item Analysis, Junior High Schools, Multiple Choice Tests
Rosso, Martin A.; Reckase, Mark D. – 1981
The overall purpose of this research was to compare a maximum likelihood based tailored testing procedure to a Bayesian tailored testing procedure. The results indicated that both tailored testing procedures produced equally reliable ability estimates. Also an analysis of test length indicated that reasonable ability estimates could be obtained…
Descriptors: Adaptive Testing, Bayesian Statistics, Comparative Analysis, Computer Assisted Testing
Levine, Michael V. – 1976
It is shown that empirical mental test P - P plots are approximately equal to theoretical item-item curves, at least for long tests administered to many people. This result is important because it leads to (1) a distribution free method for estimating points on item-item curves; (2) a general method for defining estimates of item parameters; and…
Descriptors: Item Analysis, Latent Trait Theory, Mathematical Applications, Mathematical Models
Gould, R. Bruce – 1978
The construction and norming of Form N of the Air Force Officer Qualifying Test (AFOQT) is described. The new form serves the same purpose as its predecessor and possesses basically the same characteristics. References are made to the research which provided the basis for most of the changes. Other changes were made because of the admission of…
Descriptors: Aircraft Pilots, Item Analysis, Norms, Occupational Tests


