Publication Date
| In 2026 | 3 |
| Since 2025 | 666 |
| Since 2022 (last 5 years) | 3167 |
| Since 2017 (last 10 years) | 7408 |
| Since 2007 (last 20 years) | 15046 |
Descriptor
| Test Reliability | 15036 |
| Test Validity | 10272 |
| Reliability | 9759 |
| Foreign Countries | 7141 |
| Test Construction | 4823 |
| Validity | 4191 |
| Measures (Individuals) | 3877 |
| Factor Analysis | 3825 |
| Psychometrics | 3525 |
| Interrater Reliability | 3124 |
| Correlation | 3039 |
| More ▼ | |
Source
Author
Publication Type
Education Level
Audience
| Researchers | 709 |
| Practitioners | 451 |
| Teachers | 208 |
| Administrators | 122 |
| Policymakers | 66 |
| Counselors | 42 |
| Students | 38 |
| Parents | 11 |
| Community | 7 |
| Support Staff | 6 |
| Media Staff | 5 |
| More ▼ | |
Location
| Turkey | 1327 |
| Australia | 436 |
| Canada | 379 |
| China | 368 |
| United States | 271 |
| United Kingdom | 256 |
| Indonesia | 252 |
| Taiwan | 234 |
| Netherlands | 223 |
| Spain | 216 |
| California | 214 |
| More ▼ | |
Laws, Policies, & Programs
Assessments and Surveys
What Works Clearinghouse Rating
| Meets WWC Standards without Reservations | 8 |
| Meets WWC Standards with or without Reservations | 9 |
| Does not meet standards | 6 |
Mahoney, Gerald; Petersen, Gail – 1980
This study reports the interrater agreement of the Maternal Language Classification Scale (MLCS), a functional language classification system, developed partly to avoid problems identified with previous scales. The MLCS is a comprehensive system for classifying the functional content of maternal language addressed to children whose mean length of…
Descriptors: Infants, Language Acquisition, Measures (Individuals), Mothers
BENTWICH, J.; AND OTHERS – 1967
THIS TEST BATTERY IS DESIGNED TO BE USED AS AN AID IN COUNSELING AND GUIDANCE FOR PUPILS IN THE NINTH AND TENTH GRADES OF ACADEMIC HIGH SCHOOLS IN ISRAEL. AS THE INTENT IS TO MEASURE THE PUPIL'S ABILITY TO DO CRITICAL THINKING IN BROAD AREAS, THE TEST BATTERY MEASURES GENERAL EDUCATIONAL DEVELOPMENT RATHER THAN SPECIFIC ACHIEVEMENT SKILLS IN THE…
Descriptors: Achievement Tests, Hebrew, Test Construction, Test Reliability
Mayekawa, Shin-ichi; Haebara, Tomokazu – 1980
A least squares approach to estimating the reliability of a measure consisting of more than three content homogeneous or congeneric parts is proposed. The advantages of this method over a more indirect approach in which certain parts of a measure are combined to use Kristof's or Feldt's coefficients are examined. One hundred four-part tests were…
Descriptors: Achievement Tests, Least Squares Statistics, Mathematical Models, Test Reliability
AN INVESTIGATION OF NON-INDEPENDENCE OF COMPONENTS OF SCORES ON MULTIPLE-CHOICE TESTS. FINAL REPORT.
ZIMMERMAN, DONALD W.; BURKHEIMER, GRAHAM J., JR. – 1968
INVESTIGATION IS CONTINUED INTO VARIOUS EFFECTS OF NON-INDEPENDENT ERROR INTRODUCED INTO MULTIPLE-CHOICE TEST SCORES AS A RESULT OF CHANCE GUESSING SUCCESS. A MODEL IS DEVELOPED IN WHICH THE CONCEPT OF THEORETICAL COMPONENTS OF SCORES IS NOT INTRODUCED AND IN WHICH, THEREFORE, NO ASSUMPTIONS REGARDING ANY RELATIONSHIP BETWEEN SUCH COMPONENTS NEED…
Descriptors: Computers, Item Analysis, Mathematical Models, Objective Tests
Luft, Max; Bemis, Katherine A. – 1970
The object of this study was to validate a technique for establishing inter-rater reliability on the Southwestern Cooperative Interaction Observation Schedule (SCIOS), where it was impractical to bring the observers to a common site. Reliability was originally obtained when eight observers met together. Observers were divided into four pairs. A…
Descriptors: Classroom Observation Techniques, Interaction Process Analysis, Reliability, Videotape Recordings
Pandey, Tej N.; Hubert, Lawrence J. – 1974
This investigation had two major purposes. The first was to explore the use of an inferential technique called Tukey's Jackknife in establishing a confidence interval about cooefficient alpha reliability. The second purpose was to study the robustness of the Feldt and the jackknife procedures when the data fails to satisfy usual normality…
Descriptors: Comparative Analysis, Item Sampling, Statistical Analysis, Statistics
Farley, Frank H.; And Others – 1970
Two studies were reported which attempted to estimate the stability and construct validity of human salivary response as a measure of individual differences (IDs) in physiological arousal. Twenty-second base line estimates and 20-second response levels to four drops of lemon juice were measured, with the former value being removed from the latter…
Descriptors: Arousal Patterns, Individual Differences, Measurement, Psychological Studies
Harris, Chester W. – 1972
The efficiency of mastery tests of fixed length which sorts students into two categories is analyzed. For the sort of the students, an index, suggested by Fisher's linear discriminant function for two groups, is provided. (DB)
Descriptors: Educational Testing, Models, Statistical Analysis, Student Distribution
Gelso, Charles J.; And Others. – 1972
This study assessed the extent to which students commit various types of errors when completing Holland's Self-Directed Search (SDS) entirely on their own. Nearly all students made some type of error and approximately half of the students made errors that affected their final 3-letter summary codes. Almost one-fifth of the students made errors…
Descriptors: Aptitude Tests, Higher Education, Occupational Tests, Test Reliability
Shapiro, Peter D. – 1972
A brief and simple guide discusses the place and purpose of coding experimental data in the research process. The trade-offs involved unitizing data are reviewed; it is noted that a decision that increases reliability may reduce the validity of results, but without reliability there will be no validity at all. A discussion of categorizing data…
Descriptors: Codification, Reliability, Research Methodology, Research Problems
Peer reviewedChase, Terry V.; And Others – Adolescence, 1975
It has been the impression of the authors that a significant minority of MMPI profiles produced by adolescents suggest schizophrenia when such a process is not evident clinically. The present study was conducted in order to measure and better understand the frequency of false positive adolescent MMPI profiles. (Author)
Descriptors: Adolescents, Patients, Psychiatry, Research Methodology
Peer reviewedBealer, Robert C. – Rural Sociology, 1975
Descriptors: Concept Formation, Generalization, Reliability, Research
Peer reviewedRosenzweig, Saul – Journal of Personality Assessment, 1978
Data are presented on the retest and split-half reliability of the Rosenzweig Picture-Frustration (P-F) Study, Children's Form, for two groups of subjects (aged 10-11 and 12-13), each group tested twice at an interval of three months. Reliability by retest was consistently higher than by the split-half method. (Author/CTM)
Descriptors: Aggression, Elementary Education, Personality Measures, Projective Measures
Peer reviewedMaggiore, Ronald P. – Exceptional Children, 1978
The reliability of the proposed short form of the Revised Illinois Test of Psycholinguistic Abilities (ITPA) was computed on data derived from six-year-old ITPA standardization test booklets (128 Ss). (Author/CL)
Descriptors: Exceptional Child Research, Handicapped Children, Test Reliability, Testing Problems
Peer reviewedMcDonald, Roderick P. – Educational and Psychological Measurement, 1978
It is shown that if a behavior domain can be described by the common factor model with a finite number of factors, the squared correlation between the sum of a selection of items and the domain total score is actually greater than coefficient alpha. (Author/JKS)
Descriptors: Factor Analysis, Item Analysis, Mathematical Models, Measurement


