Publication Date
| In 2026 | 7 |
| Since 2025 | 690 |
| Since 2022 (last 5 years) | 3191 |
| Since 2017 (last 10 years) | 7432 |
| Since 2007 (last 20 years) | 15070 |
Descriptor
| Test Reliability | 15055 |
| Test Validity | 10290 |
| Reliability | 9763 |
| Foreign Countries | 7150 |
| Test Construction | 4828 |
| Validity | 4192 |
| Measures (Individuals) | 3880 |
| Factor Analysis | 3826 |
| Psychometrics | 3532 |
| Interrater Reliability | 3126 |
| Correlation | 3040 |
| More ▼ | |
Source
Author
Publication Type
Education Level
Audience
| Researchers | 709 |
| Practitioners | 451 |
| Teachers | 208 |
| Administrators | 122 |
| Policymakers | 66 |
| Counselors | 42 |
| Students | 38 |
| Parents | 11 |
| Community | 7 |
| Support Staff | 6 |
| Media Staff | 5 |
| More ▼ | |
Location
| Turkey | 1329 |
| Australia | 436 |
| Canada | 379 |
| China | 368 |
| United States | 271 |
| United Kingdom | 256 |
| Indonesia | 253 |
| Taiwan | 234 |
| Netherlands | 224 |
| Spain | 218 |
| California | 215 |
| More ▼ | |
Laws, Policies, & Programs
Assessments and Surveys
What Works Clearinghouse Rating
| Meets WWC Standards without Reservations | 8 |
| Meets WWC Standards with or without Reservations | 9 |
| Does not meet standards | 6 |
Gaylord, Richard H. – Educ Psychol Meas, 1969
Descriptors: Correlation, Item Analysis, Mathematical Formulas, Test Construction
Edwards, Allen Jack – Psychol Rep, 1969
Descriptors: College Students, Divergent Thinking, Psychological Testing, Research
Lincoln, Yvonna S.; Guba, Egon G. – 1982
The educational audit is suggested for assessing the process of inquiry for reliability and the product of inquiry for absence of bias. The inquiry auditor must review the inquiry processes to determine that they conform to norms of "good professional practice." He must review inquiry products to ensure they can be substantiated from…
Descriptors: Data Analysis, Data Collection, Inquiry, Methods
Moore, John C., Jr. – 1980
This paper was written to help local level advocates in the human services understand and use statistical data from the U.S. Bureau of the Census. Because of budget cuts, difficult times are ahead for human services delivery. Advocates need to strengthen their technical skills. In the first section of the paper, two new programs of published data…
Descriptors: Advocacy, Guidelines, Human Services, Reliability
Gold, Robert S. – 1979
A study was conducted on the measurement of the characteristics of innovation. The results or specific recommendations of evaluation studies may be assessed for their likelihood of adoption and implementation based on these characteristics. As the recommendations are perceived more positively, the level of utilization should increase. The semantic…
Descriptors: Change Agents, Evaluation, Evaluators, Measurement Techniques
Naccarato, Richard W. – 1972
A study was initiated by a university rhetoric department to investigate the reliability of methods of rating student themes written as prerequisites for course exemptions. Ten experienced raters were selected to rate the beginning, middle, and ending paragraphs of 30 previously scored exemption themes that represented a range of achievement.…
Descriptors: Evaluation Criteria, Higher Education, Reliability, Writing Evaluation
Mahoney, Gerald; Petersen, Gail – 1980
This study reports the interrater agreement of the Maternal Language Classification Scale (MLCS), a functional language classification system, developed partly to avoid problems identified with previous scales. The MLCS is a comprehensive system for classifying the functional content of maternal language addressed to children whose mean length of…
Descriptors: Infants, Language Acquisition, Measures (Individuals), Mothers
BENTWICH, J.; AND OTHERS – 1967
THIS TEST BATTERY IS DESIGNED TO BE USED AS AN AID IN COUNSELING AND GUIDANCE FOR PUPILS IN THE NINTH AND TENTH GRADES OF ACADEMIC HIGH SCHOOLS IN ISRAEL. AS THE INTENT IS TO MEASURE THE PUPIL'S ABILITY TO DO CRITICAL THINKING IN BROAD AREAS, THE TEST BATTERY MEASURES GENERAL EDUCATIONAL DEVELOPMENT RATHER THAN SPECIFIC ACHIEVEMENT SKILLS IN THE…
Descriptors: Achievement Tests, Hebrew, Test Construction, Test Reliability
Mayekawa, Shin-ichi; Haebara, Tomokazu – 1980
A least squares approach to estimating the reliability of a measure consisting of more than three content homogeneous or congeneric parts is proposed. The advantages of this method over a more indirect approach in which certain parts of a measure are combined to use Kristof's or Feldt's coefficients are examined. One hundred four-part tests were…
Descriptors: Achievement Tests, Least Squares Statistics, Mathematical Models, Test Reliability
AN INVESTIGATION OF NON-INDEPENDENCE OF COMPONENTS OF SCORES ON MULTIPLE-CHOICE TESTS. FINAL REPORT.
ZIMMERMAN, DONALD W.; BURKHEIMER, GRAHAM J., JR. – 1968
INVESTIGATION IS CONTINUED INTO VARIOUS EFFECTS OF NON-INDEPENDENT ERROR INTRODUCED INTO MULTIPLE-CHOICE TEST SCORES AS A RESULT OF CHANCE GUESSING SUCCESS. A MODEL IS DEVELOPED IN WHICH THE CONCEPT OF THEORETICAL COMPONENTS OF SCORES IS NOT INTRODUCED AND IN WHICH, THEREFORE, NO ASSUMPTIONS REGARDING ANY RELATIONSHIP BETWEEN SUCH COMPONENTS NEED…
Descriptors: Computers, Item Analysis, Mathematical Models, Objective Tests
Luft, Max; Bemis, Katherine A. – 1970
The object of this study was to validate a technique for establishing inter-rater reliability on the Southwestern Cooperative Interaction Observation Schedule (SCIOS), where it was impractical to bring the observers to a common site. Reliability was originally obtained when eight observers met together. Observers were divided into four pairs. A…
Descriptors: Classroom Observation Techniques, Interaction Process Analysis, Reliability, Videotape Recordings
Pandey, Tej N.; Hubert, Lawrence J. – 1974
This investigation had two major purposes. The first was to explore the use of an inferential technique called Tukey's Jackknife in establishing a confidence interval about cooefficient alpha reliability. The second purpose was to study the robustness of the Feldt and the jackknife procedures when the data fails to satisfy usual normality…
Descriptors: Comparative Analysis, Item Sampling, Statistical Analysis, Statistics
Farley, Frank H.; And Others – 1970
Two studies were reported which attempted to estimate the stability and construct validity of human salivary response as a measure of individual differences (IDs) in physiological arousal. Twenty-second base line estimates and 20-second response levels to four drops of lemon juice were measured, with the former value being removed from the latter…
Descriptors: Arousal Patterns, Individual Differences, Measurement, Psychological Studies
Harris, Chester W. – 1972
The efficiency of mastery tests of fixed length which sorts students into two categories is analyzed. For the sort of the students, an index, suggested by Fisher's linear discriminant function for two groups, is provided. (DB)
Descriptors: Educational Testing, Models, Statistical Analysis, Student Distribution
Gelso, Charles J.; And Others. – 1972
This study assessed the extent to which students commit various types of errors when completing Holland's Self-Directed Search (SDS) entirely on their own. Nearly all students made some type of error and approximately half of the students made errors that affected their final 3-letter summary codes. Almost one-fifth of the students made errors…
Descriptors: Aptitude Tests, Higher Education, Occupational Tests, Test Reliability


