Publication Date
| In 2026 | 3 |
| Since 2025 | 675 |
| Since 2022 (last 5 years) | 3176 |
| Since 2017 (last 10 years) | 7417 |
| Since 2007 (last 20 years) | 15055 |
Descriptor
| Test Reliability | 15043 |
| Test Validity | 10279 |
| Reliability | 9761 |
| Foreign Countries | 7144 |
| Test Construction | 4825 |
| Validity | 4191 |
| Measures (Individuals) | 3877 |
| Factor Analysis | 3825 |
| Psychometrics | 3526 |
| Interrater Reliability | 3124 |
| Correlation | 3040 |
| More ▼ | |
Source
Author
Publication Type
Education Level
Audience
| Researchers | 709 |
| Practitioners | 451 |
| Teachers | 208 |
| Administrators | 122 |
| Policymakers | 66 |
| Counselors | 42 |
| Students | 38 |
| Parents | 11 |
| Community | 7 |
| Support Staff | 6 |
| Media Staff | 5 |
| More ▼ | |
Location
| Turkey | 1328 |
| Australia | 436 |
| Canada | 379 |
| China | 368 |
| United States | 271 |
| United Kingdom | 256 |
| Indonesia | 253 |
| Taiwan | 234 |
| Netherlands | 223 |
| Spain | 217 |
| California | 215 |
| More ▼ | |
Laws, Policies, & Programs
Assessments and Surveys
What Works Clearinghouse Rating
| Meets WWC Standards without Reservations | 8 |
| Meets WWC Standards with or without Reservations | 9 |
| Does not meet standards | 6 |
Peer reviewedGillham, James; Woelfel, Joseph – Human Communication Research, 1977
Describes the Galileo system of measurement operations including reliability and validity data. Illustrations of some of the relations between Galileo measures and traditional procedures are provided. (MH)
Descriptors: Cognitive Measurement, Communication (Thought Transfer), Higher Education, Measurement Instruments
Hansen, Jo-Ida C. – Measurement and Evaluation in Guidance, 1977
Changing from the SVIB to the SCII increased the probability of machine-scoring errors. This study examined the accuracy and consistency of SCII profile scores for three commercial scoring agencies. Results indicated improved performance compared with previous studies and suggested that scoring errors should be minimal for the SCII. (Author)
Descriptors: Comparative Analysis, Educational Testing, Interest Inventories, Research Projects
Peer reviewedTsushima, William T.; Bratton, Joseph C. – Journal of Consulting and Clinical Psychology, 1977
Investigated geographic differences in Wechsler Adult Intelligence Scale (WAIS) results by comparing 60 Hawaiian and 60 mainland United States psychiatric outpatients. The influence of pidgin English led to expectations that Hawaiian subjects would have significantly lower WAIS Verbal scores than mainland subjects. Data verified these…
Descriptors: Comparative Analysis, Cultural Differences, Cultural Influences, Geographic Location
Peer reviewedRubin, Kenneth H.; Trotter, Kristin T. – Developmental Psychology, 1977
Examined 3 methodological issues in the use of Kohlberg's Moral Judgment Scale: (1) test-retest reliability, (2) consistency of moral judgment stages from one dilemma to the next, and (3) influence of subject's verbal facility on the projective test scores. Forty children in grades 3 and 5 participated. (JMB)
Descriptors: Elementary Education, Measurement Techniques, Moral Development, Test Reliability
Peer reviewedDomino, George; Blumberg, Elaine – Journal of Youth and Adolescence, 1987
Some preliminary data on Self-Esteem Questionnaire (SEQ), organized along Gough's conceptual model of primary, secondary, and tertiary evaluation, are presented. The SEQ appears to discriminate groups along a self-esteem continuum; fits in well with a variety of theoretical approaches; does not have loadings on sex, intelligence, or social status.…
Descriptors: Adolescents, Models, Psychometrics, Self Concept Measures
Peer reviewedBrown, Robert D.; Prentice, David G. – Evaluation Review, 1987
This article reports on how short scales can be used to assess the perceived risks that exist for decision makers, their perception of the decision context and their perceived information needs. Three scales are described: (1) a Decision Risk Scale; (2) a Decision Context Inventory; and (3) a Need for Information Inventory. (JAZ)
Descriptors: Decision Making, Higher Education, Information Needs, Nurses
Peer reviewedMcLeod, P. J. – Journal of Medical Education, 1987
A study of interrater reliability among 17 faculty members assessing medical student case reports revealed marked disparities in the criteria raters felt to be important and an unacceptable spread in the ratings given. A standardized assessment instrument is recommended instead. (MSE)
Descriptors: Higher Education, Interrater Reliability, Medical Case Histories, Medical Education
Peer reviewedJames, Sharon L. – Reading Teacher, 1986
Concludes that the PLAI is a unique test that can be very useful in assessing children's ability to deal with the language of instruction. (FL)
Descriptors: Language Acquisition, Language Skills, Language Tests, Preschool Education
McLean, Mary; And Others – Journal of the Division for Early Childhood, 1987
The study evaluated the Batelle Developmental Inventory (BDI) with 40 disabled children under 30 months of age. Subjects were also given the Bayley Scales and Vine Scales of Adaptive Behavior. Results indicated high concurrent validity, interrater reliability, and internal consistency for the BDI. (Author/DB)
Descriptors: Adaptive Behavior (of Disabled), Cognitive Measurement, Disabilities, Infants
Peer reviewedBorich, Gary; Klinzing, Garhard – Journal of Classroom Interaction, 1984
Problems in studying teacher effectiveness through the use of classroom observation are discussed. Four assumptions in the observation of classroom process are offered and ways in which these assumptions can be dealt with in designing an observation study are suggested. (DF)
Descriptors: Classroom Observation Techniques, Error of Measurement, Experimenter Characteristics, Interrater Reliability
Peer reviewedRossi, Joseph S. – Teaching of Psychology, 1987
Reports a class exercise which requires students to recalculate the Chi-squares, t-tests, and one-way ANOVAs found in published psychological research articles. Describes students' reaction to the exercise and provides data on the 13% error rate they discovered. (Author/JDH)
Descriptors: Error Patterns, Higher Education, Learning Activities, Psychology
Peer reviewedNortham, Elizabeth; And Others – Merrill-Palmer Quarterly, 1987
Two studies concerned with agreement in ratings of temperament are reported. Ratings of the mothers of toddlers versus daycare workers were compared on the Toddler Temperament Scale (Study 1), and on ratings of a videotape of a 2-year-old child for responses relevant to six dimensions of temperament (Study 2). (Author/BN)
Descriptors: Affective Behavior, Behavior Rating Scales, Interrater Reliability, Mothers
Peer reviewedKinston, Warren; And Others – Journal of Marital and Family Therapy, 1987
The Family Health Scale is an instrument designed to quantify the quality of psychiatrically labeled or nonlabeled family functioning from the perspective of an external clinical observer. Clinical judgment is exercised in rating, based on information available and ideally from a valid standardized method of direct observation. Discusses…
Descriptors: Family Health, Family Problems, Foreign Countries, Observation
Peer reviewedCornaire, Claudette Marie – Canadian Modern Language Review, 1988
Two studies performed in secondary schools and at the University of Ottawa investigated the usefulness of Georges Henry's 1975 short readability formula, designed specifically for French, for evaluating instructional texts. (MSE)
Descriptors: Foreign Countries, French, Higher Education, Readability Formulas
Stiggins, Richard J. – Executive Educator, 1988
Without training in objective, systematic methods of assessing student performance, teachers often must rely on poor quality evaluation measures. This article discusses assessment problems and promotes a new Northwest Regional Education Laboratory (Oregon) development: providing classroom assessment specialists to assist teachers with student…
Descriptors: Elementary Secondary Education, Grading, Measurement Techniques, Specialists


