Publication Date
| In 2026 | 7 |
| Since 2025 | 690 |
| Since 2022 (last 5 years) | 3191 |
| Since 2017 (last 10 years) | 7432 |
| Since 2007 (last 20 years) | 15070 |
Descriptor
| Test Reliability | 15055 |
| Test Validity | 10290 |
| Reliability | 9763 |
| Foreign Countries | 7150 |
| Test Construction | 4828 |
| Validity | 4192 |
| Measures (Individuals) | 3880 |
| Factor Analysis | 3826 |
| Psychometrics | 3532 |
| Interrater Reliability | 3126 |
| Correlation | 3040 |
| More ▼ | |
Source
Author
Publication Type
Education Level
Audience
| Researchers | 709 |
| Practitioners | 451 |
| Teachers | 208 |
| Administrators | 122 |
| Policymakers | 66 |
| Counselors | 42 |
| Students | 38 |
| Parents | 11 |
| Community | 7 |
| Support Staff | 6 |
| Media Staff | 5 |
| More ▼ | |
Location
| Turkey | 1329 |
| Australia | 436 |
| Canada | 379 |
| China | 368 |
| United States | 271 |
| United Kingdom | 256 |
| Indonesia | 253 |
| Taiwan | 234 |
| Netherlands | 224 |
| Spain | 218 |
| California | 215 |
| More ▼ | |
Laws, Policies, & Programs
Assessments and Surveys
What Works Clearinghouse Rating
| Meets WWC Standards without Reservations | 8 |
| Meets WWC Standards with or without Reservations | 9 |
| Does not meet standards | 6 |
Camara, Wayne J. – 1986
Previous efforts to investigate the equivalence of rating sources for job analysis ratings have reported conflicting results. In the present research, correlational and generalizability analyses were conducted to examine the equivalency of rating sources for over 70 state civil service job classifications. Incumbent and supervisor ratings (N=697)…
Descriptors: Evaluators, Generalizability Theory, Interrater Reliability, Job Analysis
Horowitz, Frances Degen – 1987
Discussed are methodological aspects of three symposium papers on process approaches to individual differences in infancy. Fagan's (1987) research is viewed as an important contribution to the growing literature that demonstrates that process measures, that is, information processing behaviors, may provide a useful reflection of early to later…
Descriptors: Attention, Conference Papers, Individual Differences, Infant Behavior
De Leo, Diego; And Others – 1987
This study was a preliminary step in gathering reliable data on suicides and suicide attempts in Padua, Italy. Data were collected from the first aid department of the Padua general hospital, 67 general practitioners in the city, staff of a night-time and holiday home-call medical service, the reanimation department of the Padua general hospital,…
Descriptors: Comparative Analysis, Data Collection, Death, Foreign Countries
van Gelderen, A. – 1987
At the Educational Research Centre (S.C.O.) in Amsterdam, a study determined the applicability and construct validity of ratings of speaking performances by examining tape-recordings of subjects in four dimensions. Subjects were 200 pupils of 11 and 12 years of age, and performances on four different oral tasks were investigated. The rating…
Descriptors: Communication Research, Construct Validity, Elementary Education, Foreign Countries
Patience, Wayne; Auchter, Joan – 1988
A central aim in any assessment program is to ensure fair and stable scoring from administration to administration. When administrations are decentralized, not only in location, but in frequency and in logistical configuration, it is imperative to construct training, certifying, and monitoring systems that provide continuity between the original…
Descriptors: Equivalency Tests, Essay Tests, Scoring, Secondary Education
Schratz, Mary K. – 1984
To explore the appropriateness of the Rasch model for the vertical equating of a multi-level, multi-form achievement test series, both the Rasch model and the traditional Thurstone procedures were applied to the Listening Comprehension subtest scores of the Stanford Achievement Test. Two adjacent levels of these tests were administered in 1981 to…
Descriptors: Achievement Tests, Elementary Secondary Education, Equated Scores, Latent Trait Theory
Engle, Molly – 1984
Narrative data yield rich detail, insight, and information, but the personal and situational characteristics of coders (called value inertia and cognitive limitation biases) can affect data reduction. The effects of coder exposure to expected project outcomes and the level of coder research methodology sophistication were investigated. Coders,…
Descriptors: Content Analysis, Data Analysis, Educational Objectives, Experimenter Characteristics
Norman, G. R. – 1984
The use of healthy individuals acting as simulated patients for the purpose of clinical teaching is discussed. The term "standardized patients" is used to refer to training the individual to present a standard, repeatable stimulus. Some evidence suggests that simulated patients possess high fidelity (i.e., closely approximate real…
Descriptors: Clinical Teaching (Health Professions), Higher Education, Medical Education, Patients
St. Louis, Kenneth O.; Ruscello, Dennis M. – 1981
Although speech-language pathologists are expected to be able to administer and interpret oral examinations, there are currently no screening tests available that provide careful administration instructions and data for intra-examiner and inter-examiner reliability. The Oral Speech Mechanism Screening Examination (OSMSE) is designed primarily for…
Descriptors: Physiology, Screening Tests, Speech Evaluation, Speech Pathology
Subkoviak, Michael J. – 1985
Current methods of obtaining reliability coefficients for mastery tests are laborious from a practitioner's perspective. Some methods require two test administrations; while others require access to computer facilities and/or advanced measurement and statistical procedures. This report provides tables from which practitioners can read such…
Descriptors: Estimation (Mathematics), Mastery Tests, Statistical Studies, Tables (Data)
Peer reviewedSalvia, John; And Others – Exceptional Children, 1974
Descriptors: Elementary Education, Emotional Disturbances, Exceptional Child Research, Identification
Peer reviewedDulin, Kenneth L.; Chester, Robert D. – Journal of Reading, 1974
Concludes that the Estes Scale is an effective instrument for measuring levels of positive attitude toward books and reading. (RB)
Descriptors: Evaluation Methods, Reading Diagnosis, Reading Research, Secondary Education
Peer reviewedKoehler, Roger A. – Journal of Educational Measurement, 1974
The purposes of the study were to develop a measure of overconfidence on probabilistic tests, to assess the measurement characteristics of such a measure, and to investigate the relationship of overconfidence on tests to knowledge and to risk-taking propensity. (Author/BB)
Descriptors: Confidence Testing, Measurement Techniques, Multiple Choice Tests, Risk
Peer reviewedHobbs, Tom R.; Fowler, Raymond D. – Journal of Consulting and Clinical Psychology, 1974
The reliability of an abbreviated form of the Minnesota Multiphasic Personality Inventory (MMPI), the Mini-Mult, and its degree of correspondence with the MMPI were evaluated with a sample of 60 hospitalized schizophrenic veterans. The major results indicate respectable validity and reliability coefficients for most Mini-Mult Scales. (Author)
Descriptors: Behavioral Science Research, Personality Assessment, Psychological Testing, Schizophrenia
Heikkinen, Michael W. – 1978
The author presents a variation of Q-sort testing appropriate for the examination and quantification of responses to questions concerning teacher values and attitudes relating to teaching style. Specifically, the method is designed for use in examining the four "families" of teaching models developed in Joyce and Weil's taxonomy: social…
Descriptors: Measurement Techniques, Q Methodology, Rating Scales, Reliability


