Publication Date
| In 2026 | 7 |
| Since 2025 | 690 |
| Since 2022 (last 5 years) | 3191 |
| Since 2017 (last 10 years) | 7432 |
| Since 2007 (last 20 years) | 15070 |
Descriptor
| Test Reliability | 15055 |
| Test Validity | 10290 |
| Reliability | 9763 |
| Foreign Countries | 7150 |
| Test Construction | 4828 |
| Validity | 4192 |
| Measures (Individuals) | 3880 |
| Factor Analysis | 3826 |
| Psychometrics | 3532 |
| Interrater Reliability | 3126 |
| Correlation | 3040 |
| More ▼ | |
Source
Author
Publication Type
Education Level
Audience
| Researchers | 709 |
| Practitioners | 451 |
| Teachers | 208 |
| Administrators | 122 |
| Policymakers | 66 |
| Counselors | 42 |
| Students | 38 |
| Parents | 11 |
| Community | 7 |
| Support Staff | 6 |
| Media Staff | 5 |
| More ▼ | |
Location
| Turkey | 1329 |
| Australia | 436 |
| Canada | 379 |
| China | 368 |
| United States | 271 |
| United Kingdom | 256 |
| Indonesia | 253 |
| Taiwan | 234 |
| Netherlands | 224 |
| Spain | 218 |
| California | 215 |
| More ▼ | |
Laws, Policies, & Programs
Assessments and Surveys
What Works Clearinghouse Rating
| Meets WWC Standards without Reservations | 8 |
| Meets WWC Standards with or without Reservations | 9 |
| Does not meet standards | 6 |
Peer reviewedKaestner, Elisabeth; Goldstein, Marvin – Journal of Consulting and Clinical Psychology, 1977
The Sixteen Personality Factor Questionnaire (16PF) was used to determine retest reliability (7-day interval) and motivational distortion for a sample of narcotic addicts (N=141) legally committed to treatment and tested by staff for routine diagnostic purposes. (Author)
Descriptors: Drug Addiction, Institutionalized Persons, Narcotics, Personality Assessment
Peer reviewedNewmark, Charles S.; And Others – Journal of Consulting and Clinical Psychology, 1978
The standard form Minnesota Multiphasic Personality Inventory (MMPI) and two abbreviated forms were compared with direct measures of psychopathology obtained from the Brief Psychiatric Rating Scale (BPRS). The multiple correlation coefficients between the BPRS ratings and the corresponding MMPI and abbreviated-form scales were significantly high…
Descriptors: Comparative Analysis, Measurement Instruments, Measurement Techniques, Mental Disorders
Peer reviewedWalden, Brian E.; And Others – Journal of Speech and Hearing Disorders, 1977
Investigated in a series of experiments with 40 adults (20- to 70-years-old) having bilateral sensorineural hearing impairments was the test-retest reliability of the comfort level method for setting the acoustic gain of hearing aids, and the relationship between the comfort settings utilized in more realistic daily listening situations.…
Descriptors: Adults, Evaluation Criteria, Hearing Aids, Hearing Impairments
Peer reviewedKoran, Lorrin M.; And Others – Journal of Medical Education, 1978
To fully develop their diagnostic skills, medical students must recognize the limited reliability of the observations on which diagnoses are based. Study of 36 second-year students shows multiple sources of observer variation in readings of systolic and diastolic blood pressures. (LBH)
Descriptors: Clinical Diagnosis, Higher Education, Medical Education, Medical Students
Peer reviewedFleming, Dan B. – Peabody Journal of Education, 1977
Descriptors: Accountability, Evaluation Methods, Social Studies, Standardized Tests
Peer reviewedSawin, Enoch I. – Studies in Educational Evaluation, 1976
Problems associated with current expertise in evaluation are discussed. Since evaluators are not always able to reliably achieve all levels of an evaluation project, these tasks are categorized into five levels of complexity. The author suggests a more accurate label for evaluators, "descriptive inquiry specialists," and includes guidelines for…
Descriptors: Curriculum Evaluation, Elementary Secondary Education, Evaluation Criteria, Evaluation Methods
Peer reviewedHanna, Gerald S. – Journal of Educational Measurement, 1977
The effects of providing total and partial immediate feedback to pupils in multiple choice testing was investigated with fifth and sixth grade pupils. The split-half reliability was higher with total feedback than with no feedback. Concurrent validity with a completion test showed all three settings to be nearly identical. (Author/JKS)
Descriptors: Elementary Education, Elementary School Students, Feedback, Forced Choice Technique
Peer reviewedLord, Frederic M. – Journal of Educational Measurement, 1977
Two approaches for determining the optimal number of choices for a test item, presently in the literature, are compared with two new approaches. (Author)
Descriptors: Forced Choice Technique, Latent Trait Theory, Multiple Choice Tests, Test Items
Peer reviewedLord, Frederic M. – Journal of Educational Measurement, 1977
A variety of practical applications of item characteristic curve test theory are discussed. Among these applications are tailored testing, two stage testing, determining whether two tests measure the same latent trait, and measuring item bias towards minority or other groups. (Author/JKS)
Descriptors: Computer Programs, Latent Trait Theory, Mastery Tests, Measurement
Peer reviewedBurns, Edward – Journal of School Psychology, 1977
Studied the degree to which skewed score distributions can affect the interpretation of Illinois Test of Psycholinguistic Abilities (ITPA) Results suggest indices of score variability such as average deviation and standard scores must be interpreted with extreme caution when skewness is a significant factor. (Author)
Descriptors: Diagnostic Tests, Individual Psychology, Perception Tests, Psycholinguistics
Peer reviewedArndt, William B. – Journal of Speech and Hearing Disorders, 1977
In evaluating the Northwestern Syntax Screening Test (a test for assessing expressive and receptive grammar in preschool and primary age children), the author points out problems with the test norms, reliability, and validity. (SBH)
Descriptors: Early Childhood Education, Grammar, Language Tests, Screening Tests
Peer reviewedByrne, Margaret C. – Journal of Speech and Hearing Disorders, 1977
The author responds to W. Arndt's criticisms of the Northwestern Syntax Screening Test, a test for assessing receptive and expressive grammar in young children. (SBH)
Descriptors: Early Childhood Education, Grammar, Language Tests, Screening Tests
Peer reviewedCairns, E. – British Journal of Educational Psychology, 1977
It would appear that there is a lack of convincing evidence especially regarding the reliability of the Matching Familiar Figures test over short intervals and with older children. As the test is now being used for diagnostic purposes in education, more information is required, and here the MFF is examined in older children using a split-half…
Descriptors: Cognitive Ability, Educational Psychology, Elementary School Students, Information Processing
Peer reviewedHawthorne, Linda White; Larsen, Stephen C. – Journal of Learning Disabilities, 1977
Descriptors: Exceptional Child Research, Kindergarten, Learning Disabilities, Prediction
Peer reviewedBrown, Eleese V. – Perceptual and Motor Skills, 1977
Descriptors: Early Childhood Education, Elementary Education, Freehand Drawing, General Education


