Publication Date
| In 2026 | 3 |
| Since 2025 | 656 |
| Since 2022 (last 5 years) | 3157 |
| Since 2017 (last 10 years) | 7398 |
| Since 2007 (last 20 years) | 15036 |
Descriptor
| Test Reliability | 15028 |
| Test Validity | 10265 |
| Reliability | 9757 |
| Foreign Countries | 7137 |
| Test Construction | 4821 |
| Validity | 4191 |
| Measures (Individuals) | 3876 |
| Factor Analysis | 3822 |
| Psychometrics | 3520 |
| Interrater Reliability | 3124 |
| Correlation | 3039 |
| More ▼ | |
Source
Author
Publication Type
Education Level
Audience
| Researchers | 709 |
| Practitioners | 451 |
| Teachers | 208 |
| Administrators | 122 |
| Policymakers | 66 |
| Counselors | 42 |
| Students | 38 |
| Parents | 11 |
| Community | 7 |
| Support Staff | 6 |
| Media Staff | 5 |
| More ▼ | |
Location
| Turkey | 1326 |
| Australia | 436 |
| Canada | 379 |
| China | 368 |
| United States | 271 |
| United Kingdom | 256 |
| Indonesia | 251 |
| Taiwan | 234 |
| Netherlands | 223 |
| Spain | 216 |
| California | 214 |
| More ▼ | |
Laws, Policies, & Programs
Assessments and Surveys
What Works Clearinghouse Rating
| Meets WWC Standards without Reservations | 8 |
| Meets WWC Standards with or without Reservations | 9 |
| Does not meet standards | 6 |
Peer reviewedLinn, Marcia C.; And Others – Journal of Research in Science Teaching, 1987
Discusses the gender differences revealed on the science content items of the National Assessment of Educational Progress Science Assessment. Examines explanations for the differences, including differential prior instruction, differential response to uncertainty, differential response to figurally presented items, and different attitudes toward…
Descriptors: Academic Achievement, Inquiry, Prior Learning, Science Education
Peer reviewedYopp, Hallie Kay – Reading Research Quarterly, 1988
Investigates the reliability and validity of tests that have been used to operationalize the concept of phonemic awareness. Indicates that a combination of two tests, one related to each factor, has greater predictive validity for the beginning steps in reading acquisition than does one test alone. (JK)
Descriptors: Beginning Reading, Factor Analysis, Kindergarten, Kindergarten Children
Peer reviewedLou, Mimi Wheiping; And Others – Sign Language Studies, 1987
Describes a conversation measure for evaluation communicative competence of deaf adolescents and adults in light of: 1) the rationale behind its development; 2) its independence of the subjects' language variety; and 3)its use in a study of 40 deaf adolescents. The interview protocal is give in the Appendix. (Author/LMO)
Descriptors: Adolescents, American Sign Language, Communicative Competence (Languages), Deafness
Peer reviewedBates, Gary W. – Journal of Reading, 1987
Questions the validity and reliability of the Reading/Everyday Activities in Life (R/EAL) test, which is intended as a self-administered, self-directed, and self-paced test of basic literacy for minority populations and others traditionally singled out by the bias of standardized reading achievement tests. (SRT)
Descriptors: Adult Education, Adult Literacy, English (Second Language), Minority Groups
Bensberg, Gerard J.; Irons, Thomas – Education and Training of the Mentally Retarded, 1986
The Vineland Adaptive Behavior Scale Classroom Edition and the American Association on Mental Deficiency Adaptive Behavior Scale School Edition were completed by teachers of 44 moderately and severely mentally retarded students; their parents (N=37) completed the Vineland Interview Edition-Survey Form. Comparison between the scales and respondents…
Descriptors: Adaptive Behavior (of Disabled), Behavior Rating Scales, Comparative Analysis, Elementary Secondary Education
Peer reviewedGrosse, Martin E.; Wright, Benjamin D. – Evaluation and the Health Professions, 1986
Based on the standard setting procedures or the American Board of Preventive Medicine for their Core Test, this article describes how Rasch measurement can facilitate using test content judgments in setting a standard. Rasch measurement can then be used to evaluate and improve the precision of the standard and to hold it constant across time.…
Descriptors: Certification, Criterion Referenced Tests, Difficulty Level, Health Personnel
Peer reviewedHamada, Roger S.; Tomikawa, Sandra – Educational and Psychological Measurement, 1986
The Windward Rating Scale (WRS), a locally-developed teacher rating scale of student behavior, was evaluated for potential use as a screening measure. Pre-certification ratings of 720 learning disabled students and non-special education students in grades K-6 were analyzed. Psychometric properties and diagnostic efficiency of the WRS were…
Descriptors: Concurrent Validity, Construct Validity, Diagnostic Tests, Educational Diagnosis
Peer reviewedGrunkmeyer, Virgil – Reading Horizons, 1986
Explains the use of the Dolch List in the lower elementary grades. (FL)
Descriptors: Basal Reading, Beginning Reading, Primary Education, Reading Diagnosis
Peer reviewedCraig-Bray, Laura; Adams, Gerald R. – Journal of Youth and Adolescence, 1986
This article studies the convergent-divergent validity and reliability estimates for clinical interview and self-report measures of ego identity. The findings suggest that the two measures may be: (1) assessing relatively distinct forms of ego identity; or (2) that the ego-identity construct as measured by the process and outcome dimensions needs…
Descriptors: College Students, Higher Education, Interpersonal Competence, Interrater Reliability
Peer reviewedWilson, P. R. D. – Assessment and Evaluation in Higher Education, 1986
A university economics department tested the commonly held opinion that college teachers can predict their students' eventual level of educational attainment from their personal observations of the student. A larger-than-anticipated margin of prediction error was revealed. (MSE)
Descriptors: Academic Achievement, College Faculty, College Students, Economics Education
The Attending Round Observation System: A Procedure for Describing Teaching During Attending Rounds.
Peer reviewedWeinholtz, Donn; And Others – Evaluation and the Health Professions, 1986
Two separate reliability studies were conducted on an observational instrument derived from previous qualitative research and designed for collecting data on teaching behaviors during attending rounds. The reliability estimates from both studies were quite high, indicating that the instrument shows promise for use in both research and evaluation…
Descriptors: Clinical Teaching (Health Professions), Graduate Medical Education, Higher Education, Interrater Reliability
Peer reviewedHolmes, Susan E. – Evaluation and the Health Professions, 1986
A specific application of test equating is described, namely that of credentialing examination programs in the health professions. Considered are: (1) the role of test equating in the credentialing process; and (2) the issues that must be considered when implementing test equating in a credentialing examination program. (Author/LMO)
Descriptors: Certification, Credentials, Data Collection, Equated Scores
Peer reviewedMarsh, Herbert W.; And Others – American Educational Research Journal, 1985
The Self Description Questionnaire II (SDQII) results from 901 Australian secondary school students were factor analyzed and the factors correlated to identify the relationship of self-concept factors to age, sex, and academic achievement. Findings supported the multidimensionality of self-concept and support the construct validity of the SDQII.…
Descriptors: Academic Achievement, Age Differences, Factor Analysis, Factor Structure
Peer reviewedMarkham, Paul – Unterrichtspraxis, 1985
Discusses psycholinguistic models of reading comprehension and presents general guidelines for reading comprehension testing in a second language. The guidelines focus on content validity, construct validity, and predictive validity. Suggestious are given for ways teachers can prepare students for tests and avoid problems in each of the three…
Descriptors: German, Language Tests, Predictive Validity, Psycholinguistics
Peer reviewedMcConaughy, Stephanie H. – School Psychology Review, 1985
The usefulness of four standardized rating scales in assessing student behavior problems is discussed: the Child Behavior Checklist (completed by parents); the Teacher Report Form; the Direct Observation Form; and the Youth Self Report. Four case studies illustrate the use of these checklists in school-based assessment. (Author/GDC)
Descriptors: Behavior Problems, Behavior Rating Scales, Case Studies, Classroom Observation Techniques


