Publication Date
| In 2026 | 3 |
| Since 2025 | 666 |
| Since 2022 (last 5 years) | 3167 |
| Since 2017 (last 10 years) | 7408 |
| Since 2007 (last 20 years) | 15046 |
Descriptor
| Test Reliability | 15036 |
| Test Validity | 10272 |
| Reliability | 9759 |
| Foreign Countries | 7141 |
| Test Construction | 4823 |
| Validity | 4191 |
| Measures (Individuals) | 3877 |
| Factor Analysis | 3825 |
| Psychometrics | 3525 |
| Interrater Reliability | 3124 |
| Correlation | 3039 |
| More ▼ | |
Source
Author
Publication Type
Education Level
Audience
| Researchers | 709 |
| Practitioners | 451 |
| Teachers | 208 |
| Administrators | 122 |
| Policymakers | 66 |
| Counselors | 42 |
| Students | 38 |
| Parents | 11 |
| Community | 7 |
| Support Staff | 6 |
| Media Staff | 5 |
| More ▼ | |
Location
| Turkey | 1327 |
| Australia | 436 |
| Canada | 379 |
| China | 368 |
| United States | 271 |
| United Kingdom | 256 |
| Indonesia | 252 |
| Taiwan | 234 |
| Netherlands | 223 |
| Spain | 216 |
| California | 214 |
| More ▼ | |
Laws, Policies, & Programs
Assessments and Surveys
What Works Clearinghouse Rating
| Meets WWC Standards without Reservations | 8 |
| Meets WWC Standards with or without Reservations | 9 |
| Does not meet standards | 6 |
Peer reviewedTakashima, Hideyuki – British Journal of Language Teaching, 1987
Two native and one non-native (Japanese) instructors of English-as-a-foreign-language (EFL) corrected free compositions written by a Japanese college graduate with a degree in English. Analysis of the corrections revealed marked differences in type and number, with the non-native speaker most frequently indicating difficulty with articles, word…
Descriptors: Case Studies, English (Second Language), Error Analysis (Language), Interrater Reliability
Peer reviewedRhone, Lorna M. – Journal of School Psychology, 1986
Explored the reliability and validity for four self-report anxiety scales: the Test Anxiety Scale for Children, the Alpert-Haber Achievement Anxiety Scale for Children, the State-Trait Anxiety Inventory for Children, and a newly developed Reading Anxiety Scale. Conclusions are drawn regarding the use of these scales with minority-group…
Descriptors: Achievement, Adolescents, Anxiety, Black Students
Peer reviewedRichardson, Kim – Reading Teacher, 1987
Examines the Decoding Skills Test (DST), a criterion referenced, diagnostic instrument designed to identify deficiencies in word decoding. (JC)
Descriptors: Context Clues, Criterion Referenced Tests, Decoding (Reading), Diagnostic Tests
Marston, Douglas; And Others – Diagnostique, 1986
Two studies of third- through sixth-grade low-achievers (N=83) and 26 third-graders (some receiving special education services) compared traditional standardized achievement tests and alternative curriculum-based measures. Results indicated that curriculum-based measures were more sensitive in assessing student progress in reading and writing and…
Descriptors: Achievement Tests, Curriculum, Disabilities, Evaluation Methods
Peer reviewedLinn, Marcia C.; And Others – Journal of Research in Science Teaching, 1987
Discusses the gender differences revealed on the science content items of the National Assessment of Educational Progress Science Assessment. Examines explanations for the differences, including differential prior instruction, differential response to uncertainty, differential response to figurally presented items, and different attitudes toward…
Descriptors: Academic Achievement, Inquiry, Prior Learning, Science Education
Peer reviewedYopp, Hallie Kay – Reading Research Quarterly, 1988
Investigates the reliability and validity of tests that have been used to operationalize the concept of phonemic awareness. Indicates that a combination of two tests, one related to each factor, has greater predictive validity for the beginning steps in reading acquisition than does one test alone. (JK)
Descriptors: Beginning Reading, Factor Analysis, Kindergarten, Kindergarten Children
Peer reviewedLou, Mimi Wheiping; And Others – Sign Language Studies, 1987
Describes a conversation measure for evaluation communicative competence of deaf adolescents and adults in light of: 1) the rationale behind its development; 2) its independence of the subjects' language variety; and 3)its use in a study of 40 deaf adolescents. The interview protocal is give in the Appendix. (Author/LMO)
Descriptors: Adolescents, American Sign Language, Communicative Competence (Languages), Deafness
Peer reviewedBates, Gary W. – Journal of Reading, 1987
Questions the validity and reliability of the Reading/Everyday Activities in Life (R/EAL) test, which is intended as a self-administered, self-directed, and self-paced test of basic literacy for minority populations and others traditionally singled out by the bias of standardized reading achievement tests. (SRT)
Descriptors: Adult Education, Adult Literacy, English (Second Language), Minority Groups
Bensberg, Gerard J.; Irons, Thomas – Education and Training of the Mentally Retarded, 1986
The Vineland Adaptive Behavior Scale Classroom Edition and the American Association on Mental Deficiency Adaptive Behavior Scale School Edition were completed by teachers of 44 moderately and severely mentally retarded students; their parents (N=37) completed the Vineland Interview Edition-Survey Form. Comparison between the scales and respondents…
Descriptors: Adaptive Behavior (of Disabled), Behavior Rating Scales, Comparative Analysis, Elementary Secondary Education
Peer reviewedGrosse, Martin E.; Wright, Benjamin D. – Evaluation and the Health Professions, 1986
Based on the standard setting procedures or the American Board of Preventive Medicine for their Core Test, this article describes how Rasch measurement can facilitate using test content judgments in setting a standard. Rasch measurement can then be used to evaluate and improve the precision of the standard and to hold it constant across time.…
Descriptors: Certification, Criterion Referenced Tests, Difficulty Level, Health Personnel
Peer reviewedHamada, Roger S.; Tomikawa, Sandra – Educational and Psychological Measurement, 1986
The Windward Rating Scale (WRS), a locally-developed teacher rating scale of student behavior, was evaluated for potential use as a screening measure. Pre-certification ratings of 720 learning disabled students and non-special education students in grades K-6 were analyzed. Psychometric properties and diagnostic efficiency of the WRS were…
Descriptors: Concurrent Validity, Construct Validity, Diagnostic Tests, Educational Diagnosis
Peer reviewedGrunkmeyer, Virgil – Reading Horizons, 1986
Explains the use of the Dolch List in the lower elementary grades. (FL)
Descriptors: Basal Reading, Beginning Reading, Primary Education, Reading Diagnosis
Peer reviewedCraig-Bray, Laura; Adams, Gerald R. – Journal of Youth and Adolescence, 1986
This article studies the convergent-divergent validity and reliability estimates for clinical interview and self-report measures of ego identity. The findings suggest that the two measures may be: (1) assessing relatively distinct forms of ego identity; or (2) that the ego-identity construct as measured by the process and outcome dimensions needs…
Descriptors: College Students, Higher Education, Interpersonal Competence, Interrater Reliability
Peer reviewedWilson, P. R. D. – Assessment and Evaluation in Higher Education, 1986
A university economics department tested the commonly held opinion that college teachers can predict their students' eventual level of educational attainment from their personal observations of the student. A larger-than-anticipated margin of prediction error was revealed. (MSE)
Descriptors: Academic Achievement, College Faculty, College Students, Economics Education
The Attending Round Observation System: A Procedure for Describing Teaching During Attending Rounds.
Peer reviewedWeinholtz, Donn; And Others – Evaluation and the Health Professions, 1986
Two separate reliability studies were conducted on an observational instrument derived from previous qualitative research and designed for collecting data on teaching behaviors during attending rounds. The reliability estimates from both studies were quite high, indicating that the instrument shows promise for use in both research and evaluation…
Descriptors: Clinical Teaching (Health Professions), Graduate Medical Education, Higher Education, Interrater Reliability


