Publication Date
| In 2026 | 1 |
| Since 2025 | 380 |
| Since 2022 (last 5 years) | 1490 |
| Since 2017 (last 10 years) | 3358 |
| Since 2007 (last 20 years) | 5206 |
Descriptor
| Test Reliability | 10004 |
| Test Validity | 10004 |
| Test Construction | 3338 |
| Foreign Countries | 2936 |
| Psychometrics | 1830 |
| Factor Analysis | 1677 |
| Measures (Individuals) | 1333 |
| Evaluation Methods | 955 |
| Questionnaires | 933 |
| College Students | 870 |
| Factor Structure | 851 |
| More ▼ | |
Source
Author
Publication Type
Education Level
Audience
| Researchers | 297 |
| Practitioners | 226 |
| Teachers | 84 |
| Administrators | 61 |
| Policymakers | 27 |
| Counselors | 25 |
| Students | 13 |
| Parents | 9 |
| Community | 5 |
| Support Staff | 5 |
Location
| Turkey | 695 |
| China | 175 |
| Australia | 171 |
| Canada | 146 |
| Indonesia | 123 |
| Spain | 106 |
| Taiwan | 91 |
| United States | 86 |
| Germany | 83 |
| United Kingdom | 82 |
| Malaysia | 77 |
| More ▼ | |
Laws, Policies, & Programs
Assessments and Surveys
What Works Clearinghouse Rating
| Meets WWC Standards without Reservations | 1 |
| Meets WWC Standards with or without Reservations | 1 |
| Does not meet standards | 1 |
Peer reviewedLindblad, Torsten – System, 1992
Looks at the large-scale experiments on the testing of oral proficiency in English, French, and German that have been carried out over the last five years in the Swedish gymnasium. Various kinds of tasks and different grading criteria have been used, and the practical problems of scheduling and of teacher training have been discussed. (nine…
Descriptors: English (Second Language), Foreign Countries, French, German
Peer reviewedIsaacson, Stephen L. – Learning Disabilities Research and Practice, 1992
This review of the Test of Early Written Language concludes that the test succeeds in identifying students who are below their peers in writing and in measuring long-term gains in written language achievement; but its format makes it difficult to document specific strengths and weaknesses and its reliability; and validity have not been…
Descriptors: Early Childhood Education, Evaluation Methods, Student Evaluation, Test Reliability
Peer reviewedGreenan, James P.; Jarwan, Fathi A. – Career Development for Exceptional Individuals, 1992
This study focused on the validation of Generalizable Reasoning Skills assessment instruments with students with disabilities in secondary vocational programs. Results indicated that student self-ratings, teacher ratings, and a performance test were internally consistent and precise measures of reasoning skills for some uses but that most…
Descriptors: Abstract Reasoning, Disabilities, Evaluation Methods, Generalization
Peer reviewedSmith-Sebasto, N. J. – Journal of Environmental Education, 1992
A study reveals the need for extensive refinement of the Revised Perceived Environmental Control Measure purported in the past to be a reliable and valid instrument to measure the relationship between the psychological construct, "locus of control," and environmental action or environmentally responsible behavior. (MCO)
Descriptors: Behavior, Behavioral Science Research, Concurrent Validity, Construct Validity
Peer reviewedJohnson, William L.; And Others – Teacher Education and Practice, 1992
This article briefly reviews findings from more than 250 research studies on instructional leadership and productive schools and discusses development and field testing of a needs assessment instrument for assessment of the continuing education needs of principals. (IAH)
Descriptors: Administrator Education, Educational Needs, Educational Research, Elementary Secondary Education
Peer reviewedLindsey, Pam – Education and Training in Mental Retardation and Developmental Disabilities, 1994
The Consent Screening Interview was developed to enable consumers with mental retardation to express views and preferences about community residential placements and indicate to service providers their ability to give informed consent. Analysis of content and construct validity and interrater reliability, involving 69 subjects, revealed that the…
Descriptors: Adults, Cognitive Ability, Comprehension, Evaluation Methods
Peer reviewedSevin, Jay A.; And Others – Journal of Autism and Developmental Disorders, 1991
This study, involving 24 children or adolescents with pervasive developmental disorders, assessed 3 autism scales: Autism Behavior Checklist, Real Life Rating Scale, and Childhood Autism Rating Scale. The study analyzed interrater reliability, correlations between pairs of the three scales, diagnostic classification cutoff scores, and…
Descriptors: Adaptive Behavior (of Disabled), Behavior Rating Scales, Check Lists, Educational Diagnosis
Peer reviewedNelson, Jack K.; And Others – Research Quarterly for Exercise and Sport, 1991
Researchers studied the reliability of the modified push-up test in measuring upper body strength and endurance in elementary through college students. It also examined the accuracy of partner scoring. The test proved much easier to administer than the regular floor push-up. It was valid and reliable for all students and suitable for partner…
Descriptors: College Students, Elementary School Students, Elementary Secondary Education, High School Students
Peer reviewedBeck, Diane E.; Clayton, Anne G. – American Journal of Pharmaceutical Education, 1990
The development and testing of a measure of pharmacy student performance in the clinical setting is described. Participating faculty refined an instrument for its ability to measure eight behavioral objectives of patient presentation. Items were of Likert or dichotomous format and measured problem-solving and communication skills. (Author/MSE)
Descriptors: Academic Achievement, Behavioral Objectives, Clinical Experience, Communication Skills
Peer reviewedRescorla, Leslie – Topics in Language Disorders, 1991
Two parent report inventories geared at assessing language skills in toddlers are examined, in terms of reliability, validity, and application. The MacArthur Communicative Development Inventory: Toddlers and the Language Development Survey are compared and their combined use in handicap identification is discussed. (PB)
Descriptors: Communication Skills, Diagnostic Tests, Evaluation Methods, Handicap Identification
Lazarus, Belinda; Killu, Kim – Diagnostique, 1999
This article describes the second edition of the Attention Deficit Disorders Evaluation Scale (ADDES II), an individually administered behavior rating scale developed to assist in the identification and service of children with attention deficit hyperactivity disorders. Its purpose, administration, interpretation of scores, standardization,…
Descriptors: Attention Deficit Disorders, Behavior Rating Scales, Children, Disability Identification
Peer reviewedAllessandrini, Cristina Dias; Duarte, Jose Luclano Miranda; Bianco, Marisa Fernandes; Dupas, Margarida Azevedo – Art Therapy: Journal of the American Art Therapy Association, 1998
The Silver Drawing Test of Cognition and Emotion was standardized for Brazilian children (N=2,000). ANOVA results are presented for age and education groups from early grades on, including distinguishing adult education levels; results are compared for U.S. and Brazilian populations. Growth in test scores, emotional content responses, and…
Descriptors: Art Education, Art Therapy, Cross Cultural Studies, Elementary Secondary Education
Peer reviewedNash, John B.; Moroz, Pauline A. – Journal of Educational Computing Research, 1997
This study, utilizing data from 208 educators, obtained estimates of the reliability of the four subscale version of the 40-item Computer Attitude Scale (CAS); provided detailed information regarding the factor patterns of the CAS subscales; and provided evidence about the differential validity of the CAS among four groups with differing intensity…
Descriptors: Computer Anxiety, Computer Attitudes, Computer Literacy, Evaluation Methods
Peer reviewedMoulton, Caryn E.; Coplan, Robert J.; Mills, Catherine – Canadian Journal of Research in Early Childhood Education, 1999
This study examined preliminary psychometric properties of the Teaching Practices Observation Scale (TPOS), a newly developed observational taxonomy for assessing teacher behaviors during free play with young children. Behaviors of 42 child caregivers and junior kindergarten teachers were coded using a combination of time-sampling, event-sampling,…
Descriptors: Child Caregivers, Educational Practices, Measurement Techniques, Measures (Individuals)
Cho, Kwangsu; Schunn, Christian D.; Wilson, Roy W. – Journal of Educational Psychology, 2006
Although peer reviewing of writing is a way to create more writing opportunities in college and university settings, the validity and reliability of peer-generated grades are a major concern. This study investigated the validity and reliability of peer-generated writing grades of 708 students across sixteen different courses from four universities…
Descriptors: Test Validity, Test Reliability, Peer Evaluation, Writing (Composition)

Direct link
