Publication Date
| In 2026 | 3 |
| Since 2025 | 636 |
| Since 2022 (last 5 years) | 3137 |
| Since 2017 (last 10 years) | 7378 |
| Since 2007 (last 20 years) | 15016 |
Descriptor
| Test Reliability | 15015 |
| Test Validity | 10252 |
| Reliability | 9751 |
| Foreign Countries | 7126 |
| Test Construction | 4811 |
| Validity | 4189 |
| Measures (Individuals) | 3875 |
| Factor Analysis | 3821 |
| Psychometrics | 3515 |
| Interrater Reliability | 3122 |
| Correlation | 3037 |
| More ▼ | |
Source
Author
Publication Type
Education Level
Audience
| Researchers | 709 |
| Practitioners | 451 |
| Teachers | 208 |
| Administrators | 122 |
| Policymakers | 66 |
| Counselors | 42 |
| Students | 38 |
| Parents | 11 |
| Community | 7 |
| Support Staff | 6 |
| Media Staff | 5 |
| More ▼ | |
Location
| Turkey | 1320 |
| Australia | 436 |
| Canada | 379 |
| China | 368 |
| United States | 271 |
| United Kingdom | 256 |
| Indonesia | 251 |
| Taiwan | 234 |
| Netherlands | 223 |
| Spain | 216 |
| California | 214 |
| More ▼ | |
Laws, Policies, & Programs
Assessments and Surveys
What Works Clearinghouse Rating
| Meets WWC Standards without Reservations | 8 |
| Meets WWC Standards with or without Reservations | 9 |
| Does not meet standards | 6 |
Peer reviewedLindblad, Torsten – System, 1992
Looks at the large-scale experiments on the testing of oral proficiency in English, French, and German that have been carried out over the last five years in the Swedish gymnasium. Various kinds of tasks and different grading criteria have been used, and the practical problems of scheduling and of teacher training have been discussed. (nine…
Descriptors: English (Second Language), Foreign Countries, French, German
Peer reviewedRichie, Mark L. – TechTrends, 1992
W. Edwards Deming built a 40-year record of quality management in Japan known as Total Quality Management (TQM). His 14 points require a change in the belief system of managers and media directors, but their implementation in government agencies and schools will produce increased time for better services, better communications, and new programs.…
Descriptors: Administration, Cost Effectiveness, Elementary Secondary Education, Higher Education
Peer reviewedLanza, Marilyn Lewis; Carifio, James – Journal of Research in Education, 1992
Attempts to improve a set of patient assault vignettes for a simulation by developing and testing a control version of the vignettes and exploring additional question and alternative scoring procedures. Influences of victim gender and assault severity are examined for 58 neuropsychiatric hospital nurses and 12 mental health experts. (SLD)
Descriptors: Allied Health Occupations Education, Attribution Theory, Construct Validity, Medical Education
Peer reviewedIsaacson, Stephen L. – Learning Disabilities Research and Practice, 1992
This review of the Test of Early Written Language concludes that the test succeeds in identifying students who are below their peers in writing and in measuring long-term gains in written language achievement; but its format makes it difficult to document specific strengths and weaknesses and its reliability; and validity have not been…
Descriptors: Early Childhood Education, Evaluation Methods, Student Evaluation, Test Reliability
Peer reviewedTalbot, Robert W. – Higher Education Management, 1992
A new technique for evaluating postsecondary education courses and improve instruction uses directed group discussion based on formative evaluation of the course concerned. It draws on Delphi technique and nominal group process but is more easily administered and produces higher user satisfaction. Its reliability has been confirmed in educational…
Descriptors: College Administration, Course Evaluation, Delphi Technique, Evaluation Methods
Peer reviewedStoskopf, Carleen H.; And Others – Evaluation Review, 1992
Data are presented that demonstrate the reliability and construct validity of a 27-item behaviorally anchored rating scale (BARS) used to rate the performance of 757 nursing assistants in South Carolina. Results support the reliability and construct validity of the BARS and the usefulness of the BARS approach for evaluation. (SLD)
Descriptors: Construct Validity, Evaluation Methods, Long Term Care, Measurement Techniques
Peer reviewedGreenan, James P.; Jarwan, Fathi A. – Career Development for Exceptional Individuals, 1992
This study focused on the validation of Generalizable Reasoning Skills assessment instruments with students with disabilities in secondary vocational programs. Results indicated that student self-ratings, teacher ratings, and a performance test were internally consistent and precise measures of reasoning skills for some uses but that most…
Descriptors: Abstract Reasoning, Disabilities, Evaluation Methods, Generalization
Peer reviewedJohnson, William L.; And Others – Teacher Education and Practice, 1992
This article briefly reviews findings from more than 250 research studies on instructional leadership and productive schools and discusses development and field testing of a needs assessment instrument for assessment of the continuing education needs of principals. (IAH)
Descriptors: Administrator Education, Educational Needs, Educational Research, Elementary Secondary Education
Peer reviewedAbu-Hilal, Maher M.; Salameh, Kayed M. – Educational and Psychological Measurement, 1992
To assess the reliability and validity of the Maslach Burnout Inventory (MBI) in a non-Western setting, the instrument was administered to 223 teachers in Jordan. Results indicate an acceptable reliability for the MBI and suggest that it has promise for use in non-Western countries. (SLD)
Descriptors: Construct Validity, Cross Cultural Studies, Developing Nations, Elementary School Teachers
Peer reviewedLindsey, Pam – Education and Training in Mental Retardation and Developmental Disabilities, 1994
The Consent Screening Interview was developed to enable consumers with mental retardation to express views and preferences about community residential placements and indicate to service providers their ability to give informed consent. Analysis of content and construct validity and interrater reliability, involving 69 subjects, revealed that the…
Descriptors: Adults, Cognitive Ability, Comprehension, Evaluation Methods
Wadsworth, John S.; Harper, Dennis C. – Journal of the Association for Persons with Severe Handicaps (JASH), 1991
Subscales of the Sheltered Care Environmental Scale dealing with conflict, cohesion, and independence were administered to 47 adults with moderate mental retardation on 4 occasions using either a verbal format or picture-cued format. Results indicated that the use of pictures enhanced the test-retest reliability of the instrument. (Author/JDD)
Descriptors: Adults, Conflict, Group Unity, Institutionalized Persons
Peer reviewedWalker, Hill M.; And Others – School Psychology Review, 1991
Psychometric characteristics and factorial replicability of the factor structure of the adolescent version (grades 7-12) of the Walker-McConnell Scale of Social Competence and School Adjustment were studied in an initial wave (n=266) of the national normative sample. The version studied has substantial utility in assessing adolescent social…
Descriptors: Adolescents, Age Differences, Factor Analysis, Factor Structure
Peer reviewedNelson, Jack K.; And Others – Research Quarterly for Exercise and Sport, 1991
Researchers studied the reliability of the modified push-up test in measuring upper body strength and endurance in elementary through college students. It also examined the accuracy of partner scoring. The test proved much easier to administer than the regular floor push-up. It was valid and reliable for all students and suitable for partner…
Descriptors: College Students, Elementary School Students, Elementary Secondary Education, High School Students
Peer reviewedShechtman, Zipora – Journal of Personnel Evaluation in Education, 1992
The interrater reliability for 33 pairs of evaluators of a group assessment procedure developed from assessment center techniques was supported in 3 studies in Israel (admission to a counseling program for 109 candidates, admission of 94 candidates to teacher training, and selection of 69 candidates for Army teaching posts). (SLD)
Descriptors: Assessment Centers (Personnel), Educational Assessment, Evaluation Methods, Evaluators
Peer reviewedShatzer, John H.; And Others – Academic Medicine, 1993
A study compared the generalizability of 36 medical students' performance scores under systematically varied station times in 2 surgery end-of-clerkship performance-based examinations. Results indicated longer station length decreased generalizability of scores by decreasing variability among students' performances. Testing time was also affected.…
Descriptors: Academic Achievement, Clinical Experience, Competency Based Education, Higher Education


