Publication Date
| In 2026 | 3 |
| Since 2025 | 675 |
| Since 2022 (last 5 years) | 3176 |
| Since 2017 (last 10 years) | 7417 |
| Since 2007 (last 20 years) | 15055 |
Descriptor
| Test Reliability | 15043 |
| Test Validity | 10279 |
| Reliability | 9761 |
| Foreign Countries | 7144 |
| Test Construction | 4825 |
| Validity | 4191 |
| Measures (Individuals) | 3877 |
| Factor Analysis | 3825 |
| Psychometrics | 3526 |
| Interrater Reliability | 3124 |
| Correlation | 3040 |
| More ▼ | |
Source
Author
Publication Type
Education Level
Audience
| Researchers | 709 |
| Practitioners | 451 |
| Teachers | 208 |
| Administrators | 122 |
| Policymakers | 66 |
| Counselors | 42 |
| Students | 38 |
| Parents | 11 |
| Community | 7 |
| Support Staff | 6 |
| Media Staff | 5 |
| More ▼ | |
Location
| Turkey | 1328 |
| Australia | 436 |
| Canada | 379 |
| China | 368 |
| United States | 271 |
| United Kingdom | 256 |
| Indonesia | 253 |
| Taiwan | 234 |
| Netherlands | 223 |
| Spain | 217 |
| California | 215 |
| More ▼ | |
Laws, Policies, & Programs
Assessments and Surveys
What Works Clearinghouse Rating
| Meets WWC Standards without Reservations | 8 |
| Meets WWC Standards with or without Reservations | 9 |
| Does not meet standards | 6 |
Peer reviewedStelmachers, Zigfrids T.; Sherman, Robert E. – Suicide and Life-Threatening Behavior, 1990
Presented 33 case histories of suicidal patients to crisis workers (N=19) for ratings of short- and long-term suicide risk. Ratings revealed considerable variability raising question about reliability of such global assessments. The variability, as measured by the standard deviation, was comparable between short-term and long-term ratings.…
Descriptors: Anger, At Risk Persons, Case Studies, Crisis Intervention
Peer reviewedOwston, Ronald D.; Dudley-Marling, Curt – Journal of Research on Computing in Education, 1988
Reviews current educational software evaluation methods, highlights problems, and describes the York Educational Software Evaluation Scales (YESES), an alternative criterion based model. Panel evaluation used by YESES is explained and YESES results are compared with evaluations from the Educational Products Information Exchange (EPIE) to indicate…
Descriptors: Comparative Analysis, Computer Assisted Instruction, Correlation, Courseware
Peer reviewedHorowitz, Leonard M.; And Others – Journal of Consulting and Clinical Psychology, 1988
Describes Inventory of Interpersonal Problems (IIP), new measure designed to identify the types of interpersonal problems that people experience and the level of distress associated with them before, during, and after psychotherapy. Presents psychometric data from two studies which demonstrated high internal consistency of IIP, high test-retest…
Descriptors: Assertiveness, Interpersonal Competence, Interpersonal Relationship, Intimacy
Peer reviewedCahan, Sorel – Educational and Psychological Measurement, 1989
Statistical significance and "abnormality" have been used as criteria for the evaluation of intra-individual subtest score differences. Shortcomings of these criteria are identified, and improved estimates of the true score differences are suggested. The applicability of the abnormality criterion to these improved estimates is reviewed.…
Descriptors: Estimation (Mathematics), Evaluation Methods, Individual Differences, Mathematical Models
Peer reviewedWilson, Mark – Journal for Research in Mathematics Education, 1990
Summarizes a reanalysis of the data from an investigation of a test designed to measure a learning sequence in geometry based on the work of van Hiele (1986). Discusses the test based on the Rasch model. (YP)
Descriptors: Geometric Concepts, Geometry, Item Analysis, Mathematical Concepts
Peer reviewedBuckle, C. F.; Riding, R. J. – Educational Psychology: An International Journal of Experimental Educational Psychology, 1988
Focuses upon three current issues in educational evaluation. Looks at the limitations of examinations and the interpretation of results in considering reliability and validity. Discusses grading and context of learning relative to formative and summative assessment. Deals with cultural background when exploring uniformity of testing versus…
Descriptors: Educational Assessment, Elementary Secondary Education, Evaluation Methods, Evaluation Problems
Peer reviewedSchriesheim, Chester A.; And Others – Educational and Psychological Measurement, 1989
Three studies explored the effects of grouping versus randomized items in questionnaires on internal consistency and test-retest reliability with samples of 80, 80, and 100, respectively, university students and undergraduates. The 2 correlational and 1 experimental studies were reasonably consistent in demonstrating that neither format was…
Descriptors: Classification, College Students, Evaluation Methods, Higher Education
Peer reviewedCasbergue, Renee M.; Greene, Jane Fell – Journal of Reading, 1988
Argues that sensory screening does not identify children at risk for reading or learning disability, and that sensory training does not improve reading or learning. (RAE)
Descriptors: Elementary Education, Learning Disabilities, Perception, Reading Difficulties
Peer reviewedTownsend, Michael A. R.; And Others – British Journal of Educational Psychology, 1989
Describes study that examined the effect of mood on college students' grading of 13-to 14-year-old students' essays. Research involving mood and behavior is discussed, the mood inducing films and feelings checklists used in the study are described, and an analysis of the results is presented that does not support a mood effect in essay assessment.…
Descriptors: Analysis of Variance, Behavioral Science Research, Essays, Films
Peer reviewedJegede, Olugbemiro J. – Research in Science and Technological Education, 1989
Reported is a study in which secondary school students assessed their teachers for characteristics of effective science teaching. It was shown that the instrument used was highly valid and reliable and that secondary school students could effectively assess their teachers' pre-classroom characteristics, teaching behaviors, personality, and…
Descriptors: Foreign Countries, Integrated Curriculum, Reliability, Science Education
Bradley, Robert H.; And Others – American Journal on Mental Retardation, 1989
The usefulness and validity of the 3 versions (Infant-Toddler, Early Childhood, and Middle Childhood) of the HOME Inventory were studied with 261 children with cognitive, hearing, vision, or orthopedic handicaps. The Inventory in its original form and a modified form was subjected to analysis of reliability, construct validity, and criterion…
Descriptors: Concurrent Validity, Construct Validity, Disabilities, Elementary Education
Peer reviewedGermann, Paul J. – Journal of Research in Science Teaching, 1989
Describes a paper-and-pencil test for high school biology students measuring science process skills, such as developing hypotheses; making predictions; identifying assumptions; analyzing data; and formulating conclusions. Reports some data on reliability and validity of the test. Provides all 35 items of the test. (YP)
Descriptors: Biology, Science Materials, Science Tests, Secondary Education
Peer reviewedMarsh, Herbert W.; Ball, Samuel – Journal of Experimental Education, 1989
Agreement between two independent reviews of each of 278 manuscripts was compared on an overall recommendation and on specific rating items. Agreement between reviewers on separate dimensions, the unweighted sum of the dimensions, and various weighted sums was no better than that for the overall recommendation itself. (SLD)
Descriptors: Evaluation Methods, Factor Analysis, Interrater Reliability, Manuscripts
Peer reviewedHennessey, Beth Ann; Amabile, Teresa M. – Journal of Creative Behavior, 1988
The subjective judgment of observers was used to assess verbal creativity. Students, aged 5-10, told a story to accompany a picture series. Teachers rated the stories relative to one another. Interjudge reliability of the creativity measure was highly satisfactory. Two subsequent studies affirmed the results, with slightly lower interjudge…
Descriptors: Creativity, Creativity Tests, Elementary Education, Evaluation Methods
Cooper, Terence H. – Journal of Agronomic Education (JAE), 1988
Describes a study used to determine differences in exam reliability, difficulty, and student evaluations. Indicates that when a fourth option was added to the three-option items, the exams became more difficult. Includes methods, results discussion, and tables on student characteristics, whole test analyses, and selected items. (RT)
Descriptors: Agronomy, College Science, Error of Measurement, Evaluation Methods


