Publication Date
| In 2026 | 3 |
| Since 2025 | 656 |
| Since 2022 (last 5 years) | 3157 |
| Since 2017 (last 10 years) | 7398 |
| Since 2007 (last 20 years) | 15036 |
Descriptor
| Test Reliability | 15028 |
| Test Validity | 10265 |
| Reliability | 9757 |
| Foreign Countries | 7137 |
| Test Construction | 4821 |
| Validity | 4191 |
| Measures (Individuals) | 3876 |
| Factor Analysis | 3822 |
| Psychometrics | 3520 |
| Interrater Reliability | 3124 |
| Correlation | 3039 |
| More ▼ | |
Source
Author
Publication Type
Education Level
Audience
| Researchers | 709 |
| Practitioners | 451 |
| Teachers | 208 |
| Administrators | 122 |
| Policymakers | 66 |
| Counselors | 42 |
| Students | 38 |
| Parents | 11 |
| Community | 7 |
| Support Staff | 6 |
| Media Staff | 5 |
| More ▼ | |
Location
| Turkey | 1326 |
| Australia | 436 |
| Canada | 379 |
| China | 368 |
| United States | 271 |
| United Kingdom | 256 |
| Indonesia | 251 |
| Taiwan | 234 |
| Netherlands | 223 |
| Spain | 216 |
| California | 214 |
| More ▼ | |
Laws, Policies, & Programs
Assessments and Surveys
What Works Clearinghouse Rating
| Meets WWC Standards without Reservations | 8 |
| Meets WWC Standards with or without Reservations | 9 |
| Does not meet standards | 6 |
Peer reviewedFagot, Beverly I.; O'Brien, Marion – Merrill-Palmer Quarterly, 1994
Two studies: evaluated the consistency of toddlers' motor activity level over time and across situations; and examined the relations between several measures of activity level and ratings of problem behavior in toddlers. Found that activity level was stable when measured by the same methods in the same situation but not across methods or across…
Descriptors: Age Differences, Behavior Problems, Behavior Rating Scales, Context Effect
Rose, Terry L.; And Others – Diagnostique, 1990
The Developmental Activities Screening Inventory II (DASI-II) was administered to a group of 13 toddlers with severe disabilities. Results indicated that interrater reliability was statistically significant across raters for all scores reported by the DASI-II and that test-retest stability was statistically significant across administrations and…
Descriptors: Clinical Diagnosis, Developmental Psychology, Diagnostic Tests, Educational Diagnosis
Peer reviewedEiting, Mindert H. – Applied Psychological Measurement, 1991
A method is proposed for sequential evaluation of reliability of psychometric instruments. Sample size is unfixed; a test statistic is computed after each person is sampled and a decision is made in each stage of the sampling process. Results from a series of Monte-Carlo experiments establish the method's efficiency. (SLD)
Descriptors: Computer Simulation, Equations (Mathematics), Estimation (Mathematics), Mathematical Models
Peer reviewedClift, John; And Others – Assessment and Evaluation in Higher Education, 1989
After 98 lecture and 43 course evaluations, rating scales used by students to evaluate faculty at Victoria University of Wellington (New Zealand) were subjected to factor analysis. It was found that pooling ratings from different teaching situations made possible preparation of a teaching performance profile improving the information's reliability…
Descriptors: Case Studies, Evaluation Methods, Faculty Evaluation, Foreign Countries
Peer reviewedPlake, Barbara S.; And Others – Educational Measurement: Issues and Practice, 1991
Possible sources of intrajudge inconsistency in standard setting are reviewed, and approaches are presented to improve the accuracy of rating. Procedures for providing judges with feedback through discussion or computerized communication are discussed. Monitoring and maintaining judges' consistency throughout the rating process are essential. (SLD)
Descriptors: Computer Assisted Instruction, Evaluators, Examiners, Feedback
Peer reviewedFehrmann, Melinda L.; And Others – Educational and Psychological Measurement, 1991
Two frame-of-reference rater training approaches were compared for effects on reliability and accuracy of cutoff scores generated by 21 raters using Angoff methods on tests taken by 155 undergraduates. Both approaches result in higher interrater reliability and more accuracy than does a non-frame-of-reference method. (SLD)
Descriptors: Cutting Scores, Evaluators, Generalizability Theory, Higher Education
Peer reviewedFreund, Lisa S.; Reiss, Allan L. – Research in Developmental Disabilities, 1991
Parent and teacher ratings on the Aberrant Behavior Checklist with an outpatient sample of 110 children, adolescents, and young adults with mental retardation found that the 5-factor structure of both parent and teacher data corresponded very well with the 5 factors originally obtained from staff ratings of mentally retarded inpatients. (Author/DB)
Descriptors: Adolescents, Behavior Problems, Behavior Rating Scales, Check Lists
Desmedt, John; Yelon, Stephen – Performance and Instruction, 1992
Elementary performance tests and situational or simulation tests may be combined for comprehensive testing of open skills, i.e., a worker's competency in reacting to unpredictable situations. Elementary performance tests capture the professional skills whereas simulation tests retain realism and complexity and allow variation in responses.…
Descriptors: Achievement Tests, Guidelines, Industrial Training, Job Training
Peer reviewedParker, Richard; And Others – Journal of Special Education, 1992
Twenty years of research on the Maze, a classroom-based reading measure for students with learning disabilities, is summarized. Overall, the research is supportive of the Maze. However, the most common version requires revision to obtain minimum construct validity, and additional research is needed on reliability and usefulness of alternate forms.…
Descriptors: Cloze Procedure, Construct Validity, Elementary Secondary Education, Learning Disabilities
Peer reviewedTustin, R. Don; And Others – Australia and New Zealand Journal of Developmental Disabilities, 1991
Analysis of data from the Behaviour Disorder Scale for 405 adults and adolescents with intellectual disability revealed that subjects exhibited a mean of 14 problem behaviors per person. Factor analysis identified two general syndromes (a conduct syndrome and an emotional syndrome), raising the possibility of dealing with sets of problem behaviors…
Descriptors: Adolescents, Adults, Behavior Disorders, Behavior Patterns
Peer reviewedLuzzo, Darrell Anthony – Measurement and Evaluation in Counseling and Development, 1993
Assessed reliability and validity of Career Decision-Making Self-Efficacy Scale (CDMSES). Findings from 233 community college students who completed CDMSES, Career Maturity Inventory, Decision-Making Scale of the Career Development Inventory, and demographic questionnaire generally support reliability and validity of CDMSES as measure of community…
Descriptors: Career Choice, Career Development, Community Colleges, Decision Making
Peer reviewedMcGaghie, William C.; And Others – Evaluation and the Health Professions, 1993
Systematic scale-development procedures, reliability analyses on 2,852 medical students (3 samples), and factor analysis were used to develop and refine a scale reflecting attitudes about pulmonary disease prevention. Development and verification samples included 110 and 2,691 students, respectively. The scale is promising for health education and…
Descriptors: Attitude Measures, Disease Control, Health Education, Higher Education
Peer reviewedvan Berckelaer-Onnes, Ina; van Duijn, Gijs – Journal of Autism and Developmental Disorders, 1993
Seventy-two children (ages 23-148 months) referred to an autism center in the Netherlands were administered the Psychoeducational Profile (PEP) and the Handicaps Behaviour and Skills Schedule (HBS). The correlation between the two instruments was higher than expected, and internal consistency of the subscales of the PEP and the HBS was…
Descriptors: Autism, Behavior Rating Scales, Child Development, Children
Peer reviewedO'Dell, Cynthia D.; And Others – Mental Retardation, 1993
A vision screening program established at a facility for 271 individuals with severe or profound mental retardation used the acuity card procedure as its measure. The procedure was found to be a valid and reliable screening tool for this population. A few residents had good visual acuities, whereas the acuities of others were poor. (JDD)
Descriptors: Adults, Children, Institutionalized Persons, Program Effectiveness
Naglieri, Jack A.; Bardos, Achilles N. – Diagnostique, 1990
The Bracken Basic Concept Scale, for use with preschool and primary-aged children, determines a child's school readiness and knowledge of English-language verbal concepts. The instrument measures 258 basic concepts in such categories as comparisons, time, quantity, and letter identification. This paper describes test administration, scoring and…
Descriptors: Concept Formation, Diagnostic Tests, Early Childhood Education, School Readiness Tests


