Publication Date
| In 2026 | 3 |
| Since 2025 | 675 |
| Since 2022 (last 5 years) | 3176 |
| Since 2017 (last 10 years) | 7417 |
| Since 2007 (last 20 years) | 15055 |
Descriptor
| Test Reliability | 15043 |
| Test Validity | 10279 |
| Reliability | 9761 |
| Foreign Countries | 7144 |
| Test Construction | 4825 |
| Validity | 4191 |
| Measures (Individuals) | 3877 |
| Factor Analysis | 3825 |
| Psychometrics | 3526 |
| Interrater Reliability | 3124 |
| Correlation | 3040 |
| More ▼ | |
Source
Author
Publication Type
Education Level
Audience
| Researchers | 709 |
| Practitioners | 451 |
| Teachers | 208 |
| Administrators | 122 |
| Policymakers | 66 |
| Counselors | 42 |
| Students | 38 |
| Parents | 11 |
| Community | 7 |
| Support Staff | 6 |
| Media Staff | 5 |
| More ▼ | |
Location
| Turkey | 1328 |
| Australia | 436 |
| Canada | 379 |
| China | 368 |
| United States | 271 |
| United Kingdom | 256 |
| Indonesia | 253 |
| Taiwan | 234 |
| Netherlands | 223 |
| Spain | 217 |
| California | 215 |
| More ▼ | |
Laws, Policies, & Programs
Assessments and Surveys
What Works Clearinghouse Rating
| Meets WWC Standards without Reservations | 8 |
| Meets WWC Standards with or without Reservations | 9 |
| Does not meet standards | 6 |
Zwiebel, Avraham; Wolff, Anthony B. – ACEHI Journal, 1988
The study compared results of rating "Draw a Person" drawings of 250 deaf Israeli children aged 7-15 with those of 54 American deaf children of the same ages and 100 hearing Israeli children. Implications of the findings concerning instrument reliability, emotional development in deaf children, and cross-cultural aspects are considered.…
Descriptors: Adolescents, Children, Cross Cultural Studies, Cultural Differences
Greenwood, L. K.; Morton, L. L. – B. C. Journal of Special Education, 1989
Evaluation of a checklist by teachers to rate 60 secondary level learning-disabled and nondisabled students for mainstream competencies found no overall mean group differences and no inter-rater reliability, though ratings on work habits did predict course grades for all three groups (fully mainstreamed, partially mainstreamed, and nondisabled).…
Descriptors: Check Lists, Grades (Scholastic), Learning Disabilities, Mainstreaming
Peer reviewedDuffy, Patrick J. – CUPA Journal, 1989
Aspects of the new Employee Polygraph Protection Act are discussed, including exemptions, prohibited devices, limitations, exceptions, injury and access requirements, reasonable suspicion, drug industry investigations, procedural requirements, disclosure, basis for discharge, enforcement and remedies, and preemption and existing state laws. (MSE)
Descriptors: Civil Liberties, College Administration, Employer Employee Relationship, Federal Legislation
Peer reviewedManfredo, Michael J.; Shelby, Bo – Journal of Social Psychology, 1988
Examined the accuracy and validity of self-reports and their effect on tests of attitude-behavior relationships. Self-reports were reasonably accurate, but they produced results different from actual behavior in attitude-behavior tests. Concludes that self-reports and actual behavior should be measured, tested, and modeled separately. (Author/LS)
Descriptors: Attitude Measures, Behavior Patterns, Behavioral Science Research, Predictor Variables
Peer reviewedSoper, John C.; Walstad, William B. – Theory and Research in Social Education, 1988
Explores the reliability and validity of an affective domain instrument, the "Survey on Economic Attitudes," by providing new norms and a discussion of the properties of the national sample of high school students used. Presents current information about the economic attitudes of U.S. high school students and raises some important…
Descriptors: Affective Measures, Economics, Educational Assessment, Educational Research
Peer reviewedWoodruff, David J.; Sawyer, Richard L. – Applied Psychological Measurement, 1989
Two methods--non-distributional and normal--are derived for estimating measures of pass-fail reliability. Both are based on the Spearman Brown formula and require only a single test administration. Results from a simulation (n=20,000 examinees) and a licensure examination (n=4,828 examinees) illustrate these methods. (SLD)
Descriptors: Equations (Mathematics), Estimation (Mathematics), Licensing Examinations (Professions), Measures (Individuals)
Peer reviewedStansfield, Charles W.; Ross, Jacqueline – Language Testing, 1988
Outlines research necessary for determining the validity and reliability of Test of Written English, an essay test that directly measures writing ability and complements Test of English-as-a-Foreign-Language's (TOEFL) indirect assessment of writing skills. Research should cover such aspects as construct, criterion-related, concurrent, content, and…
Descriptors: English (Second Language), Essay Tests, Language Research, Language Tests
Bucher, Dale E.; Brolin, Donn E. – Diagnostique, 1987
The article describes the development of a Knowledge Battery to assess the competency level of secondary special education students receiving instruction in the Life-Centered Career Education Curriculum. The Battery consists of 89 subcompetency tests with items selected, validity, reliability, cutting scores determined, and standardization…
Descriptors: Career Education, Competency Based Education, Cutting Scores, Disabilities
Peer reviewedDuszak, Zbigniew; Koczkodaj, Waldemar W. – Library Software Review, 1994
Discussion of CD-ROM evaluation and selection processes focuses on a pairwise comparison method for knowledge acquisition with a consistency measure as a validation technique. Highlights include human experts' preferences and judgments; a CD-ROM selection model, including format, contents, usage, and technical considerations; and consistency…
Descriptors: Academic Libraries, Comparative Analysis, Evaluation Methods, Higher Education
Peer reviewedGamble, Wendy C; Woulbroun, E. Jeanne – Early Education and Development, 1995
Examined how young children participating in early childhood programs perceive the social support they receive. Results indicated that pre- and early elementary school-age children can respond to questions about their social support networks in reasonably reliable and valid ways. Significant correlations with indices of perceived competence and…
Descriptors: Child Caregivers, Childhood Attitudes, Early Childhood Education, Interpersonal Relationship
Peer reviewedJacobs, Stanley S. – Research in Higher Education, 1995
Comparison of college freshman performance on two different forms of the California Critical Thinking Skills Test (n=684, 692) found a lack of equivalence between forms and low internal consistency reliability. It is suggested that, although the test may be useful for research, it is not appropriate for decision making about individual students.…
Descriptors: College Freshmen, Comparative Analysis, Critical Thinking, Educational Research
Peer reviewedChapman, Loden J.; And Others – Developmental Review, 1994
Argues that individual and group differences in priming performance scores are heavily influenced by overall speed and accuracy, and thus are a flawed reflection of internal activation of semantic priming. Suggests that meaningful comparison of groups on the activation underlying priming difference scores requires removing the effects of overall…
Descriptors: Children, Cognitive Processes, Comparative Analysis, Data Analysis
Peer reviewedYoon, Lanju Lee – Journal of the American Society for Information Science, 1994
Describes a study that explored the relationship between the number of cited references used in a citation search and retrieval effectiveness, analyzing the overlap among posting sets retrieved by various combinations of cited references. Findings showed that the more cited references used for a citation search result, the better the performance.…
Descriptors: Citation Indexes, Citations (References), Information Retrieval, Online Searching
Peer reviewedUpshur, John A.; Turner, Carolyn E. – ELT Journal, 1995
Reviews the place of rating scales in second-language measurement and summarizes some of the problems associated with them. Standard and alternative scales were studied. High agreement among raters can be achieved even under conditions not favorable to high interrater reliability. The full range of score categories are effectively utilized. (17…
Descriptors: Evaluation Problems, Interrater Reliability, Language Tests, Measurement Techniques
Peer reviewedJohnson, Nancy E.; And Others – Assessment, 1994
Development of an alternate form of Raven's Standard Progressive Matrices Test is described. Reliability analysis with 449 children of differing racial/ethnic backgrounds showed good reliability and comparable predictive validity. The alternate form is a promising research tool. (SLD)
Descriptors: Children, Ethnic Groups, Intelligence Tests, Matrices


