Publication Date
| In 2026 | 5 |
| Since 2025 | 627 |
| Since 2022 (last 5 years) | 2564 |
| Since 2017 (last 10 years) | 5599 |
| Since 2007 (last 20 years) | 9195 |
Descriptor
| Test Validity | 21771 |
| Test Reliability | 10011 |
| Test Construction | 5891 |
| Foreign Countries | 4955 |
| Psychometrics | 2963 |
| Factor Analysis | 2941 |
| Measures (Individuals) | 2377 |
| Higher Education | 2250 |
| Evaluation Methods | 2085 |
| College Students | 1813 |
| Correlation | 1723 |
| More ▼ | |
Source
Author
Publication Type
Education Level
Audience
| Researchers | 728 |
| Practitioners | 429 |
| Teachers | 142 |
| Administrators | 96 |
| Policymakers | 57 |
| Counselors | 36 |
| Students | 20 |
| Parents | 13 |
| Community | 7 |
| Support Staff | 6 |
| Media Staff | 2 |
| More ▼ | |
Location
| Turkey | 807 |
| Australia | 347 |
| Canada | 324 |
| China | 300 |
| United States | 188 |
| Indonesia | 172 |
| Spain | 169 |
| United Kingdom | 160 |
| Netherlands | 159 |
| California | 156 |
| Germany | 153 |
| More ▼ | |
Laws, Policies, & Programs
Assessments and Surveys
What Works Clearinghouse Rating
| Meets WWC Standards without Reservations | 1 |
| Meets WWC Standards with or without Reservations | 3 |
| Does not meet standards | 1 |
Peer reviewedKulik, James A. – New Directions for Institutional Research, 2001
Reviews the conclusions about student ratings of teacher effectiveness on which most experts agree, cites some of the main sources of support for these conclusions, and discusses some dissenting opinions and the research support for those opinions. (EV)
Descriptors: Evaluation Problems, Evaluation Research, Student Evaluation of Teacher Performance, Test Validity
Peer reviewedOry, John C.; Ryan, Katherine – New Directions for Institutional Research, 2001
Examines student ratings of teacher effectiveness within a new framework that emphasizes six distinct aspects of validity: content, substantive, structural, generalizability, external, and consequential. Concludes that greater attention should be directed toward consequential validity, particularly how ratings are used on today's campuses and what…
Descriptors: Evaluation Research, Evaluation Utilization, Student Evaluation of Teacher Performance, Test Validity
Peer reviewedKnight, George P. – Journal of Research on Adolescence, 2000
Discusses general issues relevant to the use of measures in diverse populations. Describes the ideal strategy for developing a measure of underlying constructs. Describes the relevance of an examination of measurement equivalence in developing a measure for use in diverse populations and key issues to be addressed in measure development. (KB)
Descriptors: Cultural Differences, Measurement Techniques, Measures (Individuals), Research Problems
Peer reviewedKelly, Kevin R.; Jugovic, Heidi – Journal of Career Assessment, 2001
Data from the Keirsey Temperament Sorter II online instrument and Myers Briggs Type Indicator (MBTI) for 203 college freshmen were analyzed. Positive correlations appeared between the concurrent MBTI and Keirsey measures of psychological type, giving preliminary support to the validity of the online version of Keirsey. (Contains 28 references.)…
Descriptors: Career Exploration, Concurrent Validity, Personality Traits, Test Validity
Peer reviewedJosman, Naomi; Berney, Tikva; Jarus, Tal – Occupational Therapy Journal of Research, 2000
The Toglia Category Assessment was used to evaluate the cognitive categorization ability and the capacity to switch conceptual sets of 30 children with severe brain injuries and 30 without impairments. Brain-injured children had significantly lower scores; awareness scores were significantly correlated with performance scores. (Contains 33…
Descriptors: Children, Cognitive Processes, Neurological Impairments, Occupational Therapy
Peer reviewedKahn, Jeffrey H.; Miller, Steven A. – Measurement and Evaluation in Counseling and Development, 2000
Discusses the development and cross validation of a brief measure of the Research Training Environment Scale-Revised (RTES-R) in counseling graduate programs. Results indicate that the new RTES-R-S total scores correlate strongly with the original form and show good internal consistency. Recommends this new instrument when efficient measurement is…
Descriptors: Counselor Training, Graduate Study, Higher Education, Measurement Techniques
Peer reviewedLiddell, Debora L. – Journal of College Student Development, 1998
The Measure of Moral Orientation (MMO) is compared to semistructured interviews with college students. Results support the validity and reliability of MMO as a standardized assessment of moral voice. To understand the specific nature and context of students' moral dilemmas, however, educators must engage in meaningful, direct dialogue with…
Descriptors: College Students, Higher Education, Interviews, Moral Development
Peer reviewedCarson, Andrew D. – Journal of Career Assessment, 1998
Review of tests of musical aptitude indicates that objective assessment of such aptitude is not included in standard career assessment batteries. Reasons include a general decline in use of aptitude measures, less support for artistic/musical aptitudes compared to other fields, and lack of documentation of their applicability to career counseling.…
Descriptors: Aptitude Tests, Career Counseling, Music, Test Use
Peer reviewedD'Andrea, Michael; Daniels, Judy; Gaughen, Kiaka J. S. – Measurement and Evaluation in Counseling and Development, 1998
Examines the Worry Survey (adapted from the Adolescent Health Survey) as regards the developmental concerns of urban African American youths (N=495). Results show that internal consistency of the factor-item ratings on the Personal and Family Worry subscales were acceptable. Less evidence of the reliability was noted among the Peer and…
Descriptors: Black Youth, Counseling, Individual Development, Test Reliability
Peer reviewedKocarek, Catherine E.; Talbot, Donna M.; Batka, John C.; Anderson, Mary Z. – Journal of Counseling & Development, 2001
Examines the reliability and validity of three measures of multicultural competency, the Multicultural Counseling Awareness Scale: Form B (MCAS), the Multicultural Awareness-Knowledge-and Skills Survey (MAKSS), and the Survey of Graduate Students' Experiences with Diversity (GSEDS). The findings generally support the psychometric soundness of…
Descriptors: Competence, Counselors, Measurement Techniques, Multicultural Education
Hogan, Thomas P.; Agnello, Jessica – Educational and Psychological Measurement, 2004
This study investigates the current research practice concerning reporting measurement validity evidence based on a sample of 696 research reports listed in the American Psychological Association's Directory of Unpublished Experimental Mental Measures. Only 55% of the reports included any type of validity evidence. This was a substantially lower…
Descriptors: Psychology, Psychometrics, Test Validity, Psychological Testing
Lee, Kibeom; Ashton, Michael C. – Multivariate Behavioral Research, 2004
We introduce a personality inventory designed to measure six major dimensions of personality derived from lexical studies of personality structure. The HEXACO Personality Inventory (HEXACO-PI) consists of 24 facet-level personality trait scales that define the six personality factors named Honesty-Humility (H), Emotionality (E), Extraversion (X),…
Descriptors: Psychometrics, Personality Measures, Personality Assessment, Personality Traits
Simms, Leonard J.; Clark, Lee Anna – Psychological Assessment, 2005
This is a validation study of a computerized adaptive (CAT) version of the Schedule for Nonadaptive and Adaptive Personality (SNAP) conducted with 413 undergraduates who completed the SNAP twice, 1 week apart. Participants were assigned randomly to 1 of 4 retest groups: (a) paper-and-pencil (P&P) SNAP, (b) CAT, (c) P&P/CAT, and (d) CAT/P&P. With…
Descriptors: Personality Measures, Personality, Test Validity, Computer Assisted Testing
Kane, Michael – Measurement: Interdisciplinary Research and Perspectives, 2004
The theories of validity developed over the past 60 years are quite sophisticated, but the methodology of validity is not generally very effective. The validity evidence for major testing programs is typically much weaker than the evidence for more technical characteristics such as reliability. In addition, most validation efforts have a strong…
Descriptors: Test Validity, Methods, Licensing Examinations (Professions), Measurement
Toenjes, Laurence A. – Education Policy Analysis Archives, 2005
A paper appearing in this journal by Klein, Hamilton, McCaffrey and Stecher (2000) attempted to raise serious questions about the validity of the gains in student performance as measured by Texas' standardized test, the Texas Assessment of Academic Skills (TAAS). Part of their analysis was based on the results of three tests which they…
Descriptors: Standardized Tests, Test Validity, Scores, Academic Achievement

Direct link
