Publication Date
| In 2026 | 3 |
| Since 2025 | 675 |
| Since 2022 (last 5 years) | 3176 |
| Since 2017 (last 10 years) | 7417 |
| Since 2007 (last 20 years) | 15055 |
Descriptor
| Test Reliability | 15043 |
| Test Validity | 10279 |
| Reliability | 9761 |
| Foreign Countries | 7144 |
| Test Construction | 4825 |
| Validity | 4191 |
| Measures (Individuals) | 3877 |
| Factor Analysis | 3825 |
| Psychometrics | 3526 |
| Interrater Reliability | 3124 |
| Correlation | 3040 |
| More ▼ | |
Source
Author
Publication Type
Education Level
Audience
| Researchers | 709 |
| Practitioners | 451 |
| Teachers | 208 |
| Administrators | 122 |
| Policymakers | 66 |
| Counselors | 42 |
| Students | 38 |
| Parents | 11 |
| Community | 7 |
| Support Staff | 6 |
| Media Staff | 5 |
| More ▼ | |
Location
| Turkey | 1328 |
| Australia | 436 |
| Canada | 379 |
| China | 368 |
| United States | 271 |
| United Kingdom | 256 |
| Indonesia | 253 |
| Taiwan | 234 |
| Netherlands | 223 |
| Spain | 217 |
| California | 215 |
| More ▼ | |
Laws, Policies, & Programs
Assessments and Surveys
What Works Clearinghouse Rating
| Meets WWC Standards without Reservations | 8 |
| Meets WWC Standards with or without Reservations | 9 |
| Does not meet standards | 6 |
Peer reviewedClark, Kenneth – Mathematics Teacher, 1999
Explains and demonstrates a procedure that is commonly used to determine the reliability of a test in such a way that a person who has modest arithmetical skills can carry out the same analysis on a classroom test or examination. (ASK)
Descriptors: Mathematics Education, Secondary Education, Secondary School Mathematics, Test Construction
Peer reviewedMiville, Marie L.; Gelso, Charles J.; Pannu, Raji; Liu, Will; Touradji, Pegah; Holloway, Pauline; Fuertes, Jairo – Journal of Counseling Psychology, 1999
Describes results of study of a 45-item scale developed to measure the construct and administered to four separate samples. The Miville-Guzman Universality-Diversity Scale significantly correlated in theoretically predicted ways with measures of racial identity, empathy, health narcissism, feminism, androgyny, homophobia, and dogmatism (the last…
Descriptors: College Students, Construct Validity, Cultural Differences, Discriminant Analysis
Peer reviewedMohr, Jonathan J.; Rochlen, Aaron B. – Journal of Counseling Psychology, 1999
Reports on studies on the development and validation of the Attitudes Regarding Bisexuality Scale (ARBS). In heterosexuals, subscales were strongly related to attitudes toward lesbians and gay men, frequency of religious attendance, political ideology, and prior contact. In lesbians and gay men, subscales correlated with prior experiences and…
Descriptors: Bisexuality, Experience, Females, Heterosexuality
Peer reviewedHurtz, Gregory M.; Hertz, Norman R. – Educational and Psychological Measurement, 1999
Evaluated Angoff ratings from eight different occupational licensing examinations through generalizability theory to estimate the optimal number of raters. Results indicate that approximately 10 to 15 raters is an optimal target range. (SLD)
Descriptors: Cutting Scores, Evaluators, Generalizability Theory, Interrater Reliability
Peer reviewedLaufer, Batia; Nation, Paul – Language Testing, 1999
Investigated the reliability, validity, and practicality of a controlled production measure of vocabulary, consisting of items from five frequency levels and using a completion-item format. Two equivalent test forms were compared. The test was found to be useful in distinguishing between different proficiency groups. (Author/MSE)
Descriptors: Difficulty Level, Language Tests, Second Languages, Test Construction
Peer reviewedGannon, F. Terry; Draper, Peter R.; Watson, Roger; Proctor, Susan; Norman, Ian J. – Nurse Education Today, 2001
Using portfolios for assessment of nursing competence raises issues of ambiguity, confidentiality, and honesty. More research is needed to develop a clear theoretical framework and measures of validity, reliability, and credibility for nursing portfolio assessment. (Contains 29 references.) (SK)
Descriptors: Competence, Confidentiality, Higher Education, Nursing Education
Peer reviewedFall, Marijane; McLeod, Elizabeth H. – Professional School Counseling, 2001
Evaluates the revised editions of the Self-Efficacy Scale for use with children in schools. Addresses the reliability and validity of the two versions of the scale. Proposes counseling interventions to increase student self-efficacy. (Contains 22 references, 1 table, and an appendix.) (GCP)
Descriptors: Children, Counseling Techniques, Elementary Education, School Counseling
De Wever, B.; Schellens, T.; Valcke, M.; Van Keer, H. – Computers and Education, 2006
Research in the field of Computer Supported Collaborative Learning (CSCL) is based on a wide variety of methodologies. In this paper, we focus upon content analysis, which is a technique often used to analyze transcripts of asynchronous, computer mediated discussion groups in formal educational settings. Although this research technique is often…
Descriptors: Content Analysis, Educational Research, Research Methodology, Computer Mediated Communication
Strijbos, Jan-Willem; Martens, Rob L.; Prins, Frans J.; Jochems, Wim M. G. – Computers and Education, 2006
Quantitative content analysis is increasingly used to surpass surface level analyses in computer-supported collaborative learning (e.g., counting messages), but critical reflection on accepted practice has generally not been reported. A review of CSCL conference proceedings revealed a general vagueness in definitions of units of analysis. In…
Descriptors: Content Analysis, Computer Mediated Communication, Information Technology, Reliability
Chu, Brian C.; Kendall, Philip C. – Journal of Consulting and Clinical Psychology, 2004
Ratings of child involvement in manual-based cognitive-behavioral treatment for anxiety were associated with the absence of primary anxiety diagnosis and reductions in impairment ratings at posttreatment for 59 children with anxiety (ages 8-14 years). Good-to-excellent interrater reliability was established for the independent ratings of 237…
Descriptors: Psychometrics, Psychotherapy, Anxiety, Outcomes of Treatment
van I Jzendoorn,Marinus H.; Vereijken, Carolus M.J.L.; Bakermans-Kranenburg, Marian J.; Riksen-Walraven, Marianne J. – Child Development, 2004
The reliability and validity of the Attachment Q Sort (AQS; Waters & Deane, 1985) was tested in a series of meta-analyses on 139 studies with 13,835 children. The observer AQS security score showed convergent validity with Strange Situation procedure (SSP) security (r=31) and excellent predictive validity with sensitivity measures (r=39). Its…
Descriptors: Q Methodology, Predictive Validity, Attachment Behavior, Test Validity
Nystrom, Peter – Scandinavian Journal of Educational Research, 2004
Reliability is a problem inherent in all educational assessments, but the amount of attention this particular problem should be given is related to the function and use of the assessment. In this article, classification accuracy is put forward as a conceptualization of reliability that is meaningful for a large number of educational assessments.…
Descriptors: Test Validity, Test Reliability, Mathematics Tests, Foreign Countries
Doran, Harold C. – Educational and Psychological Measurement, 2005
The information function is an important statistic in item response theory (IRT) applications. Although the information function is often described as the IRT version of reliability, it differs from the classical notion of reliability from a critical perspective: replication. This article first explores the information function for the…
Descriptors: Item Response Theory, Error of Measurement, Evaluation Methods, Reliability
Foust, Michelle Singer; Elicker, Joelle D.; Levy, Paul E. – Journal of Vocational Behavior, 2006
The authors developed and validated a measure of employees' attitudes toward lateness at work. Analyses provided clear evidence of the reliability and validity of the new measure. Specifically, high reliabilities were observed in both student (a = 0.82) and employee (a = 0.84) samples. Using objective lateness data from organizations, the measure…
Descriptors: Measures (Individuals), Employee Attitudes, Work Attitudes, Test Reliability
Williams, Jo; Allison, Carrie; Scott, Fiona; Stott, Carol; Bolton, Patrick; Baron-Cohen, Simon; Brayne, Carol – Autism: The International Journal of Research & Practice, 2006
The Childhood Asperger Syndrome Test (CAST) is a 37-item parental self-completion questionnaire to screen for autism spectrum conditions in research. Good test accuracy was demonstrated in studies with primary school aged children in mainstream schools. The aim of this study was to investigate the test-retest reliability of the CAST. Parents of…
Descriptors: Asperger Syndrome, Parent Attitudes, Questionnaires, Young Children

Direct link
