Publication Date
| In 2026 | 3 |
| Since 2025 | 675 |
| Since 2022 (last 5 years) | 3176 |
| Since 2017 (last 10 years) | 7417 |
| Since 2007 (last 20 years) | 15055 |
Descriptor
| Test Reliability | 15043 |
| Test Validity | 10279 |
| Reliability | 9761 |
| Foreign Countries | 7144 |
| Test Construction | 4825 |
| Validity | 4191 |
| Measures (Individuals) | 3877 |
| Factor Analysis | 3825 |
| Psychometrics | 3526 |
| Interrater Reliability | 3124 |
| Correlation | 3040 |
| More ▼ | |
Source
Author
Publication Type
Education Level
Audience
| Researchers | 709 |
| Practitioners | 451 |
| Teachers | 208 |
| Administrators | 122 |
| Policymakers | 66 |
| Counselors | 42 |
| Students | 38 |
| Parents | 11 |
| Community | 7 |
| Support Staff | 6 |
| Media Staff | 5 |
| More ▼ | |
Location
| Turkey | 1328 |
| Australia | 436 |
| Canada | 379 |
| China | 368 |
| United States | 271 |
| United Kingdom | 256 |
| Indonesia | 253 |
| Taiwan | 234 |
| Netherlands | 223 |
| Spain | 217 |
| California | 215 |
| More ▼ | |
Laws, Policies, & Programs
Assessments and Surveys
What Works Clearinghouse Rating
| Meets WWC Standards without Reservations | 8 |
| Meets WWC Standards with or without Reservations | 9 |
| Does not meet standards | 6 |
Peer reviewedMacera, Caroline A.; Ham, Sandra A.; Jones, Deborah A.; Kimsey, C. D.; Ainsworth, Barbara E.; Neff, Linda J. – American Journal of Public Health, 2001
Explored the limitations of identifying sedentary people via an existing screening question on a questionnaire. Data from responses on the Behavioral Risk Factor Surveillance System indicated that many who initially reported no leisure time activity actually engaged in various types of physical activity, highlighting the difficulty of measuring a…
Descriptors: Health Behavior, Leisure Time, Life Style, Physical Activity Level
Peer reviewedMills, Jessica; Bogenschneider, Karen – Family Relations, 2001
Examines the reliability and validity of the Youth Support Inventory, a tool designed for community coalitions to assess the availability of local resources and supports that previous research indicates are important for preventing adolescent alcohol and other drug use. (BF)
Descriptors: Adolescents, Community Services, Health Behavior, Health Promotion
Mooney, Paul; Epstein, Michael H.; Ryser, Gail; Pierce, Corey D. – Children & Schools, 2005
Three studies are reported addressing the test-retest reliability and convergent validity of the Behavioral and Emotional Rating Scale-Second Edition (BERS-2): Parent Rating Scale (PRS). In the first study, test-retest reliability was investigated over a one-week period to determine the stability of the BERS-2 PRS over time. Reliability…
Descriptors: Behavior Rating Scales, Test Validity, Test Reliability, Parents
Peer reviewedSarkisian, Catherine A.; Steers, W. Neil; Hays, Ron D.; Mangione, Carol M. – Gerontologist, 2005
Purpose: This study describes the development of a short version of the Expectations Regarding Aging Survey (ERA-38), a 38-item survey measuring expectations regarding aging. Design and Methods: In 1999, surveys containing the ERA-38 were mailed to 588 adults aged [greater than or equal to] 65 years who were recruited through physicians; 429…
Descriptors: Organizations (Groups), Older Adults, Psychometrics, Physical Health
Mallinckrodt, Brent; Wang, Chia-Chih – Journal of Counseling Psychology, 2004
Back-translation is typically used to verify semantic equivalence (SE) of a translated measure to the original scale. Although validity of the adapted scale depends fundamentally on SE, back-translation always involves subjective evaluations. This study developed "dual-language, split-half quantitative methods of verification to supplement…
Descriptors: Measures (Individuals), Semantics, Translation, Attachment Behavior
Campbell-Sills, Laura; Liverant, Gabrielle I.; Brown, Timothy A. – Psychological Assessment, 2004
The latent structure, reliability, and validity of the Behavioral Inhibition/Behavioral Activation Scales (BIS/BAS; C. L. Carver & T. L. White, 1994) were examined in a large sample of outpatients (N = 1,825) with anxiety and mood disorders. Four subsamples were used for exploratory and confirmatory factor analyses. In addition to generally…
Descriptors: Psychometrics, Test Reliability, Test Validity, Inhibition
Peer reviewedTaylor, Annette Kujawski – College Student Journal, 2005
This research examined 2 elements of multiple-choice test construction, balancing the key and optimal number of options. In Experiment 1 the 3 conditions included a balanced key, overrepresentation of a and b responses, and overrepresentation of c and d responses. The results showed that error-patterns were independent of the key, reflecting…
Descriptors: Comparative Analysis, Test Items, Multiple Choice Tests, Test Construction
Johnston, Brenda – Studies in Higher Education, 2004
The issue of arriving at agreement over outcomes in summative assessment of portfolios has been a major concern, given the complexity of the assessment task, the educational and political context, and the widespread and growing use of portfolios in higher education. This article examines research findings in this area. The discussion takes place…
Descriptors: Portfolios (Background Materials), Portfolio Assessment, Student Evaluation, Higher Education
Brookhart, Susan M. – New Directions for Teaching and Learning, 2004
Classroom assessment information should be the basis for important classroom processes and outcomes: students' study and work patterns, students' understanding of what they are learning, and teachers' instructional and grading decisions. Attention to principles of assessment quality, especially validity and reliability, increases confidence in the…
Descriptors: Student Evaluation, Academic Achievement, Formative Evaluation, Summative Evaluation
Roberts, Kim P.; Powell, Martine B. – Journal of Experimental Child Psychology, 2006
Participants (6- and 7-year-olds, "N" = 130) participated in classroom activities four times. Children were interviewed about the final occurrence (target event) either 1 week or 4 weeks later, during which half of the event items were described inaccurately. Half of these suggestions were consistent with the theme of the detail across…
Descriptors: Young Children, Class Activities, Memory, Reliability
Botzet, Andria, M.; Winters, Ken C.; Stinchfield, Randy – Journal of Child and Adolescent Substance Abuse, 2006
Although gender issues have been addressed in clinical drug abuse literature, very little research has focused on gender differences in terms of the psychometric properties of assessment instruments. If boys and girls interpret instruments differently, the accuracy of clinical evaluation, referral, and treatment decisions based on these measures…
Descriptors: Gender Differences, Adolescents, Drug Abuse, Psychometrics
Christensen, Andrew; Eldridge, Kathleen; Catta-Preta, Adriana Bokel; Lim, Veronica R.; Santagata, Rossella – Journal of Marriage and Family, 2006
In order to examine the cross-cultural consistency of several patterns of couple communication, 363 participants from four different countries (Brazil, Italy, Taiwan, and the United States) completed self-report measures about communication and satisfaction in their romantic relationships. Across countries, constructive communication was…
Descriptors: Foreign Countries, Gender Differences, Cross Cultural Studies, Reliability
Peer reviewedRibes, Emilio; Contreras, Sagrario; Martinez, Carlos; Doval, Eduardo; Viladrich, Carme – Psychological Record, 2005
Three experimental studies were carried out in order to find within-subject consistencies as well as individual differences in a concurrent choice situation involving risk-taking. Four subjects were exposed twice, with a 4-month delay, to a horse-race game and a stock-exchange game, in order to evaluate their choices for a conservative versus a…
Descriptors: Risk, Behavioral Science Research, Task Analysis, Reliability
Marwit, Samuel J.; Meuser, Thomas M. – Death Studies, 2005
This article describes the derivation of a short-form of the Marwit-Meuser Caregiver Grief Inventory (MM-CGI), an inventory designed to measure grief in caregivers of persons with progressive dementia. It presents initial reliability and validity data and describes ways to use the inventory both clinically and scientifically. The resulting MM-CGI…
Descriptors: Dementia, Patients, Caregivers, Test Reliability
McGrath, Robert E.; Pogge, David L.; Stokes, John M.; Cragnolino, Ana; Zaccario, Michele; Hayman, Judy; Piacentini, Teresa; Wayland-Smith, Douglas – Assessment, 2005
The extent to which the Comprehensive System for the Rorschach is reliably scored has been a topic of some controversy. Although several studies have concluded it can be scored reliably in research settings, little is known about its reliability in field settings. This study evaluated the reliability of both response-level codes and protocol-level…
Descriptors: Scoring, Patients, Adolescents, Reliability

Direct link
