Publication Date
| In 2026 | 3 |
| Since 2025 | 666 |
| Since 2022 (last 5 years) | 3167 |
| Since 2017 (last 10 years) | 7408 |
| Since 2007 (last 20 years) | 15046 |
Descriptor
| Test Reliability | 15036 |
| Test Validity | 10272 |
| Reliability | 9759 |
| Foreign Countries | 7141 |
| Test Construction | 4823 |
| Validity | 4191 |
| Measures (Individuals) | 3877 |
| Factor Analysis | 3825 |
| Psychometrics | 3525 |
| Interrater Reliability | 3124 |
| Correlation | 3039 |
| More ▼ | |
Source
Author
Publication Type
Education Level
Audience
| Researchers | 709 |
| Practitioners | 451 |
| Teachers | 208 |
| Administrators | 122 |
| Policymakers | 66 |
| Counselors | 42 |
| Students | 38 |
| Parents | 11 |
| Community | 7 |
| Support Staff | 6 |
| Media Staff | 5 |
| More ▼ | |
Location
| Turkey | 1327 |
| Australia | 436 |
| Canada | 379 |
| China | 368 |
| United States | 271 |
| United Kingdom | 256 |
| Indonesia | 252 |
| Taiwan | 234 |
| Netherlands | 223 |
| Spain | 216 |
| California | 214 |
| More ▼ | |
Laws, Policies, & Programs
Assessments and Surveys
What Works Clearinghouse Rating
| Meets WWC Standards without Reservations | 8 |
| Meets WWC Standards with or without Reservations | 9 |
| Does not meet standards | 6 |
Capraro, Robert M.; Capraro, Mary Margaret – Middle Grades Research Journal, 2009
This study examines two journals specific to the middles grades where original quantitative empirical articles are published, Research in Middle Level Education and Middle Grades Research Journal to determine what quantitative statistics are used, how they are used, and what study designs are used. Important for those who write for the…
Descriptors: Periodicals, Research Methodology, Social Science Research, Effect Size
Uzuntiryaki, Esen; Aydin, Yesim Capa – Research in Science Education, 2009
This study described the process of developing and validating the College Chemistry Self-Efficacy Scale (CCSS) that can be used to assess college students' beliefs in their ability to perform essential tasks in chemistry. In the first phase, data collected from 363 college students provided evidence for the validity and reliability of the new…
Descriptors: College Students, Self Efficacy, Chemistry, Measures (Individuals)
Bjornebekk, Gunnar – Educational and Psychological Measurement, 2009
The primary aim of this study was to examine the psychometric properties of the scores on a version for children of the Carver and White Behavioral Inhibition and Activation scales (the BIS-BAS scales). This involved administering the BIS-BAS scales, the Positive and Negative Affect Schedule, the Junior Eysenck Personality Questionnaire…
Descriptors: Measures (Individuals), Psychometrics, Grade 6, Test Validity
Ehrenreich, Jill T.; Micco, Jamie A.; Fisher, Paige H.; Warner, Carrie Masia – Child Psychiatry and Human Development, 2009
Objective: Research on child and adolescent anxiety disorders has seen a surge in investigations of parenting factors potentially associated with their etiology. However, many of the well-established parenting measures are limited by over-reliance on self-report or lengthy behavioral observation procedures. Such measures may not assess factors…
Descriptors: Test Validity, Child Rearing, Interrater Reliability, Adolescents
Kim, Bryan S. K.; Soliz, Alicia; Orellana, Blanca; Alamilla, Saul G. – Measurement and Evaluation in Counseling and Development, 2009
This article describes the development of the Latino/a Values Scale (35 items, 14 reverse-worded). Evidence of reliability and validity are presented on the basis of three studies. An examination of the factor structure of the items suggests the presence of the following dimensions: cultural pride, simpatia, familismo, and espiritismo. (Contains 4…
Descriptors: Hispanic Americans, Social Values, Measures (Individuals), Reliability
Kucuk, Funda; Walters, JoDee – ELT Journal, 2009
This article reports on a study of the validity and reliability of tests administered in an EFL university setting. The study addresses the question of how well face validity reflects more objective measures of the quality of a test, such as predictive validity and reliability. According to some researchers, face validity, defined as the surface…
Descriptors: Language Tests, Test Validity, Achievement Tests, English (Second Language)
Soreni, Noam; Crosbie, Jennifer; Ickowicz, Abel; Schachar, Russell – Journal of Attention Disorders, 2009
Objective: To measure test-retest reliability of the Stop-Signal Task (SST) and the Conners' Continuous Performance Test (CPT) in children with ADHD. Methods: 12 children with ADHD (age 11.46 plus or minus 1.66) participated in the study. Primary outcome measures were stop-signal reaction time (SSRT) for the SST and CPT's commission errors (%FP).…
Descriptors: Intervals, Reaction Time, Performance Tests, Attention Deficit Disorders
Yao, Grace; Wu, Chia-huei – Social Indicators Research, 2009
To facilitate comparison across cultures, the World Health Organization (WHO) has been developing a universal measure of quality of life (QOL) called the WHOQOL Questionnaire. This questionnaire contains 24 facets organized into six broad domains: physical, psychological, level of independence, social relationships, environment, and…
Descriptors: Social Indicators, Cross Cultural Studies, Quality of Life, Questionnaires
Shahvali, M.; Poursaeed, A.; Sharifzadeh, M. – Journal of Natural Resources and Life Sciences Education, 2009
This study investigated the effects of workshop and lecture methods on pastoralists' learning in Ilam Province, west of Iran. A quasi-experimental research method and non-equivalent control group design was used. Sixty pastoralists participated in this study. An open-ended questionnaire was used as the instrument of the study and found to have…
Descriptors: Control Groups, Content Validity, Validity, Interrater Reliability
Rottinghaus, Patrick J. – Journal of Career Assessment, 2009
This article introduces the Kuder Skills Assessment-College and Adult version (KSA-CA; Rottinghaus, 2006), a new measure incorporating advances in the measurement of self-efficacy across 16 basic occupational domains (e.g., finance, information technology) and the six Kuder Clusters. Similar to the original development sample, all scales of the…
Descriptors: Self Efficacy, Measures (Individuals), College Students, Majors (Students)
Carpenter, Brian D.; Balsis, Steve; Otilingam, Poorni G.; Hanson, Priya K.; Gatz, Margaret – Gerontologist, 2009
Purpose: This study provides preliminary evidence for the acceptability, reliability, and validity of the new Alzheimer's Disease Knowledge Scale (ADKS), a content and psychometric update to the Alzheimer's Disease Knowledge Test. Design and Methods: Traditional scale development methods were used to generate items and evaluate their psychometric…
Descriptors: Alzheimers Disease, Caregivers, Risk, Patients
Lee-Ellis, Sunyoung – Language Testing, 2009
Despite the importance of having a reliable and valid measure of Second Language (L2) proficiency, L2 researchers of less commonly taught languages rarely have such a tool. Existing proficiency measures (e.g., DLPT, OPI) are often costly, labor-intensive, time-consuming, or unavailable to the public. With the intent to provide a practical and…
Descriptors: Uncommonly Taught Languages, Test Validity, Korean, Test Construction
Kang, Sonia K.; Chasteen, Alison L. – Gerontologist, 2009
Purpose: There is much evidence suggesting that older adults are often negatively affected by aging stereotypes; however, no method to identify individual differences in vulnerability to these effects has yet been developed. The purpose of this study was to develop a reliable and valid questionnaire to measure individual differences in the…
Descriptors: Construct Validity, Reliability, Questionnaires, Older Adults
Ridley, Charles R.; Shaw-Ridley, Mary – Counseling Psychologist, 2009
Clinical judgment is foundational to psychological practice. Accurate judgment forms the basis for establishing reasonable goals and selecting appropriate treatments, which in turn are essential in achieving positive therapeutic outcomes. Therefore, Spengler and colleagues' meta-analytic finding--clinical judgment accuracy improves marginally with…
Descriptors: Medical Evaluation, Clinical Experience, Inferences, Therapy
Jang, Yoonhee; Wixted, John T.; Huber, David E. – Journal of Experimental Psychology: General, 2009
The current study compared 3 models of recognition memory in their ability to generalize across yes/no and 2-alternative forced-choice (2AFC) testing. The unequal-variance signal-detection model assumes a continuous memory strength process. The dual-process signal-detection model adds a thresholdlike recollection process to a continuous…
Descriptors: Test Format, Familiarity, Testing, Criteria

Peer reviewed
Direct link
