Publication Date
| In 2026 | 7 |
| Since 2025 | 690 |
| Since 2022 (last 5 years) | 3191 |
| Since 2017 (last 10 years) | 7432 |
| Since 2007 (last 20 years) | 15070 |
Descriptor
| Test Reliability | 15055 |
| Test Validity | 10290 |
| Reliability | 9763 |
| Foreign Countries | 7150 |
| Test Construction | 4828 |
| Validity | 4192 |
| Measures (Individuals) | 3880 |
| Factor Analysis | 3826 |
| Psychometrics | 3532 |
| Interrater Reliability | 3126 |
| Correlation | 3040 |
| More ▼ | |
Source
Author
Publication Type
Education Level
Audience
| Researchers | 709 |
| Practitioners | 451 |
| Teachers | 208 |
| Administrators | 122 |
| Policymakers | 66 |
| Counselors | 42 |
| Students | 38 |
| Parents | 11 |
| Community | 7 |
| Support Staff | 6 |
| Media Staff | 5 |
| More ▼ | |
Location
| Turkey | 1329 |
| Australia | 436 |
| Canada | 379 |
| China | 368 |
| United States | 271 |
| United Kingdom | 256 |
| Indonesia | 253 |
| Taiwan | 234 |
| Netherlands | 224 |
| Spain | 218 |
| California | 215 |
| More ▼ | |
Laws, Policies, & Programs
Assessments and Surveys
What Works Clearinghouse Rating
| Meets WWC Standards without Reservations | 8 |
| Meets WWC Standards with or without Reservations | 9 |
| Does not meet standards | 6 |
Peer reviewedKaufman, James C.; Gentile, Claudia A.; Baer, John – Gifted Child Quarterly, 2005
Little research has been conducted on how gifted novices compare to experts in their judgments of creative writing. If novices and experts assign similar ratings, it could be argued that gifted novices are able to offer their peers feedback of a similar quality to that provided by experts. Such a finding would support the use of collaborative…
Descriptors: Psychologists, Literary Genres, Interrater Reliability, Feedback
Raghavan, R.; Marshall, M.; Lockwood, A.; Duggan, L. – Journal of Intellectual Disability Research, 2004
People with learning disability (LD) experience a range of mental health problems. They are a complex population, whose needs are not well understood. This study focuses on the development of a systematic process of needs assessment for this population. The Cardinal Needs Schedule used in general psychiatry was adapted for people with learning…
Descriptors: Psychiatry, Needs Assessment, Mental Disorders, Interrater Reliability
Galea, Jennifer; Butler, Jenny; Iacono, Teresa; Leighton, Daniel – Journal of Intellectual and Developmental Disability, 2004
The aims of this study were to evaluate components of a new tool, the Assessment of Sexual Knowledge (ASK), and to use it to assess the sexual knowledge of adults with intellectual disability. The ASK consists of a Knowledge Section, an Attitudes Section, a Quick Knowledge Quiz and a Problematic Socio-Sexual Behaviours Checklist. A sample of 96…
Descriptors: Foreign Countries, Knowledge Level, Sexuality, Test Reliability
Yiu, Edwin M.-L.; Ng, Chi-Yan – Clinical Linguistics and Phonetics, 2004
One of the factors that affects the reliability of perceptual voice evaluation is the rating scale. Equal-appearing interval (EAI) and visual analogue (VA) scales are the two most common scales used and have attracted much attention in recent studies of perceptual voice evaluation. Available findings are contradictory, with one study finding the…
Descriptors: Test Reliability, Measurement Techniques, Rating Scales, Phonetics
Brookhart, Susan M. – Educational Measurement: Issues and Practice, 2005
A sample of 293 local district assessments used in the Nebraska STARS (School-based Teacher-led Assessment and Reporting System), 147 from 2004 district mathematics assessment portfolios and 146 from 2003 reading assessment portfolios, was scored with a rubric evaluating their quality. Scorers were Nebraska educators with background and training…
Descriptors: Portfolios (Background Materials), Scoring, Student Evaluation, Reliability
Kupermintz, Haggai – Journal of Educational Measurement, 2004
A decision-theoretic approach to the question of reliability in categorically scored examinations is explored. The concepts of true scores and errors are discussed as they deviate from conventional psychometric definitions and measurement error in categorical scores is cast in terms of misclassifications. A reliability measure based on…
Descriptors: Test Reliability, Error of Measurement, Psychometrics, Test Theory
Penev, Spiridon; Raykov, Tenko – Multivariate Behavioral Research, 2006
A linear combination of a set of measures is often sought as an overall score summarizing subject performance. The weights in this composite can be selected to maximize its reliability or to maximize its validity, and the optimal choice of weights is in general not the same for these two optimality criteria. We explore several relationships…
Descriptors: Behavioral Science Research, Reliability, Validity, Evaluation Methods
van der Schaaf, Marieke; Stokking, Karel; Verloop, Nico – Studies in Educational Evaluation, 2005
Portfolios are frequently used to assess teachers' competences. In portfolio assessment, the issue of rater reliability is a notorious problem. To improve the quality of assessments insight into raters' judgment processes is crucial. Using a mixed quantitative and qualitative approach we studied cognitive processes underlying raters' judgments and…
Descriptors: Portfolios (Background Materials), Systems Approach, Cognitive Processes, Portfolio Assessment
Jobes, David A.; Nelson, Kathryn N.; Peterson, Erin M.; Pentiuc, Daniel; Downing, Vanessa; Francini, Kristen; Kiernan, Amy – Suicide and Life-Threatening Behavior, 2004
Given the incidence and seriousness of suicidality in clinical practice, the need for new and better ways to assess suicide risk is clear. While there are many published assessment instruments in the literature, survey data suggest that these measure are not widely used. One possible explanation is that current quantitatively developed assessment…
Descriptors: Patients, Research Methodology, Interrater Reliability, Suicide
Gray, Matt J.; Litz, Brett T.; Hsu, Julie L.; Lombardo, Thomas W. – Assessment, 2004
The Life Events Checklist (LEC), a measure of exposure to potentially traumatic events, was developed at the National Center for Posttraumatic Stress Disorder (PTSD) concurrently with the Clinician Administered PTSD Scale (CAPS) to facilitate the diagnosis of PTSD. Although the CAPS is recognized as the gold standard in PTSD symptom assessment,…
Descriptors: Psychometrics, Check Lists, Veterans, Posttraumatic Stress Disorder
Lowe, Patricia A.; Reynolds, Cecil R. – Educational and Psychological Measurement, 2004
Responses of 871 adults to the Adult Manifest Anxiety Scale-Adult version (AMAS-A) were factor analyzed using the method of principal axis factoring with promax rotation. Factor analysis yielded a four-factor solution: three anxiety factors (Worry/Oversensitivity, Stress, and Physiological Anxiety) and a Lie factor. The AMAS-A's three-factor…
Descriptors: Psychometrics, Validity, Factor Analysis, Anxiety
Hogan, Thomas P.; Agnello, Jessica – Educational and Psychological Measurement, 2004
This study investigates the current research practice concerning reporting measurement validity evidence based on a sample of 696 research reports listed in the American Psychological Association's Directory of Unpublished Experimental Mental Measures. Only 55% of the reports included any type of validity evidence. This was a substantially lower…
Descriptors: Psychology, Psychometrics, Test Validity, Psychological Testing
Lee, Kibeom; Ashton, Michael C. – Multivariate Behavioral Research, 2004
We introduce a personality inventory designed to measure six major dimensions of personality derived from lexical studies of personality structure. The HEXACO Personality Inventory (HEXACO-PI) consists of 24 facet-level personality trait scales that define the six personality factors named Honesty-Humility (H), Emotionality (E), Extraversion (X),…
Descriptors: Psychometrics, Personality Measures, Personality Assessment, Personality Traits
Strauss, Gregory P.; Allen, Daniel N.; Jorgensen, Melinda L.; Cramer, Stacey L. – Assessment, 2005
Previous studies have examined the reliability of scores derived from various Stroop tasks. However, few studies have compared reliability of more recently developed Stroop variants such as emotional Stroop tasks to standard versions of the Stroop. The current study developed four different single-stimulus Stroop tasks and compared test-retest…
Descriptors: Undergraduate Students, Test Reliability, Visual Perception, Comparative Analysis
Colombo, Lucia; Laudanna, Alessandro; De Martino, Maria; Brivio, Cristina – Brain and Language, 2004
In the present study we have investigated the acquisition of the past participle of Italian verbs of the second (including mostly irregular verbs) and third (including mostly regular verbs) conjugations in school age children, and with simulations with an artificial neural network. We aimed to verify the extent to which children are sensitive to…
Descriptors: Verbs, Morphemes, Italian, Children

Direct link
