Publication Date
| In 2026 | 12 |
| Since 2025 | 958 |
| Since 2022 (last 5 years) | 4567 |
| Since 2017 (last 10 years) | 10500 |
| Since 2007 (last 20 years) | 21963 |
Descriptor
| Test Validity | 21786 |
| Validity | 13791 |
| Test Reliability | 10864 |
| Foreign Countries | 9887 |
| Test Construction | 6897 |
| Factor Analysis | 5761 |
| Measures (Individuals) | 5633 |
| Predictive Validity | 5022 |
| Psychometrics | 4820 |
| Reliability | 4635 |
| Correlation | 4376 |
| More ▼ | |
Source
Author
Publication Type
Education Level
Audience
| Researchers | 1169 |
| Practitioners | 629 |
| Teachers | 336 |
| Administrators | 165 |
| Policymakers | 110 |
| Counselors | 63 |
| Students | 63 |
| Parents | 15 |
| Community | 12 |
| Media Staff | 10 |
| Support Staff | 8 |
| More ▼ | |
Location
| Turkey | 1397 |
| Australia | 705 |
| Canada | 626 |
| China | 528 |
| United States | 439 |
| Indonesia | 389 |
| United Kingdom | 363 |
| Germany | 340 |
| California | 338 |
| Netherlands | 336 |
| Spain | 311 |
| More ▼ | |
Laws, Policies, & Programs
Assessments and Surveys
What Works Clearinghouse Rating
| Meets WWC Standards without Reservations | 7 |
| Meets WWC Standards with or without Reservations | 12 |
| Does not meet standards | 10 |
Edwards, Wayne R.; Schleicher, Deidra J. – Journal of Educational Psychology, 2004
This study provides initial evidence for the criterion-related validity of tacit knowledge (TK) as an alternative measure for selecting psychology graduate students and adds insight to the construct of TK by evaluating its factor structure, assessing convergent relationships with other variables, and exploring alternative reasons for why TK…
Descriptors: Academic Achievement, Knowledge Level, Educational Psychology, Predictor Variables
Serlin, Ronald C.; Harwell, Michael R. – Psychological Methods, 2004
It is well-known that for normally distributed errors parametric tests are optimal statistically, but perhaps less well-known is that when normality does not hold, nonparametric tests frequently possess greater statistical power than parametric tests, while controlling Type I error rate. However, the use of nonparametric procedures has been…
Descriptors: Multiple Regression Analysis, Monte Carlo Methods, Nonparametric Statistics, Error Patterns
Mumma, Gregory H. – Psychological Assessment, 2004
This article describes a method for the intraindividual clinical validation of a cognitive case formulation (CCF) involving hypotheses about the patient's idiosyncratic cognitive schema (ICS). The two-stage approach begins by testing the convergent and discriminant validity of the hypothesized ICS against the individual's daily ratings of…
Descriptors: Factor Analysis, Anxiety, Test Validity, Factor Structure
Mantzicopoulos, Panayota; French, Brian F.; Maller, Susan J. – Child Development, 2004
Competing models of the factorial structure of the Pictorial Scale of Perceived Competence and Social Acceptance (PSPCSA) were tested for fit using multisample confirmatory factor analysis. The best fitting model was tested for invariance (a) across samples of middle-class (n251) and economically disadvantaged (Head Start, n=117) kindergarten…
Descriptors: Disadvantaged Youth, Measures (Individuals), Economically Disadvantaged, Kindergarten
Peer reviewedScarborough, Janna L. – Professional School Counseling, 2005
Given that there is a continual need for school counselors to describe and account for what they do, it follows that an instrument that can be used to collect relevant process data is warranted. The importance of collecting process data describing school counselor practice is widely supported as a component of accountability. However, the lack of…
Descriptors: Test Validity, School Counseling, Rating Scales, School Counselors
Uhing, Brad M.; Mooney, Paul; Ryser, Gail R. – Journal of Emotional & Behavioral Disorders, 2005
The authors' studies evaluated the discriminative ability of the youth and parent forms of the "Behavioral and Emotional Rating Scale--Second Edition" (BERS-2; Epstein, 2004). The BERS-2 is a standardized rating scale system that assesses the emotional and behavioral strengths of children and youth. Separate studies compared the…
Descriptors: Parents, Rating Scales, Emotional Disturbances, Behavior Problems
Koul, Ravinder; Clariana, Roy B.; Salehi, Roya – Journal of Educational Computing Research, 2005
This article reports the results of an investigation of the convergent criterion-related validity of two computer-based tools for scoring concept maps and essays as part of the ongoing formative evaluation of these tools. In pairs, participants researched a science topic online and created a concept map of the topic. Later, participants…
Descriptors: Scoring, Essay Tests, Test Validity, Formative Evaluation
Kulm, Gerald; Dager Wilson, Linda; Kitchen, Richard – Educational Assessment, 2005
Alignment has taken on increased importance given the current high-stakes nature of assessment. To make well-informed decisions about student learning on the basis of test results, assessment items need to be well aligned with standards. Project 2061 of the American Association for the Advancement of Science (AAAS) has developed a procedure for…
Descriptors: Test Results, Test Validity, Evaluation Methods, Mathematics Instruction
Aschenbrand, Sasha G.; Angelosante, Aleta G.; Kendall, Philip C. – Journal of Clinical Child and Adolescent Psychology, 2005
This study investigated the utility of several scales of the Child Behavior Checklist (CBCL) when diagnosing anxiety disorders in youth. Participants were the mothers and fathers of 130 children (ages 7 to 14; M = 9.61 years, SD = 1.74; 69 boys, 61 girls) who were evaluated at a specialty mental health clinic (100 were referred for treatment; 30…
Descriptors: Child Behavior, Mothers, Fathers, Psychometrics
Vermeer, Adri; Lijnse, Margot; Lindhout, Marleen – European Journal of Special Needs Education, 2004
The results of a study examining the psychometric quality of a pictorial scale to measure perceived physical competence, perceived cognitive competence and perceived social acceptance by peers and caregivers in individuals with intellectual disabilities are reported. The scale was administered twice to 100 subjects. The stability of the scale…
Descriptors: Disabilities, Attitude Measures, Caregiver Attitudes, Foreign Countries
Ozanne, Julie L.; Adkins, Natalie Ross; Sandlin, Jennifer A. – Adult Education Quarterly: A Journal of Research and Theory, 2005
Little empirical evidence exists on how adult literacy learners act as consumers. Yet, adult literacy programs often employ a "functional" approach to consumer education and assume that adult learners are deficient in consumer skills. Data from a qualitative study of the consumer behaviors of adult literacy learners are used to explore how adult…
Descriptors: Writing Skills, Consumer Education, Adult Students, Adult Learning
McKevitt, Brian C.; Elliott, Stephen N. – Psychology in the Schools, 2005
Data were gathered from videotaped recordings of two preschool children engaged in unstructured free play over 12 days each. Observers coded behavior from the videotapes and completed a behavior rating scale for each child after every two observation sessions. Teachers also completed two behavior rating scales per child. Results indicated that at…
Descriptors: Social Behavior, Observation, Validity, Play
Marshall, Margarita B.; Bagby, R. Michael – Assessment, 2006
The incremental validity and clinical utility of the recently developed Minnesota Multiphasic Personality Inventory-2 (MMPI-2) Infrequency Posttraumatic Stress Disorder Scale (Fptsd) was examined in relation to the family of MMPI-2 F scales in distinguishing feigned post-traumatic stress disorder (PTSD) from disability claimants with PTSD.…
Descriptors: Posttraumatic Stress Disorder, Test Validity, Personality Measures, Effect Size
Schulz, E. Matthew; Betebenner, Damian; Ahn, Meeyeon – Journal of Educational Measurement, 2004
Whether hierarchical logistic regression can reduce the sample size requirement for estimating optimal cutoff scores in a course placement service where predictive validity is measured by a threshold utility function is explored. Data from courses with varying class size were randomly partitioned into two halves per course. Nonhierarchical and…
Descriptors: Class Size, Sample Size, Cutting Scores, Predictive Validity
Sofroniou, Nick; Kellaghan, Thomas – Journal of Educational Measurement, 2004
To examine the predictive utility of three scales provided in the released database of the Third International Mathematics and Science Study (TIMSS) (international plausible values, standardized percent correct score, and national Rasch score), information was obtained on the performance in state examinations in mathematics and science in 1996…
Descriptors: Foreign Countries, Predictive Validity, National Competency Tests, Mathematics Tests

Direct link
