Publication Date
| In 2026 | 3 |
| Since 2025 | 656 |
| Since 2022 (last 5 years) | 3157 |
| Since 2017 (last 10 years) | 7398 |
| Since 2007 (last 20 years) | 15036 |
Descriptor
| Test Reliability | 15028 |
| Test Validity | 10265 |
| Reliability | 9757 |
| Foreign Countries | 7137 |
| Test Construction | 4821 |
| Validity | 4191 |
| Measures (Individuals) | 3876 |
| Factor Analysis | 3822 |
| Psychometrics | 3520 |
| Interrater Reliability | 3124 |
| Correlation | 3039 |
| More ▼ | |
Source
Author
Publication Type
Education Level
Audience
| Researchers | 709 |
| Practitioners | 451 |
| Teachers | 208 |
| Administrators | 122 |
| Policymakers | 66 |
| Counselors | 42 |
| Students | 38 |
| Parents | 11 |
| Community | 7 |
| Support Staff | 6 |
| Media Staff | 5 |
| More ▼ | |
Location
| Turkey | 1326 |
| Australia | 436 |
| Canada | 379 |
| China | 368 |
| United States | 271 |
| United Kingdom | 256 |
| Indonesia | 251 |
| Taiwan | 234 |
| Netherlands | 223 |
| Spain | 216 |
| California | 214 |
| More ▼ | |
Laws, Policies, & Programs
Assessments and Surveys
What Works Clearinghouse Rating
| Meets WWC Standards without Reservations | 8 |
| Meets WWC Standards with or without Reservations | 9 |
| Does not meet standards | 6 |
Zakaria, Effandi; Haron, Zolkepeli; Daud, Md Yusoff – Journal of Science and Mathematics Education in Southeast Asia, 2004
The Attitudes Toward Problem Solving Scale (ATPSS) has received limited attention concerning its reliability and validity with a Malaysian secondary education population. Developed by Charles, Lester & O'Daffer (1987), the instruments assessed attitudes toward problem solving in areas of Willingness to Engage in Problem Solving Activities,…
Descriptors: Self Esteem, Construct Validity, Reliability, Problem Solving
Using a Longitudinal Database to Assess the Validity of Preceptors' Ratings of Clerkship Performance
Ferguson, Kristi J.; Kreiter, Clarence D. – Advances in Health Sciences Education, 2004
Purpose: To examine the validity of using scores from a clinical evaluation form as an assessment of clinical competence. Method: Investigators collected a longitudinal clinical skills assessment database that included scores reflecting performance on standardized patient interactions, case-based learning performance, scores on multiple-choice…
Descriptors: Generalizability Theory, Medical Students, Validity, Program Effectiveness
Watkins, Marley W.; Edwards, Vicki A. – Journal of Psychoeducational Assessment, 2004
Considerable evidence suggests that phonemic awareness is associated with the development of skilled reading. Consequently, it is recommended that beginning readers be assessed to ensure adequate development of phonemic awareness skills. When choosing an assessment method, reliability and validity, ease of administration and scoring, and…
Descriptors: Reading Difficulties, Phonemics, Phonological Awareness, Phonemic Awareness
Swanson, Jewel – Canadian Journal of School Psychology, 2005
The Delis-Kaplan Executive Function System (D-KEFS; Delis, Kaplan, & Kramer, 2001a) is a set of standardized tests for comprehensively assessing higher-level cognitive functions, referred to as "executive functions," in both children and adults (aged 8 to 89). Executive functions draw on the individual's more fundamental or primary cognitive…
Descriptors: Cognitive Processes, Standardized Tests, Cognitive Tests, Children
Sudweeks, Richard R.; Reeve, Suzanne; Bradshaw, William S. – Assessing Writing, 2004
A pilot study was conducted to evaluate and improve the rating procedure proposed for use in a research effort designed to assess the essay writing ability of college sophomores. Generalizability theory and the Many-Facet Rasch Model were each used to (a) estimate potential sources of error in the rating, (b) to obtain reliability estimates, and…
Descriptors: Generalizability Theory, College Students, Writing Ability, Writing Evaluation
McVilly, Keith R.; Stancliffe, Roger J.; Parmenter, Trevor R.; Burton-Smith, Rosanne M. – Journal of Applied Research in Intellectual Disabilities, 2006
Background: This study explored "loneliness" as experienced by adults with intellectual disability, with "intermittent" to "limited" support needs. Method: A measure of loneliness was piloted, and qualitative techniques used to develop a greater understanding of the participants' experience. Results: The Loneliness Scale proved valid and reliable…
Descriptors: Mental Retardation, Quality of Life, Social Networks, Psychological Patterns
Ho, Debbie – Australian Review of Applied Linguistics, 2006
This paper explores the possibility of expanding the focus group interview into the field of English as a Second Language (ESL), where this research methodology is yet to be thoroughly explored. Specifically, it aims to challenge popular criticisms about the reliability and validity of the focus group as a qualitative research methodology. It does…
Descriptors: Qualitative Research, Research Methodology, Focus Groups, Social Sciences
Dorn, Charles M.; Sabol, F. Robert – Studies in Art Education: A Journal of Issues and Research in Art Education, 2006
This is a report of an experimental study that focused on adjudicating the art portfolios of secondary art students assessed in both actual and digital forms in order to determine whether art teachers evaluate actual works of art in students' portfolios differently than digital copies of them. The study participants included 178 students of 29…
Descriptors: Portfolio Assessment, Secondary School Students, Studio Art, Art Products
Sprinkle, Stephen D.; Lurie, Daphne; Insko, Stephanie L.; Atkinson, George; Jones, George L.; Logan, Arthur R.; Bissada, Nancy N. – Journal of Counseling Psychology, 2002
The criterion validity of the Beck Depression Inventory-II (BDI-II; A. T. Beck, R. A. Steer, & G. K. Brown, 1996) was investigated by pairing blind BDI-II administrations with the major depressive episode portion of the Structured Clinical Interview for DSM-IV Axis I Disorders (SCID-I; M. B. First, R. L. Spitzer, M. Gibbon, & J. B. W.…
Descriptors: Severity (of Disability), Guidance Centers, Predictive Validity, Cutting Scores
Liang, Xin – Evaluation and Research in Education, 2003
Multiple matrix sampling is a data collection technique that ensures accuracy and efficiency in group performance. It has been widely used in large-scale curriculum evaluation since the 1980s. However, the design does not always fully embrace the dynamics of local evaluation demands. The purpose of this study is to introduce a modified matrix…
Descriptors: Curriculum Evaluation, Item Sampling, Matrices, Statistical Studies
Guy, Laura S.; Douglas, Kevin S. – Psychological Assessment, 2006
The correspondence between the Hare Psychopathy Checklist: Screening Version (PCL:SV; S. D. Hart, D. N. Cox, & R. D. Hare, 1995) and the Hare Psychopathy Checklist-Revised (PCL-R; R. D. Hare, 1991, 2003) was examined in forensic (N = 175) and correctional (N = 188) samples. Intermeasure correlations for Total scores (0.95 forensic, 0.94…
Descriptors: Check Lists, Screening Tests, Models, Mental Disorders
Munroe, Arnold; Pearson, Carolyn – Educational and Psychological Measurement, 2006
Institutions of higher education want to diversify their learning climates, and many offer courses in multiculturalism, yet these courses still do not meet the needs of attitudinal change. A new instrument was developed, the Munroe Multicultural Attitude Scale Questionnaire (MASQUE), that was theoretically based in Banks's transformative approach,…
Descriptors: Higher Education, Colleges, Data Analysis, Test Reliability
Cutler, Lois J.; Kane, Rosalie A.; Degenholtz, Howard B.; Miller, Michael J.; Grant, Leslie – Gerontologist, 2006
Purpose: We developed and tested theoretically derived procedures to observe physical environments experienced by nursing home residents at three nested levels: their rooms, the nursing unit, and the overall facility. Illustrating with selected descriptive results, in this article we discuss the development of the approach. Design and Methods: On…
Descriptors: Physical Environment, Nursing Homes, Research Tools, Evaluation Methods
Hadley, Pamela A.; Short, Heather – Journal of Speech, Language, and Hearing Research, 2005
Purpose: The purpose of this study was to develop and evaluate measures reflecting the onset of tense marking for children between the ages of 2;0 (years;months) and 3;0. Method: The validity of 4 cumulative measures of tense marker emergence and productivity was evaluated relative to existing measures of early grammatical development in a sample…
Descriptors: Identification, Grammar, Language Impairments, Morphemes
Wilson, Sandip – Journal of Children's Literature, 2006
The accuracy of information in a children's nonfiction book is one criterion the seven-member Orbis Pictus Award Committee considers when selecting outstanding children's nonfiction books for the award. The charge of the committee is to consider other criteria as well, including the clarity and coherence of the book's organization, the extent to…
Descriptors: Recognition (Achievement), Credits, Nonfiction, Book Reviews

Direct link
Peer reviewed
