Publication Date
| In 2026 | 3 |
| Since 2025 | 666 |
| Since 2022 (last 5 years) | 3167 |
| Since 2017 (last 10 years) | 7408 |
| Since 2007 (last 20 years) | 15046 |
Descriptor
| Test Reliability | 15036 |
| Test Validity | 10272 |
| Reliability | 9759 |
| Foreign Countries | 7141 |
| Test Construction | 4823 |
| Validity | 4191 |
| Measures (Individuals) | 3877 |
| Factor Analysis | 3825 |
| Psychometrics | 3525 |
| Interrater Reliability | 3124 |
| Correlation | 3039 |
| More ▼ | |
Source
Author
Publication Type
Education Level
Audience
| Researchers | 709 |
| Practitioners | 451 |
| Teachers | 208 |
| Administrators | 122 |
| Policymakers | 66 |
| Counselors | 42 |
| Students | 38 |
| Parents | 11 |
| Community | 7 |
| Support Staff | 6 |
| Media Staff | 5 |
| More ▼ | |
Location
| Turkey | 1327 |
| Australia | 436 |
| Canada | 379 |
| China | 368 |
| United States | 271 |
| United Kingdom | 256 |
| Indonesia | 252 |
| Taiwan | 234 |
| Netherlands | 223 |
| Spain | 216 |
| California | 214 |
| More ▼ | |
Laws, Policies, & Programs
Assessments and Surveys
What Works Clearinghouse Rating
| Meets WWC Standards without Reservations | 8 |
| Meets WWC Standards with or without Reservations | 9 |
| Does not meet standards | 6 |
Dhami, Mandeep K. – Journal of Experimental Psychology: Applied, 2008
Beyond reasonable doubt represents a probability value that acts as the criterion for conviction in criminal trials. I introduce the membership function (MF) method as a new tool for measuring quantitative interpretations of reasonable doubt. Experiment 1 demonstrated that three different methods (i.e., direct rating, decision theory based, and…
Descriptors: Probability, Criminal Law, Court Litigation, Decision Making
Flanagan, Rosemary – Journal of Psychoeducational Assessment, 2008
This article provides a review of Roberts-2, an individually administered narrative measure published by Western Psychological Services. Roberts-2 is a substantial revision of the earlier version of this measure, the Roberts Apperception Test for Children (RATC; McArthur & Roberts, 1982). Roberts-2 is composed of 16 stimulus cards that direct…
Descriptors: Psychological Services, Psychologists, Test Reviews, Children
Kember, David; Leung, Doris Y. P. – Assessment & Evaluation in Higher Education, 2008
This article uses the case of designing a new course questionnaire to discuss the issues of validity, reliability and diagnostic power in good questionnaire design. Validity is often not well addressed in course questionnaire design as there are no straightforward tests that can be applied to an individual instrument. The authors propose the…
Descriptors: Qualitative Research, Course Evaluation, Test Validity, Questionnaires
Page, Randy; Montgomery, Katie; Ponder, Andrea; Richard, Amanda – American Journal of Health Education, 2008
Background: Despite recent and heightened concern about the marketing of food to children as a health issue, there is little previous research describing the product packaging characteristics of specific products intensely marketed to children. Purpose: In order to better understand food marketing tactics targeting children, the purpose of this…
Descriptors: Health Education, Marketing, Content Analysis, Public Policy
Wilson, Maja – Educational Leadership, 2008
Wilson asserts that the quest for absolute objectivity in scoring student writing--including the use of rubrics--creates harmful distance between reader and writer and ignores the unique, transactional characteristics of writing. She puts forth the view of Rosenblatt and other literacy theorists that meaning and value of texts are not rigidly…
Descriptors: Writing Evaluation, Scoring, Reader Text Relationship, Perspective Taking
Van Hasselt, Vincent B.; Sheehan, Donald C.; Malcolm, Abigail S.; Sellers, Alfred H.; Baker, Monty T.; Couwels, Judy – Behavior Modification, 2008
This study establishes the reliability and validity of the Law Enforcement Officer Stress Survey (LEOSS), a short early-warning stress-screening measure for law enforcement officers. The initial phase of LEOSS development employed the behavioral-analytic model to construct a 25-item instrument specifically geared toward evaluation of stress in…
Descriptors: Police, Validity, Law Enforcement, Psychometrics
Garb, Howard N. – American Psychologist, 2008
Comments on the original article "Plate tectonics in the classification of personality disorder: Shifting to a dimensional model," by T. A. Widiger and T. J. Trull. The purpose of this comment is to address (a) whether psychologists know how personality traits are currently assessed by clinicians and (b) the reliability and validity of those…
Descriptors: Personality Traits, Personality Problems, Psychologists, Diagnostic Tests
Ruiz, Mark A.; Poythress, Norman G.; Lilienfeld, Scott O.; Douglas, Kevin S. – Assessment, 2008
The authors examined the psychometric properties, factor structure, and construct validity of the Dissociative Experiences Scale (DES) in a large offender sample (N = 1,515). Although the DES is widely used with community and clinical samples, minimal work has examined offender samples. Participants were administered self-report and interview…
Descriptors: Factor Structure, Construct Validity, Correlation, Psychometrics
Burch, V. C.; Norman, G. R.; Schmidt, H. G.; van der Vleuten, C. P. M. – Advances in Health Sciences Education, 2008
High stakes postgraduate specialist certification examinations have considerable implications for the future careers of examinees. Medical colleges and professional boards have a social and professional responsibility to ensure their fitness for purpose. To date there is a paucity of published data about the reliability of specialist certification…
Descriptors: Generalizability Theory, Physicians, Foreign Countries, Specialists
Greatorex, Jackie; Suto, Irenka W. M. – Educational Research, 2008
Background: "Thinking aloud" is a well-established method of data collection in education, assessment, and other fields of research. However, while many researchers have reported their views on its usage, the first-hand experiences of research participants have received less attention. Purpose: The aim of this exploratory study was to…
Descriptors: Foreign Countries, Examiners, Protocol Analysis, Interrater Reliability
Riley-Tillman, T. Chris; Chafouleas, Sandra M.; Briesch, Amy M.; Eckert, Tanya L. – Journal of Behavioral Education, 2008
More than ever, educators require assessment procedures and instrumentation that are technically adequate as well as efficient to guide data-based decision making. Thus, there is a need to understand perceptions of available tools, and the decisions made when using collected data, by the primary users of those data. In this paper, two studies that…
Descriptors: Report Cards, Observation, Formative Evaluation, School Psychologists
Yu, Kumlan; Lee, Sang Min; Nesbit, Elisabeth A. – Measurement and Evaluation in Counseling and Development, 2008
This article describes the development of the culturally valid Counselor Burnout Inventory. A multistage approach including item translation; item refinement; and evaluation of factorial validity, reliability, and score validity was used to test constructs and validation. Implications for practice and future research are discussed. (Contains 3…
Descriptors: Burnout, Cultural Relevance, Counselors, Measures (Individuals)
Versace, Francesco; Mazzetti, Michela; Codispoti, Maurizio – Assessment, 2008
The temporal stability of the effects induced by the Cued Reaction Time Task (CRTT) on the orienting of attention was assessed across four weekly sessions. Benefits, costs, and validity effects were computed for each session, and the correlation coefficients between each session were calculated (interindividual stability index). Intraindividual…
Descriptors: Cues, Reaction Time, Validity, Correlation
Ward, Dianne; Hales, Derek; Haverly, Katie; Marks, Julie; Benjamin, Sara; Ball, Sarah; Trost, Stewart – American Journal of Health Behavior, 2008
Objectives: To describe protocol and interobserver agreements of an instrument to evaluate nutrition and physical activity environments at child care. Methods: Interobserver data were collected from 9 child care centers, through direct observation and document review (17 observer pairs). Results: Mean agreement between observer pairs was 87.26%…
Descriptors: Physical Activity Level, Nutrition, Child Care Centers, Child Care
Lehto, Marybeth – Online Submission, 2009
The primary purpose of this study was to determine whether the data from the qualitative study fit Rasch model requirements for the definition of a measure, as well as to address concern in the extant literature regarding the appropriate number of items needed in analysis to assure unidimensionality. The self-report victimization scale was…
Descriptors: Qualitative Research, High School Students, Grade 9, Bullying

Peer reviewed
Direct link
