Publication Date
| In 2026 | 3 |
| Since 2025 | 666 |
| Since 2022 (last 5 years) | 3167 |
| Since 2017 (last 10 years) | 7408 |
| Since 2007 (last 20 years) | 15046 |
Descriptor
| Test Reliability | 15036 |
| Test Validity | 10272 |
| Reliability | 9759 |
| Foreign Countries | 7141 |
| Test Construction | 4823 |
| Validity | 4191 |
| Measures (Individuals) | 3877 |
| Factor Analysis | 3825 |
| Psychometrics | 3525 |
| Interrater Reliability | 3124 |
| Correlation | 3039 |
| More ▼ | |
Source
Author
Publication Type
Education Level
Audience
| Researchers | 709 |
| Practitioners | 451 |
| Teachers | 208 |
| Administrators | 122 |
| Policymakers | 66 |
| Counselors | 42 |
| Students | 38 |
| Parents | 11 |
| Community | 7 |
| Support Staff | 6 |
| Media Staff | 5 |
| More ▼ | |
Location
| Turkey | 1327 |
| Australia | 436 |
| Canada | 379 |
| China | 368 |
| United States | 271 |
| United Kingdom | 256 |
| Indonesia | 252 |
| Taiwan | 234 |
| Netherlands | 223 |
| Spain | 216 |
| California | 214 |
| More ▼ | |
Laws, Policies, & Programs
Assessments and Surveys
What Works Clearinghouse Rating
| Meets WWC Standards without Reservations | 8 |
| Meets WWC Standards with or without Reservations | 9 |
| Does not meet standards | 6 |
Peer reviewedRogers, James R.; DeShon, Richard P. – Suicide and Life-Threatening Behavior, 1992
Presents psychometric investigation of the eight-factor clinical model of the Suicide Opinion Questionnaire (SOQ) as representing the most appropriate interpretive model for the SOQ. Notes that factor-analytic and internal consistency reliability results failed to support hypothesized eight-factor model. Discusses alternative factor scheme and…
Descriptors: Factor Structure, Models, Opinions, Suicide
Harbour, Jerry L. – Performance and Instruction, 1993
Discussion of performance improvement focuses on work processes. Highlights include a definition of process; types of process steps, including operational and nonoperational; desired process characteristics, including high reliability and low variability; a comparison of two different processes; and suggestions for process improvement, including…
Descriptors: Comparative Analysis, Flow Charts, Job Analysis, Performance Factors
Peer reviewedBeskow, Jan; And Others – Suicide and Life-Threatening Behavior, 1990
Discusses methodological and ethical issues pertaining to "psychological autopsy," an interview method for reconstruction of suicidal death through interviews with survivors, based on application of method to three studies of suicides and review of other investigations. Emphasizes consideration of integrity of deceased, integrity and health of…
Descriptors: Death, Ethics, Integrity, Interviews
Peer reviewedKlecker, Beverly M.; Loadman, William E. – Educational and Psychological Measurement, 1998
The stability, reliability, and validity of scores on the subscales of the School Participant Empowerment Scale (P. Short and J. Rinehart, 1992) were studied with data from 4,091 Ohio classroom teachers. Confirmatory factor analysis did not confirm the subscales identified by the instrument developers. Explanatory factor analysis was used to…
Descriptors: Empowerment, Participative Decision Making, Reliability, Teachers
Peer reviewedKember, David; Jones, Alice; Loke, Alice; McKay, Jan; Sinclair, Kit; Tse, Harrison; Webb, Celia; Wong, Frances; Wong, Marian; Yeung, Ella – International Journal of Lifelong Education, 1999
A coding method for measuring reflective thinking in student journals was tested twice, demonstrating acceptable reliability among evaluators and supporting the precision of the guidelines for coding. Coding categories were as follows: habitual action, introspection, thoughtful action, content reflection, process reflection, content and process…
Descriptors: Adult Education, Coding, Evaluation Methods, Interrater Reliability
Peer reviewedBerning, Lisa C.; Weed, Nathan C.; Aloia, Mark S. – Assessment, 1998
To examine the interrater reliability of the Ruff Figural Fluency Test (RFFT) (R. Ruff, 1988), 124 college students completed the measure and scored RFFT test protocols. Results indicated substantial interscorer reliability on the RFFT, particularly for number of unique designs. Reliability was lower for scoring perseverative errors and error…
Descriptors: College Students, Higher Education, Interrater Reliability, Scoring
Peer reviewedCoenders, Germa; Saris, Willem E.; Batista-Foguet, Joan M.; Andreenkova, Anna – Structural Equation Modeling, 1999
Illustrates that sampling variance can be very large when a three-wave quasi simplex model is used to obtain reliability estimates. Also shows that, for the reliability parameter to be identified, the model assumes a Markov process. These problems are evaluated with both real and Monte Carlo data. (SLD)
Descriptors: Estimation (Mathematics), Markov Processes, Monte Carlo Methods, Reliability
Peer reviewedSnowden, Lonnie R.; Hines, Alice M. – Journal of Black Psychology, 1999
Investigated an acculturation scale designed for use in the African-American population. Responses from more than 900 African Americans generally indicate an African-American orientation within the sample, although there are notable variations on all 10 scale items. Discusses evidence for scale reliability and validity. (SLD)
Descriptors: Acculturation, Adults, Blacks, Psychological Characteristics
Peer reviewedLee, Guemin; Frisbie, David A. – Applied Measurement in Education, 1999
Studied the appropriateness and implications of using a generalizability theory approach to estimating the reliability of scores from tests composed of testlets. Analyses of data from two national standardization samples suggest that manipulating the number of passages is a more productive way to obtain efficient measurement than manipulating the…
Descriptors: Generalizability Theory, Models, National Surveys, Reliability
Peer reviewedTheron, C. C. – Educational and Psychological Measurement, 1999
Studies the effects of a joint correction for criterion unreliability and Case 1 selection on the parameters of the decision function in personnel selection. Illustrates the joint correction equations. (SLD)
Descriptors: Criteria, Decision Making, Equations (Mathematics), Personnel Selection
Peer reviewedJarjoura, David; Hartman-Stein, Paula; Speight, Joan; Reuter, Jeanette – Educational and Psychological Measurement, 1999
Examined the reliability and construct validity in an older adult population (n=149 older adults and their informants) of scores on the Behavioral Competence Inventory (BCI) (P. Hartman-Stein). Results indicate that scores on the BCI's seven scales show adequate internal consistencies and represent seven overlapping but distinct constructs in this…
Descriptors: Behavior Patterns, Competence, Construct Validity, Older Adults
Peer reviewedSanders, Piet F.; Verschoor, Alfred J. – Applied Psychological Measurement, 1998
Presents minimization and maximization models for parallel test construction under constraints. The minimization model constructs weakly and strongly parallel tests of minimum length, while the maximization model constructs weakly and strongly parallel tests with maximum test reliability. (Author/SLD)
Descriptors: Algorithms, Models, Reliability, Test Construction
Peer reviewedDoble, Susan E.; Fisk, John D.; Lewis, Norma; Rockwood, Kenneth – Occupational Therapy Journal of Research, 1999
The findings of a study of 55 elderly adults support the test-retest reliability of the Assessment of Motor and Process Skills, illustrate the utility of alternative methods for examining the reliability of individual subjects' measures, and indicate that not all test-retest differences represent measurement error. (Author/JOW)
Descriptors: Error of Measurement, Older Adults, Psychomotor Skills, Test Reliability
Peer reviewedVoorhees, Ellen M. – Information Processing & Management, 2000
Discusses the test collections developed in the TREC (Text REtrieval Conference) workshops for information retrieval research and describes a study by NIST (National Institute of Standards and Technology) that verified their reliability by investigating the effect changes in the relevance assessments have on the evaluation of retrieval results.…
Descriptors: Correlation, Information Retrieval, Relevance (Information Retrieval), Reliability
Peer reviewedCaruso, John C. – Educational and Psychological Measurement, 2000
Performed a reliability generalization using 244 studies that used NEO personality scales. Reliability estimates were given in only 15% of these studies, and 44% made no mention of reliability at all. Results suggest that many researchers have an inadequate understanding of concepts of reliability. Results also suggest that NEO personality scales…
Descriptors: Estimation (Mathematics), Generalization, Personality Assessment, Personality Measures


