Publication Date
| In 2026 | 3 |
| Since 2025 | 675 |
| Since 2022 (last 5 years) | 3176 |
| Since 2017 (last 10 years) | 7417 |
| Since 2007 (last 20 years) | 15055 |
Descriptor
| Test Reliability | 15043 |
| Test Validity | 10279 |
| Reliability | 9761 |
| Foreign Countries | 7144 |
| Test Construction | 4825 |
| Validity | 4191 |
| Measures (Individuals) | 3877 |
| Factor Analysis | 3825 |
| Psychometrics | 3526 |
| Interrater Reliability | 3124 |
| Correlation | 3040 |
| More ▼ | |
Source
Author
Publication Type
Education Level
Audience
| Researchers | 709 |
| Practitioners | 451 |
| Teachers | 208 |
| Administrators | 122 |
| Policymakers | 66 |
| Counselors | 42 |
| Students | 38 |
| Parents | 11 |
| Community | 7 |
| Support Staff | 6 |
| Media Staff | 5 |
| More ▼ | |
Location
| Turkey | 1328 |
| Australia | 436 |
| Canada | 379 |
| China | 368 |
| United States | 271 |
| United Kingdom | 256 |
| Indonesia | 253 |
| Taiwan | 234 |
| Netherlands | 223 |
| Spain | 217 |
| California | 215 |
| More ▼ | |
Laws, Policies, & Programs
Assessments and Surveys
What Works Clearinghouse Rating
| Meets WWC Standards without Reservations | 8 |
| Meets WWC Standards with or without Reservations | 9 |
| Does not meet standards | 6 |
Peer reviewedBurton, Richard F. – Assessment & Evaluation in Higher Education, 2001
Describes four measures of test unreliability that quantify effects of question selection and guessing, both separately and together--three chosen for immediacy and one for greater mathematical elegance. Quantifies their dependence on test length and number of answer options per question. Concludes that many multiple choice tests are unreliable…
Descriptors: Guessing (Tests), Mathematical Models, Multiple Choice Tests, Objective Tests
Peer reviewedBurry-Stock, Judith A.; And Others – Educational and Psychological Measurement, 1996
It is argued that interrater agreement is a psychometric property which is theoretically different from classic reliability. Formulas are presented to illustrate a set of algebraically equivalent rater agreement indices that are intended to provide educational and psychological researchers with a practical way to establish a measure of rater…
Descriptors: Algebra, Educational Research, Interrater Reliability, Measures (Individuals)
Peer reviewedPoznanski, Joseph J.; McLennan, Jim – Journal of Counseling Psychology, 1995
Discusses issues in conceptualizing and measuring counselors' theoretical orientations to practice. Evaluates 15 instruments previously proposed as measures of counselors' and therapists' theoretical orientations, and examines psychometric properties and the utility of each instrument. Few instruments show evidence of reliability and even fewer…
Descriptors: Counseling Techniques, Counseling Theories, Higher Education, Measurement Techniques
Peer reviewedLuzzo, Darrell Anthony – Journal of Counseling & Development, 1996
Provides a thorough psychometric evaluation of the Career Decision-Making Self-Efficacy Scale. Provides a summary of the initial construction and development of the scale, followed by a comprehensive review of the results from various investigations on its reliability and validity. Concludes with ideas for additional research. (JPS)
Descriptors: Career Choice, Decision Making, Higher Education, Psychometrics
Peer reviewedGoldsmith, H. H. – Child Development, 1996
Data on 11 samples of 1,012 toddlers used to construct and validate the Toddler Behavior Assessment Questionnaire (TBAQ) revealed that the component of negative affectivity (anger proneness and fearfulness) were independent, and item analysis suggested that shyness and other fears were independent as well. (MDM)
Descriptors: Anger, Child Behavior, Fear, Personality
Peer reviewedFry, Richard P. W.; And Others – Child Abuse & Neglect: The International Journal, 1996
Comparison of 2 interviews, with either a male or female interviewer, of 56 women who reported a history of sexual abuse found approximately one-third of the incidents were reported in only 1 interview, with gender of interviewer making little apparent difference. Contrary to expectation, subjects appeared to be more forthcoming at the first…
Descriptors: Child Abuse, Disclosure, Females, Interviews
Peer reviewedAxelrod, Bradley N.; And Others – Psychological Assessment, 1996
The calculations of D. Schretlen, R. H. B. Benedict, and J. H. Bobholz for the reliabilities of a short form of the Wechsler Adult Intelligence Scale--Revised (WAIS-R) (1994) consistently overestimated the values. More accurate values are provided for the WAIS--R and a seven-subtest short form. (SLD)
Descriptors: Error Correction, Error of Measurement, Estimation (Mathematics), Intelligence Tests
Peer reviewedSherrill, Joel T.; And Others – Behavior Modification, 1996
The importance of discipline consistency in elementary school students (n=18) is examined by varying the probability of punishment and the nature of the discipline agent's response to manipulated transgressions. Discusses the importance of conceptualizing discipline consistency as a multivariate construct, and variables and parameters that…
Descriptors: Behavior, Behavior Change, Behavior Modification, Classroom Techniques
Peer reviewedMcCormick, Bryan P. – Therapeutic Recreation Journal, 2000
Reviews the rationale for and implications of case study research in therapeutic recreation, examining: what can be learned from studying a single case; issues of validity and reliability; ethical conduct of research; and the practice of case study research (case protocol, case selection, collecting data, analyzing and interpreting data, and…
Descriptors: Case Studies, Data Analysis, Data Collection, Data Interpretation
Peer reviewedBlinn-Pike, Lynn; Mingus, Suzanne – Journal of Adolescence, 2000
Surveys 105 adolescent mothers at approximately two months postpartum, using the Child Abuse Potential Inventory (CAP), to assess the reliability and the value of using this measure. The results determined that reliabilities were low for the CAP abuse scale. It concludes that further research is needed to understand the psychometric properties of…
Descriptors: Adolescents, Child Abuse, Early Parenthood, Infants
Peer reviewedWatkins, Marley W. – School Psychology Quarterly, 2000
Reviews the results of four studies included in this issue of "School Psychology Quarterly" which found all four cognitive profile reports lacking reliability, validity, or diagnostic utility. Argues that ipsative methods are inferior to normative methods in cognitive assessment. Recommends that psychologists eschew the application of…
Descriptors: Clinical Diagnosis, Cognitive Measurement, Intelligence Tests, Profiles
Peer reviewedPope, Raechele L.; Mueller, John A. – Journal of College Student Development, 2000
Discusses the development of the Multicultural Competence in Student Affairs-Preliminary 2 (MCSA-P2) Scale, an assessment tool to measure multicultural competence in a higher education context. Reports the results of two studies that investigated the validity and reliability of the MCSA-P2. Explores future research needs as well as applications…
Descriptors: Cultural Pluralism, Evaluation Methods, Higher Education, Measurement Techniques
Peer reviewedCoil, Carolyn – International Schools Journal, 2000
States that the Internet is one of the most commonly used methods of obtaining information for today's students. Provides guidelines for using the Internet as a reliable source of credible and accurate information. Lists nine general guidelines for reliability of e-mail sources, and 11 for web sources. (CW)
Descriptors: Educational Technology, Higher Education, Information Sources, Information Utilization
Peer reviewedJohnson, Robert L.; McDaniel, Fred, II; Willeke, Marjorie J. – American Journal of Evaluation, 2000
Studied the interrater reliability of a portfolio assessment used in a small-scale program evaluation. Investigated analytic, combined analytic, and holistic family literacy portfolios from an Even Start program. Results show that at least three raters are needed to obtain acceptable levels of reliability for holistic and individual analytic…
Descriptors: Family Literacy, Holistic Approach, Interrater Reliability, Portfolio Assessment
Peer reviewedAndre, Kate – Nurse Education Today, 2000
Although assessment of nursing students' clinical performance is difficult, objections to grading may be based on erroneous assumptions. A combination of criterion- and norm-referenced assessment can clarify minimum competency requirements and reward meritorious performance. (SK)
Descriptors: Clinical Experience, Competence, Criterion Referenced Tests, Foreign Countries


