Publication Date
| In 2026 | 3 |
| Since 2025 | 675 |
| Since 2022 (last 5 years) | 3176 |
| Since 2017 (last 10 years) | 7417 |
| Since 2007 (last 20 years) | 15055 |
Descriptor
| Test Reliability | 15043 |
| Test Validity | 10279 |
| Reliability | 9761 |
| Foreign Countries | 7144 |
| Test Construction | 4825 |
| Validity | 4191 |
| Measures (Individuals) | 3877 |
| Factor Analysis | 3825 |
| Psychometrics | 3526 |
| Interrater Reliability | 3124 |
| Correlation | 3040 |
| More ▼ | |
Source
Author
Publication Type
Education Level
Audience
| Researchers | 709 |
| Practitioners | 451 |
| Teachers | 208 |
| Administrators | 122 |
| Policymakers | 66 |
| Counselors | 42 |
| Students | 38 |
| Parents | 11 |
| Community | 7 |
| Support Staff | 6 |
| Media Staff | 5 |
| More ▼ | |
Location
| Turkey | 1328 |
| Australia | 436 |
| Canada | 379 |
| China | 368 |
| United States | 271 |
| United Kingdom | 256 |
| Indonesia | 253 |
| Taiwan | 234 |
| Netherlands | 223 |
| Spain | 217 |
| California | 215 |
| More ▼ | |
Laws, Policies, & Programs
Assessments and Surveys
What Works Clearinghouse Rating
| Meets WWC Standards without Reservations | 8 |
| Meets WWC Standards with or without Reservations | 9 |
| Does not meet standards | 6 |
Peer reviewedLunz, Mary E.; And Others – Applied Measurement in Education, 1990
An extension of the Rasch model is used to obtain objective measurements for examinations graded by judges. The model calibrates elements of each facet of the examination on a common log-linear scale. Real examination data illustrate the way correcting for judge severity improves fairness of examinee measures. (SLD)
Descriptors: Certification, Difficulty Level, Interrater Reliability, Judges
Peer reviewedEvans, Brian – Canadian Journal of Program Evaluation/La Revue canadienne d'evaluation de programme, 1995
The distinction between two models of reliability is clarified. Reliability may be conceived of and estimated from a true score model or from the perspective of sampling precision. Basic models are developed and illustrated for each approach using data from the author's work on measuring organizational climate. (SLD)
Descriptors: Data Analysis, Error of Measurement, Evaluators, Models
Peer reviewedBirchler, Gary R.; Fals-Stewart, William – Assessment, 1994
The Response to Conflict Scale, a 24-item measure of maladaptive responses to marital conflict, was evaluated psychometrically with 420 couples. The inventory showed high internal consistency, test-retest reliability, construct and discriminant validity, and classification efficiency. Clinical utility is discussed. (SLD)
Descriptors: Classification, Conflict, Construct Validity, Marital Instability
Peer reviewedDowling-Guyer, Seana; And Others – Assessment, 1994
Reliability and validity of the Risk Behavior Assessment, a questionnaire evaluating drug use and sexual human immunovirus risk behavior through self-reports, were studied with 218 drug users who also provided urine samples. Overall, self-reports of drug use and sexual behavior were reliable. (SLD)
Descriptors: Acquired Immune Deficiency Syndrome, Adults, Behavior Patterns, Drug Use
Peer reviewedLunz, Mary E.; And Others – Educational and Psychological Measurement, 1994
In a study involving eight judges, analysis with the FACETS model provides evidence that judges grade differently, whether or not scores correlate well. This outcome suggests that adjustments for differences among judges should be made before student measures are estimated to produce reproducible decisions. (SLD)
Descriptors: Correlation, Decision Making, Evaluation Methods, Evaluators
Peer reviewedEisenstadt, Toni Hembree; And Others – Child & Family Behavior Therapy, 1994
This study investigated interparental agreement of the Eyberg Child Behavior Inventory for 44 clinic-referred families. Mothers rated their children's disruptive behavior as more frequent and more problematic than did fathers. However, strong evidence for cross-informant reliability was obtained. For maternal vs. paternal reports, classification…
Descriptors: Behavior Problems, Behavior Rating Scales, Child Behavior, Fathers
Peer reviewedHuerta-Macias, Ana – TESOL Journal, 1995
Discusses the use of alternative assessment procedures in English-as-a-Second-Language classrooms, focusing on three issues: (1) definitions of alternative assessment; (2) issues related to validity, reliability, and objectivity that are often raised as objections to alternative assessment; and (3) the power of alternative assessment to provide…
Descriptors: Alternative Assessment, Definitions, English (Second Language), Evaluation Methods
Peer reviewedSmith, Gregory T.; McCarthy, Denis M. – Psychological Assessment, 1995
Instrument refinement refers to any set of procedures designed to improve an instrument's representation of a construct. Five objectives of instrument refinement are discussed, and instrument refinement practices are reviewed in a discussion of its role in the process of developing theory and sharpening construct definition. (SLD)
Descriptors: Clinical Diagnosis, Construct Validity, Definitions, Measures (Individuals)
Peer reviewedWehmeyer, Michael L.; Kelchner, Kathy – Career Development for Exceptional Individuals, 1995
This study assessed the validity and reliability of a modified, self-report version of the Autonomous Functioning Checklist for use with adults with mental retardation. Adolescents and adults (n=409) with mental retardation were interviewed. Results generally supported the instrument's validity and reliability and previous studies' findings that…
Descriptors: Adolescents, Adults, Check Lists, Mental Retardation
Peer reviewedEndler, Norman S.; Parker, James D. A. – Psychological Assessment, 1994
Four studies on the psychometric properties of the Coping Inventory for Stressful Situations (CISS), involving 682 adults and 1,592 college students, investigated factor structure and construct and content validities. Overall, results suggest that the CISS is a valid and reliable measure of basic coping styles. (SLD)
Descriptors: Adults, College Students, Construct Validity, Content Validity
Peer reviewedParatore, Jeanne R. – Topics in Language Disorders, 1995
This article provides a framework for portfolio assessment in which common benchmarks and rubrics provide explicit and shared criteria for judging both the collection of work in the portfolio and individual performance samples. Also addressed are efforts to achieve validity and reliability in teacher, student, and parent judgments while…
Descriptors: Elementary Secondary Education, Evaluation Criteria, Individualized Programs, Literacy
Peer reviewedCarlson, Robert E.; Smith-Howell, Deborah – Communication Education, 1995
Investigates reliability and validity of speech evaluation instruments used in public speaking classes. Finds that student speeches can be evaluated reliably and validly using any of a number of different evaluation forms that address the fundamental speech constructs of content and delivery. Finds that lack of extensive rater training and…
Descriptors: Communication Research, Evaluation Methods, Higher Education, Public Speaking
Peer reviewedCooper, Eileen – Journal of Creative Behavior, 1991
This paper critiques the following tests of creativity: (1) the Torrance Test of Creative Thinking; (2) the Creativity Assessment Packet; (3) subtests of the Structure of the Intellect Learning Abilities Test; (4) Thinking Creatively with Sounds and Words; (5) Thinking Creatively in Action and Movement; and (6) the Khatena-Torrance Creative…
Descriptors: Creativity, Creativity Tests, Divergent Thinking, Elementary Secondary Education
Peer reviewedRetlev, Ulla – Online Review, 1991
Examines users' requirements from online vendors, both from the point of view of an inexperienced user and an experienced information specialist. Criteria that users look for are discussed, including available information, price structure, a trial period, service and user support, training, document delivery, consistency, quality, screen design,…
Descriptors: Criteria, Information Scientists, Information Sources, Online Systems
Peer reviewedGiral, Angela; Taylor, Arlene G. – Library Resources and Technical Services, 1993
Examines the overlap of article coverage and the consistency of indexing between the "Avery Index to Architectural Periodicals" and the "Architectural Periodicals Index." The historical backgrounds of the two indexes are described, possibilities for collaboration between them are considered, and implications for users are…
Descriptors: Architecture, Comparative Analysis, Cooperation, Indexes


