Publication Date
| In 2026 | 3 |
| Since 2025 | 666 |
| Since 2022 (last 5 years) | 3167 |
| Since 2017 (last 10 years) | 7408 |
| Since 2007 (last 20 years) | 15046 |
Descriptor
| Test Reliability | 15036 |
| Test Validity | 10272 |
| Reliability | 9759 |
| Foreign Countries | 7141 |
| Test Construction | 4823 |
| Validity | 4191 |
| Measures (Individuals) | 3877 |
| Factor Analysis | 3825 |
| Psychometrics | 3525 |
| Interrater Reliability | 3124 |
| Correlation | 3039 |
| More ▼ | |
Source
Author
Publication Type
Education Level
Audience
| Researchers | 709 |
| Practitioners | 451 |
| Teachers | 208 |
| Administrators | 122 |
| Policymakers | 66 |
| Counselors | 42 |
| Students | 38 |
| Parents | 11 |
| Community | 7 |
| Support Staff | 6 |
| Media Staff | 5 |
| More ▼ | |
Location
| Turkey | 1327 |
| Australia | 436 |
| Canada | 379 |
| China | 368 |
| United States | 271 |
| United Kingdom | 256 |
| Indonesia | 252 |
| Taiwan | 234 |
| Netherlands | 223 |
| Spain | 216 |
| California | 214 |
| More ▼ | |
Laws, Policies, & Programs
Assessments and Surveys
What Works Clearinghouse Rating
| Meets WWC Standards without Reservations | 8 |
| Meets WWC Standards with or without Reservations | 9 |
| Does not meet standards | 6 |
Velazquez, Cesareo Morales – Computers in the Schools, 2008
Data from Mexico City, Mexico (N = 978) and from Texas, USA (N = 932) were used to test the predictive validity of the teacher professional development component of the Will, Skill, Tool Model of Technology Integration in a cross-cultural context. Structural equation modeling (SEM) was used to test the model. Analyses of these data yielded…
Descriptors: Structural Equation Models, Technology Integration, Predictive Validity, Foreign Countries
Peer reviewedLathrop, Richard G.; Williams, Janice E. – Educational and Psychological Measurement, 1987
A Monte Carlo study, involving 6,000 "computer subjects" and three raters, explored the reliability of the inverse screen test for cluster analysis. Results indicate that the inverse screen may be a useful and reliable cluster analytic technique for determining the number of true groups. (TJH)
Descriptors: Cluster Analysis, Computer Simulation, Interrater Reliability, Monte Carlo Methods
Peer reviewedMeier, Augustine; Boivin, Micheline – Journal of Consulting and Clinical Psychology, 1986
The Client Verbal Response Category System classifies client responses into Temporal, Directional and Experiential categories. The categories with their subcategories are defined, interjudge reliability data is presented, and the instrument's utility in psychotherapy process research is demonstrated. Initial results indicate that the instrument is…
Descriptors: Client Characteristics (Human Services), Interrater Reliability, Psychotherapy, Research Tools
Peer reviewedTowstopiat, Olga – Contemporary Educational Psychology, 1984
The present article reviews the procedures that have been developed for measuring the reliability of human observers' judgments when making direct observations of behavior. These include the percentage of agreement, Cohen's Kappa, phi, and univariate and multivariate agreement measures that are based on quasi-equiprobability and quasi-independence…
Descriptors: Interrater Reliability, Mathematical Models, Multivariate Analysis, Observation
Peer reviewedDeSanti, Roger J.; Sullivan, Vicki Gallo – Reading Psychology, 1984
Concludes that the Cloze Reading Inventory and its coding form can be reliably employed by a variety of teachers for a variety of grade levels and passages. (FL)
Descriptors: Cloze Procedure, Elementary Secondary Education, Interrater Reliability, Reading Comprehension
Peer reviewedJackson, E. A. – European Journal of Engineering Education, 1988
Investigates the marker-marker reliability of an examination for a third-year degree course in circuit theory. Reports that the coefficient of correlation between markers fell within the range 0.94 to 0.98. (YP)
Descriptors: College Science, Engineering Education, Essay Tests, Interrater Reliability
Peer reviewedSpaulding, Cheryl L. – Journal of Reading, 1989
Reviews "Written Language Assessment" (WLA), a new standardized test to evaluate children's and adolescents' written language competence by having students write essays instead of answer multiple choice questions. Finds problems with the WLA in terms of interrater reliability. (RS)
Descriptors: Elementary Secondary Education, Essay Tests, Interrater Reliability, Standardized Tests
Peer reviewedBakermans-Kranenburg, Marian J; van IJzendoorn, Marinus H. – Developmental Psychology, 1993
Examined the validity of the Adult Attachment Interview (AAI) measure by interviewing 83 mothers twice over 2 months, using different interviewers on each occasion. The results indicated that the reliability of the AAI classifications was quite high over time and across interviewers. The AAI classifications were independent of nonattachment…
Descriptors: Attachment Behavior, Examiners, Interrater Reliability, Mothers
Ottenbacher, Kenneth J.; Cusick, Anne – Journal of the Association for Persons with Severe Handicaps (JASH), 1991
The study, with 79 rehabilitation therapists evaluating 21 single-subject graphs, found that the low interrater agreement often associated with visual analysis of single-subject data may be improved by simple supplements (such as trend lines) to visually inspected charts. (Author/DB)
Descriptors: Case Studies, Data Analysis, Disabilities, Evaluation Methods
Bastick, Tony – 1999
Questionnaires often ask for estimates, and these estimates are given with different reliabilities. It is difficult to know the different reliabilities of single estimates and to take these into account in subsequent analyses. This paper contains a practical example to show that not taking the reliability of different responses into account can…
Descriptors: Questionnaires, Reliability, Responses
Peer reviewedAlsawalmeh, Yousef M.; Feldt, Leonard S. – Educational and Psychological Measurement, 1999
Develops a statistical test for the hypothesis that alpha'(1) =alpha'(2) when alpha'(1) is the Spearman-Brown extrapolated value of Cronbach's alpha reliability for test 1 and alpha'(2) is the unadjusted coefficient for test 2. The test is shown to exercise tight control of Type I error. (Author/SLD)
Descriptors: Reliability, Test Length
Peer reviewedWang, Tianyou – Applied Psychological Measurement, 1998
Derives equations for computing weights that maximize the reliability of a test with multiple parts using a congeneric model. Presents a direct derivation for the three-part case and a two-step derivation for the "n"-part case. Gives examples that show the computations and the usefulness of the equations. (SLD)
Descriptors: Equations (Mathematics), Reliability
Peer reviewedRaykov, Tenko – Multivariate Behavioral Research, 2000
Outlines a correlation structures modeling approach to the study of stability in reliability of multiple, repeatedly administered measures and illustrates the method on data from a fluid intelligence study (P. Bates and others, 1986). The method is also applicable when examining relationships between model parameters across all variables.…
Descriptors: Correlation, Models, Reliability
Peer reviewedMellenbergh, Gideon J. – Applied Psychological Measurement, 1999
Demonstrates that two aspects of precision, reliability and information, also apply to the simple gain score. Reliability applies to a population of examinees, and information applies to a given examinee. (SLD)
Descriptors: Achievement Gains, Reliability
Peer reviewedMadigan, Elizabeth A.; Fortinsky, Richard H. – Gerontologist, 2004
The Outcomes and Assessment Information Set (OASIS) is now used extensively for regulatory, reimbursement, research, and clinical purposes in home health care. However, little is known about the interrater reliability of OASIS items based on assessments from home-health-agency clinicians. Therefore, we evaluated OASIS item interrater reliability…
Descriptors: Patients, Interrater Reliability

Direct link
