Publication Date
| In 2026 | 3 |
| Since 2025 | 666 |
| Since 2022 (last 5 years) | 3167 |
| Since 2017 (last 10 years) | 7408 |
| Since 2007 (last 20 years) | 15046 |
Descriptor
| Test Reliability | 15036 |
| Test Validity | 10272 |
| Reliability | 9759 |
| Foreign Countries | 7141 |
| Test Construction | 4823 |
| Validity | 4191 |
| Measures (Individuals) | 3877 |
| Factor Analysis | 3825 |
| Psychometrics | 3525 |
| Interrater Reliability | 3124 |
| Correlation | 3039 |
| More ▼ | |
Source
Author
Publication Type
Education Level
Audience
| Researchers | 709 |
| Practitioners | 451 |
| Teachers | 208 |
| Administrators | 122 |
| Policymakers | 66 |
| Counselors | 42 |
| Students | 38 |
| Parents | 11 |
| Community | 7 |
| Support Staff | 6 |
| Media Staff | 5 |
| More ▼ | |
Location
| Turkey | 1327 |
| Australia | 436 |
| Canada | 379 |
| China | 368 |
| United States | 271 |
| United Kingdom | 256 |
| Indonesia | 252 |
| Taiwan | 234 |
| Netherlands | 223 |
| Spain | 216 |
| California | 214 |
| More ▼ | |
Laws, Policies, & Programs
Assessments and Surveys
What Works Clearinghouse Rating
| Meets WWC Standards without Reservations | 8 |
| Meets WWC Standards with or without Reservations | 9 |
| Does not meet standards | 6 |
Ahadi, Stephan A.; And Others – 1990
The reliability and validity of teacher ratings, the relationship between teacher ratings and principal self-reports of instructional leadership, and the degree to which they are influenced by demographic factors are examined in this study. Methodology involved completion of the Instructional Leadership Inventory, a self-report measure, by 81…
Descriptors: Educational Environment, Elementary Secondary Education, Institutional Characteristics, Instructional Leadership
Aydin, Selami – Turkish Online Journal of Educational Technology - TOJET, 2006
This research aimed to investigate the effect of computers on the test and inter-rater reliability of writing test scores of ESL learners. Writing samples of 20 pen-paper and 20 computer group students were scored in analytic scoring method by two scorers, and then the scores were analyzed in Alpha (Cronbach) model. The results showed that the…
Descriptors: Foreign Countries, College Students, Computer Assisted Testing, English (Second Language)
Bunch, Michael B.; Littlefair, Wendy – 1988
A total of 2,000 essays written by 1,000 students was submitted to generalizability analyses for domain-referenced tests. Each student had written one essay on each of two prompts representing two models of discourse. Each essay was read by six readers and judged on a scale of from 1 to 4. No reader read essays from both prompts. Reader agreement…
Descriptors: Cutting Scores, Essay Tests, Generalizability Theory, Interrater Reliability
Primoff, Ernest S. – 1971
This report shows how Beta weights for the J-Coefficient may be easily developed without a formal validity study, and indicates how indications of ability other than tests can be used to measure the same abilities that are measured by tests. See also TM 001 163-64,166 for further information on job elements (J-Scale) procedures. (Author/DLG)
Descriptors: Achievement Rating, Correlation, Evaluation Criteria, Occupational Tests
Love, Judith A.; And Others – 1977
Perhaps more than ever before, college teaching is being studied and evaluated. This paper describes the development of a simple descriptive instrument used to focus observers' classifications and ratings of college teachers' instructional behaviors as recorded on video tape. The need for such an instrument is reviewed, the methodology for testing…
Descriptors: Classroom Observation Techniques, College Instruction, Correlation, Factor Analysis
Gilbert, Sharon L. – 1997
This study examined whether variations in the Developmental Observation Checklist (DC) format influences congruence of scores among both parents and the child's teacher. The DC was varied by adding pictorial illustrations and examples and having three response categories instead of two. Results from 100 sets of participants were evaluated with…
Descriptors: Check Lists, Developmental Delays, Early Intervention, Fathers
Peer reviewedLee, Steven W.; And Others – Behavioral Disorders, 1994
The Child Behavior Checklist and related forms were completed for 171 boys referred for school-based assessment resulting from academic and/or behavioral problems. Adolescents consistently underreported behavioral problems relative to parents and teachers regardless of subsequent diagnosis. Implications of these discrepancies in school-based…
Descriptors: Adolescents, Behavior Problems, Disability Identification, Educational Diagnosis
Peer reviewedMcCrae, Robert R. – Multivariate Behavioral Research, 1993
To assess cross-observer agreement on personality profiles, an Index of Profile Agreement and an associated coefficient are proposed that take into account both the difference between the ratings and the extremes of their mean. Data from the Revised NEO Personality Inventory for 250 peer ratings/self-reports and 68 spouse ratings/self-reports…
Descriptors: Adults, Comparative Analysis, Equations (Mathematics), Evaluation Methods
Peer reviewedOren, Thomas A.; Ruhl, Kathy L. – Early Childhood Education Journal, 2000
Investigated the reliability and item appropriateness, as discerned by adults affiliated with an infant center, of the Caregiver-Environment Scale (CES). Found the CES to be an easy to use, reliable instrument for evaluation. (Author/SD)
Descriptors: Caregiver Child Relationship, Child Caregivers, Child Development, Day Care
Rockwell, Pam; Dunham, Mardis – Art Therapy: Journal of the American Art Therapy Association, 2006
This study explored the use of the Formal Elements Art Therapy Scale (FEATS) with a population of persons with a DSM-IV diagnosis of Substance Use Disorder who were court ordered for treatment. Two groups of adults (N = 40) were closely matched on age, gender, race, socioeconomic status and education level, and were administered the Person Picking…
Descriptors: Measures (Individuals), Interrater Reliability, Group Membership, Art Therapy
Haberman, Shelby J. – ETS Research Report Series, 2005
Some probabilistic illustrations of the reliability coefficient are provided to assist in interpretation of this measure. All explanations are derived under the assumption that the joint distribution of examinee scores from two parallel tests is well approximated by a bivariate normal distribution.
Descriptors: Probability, Reliability, Intervals, Computation
Froman, Richard L., Jr. – 1988
The reliability of a taxonomy of humor was tested in two studies. The first study involved rater identification of nine categories for humorous incidents excerpted from television comedy programs (wordplay, exaggeration/understatement, contrast, audience knowledge, aggression, emotion, taboo, pratfall/slapstick, and repetition). The second study,…
Descriptors: Classification, Humor, Interrater Reliability, Psychometrics
Brown, R. L. – 1987
This paper explores the use of K. G. Joreskog's (1970) congeneric modeling approach to reliability using censored quantitative variables. Two Monte Carlo studies were conducted. The first explored the robustness of Normal Theory Generalized Least-Squares (NTGLS) estimates for a single-factor congeneric model across several sample sizes…
Descriptors: Interrater Reliability, Monte Carlo Methods, Sample Size
Peer reviewedRothman, Carole – Psychology in the Schools, 1974
The purpose of this study is to determine (1) whether the previously observed vulnerability of WISC subtests to tester effects appeared under ordinary testing conditions, and (2) which subtests were most susceptible to these effects. Results support the presence of both general and differential vulnerability of subtests. (Author)
Descriptors: Examiners, School Psychologists, Statistical Bias, Test Reliability
Peer reviewedMash, Eric J.; Makohoniuk, George – Child Development, 1975
This study was designed to: (1) assess the influence of an instructional set given to an observer regarding the presence or absence of a predictable pattern in the observed interaction, (2) extend and replicate findings of a previous study of observer accuracy, and (3) identify some of the specific types of errors made by observers in coding…
Descriptors: Measurement Techniques, Observation, Performance, Predictor Variables


