Publication Date
| In 2026 | 3 |
| Since 2025 | 675 |
| Since 2022 (last 5 years) | 3176 |
| Since 2017 (last 10 years) | 7417 |
| Since 2007 (last 20 years) | 15055 |
Descriptor
| Test Reliability | 15043 |
| Test Validity | 10279 |
| Reliability | 9761 |
| Foreign Countries | 7144 |
| Test Construction | 4825 |
| Validity | 4191 |
| Measures (Individuals) | 3877 |
| Factor Analysis | 3825 |
| Psychometrics | 3526 |
| Interrater Reliability | 3124 |
| Correlation | 3040 |
| More ▼ | |
Source
Author
Publication Type
Education Level
Audience
| Researchers | 709 |
| Practitioners | 451 |
| Teachers | 208 |
| Administrators | 122 |
| Policymakers | 66 |
| Counselors | 42 |
| Students | 38 |
| Parents | 11 |
| Community | 7 |
| Support Staff | 6 |
| Media Staff | 5 |
| More ▼ | |
Location
| Turkey | 1328 |
| Australia | 436 |
| Canada | 379 |
| China | 368 |
| United States | 271 |
| United Kingdom | 256 |
| Indonesia | 253 |
| Taiwan | 234 |
| Netherlands | 223 |
| Spain | 217 |
| California | 215 |
| More ▼ | |
Laws, Policies, & Programs
Assessments and Surveys
What Works Clearinghouse Rating
| Meets WWC Standards without Reservations | 8 |
| Meets WWC Standards with or without Reservations | 9 |
| Does not meet standards | 6 |
Peer reviewedGordon, Roberta R. – Journal of Educational Research, 1988
Investigation into the most effective use of a kindergarten screening battery to predict second-grade reading and mathematics achievement found that a combination of 10 readiness subtests resulted in the same degree of accuracy as that obtained using the entire battery. However, neither version was accurate enough to be useful. (Author/CB)
Descriptors: Kindergarten, Mathematics Achievement, Predictive Validity, Primary Education
Peer reviewedWeiss, David J., Ed. – Applied Psychological Measurement, 1987
Issues concerning equating test scores are discussed in an introduction, four papers, and two commentaries. Equating methods research, sampling errors, linear equating, population differences, sources of equating errors, and a circular equating paradigm are considered. (SLD)
Descriptors: Equated Scores, Latent Trait Theory, Maximum Likelihood Statistics, Statistical Analysis
Peer reviewedAbrahams, Ruby; And Others – Evaluation Review, 1988
A methodology for developing clinical/research assessment tools, training interviewers, and continuously assessing interrater reliability is discussed. Data from a multisite national evaluation of long-term health care programs (i.e., the Social/Health Maintenance Organization (HMO) for elderly clients) are used. Focus is on providing research…
Descriptors: Clinical Diagnosis, Data Collection, Health Facilities, Health Programs
Peer reviewedMeetz, Harriet K.; And Others – Journal of Dental Education, 1988
Rating scales that were potentially useful for evaluating clinical performance were identified and were pretested for content and construct validity as well as interrater reliability. The range of interrater reliabilities was determined, and the construct and predictive validity of the rating using three classes of dental students was tested. (MLW)
Descriptors: Clinical Experience, Dental Schools, Dentistry, Higher Education
Peer reviewedFimian, Michael J. – Psychology in the Schools, 1988
Investigated internal consistency and split-half reliability of Teacher Stress Inventory (TSI) based on data provided by 3,478 teachers. Data indicated for both the strength and frequency dimensions that the TSI was highly reliable in terms of both its regular and short-form length, and that the two short forms were highly correlated with each…
Descriptors: College Faculty, Elementary Secondary Education, Higher Education, Special Education Teachers
Peer reviewedFitzgerald, Louise F.; And Others – Journal of Vocational Behavior, 1988
Describes development of Sexual Experiences Questionnaire to assess sexual harassment. Reports on results of psychometric analyses, application of inventory to two large public universities, and development of second form of the inventory designed for working women. Discusses results for large sample of academic, professional and semiprofessional,…
Descriptors: College Students, Employed Women, Higher Education, Sexual Harassment
Peer reviewedDengerink, Joan E.; Bean, Roxanne E. – Language, Speech, and Hearing Services in Schools, 1988
Author-supplied item labels for two common speech discrimination tests were compared with those given spontaneously by 40 children (median age 5:5). Agreement between subjects' and authors' labels was 76.3 percent on the Word Intelligibility by Picture Identification test and 75 percent on the Northwestern University Children's Perception of…
Descriptors: Auditory Discrimination, Expressive Language, Item Analysis, Language Handicaps
Peer reviewedJafarpur, Abdoljavad – System, 1988
Investigation of non-native English speakers' ratings of other non-native English learners' oral proficiency. Results indicate that the judges' ratings significantly differed, and the average of three judges' ratings was a better appraisal of the testee's true ability than that of any single rating or pair of ratings. (Author/CB)
Descriptors: English (Second Language), Evaluation Methods, Foreign Countries, Interrater Reliability
Peer reviewedHutton, Jerry B.; And Others – Psychology in the Schools, 1987
Special education, basic, and honors ninth-grade students (n=60) rated the severity of stress for each of the life events on the Source of Stress Inventory (Chandler, 1981). There was a significant positive relationship between the Chandler rankings (teachers and mental health workers) and the student rankings. (Author/NB)
Descriptors: Grade 9, Interrater Reliability, Mental Health, Secondary Education
Peer reviewedLuecht, Richard M. – Educational and Psychological Measurement, 1987
Test Pac, a test scoring and analysis computer program for moderate-sized sample designs using dichotomous response items, performs comprehensive item analyses and multiple reliability estimates. It also performs single-facet generalizability analysis of variance, single-parameter item response theory analyses, test score reporting, and computer…
Descriptors: Computer Assisted Testing, Computer Software, Computer Software Reviews, Item Analysis
Peer reviewedJones, Randy M.; Streitmatter, Janice L. – Adolescence, 1987
Examined Extended Objective Measure of Ego Identity Status for reliability and validity among 467 secondary school students. Results were supportive of appropriateness of all measures for the subjects. Analysis of reliability, validity, demographic characteristics, and psychosocial maturity yielded results which parallel theoretical framework and…
Descriptors: Adolescent Development, Adolescents, College Students, Secondary Education
Peer reviewedLustig, Myron W. – Small Group Behavior, 1987
Investigated reliability and dimensionality of Bales's Interpersonal Rating Forms (IRF) using volunteer subjects (N=266) enrolled in undergraduate communications course. Results documented shortcomings of IRF as a measuring instrument finding the subscales neither reliable nor dimensionally structured; only 2 of 18 items in each subscale are…
Descriptors: College Students, Group Behavior, Groups, Higher Education
Peer reviewedLooney, Marilyn A. – Research Quarterly for Exercise and Sport, 1987
The characteristics of three threshold loss agreement indices which reflect the agreement or consistency in assignment to mastery-nonmastery status are reviewed. These are proportion of agreement, coefficient kappa, and modified kappa. (Author/MT)
Descriptors: Confidence Testing, Criterion Referenced Tests, Cutting Scores, Elementary Secondary Education
Peer reviewedMadsen, Harold – CALICO Journal, 1986
Evaluates one of the first operational computerized-adaptive English-as-a-second-language tests in the United States, showing an overwhelmingly positive student reaction to the tests and higher effectiveness than conventional paper-and-pencil tests. (Author/CB)
Descriptors: Anxiety, Computer Assisted Testing, English (Second Language), Language Tests
Winston, Roger B., Jr.; Polkosnik, Mark C. – Journal of College Student Personnel, 1986
Summarizes reliability and validity studies reported about the Student Developmental Task Inventory, second edition (SDTI-2), an objective assessment instrument based on Chickering's theory of psychosocial development described in Education and Identity. Outlines other findings related to differences in psychosocial development. (Author/ABB)
Descriptors: College Students, Developmental Tasks, Higher Education, Self Concept


