Publication Date
| In 2026 | 3 |
| Since 2025 | 675 |
| Since 2022 (last 5 years) | 3176 |
| Since 2017 (last 10 years) | 7417 |
| Since 2007 (last 20 years) | 15055 |
Descriptor
| Test Reliability | 15043 |
| Test Validity | 10279 |
| Reliability | 9761 |
| Foreign Countries | 7144 |
| Test Construction | 4825 |
| Validity | 4191 |
| Measures (Individuals) | 3877 |
| Factor Analysis | 3825 |
| Psychometrics | 3526 |
| Interrater Reliability | 3124 |
| Correlation | 3040 |
| More ▼ | |
Source
Author
Publication Type
Education Level
Audience
| Researchers | 709 |
| Practitioners | 451 |
| Teachers | 208 |
| Administrators | 122 |
| Policymakers | 66 |
| Counselors | 42 |
| Students | 38 |
| Parents | 11 |
| Community | 7 |
| Support Staff | 6 |
| Media Staff | 5 |
| More ▼ | |
Location
| Turkey | 1328 |
| Australia | 436 |
| Canada | 379 |
| China | 368 |
| United States | 271 |
| United Kingdom | 256 |
| Indonesia | 253 |
| Taiwan | 234 |
| Netherlands | 223 |
| Spain | 217 |
| California | 215 |
| More ▼ | |
Laws, Policies, & Programs
Assessments and Surveys
What Works Clearinghouse Rating
| Meets WWC Standards without Reservations | 8 |
| Meets WWC Standards with or without Reservations | 9 |
| Does not meet standards | 6 |
Tonelson, Stephen W. – 1978
The purpose of the study was to assess the reliability and the validity of the Ski Hi Language Development Scale which was designed to determine the receptive and the expressive language levels of hearing impaired children from birth to age 5. The reliability of the instrument was estimated through: (1) internal consistency, (2) inter-rater…
Descriptors: Expressive Language, Hearing Impairments, Language Acquisition, Preschool Education
Hayford, Paul D.; Salter, Ruth – 1978
Reading comprehension involves a number of distinctly different intellectual skills that can be assessed if the proper techniques are employed. As part of a reading assessment system, two measures of literal comprehension were developed: the Literal Comprehension Details Test (LCDT) and the Paraphrase Reading Test (PRT). Both the LCDT and the PRT…
Descriptors: Measurement Techniques, Reading Comprehension, Reading Tests, Test Construction
Schwarz, J. Conrad – 1981
The construct validity of four measures of delay of gratification as indices of a stable personality disposition--that is, the disposition to adaptively delay gratification--is examined. The four measures (Choice-to-Delay, Wait-to-Accumulate, Wait-upon-Request, and Ratings by Knowledgeable Informants) are described and empirical evidence that…
Descriptors: Delay of Gratification, Measures (Individuals), Personality Traits, Preschool Children
Cureton, Edward E. – 1973
Presented are the methodology and results of an equipercentile equating study in which subtests of the following three editions of multiple aptitude test batteries, in widespread use in 1960, were equated to the tests of the Project TALENT test battery: Flanagan Aptitude Classification Tests (1957); Differential Aptitude Tests (1947) and; the…
Descriptors: Aptitude Tests, Equated Scores, Raw Scores, Secondary Education
Munoz-Colberg, Magda – 1977
The logical foundations of deduction and induction are outlined to form the rules for the construction of a set of tests of reasoning ability. Both deduction and induction involve the derivation of a conclusion from a set of premises. Deductive logic uses syllogisms and is abstract. Inductive logic is both empirical and abstract. Although…
Descriptors: Abstract Reasoning, Cognitive Tests, Deduction, Induction
Bachelor, Barry; And Others – 1980
Verbal responses to several tests of originality were subjectively rated for originality. The ratings were compared with the statistical frequency of these responses in two samples of test takers, 150 elementary school children and 60 college students. The elementary school children were administered three measures from the Wallach and Kogar test…
Descriptors: Creativity, Creativity Tests, Elementary Education, Higher Education
Ysseldyke, James E. – 1977
The author traces reasons to support his contention that the state of the art in assessing learning disabled students is not good. Among issues examined are the following: use of tests for purposes other than those for which they were intended; technical adequacy of currently used tests (standardization, reliability, validity); the use of deficit…
Descriptors: Evaluation Methods, Learning Disabilities, Student Evaluation, Test Bias
DAVIS, FREDERICK B. – 1967
A STUDY DESCRIBED AS THE FIRST APPLICATION OF CROSS-VALIDATED UNIQUENESS ANALYSIS TECHNIQUES WAS DESIGNED TO ELIMINATE THE EFFECTS OF IMPERFECTIONS IN A PRIOR FACTOR-ANALYTIC STUDY OF READING COMPREHENSION WHICH USED TESTS ESPECIALLY CONSTRUCTED TO MEASURE MENTAL SKILLS IN READING. A UNIQUENESS ANALYSIS BASED ON LARGE SAMPLES WAS USED TO OBTAIN…
Descriptors: Grade 12, Reading Comprehension, Reading Research, Reading Skills
Marcus, Robert F.; And Others – 1980
Time sampled observations of the cooperative behavior of two samples of 31 preschool children were analyzed for stability (that is, short term reliability of behavior) over a 2-month period using Cronbach's generalizability coefficient. Observations were made during free play periods on nursery school settings. The observation schedule required…
Descriptors: Behavioral Science Research, Cooperation, Observation, Play
Naccarato, Richard W.; Gillmore, Gerald M. – 1976
This paper involves an application of generalizability theory in assessing the dependability of a foreign language placement exam. The French Cloze test was administered to students within five levels of French classes and the results were scored by four different raters. Three specific generalizability coefficients are discussed along with…
Descriptors: College Students, French, Higher Education, Measurement Techniques
Mare, Robert D.; Mason, William M. – 1978
An important class of applications of measurement error or constrained factor analytic models consists of comparing models for several populations. In such cases, it is appropriate to make explicit statistical tests of model similarity across groups and to constrain some parameters of the models to be equal across groups using a priori substantive…
Descriptors: Factor Analysis, Goodness of Fit, Information Sources, Mathematical Models
Smith, Sandra E.; And Others – 1978
A correction of the standard F-ratio for unreliability of the dependent measure has recently been proposed by Winne; the rationale is analogous to that of correcting a correlation for attenuation. However, there are two problems associated with Winne's correction of which potential users should be aware. First, the corrected statistic, F*, has…
Descriptors: Analysis of Variance, Hypothesis Testing, Reliability, Research Problems
Bergquist, Constance C. – 1979
The reliability of evaluators' judgments was investigated, using a discrepancy evaluation model within a program evaluation context. Two research problems were addressed: the degree of evaluator reliability in detecting and rating problems in federal educational project planning, management, and evaluation; and the relationship between inter-judge…
Descriptors: Decision Making, Evaluation Criteria, Evaluation Methods, Evaluators
Webb, Jeaninne Nelson; Brown, Bob Burton – 1969
A study was designed to (1) compare two types of reliability in the observation of teachers' behavior, (2) explore the relationship between observer reliability and the validity of their systematic classroom observations, and (3) investigate the effects of training, observer beliefs, and the passage of time on reliability and validity estimates.…
Descriptors: Beliefs, Classroom Observation Techniques, Educational Experiments, Educational Researchers
Moyer, Judith E.; Fishbein, Ronald L. – 1977
The problem that this research addressed was one of decision making. Given three sets of criterion-referenced tests which were designed to be parallel in content, would a traditional reliability coefficient produce different decisions about the reliability of those tests than would kappa? The procedure used collected statewide results on 136 test…
Descriptors: Analysis of Variance, Comparative Analysis, Criterion Referenced Tests, Measurement Techniques


