Publication Date
| In 2026 | 3 |
| Since 2025 | 675 |
| Since 2022 (last 5 years) | 3176 |
| Since 2017 (last 10 years) | 7417 |
| Since 2007 (last 20 years) | 15055 |
Descriptor
| Test Reliability | 15043 |
| Test Validity | 10279 |
| Reliability | 9761 |
| Foreign Countries | 7144 |
| Test Construction | 4825 |
| Validity | 4191 |
| Measures (Individuals) | 3877 |
| Factor Analysis | 3825 |
| Psychometrics | 3526 |
| Interrater Reliability | 3124 |
| Correlation | 3040 |
| More ▼ | |
Source
Author
Publication Type
Education Level
Audience
| Researchers | 709 |
| Practitioners | 451 |
| Teachers | 208 |
| Administrators | 122 |
| Policymakers | 66 |
| Counselors | 42 |
| Students | 38 |
| Parents | 11 |
| Community | 7 |
| Support Staff | 6 |
| Media Staff | 5 |
| More ▼ | |
Location
| Turkey | 1328 |
| Australia | 436 |
| Canada | 379 |
| China | 368 |
| United States | 271 |
| United Kingdom | 256 |
| Indonesia | 253 |
| Taiwan | 234 |
| Netherlands | 223 |
| Spain | 217 |
| California | 215 |
| More ▼ | |
Laws, Policies, & Programs
Assessments and Surveys
What Works Clearinghouse Rating
| Meets WWC Standards without Reservations | 8 |
| Meets WWC Standards with or without Reservations | 9 |
| Does not meet standards | 6 |
Hoepfner, Ralph – 1970
The present evaluation attempts to simplify the task of incorporating the affective components into an existing system of goals for the school. The two major sources used in collecting the affective goals were existing personality tests and psychological theories of personality. After compiling the affective goals, the team developed their own…
Descriptors: Affective Behavior, Children, Evaluation, Measurement
Larkins, A. Guy; Shaver, James P. – 1967
There exist special problems in testing first-grade children. Orally administered yes-no tests reduce the problems found in the other types, but they have their own drawbacks. A solution to some of these drawbacks is the use of the matched-pair scoring technique. For each "yes" item on the test there is included a "reversed" or…
Descriptors: Achievement Tests, Economics, Grade 1, Primary Education
Comparison of Yes-No, Matched-Pairs, and All-No Scoring of a First-Grade Economics Achievement Test.
Larkins, A. Guy; Shaver, James P. – 1968
Developing practical achievement tests for use at the primary-grade level is a difficult task. Some problems encountered appear to be resolved by using verbally administered yes-no tests. But such tests are criticized as having a low reliability because they offer only two choices. Two modifications of the yes-no test have been proposed to…
Descriptors: Achievement Tests, Comparative Analysis, Primary Education, Test Construction
Rankin, Richard J.; Henderson, Ronald W. – 1969
The purpose of this paper is to evaluate the reliability of the Wechsler Preschool and Primary Scale of Intelligence and to measure whether this reliability is affected when subjects are from a disadvantaged group. The subjects were 25 male and 24 female 5 1/2-year-old poor Mexican-Americans. Generally, the Wechsler Preschool Scale showed high…
Descriptors: Cultural Influences, Disadvantaged, Intelligence Tests, Mexican Americans
Faizunisa, Ali; Costello, Joan – 1969
This study reports an attempt to improve the administration of the Peabody Picture Vocabulary Test (PPVT) by identifying and modifying aspects of the test which adversely affect disadvantaged preschoolers' performance. The resultant test was called the Modified Peabody Picture Vocabulary Test (M-PPVT). Two samples from the same lower class…
Descriptors: Disadvantaged, Improvement, Intelligence Tests, Pictorial Stimuli
Glasnapp, Douglas R. – 1967
The Metropolitan Readiness tests, first published in 1948 (forms R and S), were revised in 1966 (forms A and B). This study was instigated as a result of the charge that the revisions of the tests made them more difficult and more unfair to deprived children. Thirty-six Caucasian beginning first graders (divided evenly by high and low…
Descriptors: Comparative Analysis, Disadvantaged, Evaluation, Predictive Measurement
Unks, Nancy J. – 1967
The testing sub-program is designed to provide the diagnostic instruments necessary to measure pupil progress through the Individually Prescribed Instruction (IPI) curricula. Its objectives are to provide information about pupils which teachers can use to direct each child's individual learning program, to provide the measurements necessary for…
Descriptors: Individualized Instruction, Program Evaluation, Test Construction, Test Interpretation
Kane, Michael T.; Moloney, James M. – 1974
Gilman and Ferry have shown that when the student's score on a multiple choice test is the total number of responses necessary to get all items correct, substantial increases in reliability can occur. In contrast, similar procedures giving partial credit on multiple choice items have resulted in relatively small gains in reliability. The analysis…
Descriptors: Feedback, Guessing (Tests), Multiple Choice Tests, Response Style (Tests)
Finkel, A.; Norman, G. R. – 1973
Two modes of evaluation are compared: the summary evaluation by supervisors performed at six-month intervals, and the technique of direct observation of a clinical encounter through one-way glass. The sample consists of 17 residents in pediatrics who were evaluated, using both methods, over an eight-month interval. The analysis of data indicates…
Descriptors: Clinical Experience, Comparative Analysis, Evaluation Methods, Formative Evaluation
Ekstrom, Ruth B.; And Others – 1974
This report is part of a general study of Reference Measures for Cognitive and Noncognitive Factors. The specific activity that is being reported is the development of "factor-referenced" tests or "marker" tests for several cognitive factors related to divergent production (i.e., ability to produce a variety of words, phrases,…
Descriptors: Cognitive Ability, Cognitive Processes, Creativity, Divergent Thinking
Schwartz, Howard P. – 1974
Distinction between norm referenced and criterion referenced tests are explored in relationship to underlying philosophy and intent. In considering the use of a criterion referenced test for instructional purposes, consideration is given to: specification of objectives, item content and selection, reliability, and needs assessment. (Author)
Descriptors: Comparative Analysis, Criterion Referenced Tests, Educational Assessment, Educational Needs
Diederich, Paul B. – 1973
Written by an ex-Latin teacher, short-cuts to analyzing test results for the non-mathematical teacher are provided. Discussions are given of item analysis (item analysis by a show of hands, standards for test items: success, standards for test items: discrimination, and the second stage of item analysis. The standard error is then presented (the…
Descriptors: Correlation, Error of Measurement, Guides, Item Analysis
Doppelt, Jerome E. – Test Service Bulletin, 1956
The standard error of measurement as a means for estimating the margin of error that should be allowed for in test scores is discussed. The true score measures the performance that is characteristic of the person tested; the variations, plus and minus, around the true score describe a characteristic of the test. When the standard deviation is used…
Descriptors: Bulletins, Error of Measurement, Measurement Techniques, Reliability
PDF pending restorationTaylor, Anne P.; Helmstadter, G. C. – 1971
A pair comparison scale for measuring aesthetic judgment which could be used with four and five year old children was developed by having art experts independently judge for "aesthetic quality" color slides representing a variety of stimuli on an eleven-point successive category scale. The scale was administered to forty children on two…
Descriptors: Art Appreciation, Childhood Attitudes, Childhood Interests, Pictorial Stimuli
Koos, Eugenia M. – 1970
Problems in assessing the validity and reliability of the Mid-Continent Regional Educational Laboratory (McREL) tests of the inquiry skills of biology students are discussed by reference to the first trial version of the first Explorations in Biology (EIB) booklets. Since students learn during the two parts of the test, coefficients of stability…
Descriptors: Biology, Evaluation, Inquiry, Measurement


