Publication Date
| In 2026 | 6 |
| Since 2025 | 481 |
| Since 2022 (last 5 years) | 1960 |
| Since 2017 (last 10 years) | 4532 |
| Since 2007 (last 20 years) | 7017 |
Descriptor
| Test Reliability | 15055 |
| Test Validity | 10022 |
| Test Construction | 4374 |
| Foreign Countries | 3840 |
| Psychometrics | 2435 |
| Factor Analysis | 2302 |
| Measures (Individuals) | 1787 |
| Evaluation Methods | 1410 |
| Higher Education | 1391 |
| Questionnaires | 1264 |
| Factor Structure | 1249 |
| More ▼ | |
Source
Author
Publication Type
Education Level
Audience
| Researchers | 454 |
| Practitioners | 319 |
| Teachers | 128 |
| Administrators | 73 |
| Policymakers | 33 |
| Counselors | 31 |
| Students | 17 |
| Parents | 10 |
| Community | 6 |
| Support Staff | 5 |
Location
| Turkey | 840 |
| Australia | 239 |
| China | 211 |
| Canada | 207 |
| Indonesia | 163 |
| Spain | 131 |
| United States | 123 |
| United Kingdom | 121 |
| Germany | 112 |
| Taiwan | 108 |
| Netherlands | 103 |
| More ▼ | |
Laws, Policies, & Programs
Assessments and Surveys
What Works Clearinghouse Rating
| Meets WWC Standards without Reservations | 2 |
| Meets WWC Standards with or without Reservations | 2 |
| Does not meet standards | 1 |
Faizunisa, Ali; Costello, Joan – 1969
This study reports an attempt to improve the administration of the Peabody Picture Vocabulary Test (PPVT) by identifying and modifying aspects of the test which adversely affect disadvantaged preschoolers' performance. The resultant test was called the Modified Peabody Picture Vocabulary Test (M-PPVT). Two samples from the same lower class…
Descriptors: Disadvantaged, Improvement, Intelligence Tests, Pictorial Stimuli
Glasnapp, Douglas R. – 1967
The Metropolitan Readiness tests, first published in 1948 (forms R and S), were revised in 1966 (forms A and B). This study was instigated as a result of the charge that the revisions of the tests made them more difficult and more unfair to deprived children. Thirty-six Caucasian beginning first graders (divided evenly by high and low…
Descriptors: Comparative Analysis, Disadvantaged, Evaluation, Predictive Measurement
Unks, Nancy J. – 1967
The testing sub-program is designed to provide the diagnostic instruments necessary to measure pupil progress through the Individually Prescribed Instruction (IPI) curricula. Its objectives are to provide information about pupils which teachers can use to direct each child's individual learning program, to provide the measurements necessary for…
Descriptors: Individualized Instruction, Program Evaluation, Test Construction, Test Interpretation
Kane, Michael T.; Moloney, James M. – 1974
Gilman and Ferry have shown that when the student's score on a multiple choice test is the total number of responses necessary to get all items correct, substantial increases in reliability can occur. In contrast, similar procedures giving partial credit on multiple choice items have resulted in relatively small gains in reliability. The analysis…
Descriptors: Feedback, Guessing (Tests), Multiple Choice Tests, Response Style (Tests)
Ekstrom, Ruth B.; And Others – 1974
This report is part of a general study of Reference Measures for Cognitive and Noncognitive Factors. The specific activity that is being reported is the development of "factor-referenced" tests or "marker" tests for several cognitive factors related to divergent production (i.e., ability to produce a variety of words, phrases,…
Descriptors: Cognitive Ability, Cognitive Processes, Creativity, Divergent Thinking
Schwartz, Howard P. – 1974
Distinction between norm referenced and criterion referenced tests are explored in relationship to underlying philosophy and intent. In considering the use of a criterion referenced test for instructional purposes, consideration is given to: specification of objectives, item content and selection, reliability, and needs assessment. (Author)
Descriptors: Comparative Analysis, Criterion Referenced Tests, Educational Assessment, Educational Needs
Diederich, Paul B. – 1973
Written by an ex-Latin teacher, short-cuts to analyzing test results for the non-mathematical teacher are provided. Discussions are given of item analysis (item analysis by a show of hands, standards for test items: success, standards for test items: discrimination, and the second stage of item analysis. The standard error is then presented (the…
Descriptors: Correlation, Error of Measurement, Guides, Item Analysis
PDF pending restorationTaylor, Anne P.; Helmstadter, G. C. – 1971
A pair comparison scale for measuring aesthetic judgment which could be used with four and five year old children was developed by having art experts independently judge for "aesthetic quality" color slides representing a variety of stimuli on an eleven-point successive category scale. The scale was administered to forty children on two…
Descriptors: Art Appreciation, Childhood Attitudes, Childhood Interests, Pictorial Stimuli
Koos, Eugenia M. – 1970
Problems in assessing the validity and reliability of the Mid-Continent Regional Educational Laboratory (McREL) tests of the inquiry skills of biology students are discussed by reference to the first trial version of the first Explorations in Biology (EIB) booklets. Since students learn during the two parts of the test, coefficients of stability…
Descriptors: Biology, Evaluation, Inquiry, Measurement
Thomas, Charles R. – 1971
One hundred and fifteen first graders were randomly assigned to experimental and control groups. Experimental pupils used the Visual Tracking program, and the control pupils participated in directed listening activities in separate rooms. The teachers followed a weekly rotating schedule in supervising the groups. After 12 weeks of training,…
Descriptors: Eye Movements, Grade 1, Sex Differences, Silent Reading
Schmeiser, Cynthia Board; Whitney, Douglas R. – 1973
Violations of four selected principles of writing multiple-choice items were introduced into an undergraduate religion course mid-term examination. Three of the flaws significantly increased test difficulty. KR-sub-20 values were lower for all of the tests containing the flawed items than for the "good" versions of the items but significantly so…
Descriptors: Item Analysis, Multiple Choice Tests, Research Reports, Test Construction
Young, Jon I. – 1972
Some theoretical concerns for competency-based evaluation instruments are discussed, and means of examining these instruments for validity and reliability are presented. The areas of concern include descriptions of the behavior, the level of response, and the nature of the evaluation. Two different types of instruments are examined to determine…
Descriptors: Evaluation Methods, Measurement Instruments, Models, Performance Tests
Modu, Christopher C. – 1972
The contribution of a 20-minute essay question, given as part of the one-hour achievement test in American History and Social Studies, to the pool of information available on a candidate from an all-objective examination of the College Board Admissions Testing Program is presented in this report. The study limits itself to a consideration of the…
Descriptors: Academic Achievement, American History, Essay Tests, Objective Tests
Wilmoth, Gregory H.; McFarland, Sam G. – 1976
Kohlberg's Moral Judgment Scale, Gilligan, et al.'s Sexual Moral Judgment Scale, Maitland and Goldman's Objective Moral Judgment Scale, and Hogan's Maturity of Moral Judgment Scale were examined for reliability and inter-scale relationships. All measures except the Objective Moral Judgment Scale had good reliabilities. The obtained relations…
Descriptors: Adults, Comparative Analysis, Correlation, Moral Development
Schlenker, Richard M.
Sixty-nine students in grades 9, 10, and 11 were tested with three of Viktor Lowenfeld's visual-haptic tests in an attempt to ascertain whether students at these levels segregated in a fashion similar to Lowenfeld's sample. Respondents were spread over the visual-haptic continuum as Lowenfeld suggested they should be. However, a large and…
Descriptors: Aptitude Tests, Perception Tests, Scoring, Secondary Education


