Publication Date
| In 2026 | 3 |
| Since 2025 | 675 |
| Since 2022 (last 5 years) | 3176 |
| Since 2017 (last 10 years) | 7417 |
| Since 2007 (last 20 years) | 15055 |
Descriptor
| Test Reliability | 15043 |
| Test Validity | 10279 |
| Reliability | 9761 |
| Foreign Countries | 7144 |
| Test Construction | 4825 |
| Validity | 4191 |
| Measures (Individuals) | 3877 |
| Factor Analysis | 3825 |
| Psychometrics | 3526 |
| Interrater Reliability | 3124 |
| Correlation | 3040 |
| More ▼ | |
Source
Author
Publication Type
Education Level
Audience
| Researchers | 709 |
| Practitioners | 451 |
| Teachers | 208 |
| Administrators | 122 |
| Policymakers | 66 |
| Counselors | 42 |
| Students | 38 |
| Parents | 11 |
| Community | 7 |
| Support Staff | 6 |
| Media Staff | 5 |
| More ▼ | |
Location
| Turkey | 1328 |
| Australia | 436 |
| Canada | 379 |
| China | 368 |
| United States | 271 |
| United Kingdom | 256 |
| Indonesia | 253 |
| Taiwan | 234 |
| Netherlands | 223 |
| Spain | 217 |
| California | 215 |
| More ▼ | |
Laws, Policies, & Programs
Assessments and Surveys
What Works Clearinghouse Rating
| Meets WWC Standards without Reservations | 8 |
| Meets WWC Standards with or without Reservations | 9 |
| Does not meet standards | 6 |
Peer reviewedGardner, P. L.; Taylor, S. M. – British Journal of Educational Psychology, 1980
Transmission-interpretation (T-I) has been conceptualized as an important dimension of teacher verbal behavior. This study measured teacher T-I through a Likert scale completed by students. The T-I scale (a copy is appended) correlated 0.83 with the Student Perception of Teacher Style scale, measuring teacher directiveness. (Author/SJL)
Descriptors: Classroom Communication, Rating Scales, Student Evaluation of Teacher Performance, Teacher Characteristics
Cervero, Ronald M. – Adult Education, 1980
Researchers reanalyzed the original Adult Performance Level Test survey data for the test's validity and reliability, and they concluded that (1) the test is not content valid because it assumes that functional competence can be logically defined and adult success accurately measured; and (2) the test is a valid measure of verbal, writing, and…
Descriptors: Adult Basic Education, Basic Skills, Factor Analysis, Functional Literacy
Peer reviewedBohning, Gerry – Psychology in the Schools, 1980
An item analysis profile sheet to accompany the Slosson Intelligence Test (SIT) is helpful in providing a functional test interpretation. The lack of recorded technical and statistical information is a serious concern. Without such information, a practitioner could not use the Item Analysis of SIT with confidence. (Author)
Descriptors: Children, Educational Diagnosis, Elementary Secondary Education, Intelligence Tests
Peer reviewedRichmond, Bert O.; Horn, William R. – Psychology in the Schools, 1980
Describes a new instrument designed for brief administration, to be educationally relevant, and to measure five domains of adaptive behavior: language development, independent functioning, family role performance and economic-vocational activity, and socialization. An initial study indicates the instrument has high reliability. (Author)
Descriptors: Adjustment (to Environment), Behavior Rating Scales, Children, Diagnostic Tests
Peer reviewedReynolds, Cecil R.; And Others – Psychology in the Schools, 1980
Contrary to findings with older children, no sex differences occurred in scoring on the anxiety scale. Kindergarten children generally scored higher on the anxiety scale than did older children. Lie scale scores were comparable to those of other primary grade children. (Author)
Descriptors: Anxiety, Educational Diagnosis, Elementary Education, Emotional Problems
Peer reviewedForsyth, Robert A.; Spratt, Kevin F. – Journal of Educational Measurement, 1980
The effects of two item formats on item difficulty and item discrimination indices for mathematics problem solving multiple-choice tests were investigated. One format required identifying the proper "set-up" for the item; the other format required complete solving of the item. (Author/JKS)
Descriptors: Difficulty Level, Junior High Schools, Multiple Choice Tests, Problem Solving
Peer reviewedLowman, Joseph – Journal of Personality Assessment, 1980
Three studies demonstrate that the Inventory of Family Feelings, a measure of family affective structure, has high reliability and construct and concurrent validity. It is appropriate for affective comparisons by age, sex, and ordinal position of children and for measuring change after family or marital therapy, or after predictable stress…
Descriptors: Adjustment (to Environment), Affective Measures, Family Problems, Family Relationship
Peer reviewedKozeki, Bela – High School Journal, 1980
The Junior Index of Motivation (JIM) Scale was translated and administered to 363 Hungarian students, aged 11-14. Teacher ratings of individuals' motivation and data on achievement, creativity, and intelligence were also collected. Results confirmed the JIM Scale's reliability and showed academic motivation correlates similar to those found by…
Descriptors: Correlation, Cross Cultural Studies, Junior High School Students, Junior High Schools
Peer reviewedPage, Roger; Bode, James – Educational and Psychological Measurement, 1980
The Ethical Reasoning Inventory (ERI) is an objective test derived from Kohlberg's Moral Judgment Interview. It correlated higher with Kohlberg , and has higher internal consistency than the Defining Issues Test and the Moral Judgment Scale. (CP)
Descriptors: Abstract Reasoning, Higher Education, Item Analysis, Moral Issues
Froehlich, Loren H.; Jepson, David A. – Measurement and Evaluation in Guidance, 1980
The Self-Perception Inventory (SPI) was administered to a group of students in junior high and later when they were in senior high. SPI scores seem to be rather unstable, reflect little change, and have limited usefulness for evaluation purposes. (Author)
Descriptors: High School Students, Junior High School Students, Secondary Education, Self Concept
Peer reviewedMishra, Shitala P. – Psychology in the Schools, 1981
A study with Mexican-American children showed the internal consistency reliability coefficients for the Wide Range Achievement Test (WRAT) were high and comparable to those reported in the WRAT manual. A high relationship was found between WRAT and Metropolitan Achievement Test scores. WRAT meets reliability and validity requirements with Mexican…
Descriptors: Achievement Tests, Cultural Influences, Elementary School Students, Intermediate Grades
Peer reviewedBrinker, Richard P.; Goldbart, Juliet – British Journal of Psychology, 1981
Social and communicative behavior of 28 preschoolers, some developmentally delayed, was classified under various conditions by four observers. Inter-observer agreements from observations of developmentally delayed and normal children were compared. No significant differences were found. Results are discussed in terms of reliability problems in…
Descriptors: Behavior Rating Scales, Child Language, Classroom Observation Techniques, Communication Research
Peer reviewedWard, James Gordon – Peabody Journal of Education, 1981
Teachers need valid information to judge the types of programs, instruction, and colleges best suited to students. Teachers appear to support the use of standardized tests to provides some of that information. Abolishing such tests may lead to dependence on more subjective measures, resulting in inequities in placement and selection. (FG)
Descriptors: Achievement Tests, Educational Assessment, Elementary Secondary Education, Standardized Tests
Tosti, Donald T. – Training and Development Journal, 1979
Explores need for development of validation and certification procedures. Performance measures used for these procedures must come from on-job experiences. Validity of measures depends on similarity of participant's performance to job exercises. Outlines how to develop reliable job performance measures necessary for job certification and…
Descriptors: Certification, Guidelines, Opinions, Performance Criteria
Peer reviewedYelvington, James Yowell; Brady, Raymond G. – Community/Junior College Research Quarterly, 1979
Assesses the applicability of corrective feedback (CF) testing, which allows multiple attempts to respond to a test item, to the community college classroom. Compares CF testing to single answer testing, especially with regard to reliability, equitability, and effect on student motivation. (DD)
Descriptors: Community Colleges, Educational Testing, Feedback, Multiple Choice Tests


