Publication Date
| In 2026 | 3 |
| Since 2025 | 666 |
| Since 2022 (last 5 years) | 3167 |
| Since 2017 (last 10 years) | 7408 |
| Since 2007 (last 20 years) | 15046 |
Descriptor
| Test Reliability | 15036 |
| Test Validity | 10272 |
| Reliability | 9759 |
| Foreign Countries | 7141 |
| Test Construction | 4823 |
| Validity | 4191 |
| Measures (Individuals) | 3877 |
| Factor Analysis | 3825 |
| Psychometrics | 3525 |
| Interrater Reliability | 3124 |
| Correlation | 3039 |
| More ▼ | |
Source
Author
Publication Type
Education Level
Audience
| Researchers | 709 |
| Practitioners | 451 |
| Teachers | 208 |
| Administrators | 122 |
| Policymakers | 66 |
| Counselors | 42 |
| Students | 38 |
| Parents | 11 |
| Community | 7 |
| Support Staff | 6 |
| Media Staff | 5 |
| More ▼ | |
Location
| Turkey | 1327 |
| Australia | 436 |
| Canada | 379 |
| China | 368 |
| United States | 271 |
| United Kingdom | 256 |
| Indonesia | 252 |
| Taiwan | 234 |
| Netherlands | 223 |
| Spain | 216 |
| California | 214 |
| More ▼ | |
Laws, Policies, & Programs
Assessments and Surveys
What Works Clearinghouse Rating
| Meets WWC Standards without Reservations | 8 |
| Meets WWC Standards with or without Reservations | 9 |
| Does not meet standards | 6 |
Peer reviewedBruininks, Robert H.; Lucker, William G. – Journal of Reading Behavior, 1970
Descriptors: Disadvantaged Youth, Economically Disadvantaged, Intelligence Tests, Longitudinal Studies
Brigham, Bruce W. – J Reading Spec, 1970
Descriptors: Measurement Instruments, Performance Factors, Reading Achievement, Reading Tests
Dulin, Kenneth La Marr – J Reading, 1969
Descriptors: Abstract Reasoning, Context Clues, Grade 10, Middle Class
Peer reviewedPreece, Peter F. W. – Educational and Psychological Measurement, 1982
The validity of various reliability-corrected procedures for adjusting for initial differences between groups in uncontrolled studies is established for subjects exhibiting linear fan-spread growth. The results are then extended to a nonlinear model of growth. (Author)
Descriptors: Achievement Gains, Analysis of Covariance, Error of Measurement, Hypothesis Testing
Peer reviewedReece, Mary J.; Gable, Robert K. – Educational and Psychological Measurement, 1982
A 10 item Attitudes Toward Computers instrument was developed to measure the attitudes of students toward the use of computers. A factorial validity study revealed one identifiable factor dimension entitled General Attitude Toward Computers with an estimated alpha internal consistency reliability of .87. (Author/PN)
Descriptors: Attitude Measures, Computer Assisted Instruction, Computers, Factor Structure
Peer reviewedZughoul, Muhammad R.; Kambal, M. Osman – International Review of Applied Linguistics in Language Teaching, 1983
Based on the responses of 50 ESL instructors to a composition-scoring exercise, a detailed method of scoring compositions was developed that divides the writing into basic components (structure, content, vocabulary, organization, and mechanics) and provides a scoring mechanism for each component for each of three competency levels. (MSE)
Descriptors: English (Second Language), Evaluation Criteria, Evaluation Methods, Measurement Techniques
Peer reviewedPiercy, Fred P.; And Others – Journal of Marital and Family Therapy, 1983
Reports the development of a scale for evaluating family therapist skills. All items discriminated significantly between videotaped segments of effective and ineffective family therapist skills and between experienced and inexperienced family therapists. Interrater reliability and internal consistency of the categories were acceptable. The scale…
Descriptors: Counseling Techniques, Counselor Evaluation, Counselor Performance, Counselor Training
Peer reviewedCarp, Frances M.; Carp, Abraham – Journal of Gerontology, 1983
Studied instruments which measure well-being, life satisfaction, and morale, for structural stability across age and gender in two samples of adults and older adults. Analysis showed four factors that defined dimensions underlying these measures which were constant across age and gender. (WAS)
Descriptors: Adults, Affective Measures, Age Differences, Factor Structure
Ibe, Milagros D. – Journal of Science and Mathematics Education in Southeast Asia, 1983
This study investigated the effects of specific criteria for marking a test on its reliability and validity. Eight algebra word problems were administered to grade 10 students. The objectivity of scoring criteria improved the reliability of the test, but did not affect its validity. (MNS)
Descriptors: Algebra, Educational Research, Grade 10, Mathematics Instruction
Peer reviewedAiken, Lewis R. – Educational and Psychological Measurement, 1983
Each of six forms of a 10-item teacher evaluation rating scale, having two to seven response categories per form, was administered to over 100 college students. Means of item responses and item variances increased with the number of response categories. Internal consistency of total scores did not change systematically. (Author/PN)
Descriptors: College Students, Higher Education, Item Analysis, Rating Scales
Peer reviewedHunt, D. Daniel; And Others – Journal of Medical Education, 1982
The Confidence in Interviewing Scale, developed to test medical students' interviewing confidence by asking for students' estimations of their ability to handle challenging situations, was tested at the University of Washington. Both reliability and validity were found, but comparison with an external observer's ratings is advocated. (MSE)
Descriptors: Higher Education, Interviews, Measurement Techniques, Medical Case Histories
Peek, George S. – Improving College and University Teaching, 1982
An experimental program in jury grading of freshman composition at Arkansas State University, abandoned after charges of racial bias and infringement on academic freedom, is described and evaluated based on comparison of jury grades and standardized test scores. The consensus of instructors is that the method was objective and consistent. (MSE)
Descriptors: Academic Freedom, College Freshmen, College Instruction, English Instruction
Peer reviewedAmberg, Jay – American Scholar, 1982
The fact that the Scholastic Aptitude Test (SAT) is susceptible to coaching does not mean it is a poor test. The abilities measured by it are acquired, apart from test-wiseness. Even though some uses of the scores in admissions may be discriminatory, the test itself is fair, uniform, and judiciously administered. (MSE)
Descriptors: Admission Criteria, Advance Organizers, College Entrance Examinations, Higher Education
Peer reviewedVockell, Edward L. – Science Education, 1982
A test for measuring attitudes toward animal life (Fireman Test) was developed. The test was determined to be reliable, valid, free from social-response biases, easy to administer and score, and potentially useful in ascertaining whether changes in attitudes have occurred as a result of some planned intervention. (Author/JN)
Descriptors: Animals, Attitude Measures, Elementary Education, Elementary School Science
Kronowitz, Ellen; Finney, Victoria – California Journal of Teacher Education, 1983
Elementary school pupils judged their student teachers' performances in the areas of planning, instructional skill, evaluation, and behavior, and in classroom organization and control. Their evaluations were compared with adult observers' ratings. Results indicate that elementary school students can assess performance and discriminate among…
Descriptors: Elementary Education, Elementary School Students, Evaluation Criteria, Evaluation Methods


