Publication Date
| In 2026 | 3 |
| Since 2025 | 636 |
| Since 2022 (last 5 years) | 3137 |
| Since 2017 (last 10 years) | 7378 |
| Since 2007 (last 20 years) | 15016 |
Descriptor
| Test Reliability | 15015 |
| Test Validity | 10252 |
| Reliability | 9751 |
| Foreign Countries | 7126 |
| Test Construction | 4811 |
| Validity | 4189 |
| Measures (Individuals) | 3875 |
| Factor Analysis | 3821 |
| Psychometrics | 3515 |
| Interrater Reliability | 3122 |
| Correlation | 3037 |
| More ▼ | |
Source
Author
Publication Type
Education Level
Audience
| Researchers | 709 |
| Practitioners | 451 |
| Teachers | 208 |
| Administrators | 122 |
| Policymakers | 66 |
| Counselors | 42 |
| Students | 38 |
| Parents | 11 |
| Community | 7 |
| Support Staff | 6 |
| Media Staff | 5 |
| More ▼ | |
Location
| Turkey | 1320 |
| Australia | 436 |
| Canada | 379 |
| China | 368 |
| United States | 271 |
| United Kingdom | 256 |
| Indonesia | 251 |
| Taiwan | 234 |
| Netherlands | 223 |
| Spain | 216 |
| California | 214 |
| More ▼ | |
Laws, Policies, & Programs
Assessments and Surveys
What Works Clearinghouse Rating
| Meets WWC Standards without Reservations | 8 |
| Meets WWC Standards with or without Reservations | 9 |
| Does not meet standards | 6 |
Peer reviewedAnd Others; Mann, Irene T. – Applied Psychological Measurement, 1979
Several methodological problems (particularly the assumed bipolarity of scales, instructions regarding use of the midpoint, and concept-scale interaction) which may contribute to a lack of precision in the semantic differential technique were investigated. Results generally supported the use of the semantic differential. (Author/JKS)
Descriptors: Analysis of Variance, Computer Assisted Testing, Higher Education, Rating Scales
Spille, Henry; And Others – New Directions for Experiential Learning, 1980
Five contributors identify essential means for the college to assure high standards, quality control, and consistency in assessing the prior learning of adults. Types of learning include work or military experience and life experience; assessment methods include examination, portfolio examination, and competence-based programs. (Author/MSE)
Descriptors: Academic Standards, Adult Education, College Credits, Educational Quality
Peer reviewedBoyd, Marcia A.; And Others – Journal of Dental Education, 1980
The Dental Aptitude Test (DAT) in Canada and the Dental Admission Test (DAT) in the United States are discussed. A history of the DAT, its composition, underlying concepts, and the problems of validation and guidelines for its uses are presented. A review of the literature is provided. (Author/MLW)
Descriptors: Admission (School), Admission Criteria, Aptitude Tests, College Entrance Examinations
Peer reviewedVandivier, Phillip L.; Vandivier, Stella Sue – Educational Forum, 1979
Arguments and prejudices against the use of individually administered intelligence tests are considered and compared with possible values that may be obtained. Cautions about test score interpretation are discussed. Implications of abolishing intelligence testing are considered and recommendations for effective testing policies are presented. (CTM)
Descriptors: Academic Achievement, Diagnostic Tests, Elementary Secondary Education, Intelligence
Peer reviewedHarasym, P. H.; And Others – Evaluation and the Health Professions, 1980
Coded, as opposed to free response items, in a multiple choice physiology test had a cueing effect which raised students' scores, especially for lower achievers. Reliability of coded items was also lower. Item format and scoring method had an effect on test results. (GDC)
Descriptors: Achievement Tests, Comparative Testing, Cues, Higher Education
Peer reviewedAlbanese, Mark A. – Journal of Medical Education, 1979
Results of a study involving pathology students suggest that there is significant cluing in multiple-true-false test questions that use secondary responses to represent combinations of the primary response (e.g., "Mark B if only 1 and 3 are correct"). Thus test scores are artificially inflated and test reliability is lowered. (JMD)
Descriptors: Allied Health Occupations Education, Cues, Higher Education, Medical Education
Ojo, Folayan – Bulletin of the Association of African Universities, 1976
The reliability of Nigeria's entry qualification examinations as a predictor of success at the university level is examined. Results indicate a positive correlation in the science-based fields and very low predictability in the social sciences. (JMF)
Descriptors: Academic Achievement, Academic Standards, Admission Criteria, African Culture
Tinari, Frank D. – Improving College and University Teaching, 1979
Computerized analysis of multiple choice test items is explained. Examples of item analysis applications in the introductory economics course are discussed with respect to three objectives: to evaluate learning; to improve test items; and to help improve classroom instruction. Problems, costs and benefits of the procedures are identified. (JMD)
Descriptors: College Instruction, Computer Programs, Discriminant Analysis, Economics Education
Peer reviewedGreene, Roger L.; And Others – Journal of Personality Assessment, 1979
Students' ability to validate results of their psychological tests was examined. Seniors and graduate students could reliably select their profiles from the California Psychological Inventory (CPI), while college sophomores could not. College sophomores could select their Differential Aptitudes Test (DAT) profiles more confidently than their CPI…
Descriptors: Academic Aptitude, Age Differences, Aptitude Tests, Graduate Students
Peer reviewedLinn, Marcia C.; Rice, Marian – Journal of Educational Measurement, 1979
The Springs task, an individually administered measure of ability to criticize and control experiments, is described. The task has characteristics similar to Inhelder and Piaget's Bending Rods task; it yields scores on naming variables, controlling variables, and analyzing experiments. (See also RIE: ED 163 092.) (JKS)
Descriptors: Cognitive Processes, Cognitive Tests, Critical Thinking, Developmental Stages
Peer reviewedSeidman, Edward; And Others – Journal of Educational Psychology, 1979
The development and validation of three instruments for the multidimensional assessment of a child's classroom behavior are described. The multidimensional nature, internal consistency, and test-retest properties of scales depicting teacher-, peer-, and self-rated behavior are explicated. Principal-components analyses demonstrate the convergent…
Descriptors: Behavior Rating Scales, Factor Structure, Peer Evaluation, Primary Education
Peer reviewedHosley, Deborah; Meredith, Keith – TESOL Quarterly, 1979
This study provides validity information for the Test of English as a Foreign Language (TOEFL) by examining some of its inter- and intra-test correlates. (CFM)
Descriptors: English (Second Language), Factor Analysis, Foreign Students, Higher Education
Peer reviewedLynch, Brian K. – Language Testing, 1997
Addresses the question of whether any test can be defended as ethical or moral. Defines ethicality in terms of issues such as harm, consent, confidentiality of data, and fairness and presents frameworks for determining equity of educational opportunity. An assessment project in Australia is examined in relation to these concerns. (24 references)…
Descriptors: Elementary Education, Elementary School Students, Equal Education, Ethics
Drabenstott, Karen M.; Weller, Marjorie S. – Proceedings of the ASIS Annual Meeting, 1996
Describes the comparative approach to system evaluation used in a project which administered an online retrieval test to an experimental online catalog to produce data for evaluating effectiveness of a new subject access design. Discusses efforts used to ensure data reliability, and strategies the researchers can use to improve on the data…
Descriptors: Comparative Analysis, Computer Assisted Testing, Computer System Design, Data Analysis
Peer reviewedOlney, Cynthia; Grande, Steve – Michigan Journal of Community Service Learning, 1995
Describes the Scale of Service Learning Involvement, developed to validate a service-learning model of developmental processes experienced by students engaged in community volunteer work, from sporadic involvement to internalization of social responsibility, and to assess student outcomes. Reliability, concurrent validity, and contrasting group…
Descriptors: Attitude Change, Citizenship Responsibility, Higher Education, Measurement Techniques


