Publication Date
| In 2026 | 3 |
| Since 2025 | 666 |
| Since 2022 (last 5 years) | 3167 |
| Since 2017 (last 10 years) | 7408 |
| Since 2007 (last 20 years) | 15046 |
Descriptor
| Test Reliability | 15036 |
| Test Validity | 10272 |
| Reliability | 9759 |
| Foreign Countries | 7141 |
| Test Construction | 4823 |
| Validity | 4191 |
| Measures (Individuals) | 3877 |
| Factor Analysis | 3825 |
| Psychometrics | 3525 |
| Interrater Reliability | 3124 |
| Correlation | 3039 |
| More ▼ | |
Source
Author
Publication Type
Education Level
Audience
| Researchers | 709 |
| Practitioners | 451 |
| Teachers | 208 |
| Administrators | 122 |
| Policymakers | 66 |
| Counselors | 42 |
| Students | 38 |
| Parents | 11 |
| Community | 7 |
| Support Staff | 6 |
| Media Staff | 5 |
| More ▼ | |
Location
| Turkey | 1327 |
| Australia | 436 |
| Canada | 379 |
| China | 368 |
| United States | 271 |
| United Kingdom | 256 |
| Indonesia | 252 |
| Taiwan | 234 |
| Netherlands | 223 |
| Spain | 216 |
| California | 214 |
| More ▼ | |
Laws, Policies, & Programs
Assessments and Surveys
What Works Clearinghouse Rating
| Meets WWC Standards without Reservations | 8 |
| Meets WWC Standards with or without Reservations | 9 |
| Does not meet standards | 6 |
Peer reviewedAlbanese, Mark A. – Journal of Medical Education, 1979
Results of a study involving pathology students suggest that there is significant cluing in multiple-true-false test questions that use secondary responses to represent combinations of the primary response (e.g., "Mark B if only 1 and 3 are correct"). Thus test scores are artificially inflated and test reliability is lowered. (JMD)
Descriptors: Allied Health Occupations Education, Cues, Higher Education, Medical Education
Ojo, Folayan – Bulletin of the Association of African Universities, 1976
The reliability of Nigeria's entry qualification examinations as a predictor of success at the university level is examined. Results indicate a positive correlation in the science-based fields and very low predictability in the social sciences. (JMF)
Descriptors: Academic Achievement, Academic Standards, Admission Criteria, African Culture
Tinari, Frank D. – Improving College and University Teaching, 1979
Computerized analysis of multiple choice test items is explained. Examples of item analysis applications in the introductory economics course are discussed with respect to three objectives: to evaluate learning; to improve test items; and to help improve classroom instruction. Problems, costs and benefits of the procedures are identified. (JMD)
Descriptors: College Instruction, Computer Programs, Discriminant Analysis, Economics Education
Peer reviewedGreene, Roger L.; And Others – Journal of Personality Assessment, 1979
Students' ability to validate results of their psychological tests was examined. Seniors and graduate students could reliably select their profiles from the California Psychological Inventory (CPI), while college sophomores could not. College sophomores could select their Differential Aptitudes Test (DAT) profiles more confidently than their CPI…
Descriptors: Academic Aptitude, Age Differences, Aptitude Tests, Graduate Students
Peer reviewedLinn, Marcia C.; Rice, Marian – Journal of Educational Measurement, 1979
The Springs task, an individually administered measure of ability to criticize and control experiments, is described. The task has characteristics similar to Inhelder and Piaget's Bending Rods task; it yields scores on naming variables, controlling variables, and analyzing experiments. (See also RIE: ED 163 092.) (JKS)
Descriptors: Cognitive Processes, Cognitive Tests, Critical Thinking, Developmental Stages
Peer reviewedSeidman, Edward; And Others – Journal of Educational Psychology, 1979
The development and validation of three instruments for the multidimensional assessment of a child's classroom behavior are described. The multidimensional nature, internal consistency, and test-retest properties of scales depicting teacher-, peer-, and self-rated behavior are explicated. Principal-components analyses demonstrate the convergent…
Descriptors: Behavior Rating Scales, Factor Structure, Peer Evaluation, Primary Education
Peer reviewedHosley, Deborah; Meredith, Keith – TESOL Quarterly, 1979
This study provides validity information for the Test of English as a Foreign Language (TOEFL) by examining some of its inter- and intra-test correlates. (CFM)
Descriptors: English (Second Language), Factor Analysis, Foreign Students, Higher Education
Peer reviewedLynch, Brian K. – Language Testing, 1997
Addresses the question of whether any test can be defended as ethical or moral. Defines ethicality in terms of issues such as harm, consent, confidentiality of data, and fairness and presents frameworks for determining equity of educational opportunity. An assessment project in Australia is examined in relation to these concerns. (24 references)…
Descriptors: Elementary Education, Elementary School Students, Equal Education, Ethics
Drabenstott, Karen M.; Weller, Marjorie S. – Proceedings of the ASIS Annual Meeting, 1996
Describes the comparative approach to system evaluation used in a project which administered an online retrieval test to an experimental online catalog to produce data for evaluating effectiveness of a new subject access design. Discusses efforts used to ensure data reliability, and strategies the researchers can use to improve on the data…
Descriptors: Comparative Analysis, Computer Assisted Testing, Computer System Design, Data Analysis
Peer reviewedOlney, Cynthia; Grande, Steve – Michigan Journal of Community Service Learning, 1995
Describes the Scale of Service Learning Involvement, developed to validate a service-learning model of developmental processes experienced by students engaged in community volunteer work, from sporadic involvement to internalization of social responsibility, and to assess student outcomes. Reliability, concurrent validity, and contrasting group…
Descriptors: Attitude Change, Citizenship Responsibility, Higher Education, Measurement Techniques
Peer reviewedPomplun, Mark; Omar, Md Hafidz – Educational and Psychological Measurement, 1997
Four threats to validity of an alternative objective test item format, the multiple-mark format, were studied with data from a state-mandated assessment with about 30,000 students at each of three grade levels. Reliability and validity coefficients show that the format has promise as an objective format that can be aligned with new curriculum…
Descriptors: Curriculum Development, Elementary School Students, Elementary Secondary Education, Objective Tests
Peer reviewedGreaves, Daryl; Poole, Charles – Journal of Intellectual and Developmental Disability, 1996
Fifty-five mothers of young children with Down syndrome completed the Parenting Stress Index. Low internal reliability coefficient for the Adaptability/Plasticity Child Domain subscale was found. Factor analysis of the scale found that the Adaptability/Plasticity factor did not perform as a unidimensional structure, but provided information on…
Descriptors: Adaptive Behavior (of Disabled), Adjustment (to Environment), Behavior Patterns, Coping
Peer reviewedOno, Yoshiro – Research in Developmental Disabilities, 1996
Evaluation of the factor validity and reliability of the Aberrant Behavior Checklist (Japanese version) with 322 subjects (mean age 30) with moderate to profound mental retardation found most items loading on the same factors as in the original factor solution, high coefficient alphas across 5 subscales, high test-retest reliability, and…
Descriptors: Behavior Problems, Behavior Rating Scales, Check Lists, Factor Analysis
Peer reviewedFunderburk, Beverly W.; Eyberg, Sheila M.; Rich, Brendan A.; Behar, Lenore – Early Education and Development, 2003
Examined psychometric properties of the Eyberg Child Behavior Inventory, Preschool Behavior Questionnaire (PBQ) (Parent and Teacher versions), and Sutter-Eyberg Student Behavior Inventory (SESBI) with a sample of 2- to 6-year-olds. Established internal consistency for SESBI and the PBQ-Teacher. Obtained evidence for long-term stability on the…
Descriptors: Behavior Rating Scales, Construct Validity, Early Childhood Education, Measures (Individuals)
Peer reviewedNaizer, Gilbert L. – Action in Teacher Education, 1997
This study evaluated the validity and reliability of performance portfolios in a preservice elementary mathematics/science methods class, assessing students' domain-strategic and general-learning strategic knowledge. Results supported performance portfolios as a valid method of assessing desired abilities of preservice teachers that can be…
Descriptors: Elementary Education, Evaluation Methods, Higher Education, Mathematics Education


