Publication Date
| In 2026 | 3 |
| Since 2025 | 675 |
| Since 2022 (last 5 years) | 3176 |
| Since 2017 (last 10 years) | 7417 |
| Since 2007 (last 20 years) | 15055 |
Descriptor
| Test Reliability | 15043 |
| Test Validity | 10279 |
| Reliability | 9761 |
| Foreign Countries | 7144 |
| Test Construction | 4825 |
| Validity | 4191 |
| Measures (Individuals) | 3877 |
| Factor Analysis | 3825 |
| Psychometrics | 3526 |
| Interrater Reliability | 3124 |
| Correlation | 3040 |
| More ▼ | |
Source
Author
Publication Type
Education Level
Audience
| Researchers | 709 |
| Practitioners | 451 |
| Teachers | 208 |
| Administrators | 122 |
| Policymakers | 66 |
| Counselors | 42 |
| Students | 38 |
| Parents | 11 |
| Community | 7 |
| Support Staff | 6 |
| Media Staff | 5 |
| More ▼ | |
Location
| Turkey | 1328 |
| Australia | 436 |
| Canada | 379 |
| China | 368 |
| United States | 271 |
| United Kingdom | 256 |
| Indonesia | 253 |
| Taiwan | 234 |
| Netherlands | 223 |
| Spain | 217 |
| California | 215 |
| More ▼ | |
Laws, Policies, & Programs
Assessments and Surveys
What Works Clearinghouse Rating
| Meets WWC Standards without Reservations | 8 |
| Meets WWC Standards with or without Reservations | 9 |
| Does not meet standards | 6 |
Peer reviewedBennett, Randy Elliot; And Others – Special Services in the Schools, 1988
A study of Scholastic Aptitude Test scores for nine groups of students with disabilities taking special test administrations found differences in score levels among disability groups but no significant differences of measurement precision and no evidence of disadvantage for disabled students. (Author/MSE)
Descriptors: Adaptive Testing, College Entrance Examinations, Comparative Analysis, Disabilities
Peer reviewedBoud, David – Assessment and Evaluation in Higher Education, 1989
Use of student self-evaluation in grading is recommended, and several ways in which self-evaluation may be incorporated into the grading process are described, including grades justified and moderated by faculty and/or peers, criteria generated by peers, weighting, combining self-evaluation with demonstration of competence, and learning and grade…
Descriptors: College Students, Evaluation Methods, Grading, Higher Education
Peer reviewedCervantes, Richard C.; And Others – Hispanic Journal of Behavioral Sciences, 1990
Examines reliability and validity of two versions of Hispanic Stress Inventory (HSI), a new instrument to assess psychosocial stress among Hispanic adults. Subscale scores found to correlate highly with criterion measures of distress. Tests showed internal consistency and supported HSI reliability. Need for further evaluation discussed. (TES)
Descriptors: Adults, Factor Analysis, Hispanic Americans, Measurement Techniques
Peer reviewedLong, Edgar C. J. – Educational and Psychological Measurement, 1990
The development, preliminary assessment, and uses of two paper-and-pencil measures of dyadic perspective-taking are described. A literature review revealed 23 perspective-taking type items each for the Self Dyadic Perspective-Taking Scale and the Other Dyadic Perspective-Taking Scale. Results for 277 college students indicate that the tests are…
Descriptors: College Students, Comparative Testing, Higher Education, Perspective Taking
Peer reviewedSchaeffer, Nora Cate – Journal of Marriage and the Family, 1989
Examined, through analysis of questions about conflict between separated parents (N=327), three hypotheses about the relationship among frequency and intensity response questions. Found associations among related intensity items possibly stronger than those among related frequency items; associated intensity items and associated frequency items…
Descriptors: Divorce, Measurement Objectives, Measurement Techniques, Reliability
Peer reviewedBrown, Jonathan R. – Language, Speech, and Hearing Services in Schools, 1989
The importance of using the standard error of measurement (SEm) in determining reliability in test scores is emphasized. The SEm is compared to the hypothetical true score for standardized tests, and procedures for calculation of the SEm are explained. (JDD)
Descriptors: Elementary Secondary Education, Error of Measurement, Scores, Standardized Tests
Scott, Randall L.; And Others – American Journal on Mental Retardation, 1989
The psychometric integrity of a short form of the Questionnaire on Resources and Stress was evaluated, based on data provided by 66 mothers and 66 fathers of 33 handicapped and 33 nonhandicapped preschool children. Alpha reliability, factor structure, and construct validity analyses indicated that the measure has reasonable psychometric integrity.…
Descriptors: Comparative Analysis, Disabilities, Evaluation Methods, Fathers
Peer reviewedSciarone, A. G.; Schoorl, J. J. – Language Learning, 1989
Presents findings from an experiment that sought to determine the minimal number of blanks required to ensure parallelism in cloze tests, differing only in the point at which deletion starts. Results showed the required minimum depended on the scoring methods used, with exact-word tests requiring about 100 blanks and acceptable-word tests…
Descriptors: Cloze Procedure, Dutch, Indonesian, Reading Tests
Peer reviewedWatson, J. Allen; And Others – Journal of Educational Technology Systems, 1989
Reports on a computer-based research model that was designed to test family process variables. Integration with an existing family decision-making process model is described, the microcomputer/mainframe system is explained, and system reliability and validity are discussed in relation to traditional process variable research methodologies. (29…
Descriptors: Computer Networks, Computer Oriented Programs, Decision Making, Family Life Education
Peer reviewedShepard, Lorrie A. – Educational Leadership, 1989
In today's political climate, standardized tests are inadequate and misleading as achievement measures. Educators should employ a variety of measures, improve standardized test content and format, and remove incentives for teaching to the test. Focusing on raising test scores distorts instruction and renders scores less credible. Includes 13…
Descriptors: Academic Achievement, Elementary Secondary Education, Politics of Education, Scores
Peer reviewedChusmir, Leonard H. – Psychology: A Journal of Human Behavior, 1988
Calculated Cronbach's alpha coefficients for five recent studies (N=1,723) which used the Manifest Needs Questionnaire (MNQ) to measure needs for achievement, autonomy, affiliation, and dominance. Results showed acceptable levels of internal consistency for achievement and dominance needs. Found support for autonomy and affiliation needs with some…
Descriptors: Achievement Need, Affiliation Need, Individual Needs, Meta Analysis
Peer reviewedNewsham, Gwen S. – Canadian Modern Language Review, 1989
Information in published articles on communicative testing is examined and discussed from the point of view of a classroom teacher. The administrability, reliability, and validity of communicative language testing are highlighted. (MSE)
Descriptors: Communicative Competence (Languages), Language Tests, Research Utilization, Scholarly Journals
Peer reviewedvan den Bergh, Huub; Eiting, Mindert H. – Journal of Educational Measurement, 1989
A method of assessing rater reliability via a design of overlapping rater teams is presented. Covariances or correlations of ratings can be analyzed with LISREL models. Models in which the rater reliabilities are congeneric, tau-equivalent, or parallel can be tested. Two examples based on essay ratings are presented. (TJH)
Descriptors: Analysis of Covariance, Computer Simulation, Correlation, Elementary Secondary Education
Peer reviewedPowers, Donald E.; And Others – Journal of Educational Measurement, 1994
The effects on essay scores of intermingling handwritten and word-processed student essays were studied with 32 students who produced handwritten and word-processed essays. Essays were converted to the other format and rescored. Results reveal higher average scores for handwritten essays. Implications for scoring are considered. (SLD)
Descriptors: College Students, Computer Uses in Education, Essays, Handwriting
Peer reviewedJensen, Arthur R.; Weng, Li-Jen – Intelligence, 1994
The stability of psychometric "g," the general factor of intelligence, is investigated in simulated correlation matrices and in typical empirical data from a large battery of mental tests. "G" is robust and almost invariant across methods of analysis. A reasonable strategy for estimating "g" is suggested. (SLD)
Descriptors: Correlation, Estimation (Mathematics), Factor Analysis, Intelligence


