Publication Date
| In 2026 | 3 |
| Since 2025 | 666 |
| Since 2022 (last 5 years) | 3167 |
| Since 2017 (last 10 years) | 7408 |
| Since 2007 (last 20 years) | 15046 |
Descriptor
| Test Reliability | 15036 |
| Test Validity | 10272 |
| Reliability | 9759 |
| Foreign Countries | 7141 |
| Test Construction | 4823 |
| Validity | 4191 |
| Measures (Individuals) | 3877 |
| Factor Analysis | 3825 |
| Psychometrics | 3525 |
| Interrater Reliability | 3124 |
| Correlation | 3039 |
| More ▼ | |
Source
Author
Publication Type
Education Level
Audience
| Researchers | 709 |
| Practitioners | 451 |
| Teachers | 208 |
| Administrators | 122 |
| Policymakers | 66 |
| Counselors | 42 |
| Students | 38 |
| Parents | 11 |
| Community | 7 |
| Support Staff | 6 |
| Media Staff | 5 |
| More ▼ | |
Location
| Turkey | 1327 |
| Australia | 436 |
| Canada | 379 |
| China | 368 |
| United States | 271 |
| United Kingdom | 256 |
| Indonesia | 252 |
| Taiwan | 234 |
| Netherlands | 223 |
| Spain | 216 |
| California | 214 |
| More ▼ | |
Laws, Policies, & Programs
Assessments and Surveys
What Works Clearinghouse Rating
| Meets WWC Standards without Reservations | 8 |
| Meets WWC Standards with or without Reservations | 9 |
| Does not meet standards | 6 |
Peer reviewedCampbell, N. Jo; And Others – Elementary School Guidance and Counseling, 1985
Two levels of the Students' Attitudes Toward the Mentally Handicapped (SAMH) inventory were developed and administered to fourth and seventh graders (N=306). Results suggested that internal reliability estimates of responses on subscales and total scale were acceptable; the SAMH was useful in assessing student attitudes toward mentally handicapped…
Descriptors: Attitudes toward Disabilities, Childhood Attitudes, Intermediate Grades, Junior High Schools
Peer reviewedAllen, Dennis L. – Journal of Medical Education, 1985
A current study of major changes in the behavioral science curriculum in a family practice residency is reported. The survey instrument appears to be highly reliable for assessing graduates' perspectives on specific aspects of residency training in the behavioral sciences. (MLW)
Descriptors: Behavioral Sciences, Curriculum Development, Curriculum Evaluation, Family Practice (Medicine)
Peer reviewedJohnson, E. G. – Assessment and Evaluation in Higher Education, 1985
The 15 components used for assessment in an introductory psychology course were examined to determine their relative value as predictors of overall performance. Multiple-choice tests were consistently found to be more effective than essay examinations, but arguments are presented for the retention of essays, with suggestions for their improvement.…
Descriptors: Foreign Countries, Higher Education, Introductory Courses, Predictor Variables
Peer reviewedPerkins, Kyle; Miller, Leah D. – Language Testing, 1984
Describes a study which submitted data from a multiple-choice English as a second language reading comprehension test to classical test theory item analysis and latent trait measurement. The purpose was to identify weak items and to compare the number of weak items indicated by the two different approaches. (SED)
Descriptors: English (Second Language), Language Tests, Latent Trait Theory, Reading Comprehension
Peer reviewedCameron, L. A.; Heywood, J. – College Teaching, 1985
A study of the comparability of results of two techniques of testing, the traditional approach and that of giving the students the questions before the examination, revealed little difference in the two approaches' results, and supports the use of "prior notice" to reduce test anxiety. (MSE)
Descriptors: College Instruction, Comparative Analysis, Higher Education, Preservice Teacher Education
Peer reviewedCook, Desmond L.; Loadman, William E. – Educational and Psychological Measurement, 1984
A new instrument designed to assess perceptions and attitudes toward proposal development and funding was administered to 416 American Educational Research Association members. The instrument demonstrated good scaling properties and high internal consistency. Discriminations beyond the chance level were obtained on both sex and proposal…
Descriptors: Adults, Attitude Measures, Attitudes, Educational Researchers
Peer reviewedRush, R. Timothy – Reading Teacher, 1985
Discusses the characteristics of three popular readability formulas: the Dale-Chall, the Fry Graph, and the Spache. Describes text based and reader/text based alternatives. Offers appropriate applications of each form of assessment. (FL)
Descriptors: Computer Assisted Testing, Elementary Education, Evaluation Methods, Readability
Peer reviewedMcCarthy, Constance – College and Research Libraries, 1986
This essay addresses issue of uniformity or consistency in application of subject headings--as distinct from consistency among headings--to books on any given topic. Discussion covers purpose of subject headings or descriptors, reliability of choices made by catalogers when assigning subject headings, and card and online catalogs. (18 references)…
Descriptors: Cataloging, Databases, Indexing, Information Retrieval
Peer reviewedPowers, Stephen; Gose, Kenneth F. – Educational and Psychological Measurement, 1986
The Maslach Burnout Inventory was administered to 72 upper-level and graduate students. Item-responses were intercorrelated and subjected to factor analyses. Support was obtained for the three hypothesized factors: emotional exhaustion, depersonalization, and personal accomplishment. (Author/GDC)
Descriptors: Affective Measures, Attitude Measures, Burnout, Factor Structure
Peer reviewedIrvine, Jacqueline Jordan – Journal of Educational Psychology, 1986
Students' initiating behaviors, teachers' verbal feedback, and students' available response opportunities were studied in 63 classrooms in relation to student race, sex, and grade level, using a modified Brophy-Good Observation System. Results indicated that male students initiate more positive and negative interactions with teachers than do…
Descriptors: Analysis of Variance, Elementary Education, Feedback, Interrater Reliability
Peer reviewedSchaeffer, Gary A.; And Others – Evaluation Review, 1986
The reliability of criterion-referenced tests (CRTs) used in health program evaluation can be conceptualized in different ways. Formulas are presented for estimating appropriate standard error of measurement (SEM) for CRTs. The SEM can be used in computing confidence intervals for domain score estimates and for a cut-score. (Author/LMO)
Descriptors: Accountability, Criterion Referenced Tests, Cutting Scores, Error of Measurement
Peer reviewedDaRosa, Debra A.; And Others – Evaluation and Program Planning, 1985
Medical students' performance on simulated patient practical examinations were compared to faculty ratings of the surgical clerks and awards of honor ratings. Faculty ratings were correlated with objective measures of the criteria on the exams. The performance of honor, versus non-honor students was higher on the practical exam. (Author/GDC)
Descriptors: Correlation, Evaluation Criteria, Evaluation Methods, Higher Education
Peer reviewedMillman, Jason; And Others – Research in Higher Education, 1983
Two studies examining the effect of grade inflation on the piling up of grades in fewer grade categories and on the reliability of grade point averages found that reliability suffered significantly only in graduate study in which almost all grades were in two categories, A and B. Some other small effects were found in different rating scales. (MSE)
Descriptors: Comparative Analysis, Evaluation Criteria, Grade Inflation, Grade Point Average
Peer reviewedRogosa, David R.; Willett, John B. – Journal of Educational Measurement, 1983
Demonstrating good reliability for the difference score in measurement, the results of this study indicate that the difference score is often highly reliable when the correlation between true change and true initial status is nonnegative. In general, when individual differences in true change are appreciable, the difference score shows strong…
Descriptors: Achievement Gains, Error of Measurement, Individual Differences, Measurement Techniques
Peer reviewedPitishkin-Potanich, V. – Higher Education in Europe, 1983
An experiment testing the reliability of student self-evaluation is reported, and issues in using grades as incentives and in measuring academic achievement objectively are discussed, emphasizing the qualitative aspects of achievement as well as the quantitative. (MSE)
Descriptors: Academic Achievement, College Students, Foreign Countries, Grades (Scholastic)


