Publication Date
| In 2026 | 3 |
| Since 2025 | 636 |
| Since 2022 (last 5 years) | 3137 |
| Since 2017 (last 10 years) | 7378 |
| Since 2007 (last 20 years) | 15016 |
Descriptor
| Test Reliability | 15015 |
| Test Validity | 10252 |
| Reliability | 9751 |
| Foreign Countries | 7126 |
| Test Construction | 4811 |
| Validity | 4189 |
| Measures (Individuals) | 3875 |
| Factor Analysis | 3821 |
| Psychometrics | 3515 |
| Interrater Reliability | 3122 |
| Correlation | 3037 |
| More ▼ | |
Source
Author
Publication Type
Education Level
Audience
| Researchers | 709 |
| Practitioners | 451 |
| Teachers | 208 |
| Administrators | 122 |
| Policymakers | 66 |
| Counselors | 42 |
| Students | 38 |
| Parents | 11 |
| Community | 7 |
| Support Staff | 6 |
| Media Staff | 5 |
| More ▼ | |
Location
| Turkey | 1320 |
| Australia | 436 |
| Canada | 379 |
| China | 368 |
| United States | 271 |
| United Kingdom | 256 |
| Indonesia | 251 |
| Taiwan | 234 |
| Netherlands | 223 |
| Spain | 216 |
| California | 214 |
| More ▼ | |
Laws, Policies, & Programs
Assessments and Surveys
What Works Clearinghouse Rating
| Meets WWC Standards without Reservations | 8 |
| Meets WWC Standards with or without Reservations | 9 |
| Does not meet standards | 6 |
Peer reviewedMatthews, Margaret – ELT Journal, 1990
Discusses problems with the current trend in using behavior trait-based criteria to assess English-as-a-Second-Language productivity skills, and describes alternatives to such testing that involve the matching of linguistic tasks against nonlinguistic criteria. (Author/CB)
Descriptors: Communicative Competence (Languages), English (Second Language), Evaluation Criteria, Language Proficiency
Peer reviewedBurchard, Kenneth W.; Rowland-Morin, Pamela A. – Academic Medicine, 1990
Development of a behavioral test of surgeons' interpersonal skills involved applying a rating scale to videotape recordings of physician interactions with a gall bladder patient, at three intervals. Quality of communication was found to vary over the intervals. The instrument used was more sensitive to variation than a comparison scale.…
Descriptors: Evaluation Methods, Higher Education, Interpersonal Competence, Measurement Techniques
Peer reviewedCrowley, Mary L. – Journal for Research in Mathematics Education, 1990
Provides an alternative analysis of the reliability associated with the van Hiele Geometry Test based on the assumption that the norm-referenced reliability coefficients provided by the developers were inappropriate. Discusses the agreement coefficient and the kappa coefficient. (YP)
Descriptors: Criterion Referenced Tests, Geometric Concepts, Geometry, Mathematical Concepts
Peer reviewedAngoff, William H. – Applied Measurement in Education, 1988
Suggestions are provided for future research in item bias detection, reduction of essay-reader variation in setting cut-score levels, and limitations of equating theory. (TJH)
Descriptors: College Entrance Examinations, Cutting Scores, Equated Scores, Essay Tests
Peer reviewedAndrews, Jac; Janzen, Henry – Psychology in the Schools, 1988
Presents globally oriented scoring sheet, reference guide, and rating scale for facilitating clinical hypotheses from children's Kinetic School Drawings (KSDs) and further empirical evaluations of KSD technique. Provides information on instrument construction and preliminary findings in terms of procedures' reliability and discriminant validity.…
Descriptors: Elementary Secondary Education, Evaluation Methods, Foreign Countries, Freehand Drawing
Seldin, Peter – AGB Reports, 1988
Ten guidelines for successful administrator evaluation programs based on current research and interviews with campus administrators are presented. They concern the evaluation's purpose and approach and the role of the administrator in the process. (MSE)
Descriptors: Administrator Evaluation, Administrator Role, College Administration, Cost Effectiveness
Abedi, Jamal; Bruno, James – Journal of Computer-Based Instruction, 1989
Reports the results of several test-reliability experiments which compared a modified confidence weighted-admissible probability measurement (MCW-APM) with conventional forced choice or binary type (R-W) test scoring methods. Psychometric properties using G theory and conventional correlational methods are examined, and their implications for…
Descriptors: Ability Grouping, Analysis of Variance, Computer Assisted Testing, Correlation
Peer reviewedMeier, Scott T. – Computers in Human Behavior, 1988
Description of the development of a theoretically based instrument--the Computer Aversion Scale (CAVS)--to measure negative reactions to computers, focuses on the use of microcomputers in the mental health field. Previous efforts to assess computer anxiety are reviewed, and studies testing the reliability and validity of the CAVS are described.…
Descriptors: Computer Assisted Testing, Concurrent Validity, Correlation, Factor Analysis
Peer reviewedGothberg, Helen M.; Aleamoni, Lawrence M. – Journal of Education for Library and Information Science, 1988
Describes an objective test used as the comprehensive examination in a graduate library school and discusses its advantages over essay tests. The topics covered include test construction, the use of item analysis for scoring and test revision, and student reactions to the objective test. (1 reference) (CLB)
Descriptors: Case Studies, Graduate Study, Graduation Requirements, Higher Education
Peer reviewedBrown, James Dean – Language Testing, 1988
The reliability and validity of a cloze procedure used as an English-as-a-second-language (ESL) test in China were improved by applying traditional item analysis and selection techniques. The 'best' test items were chosen on the basis of item facility and discrimination indices, and were administered as a 'tailored cloze.' 29 references listed.…
Descriptors: Adaptive Testing, Cloze Procedure, English (Second Language), Foreign Countries
Peer reviewedKluever, Raymond C.; And Others – Journal of Educational Computing Research, 1994
Describes a study that investigated the reliability, factorial validity, and fit to a unidimensional model of the Computer Attitude Scale based on pretests and posttests from 265 teachers who participated in training on classroom applications of computer hardware and software. Discussion includes computer anxiety, efficiency, liking, and…
Descriptors: Computer Anxiety, Computer Assisted Instruction, Courseware, Elementary Secondary Education
Peer reviewedWiecha, John M.; And Others – Journal of Community Health, 1994
Vietnamese high school students completed a food frequency questionnaire (FFQ) and completed daily diet reports for seven weeks. Data from the FFQ were compared to the food reports. The results indicated a few simple FFQ items, particularly for indicator foods such as rice, were reliable for dietary assessment for that population. (SM)
Descriptors: Asian Americans, Cultural Influences, Dietetics, Eating Habits
Peer reviewedGredler, Margaret E. – Studies in Educational Evaluation, 1995
Different meanings of portfolio assessment are reviewed, and potential applications to program evaluation are explored. At present, portfolio assessments are not recommended as the primary source of evidence about the attainment of program goals in evaluations that compare curricula or programs because of the lack of validity and reliability…
Descriptors: Alternative Assessment, Comparative Analysis, Curriculum, Educational Assessment
Peer reviewedSevery, Lawrence J.; And Others – NACADA Journal, 1994
This study compared two instruments for evaluating the performance of individual college-level academic advisors: (1) a student form assessing abilities in listening and other counseling skills, knowledge, and technology use; and (2) a professional scale assessing task competence, other-orientation, and professional networking. Both showed high…
Descriptors: Academic Advising, Comparative Analysis, Competence, Construct Validity
Vane-Tempest, Stewart – Information Management & Technology, 1995
Presents a guide for selecting an optional disc system. Highlights include storage hierarchy; standards; data life cycles; security; implementing an optical jukebox system; optimizing the system; performance; quality and reliability; software; cost of online versus near-line; and growing opportunities. Sidebars provide additional information on…
Descriptors: Computer Security, Computer Selection, Computer System Design, Data


