Publication Date
| In 2026 | 1 |
| Since 2025 | 168 |
| Since 2022 (last 5 years) | 1021 |
| Since 2017 (last 10 years) | 2336 |
| Since 2007 (last 20 years) | 6522 |
Descriptor
| Reliability | 9761 |
| Validity | 3866 |
| Foreign Countries | 2823 |
| Measures (Individuals) | 1892 |
| Correlation | 1522 |
| Factor Analysis | 1460 |
| Statistical Analysis | 1278 |
| Questionnaires | 1084 |
| Scores | 1064 |
| Student Attitudes | 1034 |
| Psychometrics | 979 |
| More ▼ | |
Source
Author
Publication Type
Education Level
Audience
| Researchers | 181 |
| Practitioners | 101 |
| Teachers | 61 |
| Administrators | 42 |
| Policymakers | 33 |
| Students | 21 |
| Counselors | 10 |
| Media Staff | 5 |
| Community | 1 |
| Parents | 1 |
| Support Staff | 1 |
| More ▼ | |
Location
| Turkey | 454 |
| Australia | 155 |
| Canada | 144 |
| China | 127 |
| United States | 127 |
| Taiwan | 107 |
| United Kingdom | 100 |
| Nigeria | 98 |
| California | 95 |
| Netherlands | 91 |
| Indonesia | 86 |
| More ▼ | |
Laws, Policies, & Programs
Assessments and Surveys
What Works Clearinghouse Rating
| Meets WWC Standards without Reservations | 3 |
| Meets WWC Standards with or without Reservations | 4 |
| Does not meet standards | 2 |
Lauricella, Sharon; Kay, Robin – Australasian Journal of Educational Technology, 2010
Considerable research has been conducted examining the use of laptops in higher education, however, a reliable and valid scale to assess in-class use of laptops has yet to be developed. The purpose of the following study was to develop and evaluate the "Laptop Effectiveness Scale" (LES). The scale consisted of four constructs: academic…
Descriptors: Feedback (Response), Higher Education, Construct Validity, Measures (Individuals)
Tummons, Jonathan – Assessment & Evaluation in Higher Education, 2010
This paper forms part of an exploration of assessment on one part-time higher education (HE) course: an in-service, professional qualification for teachers and trainers in the learning and skills sector which is delivered on a franchise basis across a network of further education colleges in the north of England. This paper proposes that the…
Descriptors: Foreign Countries, Portfolios (Background Materials), Portfolio Assessment, Validity
Lynch, William W. – 1976
Guidelines for the use of observation instruments in pre-and in-service teachers of the handicapped are provided. It is explained that guidelines have been developed according to the following five principles: (1) the observation instrument must be relevant (including empirical, normative, and interpretative relevance); (2) methods of gathering…
Descriptors: Classroom Observation Techniques, Cost Effectiveness, Guidelines, Handicapped Children
PDF pending restorationKane, Michael T.; Moloney, James M. – 1976
The Answer-Until-Correct (AUC) procedure has been proposed in order to increase the reliability of multiple-choice items. A model for examinees' behavior when they must respond to each item until they answer it correctly is presented. An expression for the reliability of AUC items, as a function of the characteristics of the item and the scoring…
Descriptors: Guessing (Tests), Item Analysis, Mathematical Models, Multiple Choice Tests
Peer reviewedKlein, Stephen P.; Stecher, Brian M.; Shavelson, Richard J.; McCaffrey, Daniel; Ormseth, Tor; Bell, Robert M.; Comfort, Kathy; Othman, Abdul R. – Applied Measurement in Education, 1998
Two studies involving 368 elementary and high school students and 29 readers were conducted to investigate reader consistency, score reliability, and reader time requirements of three hands-on science performance tasks. Holistic scores were as reliable as analytic scores, and there was a high correlation between them after they were disattenuated…
Descriptors: Elementary School Students, Elementary Secondary Education, Hands on Science, High School Students
Love, Angela; And Others – 1996
The development of a coding scheme to identify the function of each conversational turn within episodes of conflict in a peer tutoring setting is described, and the scheme, based on Cohen's kappa analysis, is presented. Although 15 codes were developed for the initial effort, 7 codes were finally used to reflect each utterance as: (1) agreement;…
Descriptors: Coding, Correlation, Peer Teaching, Reliability
Kissel, Mary Ann – 1970
The problem of this study was to determine whether Method A is a more efficient observational method for obtaining activity type behaviors in an individualized classroom than Method B. Method A requires the observer to record the activities of the entire class at given intervals while Method B requires only the activities of selected individuals…
Descriptors: Classroom Observation Techniques, Individualized Instruction, Individualized Programs, Reliability
Hayes, Robert B. – 1968
This paper reports results of efforts over a 7-year period (1960-67) to determine if the Hayes Pupil-Teacher Reaction Scale is a reliable, valid unidimensional instrument which may be used to measure the attitude of students toward the teaching effectiveness of their teachers. Criteria used were 1) each respondent's total score describes with at…
Descriptors: Measurement Instruments, Reliability, Student Attitudes, Teacher Evaluation
Whalen, Thomas E. – 1971
Smith (1969) reported the results of an instrument for measuring teacher judgment of written composition. His test was first administered to a group of "experts" whose ratings were in high agreement. Then the test was given to a sample of over 200 teachers and lay readers. Among Smith's conclusions was that over half of the teachers have judgment…
Descriptors: Essay Tests, Reliability, Scoring, Test Validity
Peer reviewedLarrabee, Marva J.; Froehle, Thomas C. – Counselor Education and Supervision, 1979
Demonstrates that differences occur in role fidelity and in the performance consistency of a coached client over a series of simulated interviews. Illustrates that such differences can be quantitatively described, and that the results of the frequency tabulation procedure are affected by the training of raters in component observation. (Author)
Descriptors: Modeling (Psychology), Observation, Performance Factors, Reliability
Peer reviewedShowalter, Stuart W. – Journalism Quarterly, 1978
Reports that the "Readers' Guide to Periodical Literature" provides quick access to popular magazine content, although the titles are not drawn randomly from a universe of publications; that the indexers take an inclusive approach to cataloging; and that the indexers demonstrate high reliability in locating and cataloging full-length…
Descriptors: Cataloging, Indexes, Indexing, Periodicals
Reliability and Mean Length of Utterance as a Function of Sample Size in Early Language Development.
Peer reviewedRondal, J. A.; DeFays, D. – Journal of Genetic Psychology, 1978
Recommends criteria for determining adequate sample size for the use of Mean Length of Utterance (MLU) as an indicator of early language development. (BD)
Descriptors: Infants, Language Acquisition, Reliability, Research Criteria
Zuravin, Susan J.; And Others – Child Abuse and Neglect: The International Journal, 1987
Anonymous reports (n=155) of child physical abuse in Baltimore (MD) were compared with reports made by professionals (n=588) and nonprofessionals (n=262) in terms of substantiation rate, seriousness of substantiated incidents, and severity of allegations. While anonymous reports were more likely to be unfounded, those that were substantiated were…
Descriptors: Child Abuse, Comparative Analysis, Professional Personnel, Reliability
Peer reviewedCooper, Merri-Ann; Fiske, Donald W. – Educational and Psychological Measurement, 1976
Construct validity patterns of test-criteria and item-criteria correlations are shown to be inconsistent across samples. The results of an investigation of construct validity patterns on two published personality scales is presented. (JKS)
Descriptors: Correlation, Item Analysis, Personality Measures, Reliability
Iramaneerat, Cherdsak; Myford, Carol M.; Yudkowsky, Rachel – Online Submission, 2006
An Objective Structured Clinical Examination (OSCE) is an assessment approach employed in medical education, in which residents rotate through multiple stations of standardized clinical tasks to evaluate their clinical competence. Because items used to evaluate residents' performance in each OSCE station are linked to the same task and are rated…
Descriptors: Scoring, Reliability, Rating Scales, Medical Education

Direct link
