Publication Date
| In 2026 | 1 |
| Since 2025 | 166 |
| Since 2022 (last 5 years) | 1019 |
| Since 2017 (last 10 years) | 2334 |
| Since 2007 (last 20 years) | 6520 |
Descriptor
| Reliability | 9759 |
| Validity | 3866 |
| Foreign Countries | 2823 |
| Measures (Individuals) | 1892 |
| Correlation | 1522 |
| Factor Analysis | 1460 |
| Statistical Analysis | 1278 |
| Questionnaires | 1084 |
| Scores | 1064 |
| Student Attitudes | 1034 |
| Psychometrics | 979 |
| More ▼ | |
Source
Author
Publication Type
Education Level
Audience
| Researchers | 181 |
| Practitioners | 101 |
| Teachers | 61 |
| Administrators | 42 |
| Policymakers | 33 |
| Students | 21 |
| Counselors | 10 |
| Media Staff | 5 |
| Community | 1 |
| Parents | 1 |
| Support Staff | 1 |
| More ▼ | |
Location
| Turkey | 454 |
| Australia | 155 |
| Canada | 144 |
| China | 127 |
| United States | 127 |
| Taiwan | 107 |
| United Kingdom | 100 |
| Nigeria | 98 |
| California | 95 |
| Netherlands | 91 |
| Indonesia | 86 |
| More ▼ | |
Laws, Policies, & Programs
Assessments and Surveys
What Works Clearinghouse Rating
| Meets WWC Standards without Reservations | 3 |
| Meets WWC Standards with or without Reservations | 4 |
| Does not meet standards | 2 |
Peer reviewedDiserens, Deborah; And Others – Journal of Medical Education, 1986
A computer program developed at the University of Pennsylvania School of Medicine presents simulated patient cases and then scores participants' clinical problem-solving in the cases by comparing their performances with those of faculty members. The validity and reliability of this evaluation system was investigated. (Author/MLW)
Descriptors: Clinical Diagnosis, Evaluation Methods, Graduate Medical Students, Higher Education
Peer reviewedEdwards, Dee; Williams, David – British Journal of Educational Technology, 1985
Discusses process of continuous assessment that involves many tutors grading papers at Great Britain's Open University and the problem of grade unreliability within such a system. A longitudinal experiment involving grading of papers in one course by all tutors and comparing their grades to determine grading reliability is described. (MBR)
Descriptors: College Faculty, Correspondence Study, Distance Education, Evaluation Methods
Peer reviewedJensen, Arthur R. – Intelligence, 1985
Borkowski and Krause (1983) concluded that the locus of black-white intelligence differences lies in metaprocesses not elementary cognitive processes. However, some variables were difference scores with unacceptably low reliability. Magnitude comparisons of racial differences give a different picture of results; comparable differences in measures…
Descriptors: Black Students, Cognitive Measurement, Cognitive Processes, Correlation
Peer reviewedSmith, Stephen R.; Paulen, Leslie J. – Journal of Medical Education, 1984
A total of 120 medical schools responded to a survey about their use of written student evaluations of faculty, anonymity policy, and use of the evaluations in faculty promotion, tenure, and salary decisions. Most currently use a system of regular, anonymous evaluation by students, and its continuation is recommended. (MSE)
Descriptors: Confidentiality, Higher Education, Medical School Faculty, Medical Schools
Karkee, Thakur; Lewis, Dan M.; Barton, Karen; Haug, Carolyn – 2003
This study aimed to determine the degree to which the inclusion of accommodated students with disabilities in the calibration sample affects the characteristics of item parameters and the test results. Investigated were effects on test reliability, item fit to the applicable item response theory (IRT) model, item parameter estimates, and students'…
Descriptors: Academic Accommodations (Disabilities), Disabilities, Elementary School Students, Intermediate Grades
Selfa, Lance A.; Suter, Natalie; Myers, Sharon; Koch, Shaun; Johnson, Robert A.; Zahs, Daniel A.; Kuhr, Brian D.; Abraham, Sameer Y.; Zimbler, Linda J. – 1997
The 1988 National Survey of Postsecondary Faculty (NSOPF-88), later named the National Study of Postsecondary Faculty, was the first comprehensive study of higher education instructional faculty conducted by the National Center for Education Statistics since 1963. This report provides a description of the 1993 NSOPF and the data generated by its…
Descriptors: College Faculty, Colleges, Data Analysis, Data Collection
Dugoni, Bernard; Lee, Lisa; Tourangeau, Roger – 1997
During round 16 of the National Longitudinal Survey of Youth (NLSY), 900 NLSY sample members were randomly assigned to be interviewed about the period since their round 14 interview. Their responses were compared to those of approximately 8,000 NLSY sample members who were assigned to be interviewed about the 1-year period since their round 15…
Descriptors: Comparative Analysis, Data Collection, Employment Level, Interviews
Merriam, Sharan B. – 1998
This book offers a resource guide for qualitative researchers in education, discussing data collection techniques, data analysis, reporting, and the issues of validity, reliability, and ethics. Part 1 reviews the nature and design of qualitative research; it discusses various types of qualitative research (including case studies), and how to…
Descriptors: Case Studies, Data Analysis, Data Collection, Evaluation Methods
Royce, Daniel – 1994
Reinterviews were conducted to measure the response variance of selected questions from the 1991 Schools and Staffing Survey (SASS) administrator, school, and teacher questionnaires. Response variance measures one component of the nonsampling error in the data collected by a question, and it indicates how consistently respondents answer questions…
Descriptors: Administrators, Data Collection, Educational Research, Elementary Secondary Education
Quinn, Thomas James – 1998
An instrument was developed to measure perceived levels of anxiety of students enrolled in a resident outdoor adventure education course, and to confirm four underlying factors that contribute to anxiety in such settings. These factors are level of control, program inadequacies, personal inadequacies, and level of comfort. A 53-item Outdoor…
Descriptors: Adventure Education, Affective Measures, Anxiety, Higher Education
Arce-Ferrer, Alvaro J.; Cisneros-Cohernour, Edith J. – 2001
This paper summarizes main findings from a two-step investigation of the translation of a psychological scale from English into Spanish. The overall purpose of the study was to document the effects of tailoring a scale with etic items (i.e., culturally general items) and emic items (i.e., culture specific items) on the quality of the information.…
Descriptors: Culture Fair Tests, English, Foreign Countries, High School Students
Bastick, Tony – 1999
A method for measuring the contribution an individual makes to group work is described, and its use is supported through a study of 57 university students aged from 20 to 46 years working in 8 groups of 4 to 10 members each. The method recognizes that the most valid sources of information on the contribution of each individual to the group work…
Descriptors: Accountability, College Students, Cooperative Learning, Criteria
VanLehn, Kurt – 2001
Olae is a computer system for assessing student knowledge of physics, and Newtonian mechanics in particular, using performance data collected while students solve complex problems. Although originally designed as a stand-alone system, it has also been used as part of the Andes intelligent tutoring system. Like many other performance assessment…
Descriptors: Bayesian Statistics, Computer Assisted Testing, Intelligent Tutoring Systems, Knowledge Level
Pascoe, Donna; Halpin, Glennelle – 2001
This review covers the test components of validity, reliability, job-relatedness, and test bias in relation to teacher licensing examinations and the legal decisions that have affected policy in this area. The literature provides a history of court decisions and legal rulings that have shaped policy, test design, and test use. The important…
Descriptors: Court Litigation, Elementary Secondary Education, Job Skills, Legal Problems
Yen, Shu Jing; Bene, Nancy; Huynh, Huynh – 2000
Content integration in performance assessment involves mixing different areas of knowledge in one assessment. In this type of testing situation, assessment tasks are designed to measure the ability of students to solve problems by applying their knowledge and skills in multiple content areas. This study examined the effect of integrated science…
Descriptors: Elementary Secondary Education, Integrated Activities, Performance Based Assessment, Reading Achievement


