Publication Date
| In 2026 | 3 |
| Since 2025 | 666 |
| Since 2022 (last 5 years) | 3167 |
| Since 2017 (last 10 years) | 7408 |
| Since 2007 (last 20 years) | 15046 |
Descriptor
| Test Reliability | 15036 |
| Test Validity | 10272 |
| Reliability | 9759 |
| Foreign Countries | 7141 |
| Test Construction | 4823 |
| Validity | 4191 |
| Measures (Individuals) | 3877 |
| Factor Analysis | 3825 |
| Psychometrics | 3525 |
| Interrater Reliability | 3124 |
| Correlation | 3039 |
| More ▼ | |
Source
Author
Publication Type
Education Level
Audience
| Researchers | 709 |
| Practitioners | 451 |
| Teachers | 208 |
| Administrators | 122 |
| Policymakers | 66 |
| Counselors | 42 |
| Students | 38 |
| Parents | 11 |
| Community | 7 |
| Support Staff | 6 |
| Media Staff | 5 |
| More ▼ | |
Location
| Turkey | 1327 |
| Australia | 436 |
| Canada | 379 |
| China | 368 |
| United States | 271 |
| United Kingdom | 256 |
| Indonesia | 252 |
| Taiwan | 234 |
| Netherlands | 223 |
| Spain | 216 |
| California | 214 |
| More ▼ | |
Laws, Policies, & Programs
Assessments and Surveys
What Works Clearinghouse Rating
| Meets WWC Standards without Reservations | 8 |
| Meets WWC Standards with or without Reservations | 9 |
| Does not meet standards | 6 |
Cho, Moon-Heum; Jonassen, David – Educational Psychology, 2009
Two studies focusing on the development and validation of the Online Self-Regulated Learning Inventory (OSRLI) were conducted. The OSRLI is a self-report instrument assessing the human interaction dimension of online self-regulated learning. It consists of an affect/motivation scale and an interaction strategies scale. In Study 1, exploratory…
Descriptors: Writing Strategies, Self Efficacy, Online Courses, Learning Strategies
Ertmer, Peggy A.; Stepich, Donald A.; Flanagan, Sara; Kocaman-Karoglu, Aslihan; Reiner, Christian; Reyes, Lisette; Santone, Adam L.; Ushigusa, Shigetake – Performance Improvement Quarterly, 2009
This exploratory study examined differences in the problem representations of a case-based situation by expert and novice instructional designers. The experts and half of the novices (control group) received identical directions for case analysis, while the other novices (treatment group) received additional guidelines recommending analysis…
Descriptors: Control Groups, Instructional Design, Problem Solving, Case Studies
Albiero, Paolo; Matricardi, Giada; Speltri, Daniela; Toso, Diana – Journal of Adolescence, 2009
The present study examined the validity of the Basic Empathy Scale (BES) [Jolliffe, D., & Farrington, D. P. (2006a). Development and validation of the Basic Empathy Scale. "Journal of Adolescence," 29, 589-611; Jolliffe, D., & Farrington, D. P. (2006b). Examining the relationship between low empathy and bullying. "Aggressive…
Descriptors: Prosocial Behavior, Aggression, Factor Structure, Adolescents
Claessen, Mary; Heath, Steve; Fletcher, Janet; Hogben, John; Leitao, Suze – International Journal of Language & Communication Disorders, 2009
Background: There is a great deal of evidence to support the robust relationship between phonological awareness and literacy development. Researchers are beginning to understand the relationship between the accuracy and distinctiveness of stored phonological representations and performance on phonological awareness tasks. However, many of the…
Descriptors: Reaction Time, Phonological Awareness, Validity, Goal Orientation
Zullig, Keith J.; Huebner, E. Scott; Patton, Jon M.; Murray, Karen A. – American Journal of Health Behavior, 2009
Objectives: To investigate the psychometric properties of the BMSLSS-College among 723 college students. Methods: Internal consistency estimates explored scale reliability, factor analysis explored construct validity, and known-groups validity was assessed using the National College Youth Risk Behavior Survey and Harvard School of Public Health…
Descriptors: Life Satisfaction, Public Health, Quality of Life, Construct Validity
Egbochuku, E. O.; Aihie, N. O. – Journal of Instructional Psychology, 2009
The study focused on the influence of peer group counselling and school influence on the self-concept of adolescents' in Nigerian secondary schools. Sixty-eight Senior Secondary School II students from three schools--a boys' school, a girls' school and a co-educational school in Benin City participated in the study. A pre-test, post-test control…
Descriptors: Control Groups, Research Design, Student Attitudes, Females
Wentworth, Nancy; Erickson, Lynnette B.; Lawrence, Barbara; Popham, J. Aaron; Korth, Byran – Studies in Educational Evaluation, 2009
The Clinical Practice Assessment System (CPAS), developed in response to teacher preparation program accreditation requirements, represents a paradigm shift of one university toward data-based decision-making in teacher education programs. The new assessment system is a scale aligned with INTASC Standards, which allows for observation and…
Descriptors: Preservice Teacher Education, Data, Decision Making, Student Teacher Evaluation
Vassar, Matt; Hale, William – Journal of Interpersonal Violence, 2009
Empirical research on anger and hostility has pervaded the academic literature for more than 50 years. Accurate measurement of anger/hostility and subsequent interpretation of results requires that the instruments yield strong psychometric properties. For consistent measurement, reliability estimates must be calculated with each administration,…
Descriptors: Research Methodology, Psychometrics, Psychological Patterns, Affective Behavior
Arnold, Margery E. – 1996
It is incorrect to say "the test is reliable" because reliability is a function not only of the test itself, but of many factors. The present paper explains how different factors affect classical reliability estimates such as test-retest, interrater, internal consistency, and equivalent forms coefficients. Furthermore, the limits of classical test…
Descriptors: Estimation (Mathematics), Generalizability Theory, Heuristics, Interrater Reliability
Kang, Namjun – 1987
If content analysis is to satisfy the requirement of objectivity, measures and procedures must be reliable. Reliability is usually measured by the proportion of agreement of all categories identically coded by different coders. For such data to be empirically meaningful, a high degree of inter-coder reliability must be demonstrated. Researchers in…
Descriptors: Content Analysis, Interrater Reliability, Measurement Techniques, Media Research
Halpin, Gerald; And Others – 1986
Based upon the assumption that the process of peer review of publications and research is flawed, interrater reliability of reviews of 188 research proposals submitted for funding at a major university was studied. The eight dimensions rated were: (1) significance of the research; (2) clarity and reasonableness of the objectives; (3)…
Descriptors: College Faculty, Evaluation Criteria, Evaluators, Grants
Peer reviewedWaugh, R. P. – Journal of School Psychology, 1975
Discusses rationale, reliability, factorial validity, and educational usefulness of the Illinois Test of Psycholinguistic Abilities. The I.T.P.A. is a serious attempt to identify specific sensory processing abilities. (Author)
Descriptors: Evaluation, Factor Analysis, Measurement Techniques, Reliability
Peer reviewedPowers, Stephen; And Others – Educational and Psychological Measurement, 1985
Results of an administration of the Language Proficiency Measure indicated that the interrater reliability was adequate, internal-consistency reliability estimates were high, concurrent validity coefficients were adequate, and the classification validity was acceptable. (Author/LMO)
Descriptors: Elementary Education, Interrater Reliability, Language Proficiency, Language Tests
Peer reviewedWeinrott, Mark R.; Jones, Richard R. – Child Development, 1984
Examines the tendency of observers to make less reliable recordings of behavorial events when a calibrating observer is absent. Using four different multicategory systems, 26 experienced observers coded 200 hours of videotaped family interactions. Concludes that observers lapse into a less attentive "set" prior to coding without a…
Descriptors: Adults, Behavior Patterns, Behavior Rating Scales, Family (Sociological Unit)
Peer reviewedHenk, William A.; Selders, Mary L. – Reading Teacher, 1984
Shows that synonymic scoring of cloze tests is highly variable--that the score seems to appear simply on who grades the test. (FL)
Descriptors: Cloze Procedure, Interrater Reliability, Reading Instruction, Reading Research

Direct link
