Publication Date
| In 2026 | 3 |
| Since 2025 | 675 |
| Since 2022 (last 5 years) | 3176 |
| Since 2017 (last 10 years) | 7417 |
| Since 2007 (last 20 years) | 15055 |
Descriptor
| Test Reliability | 15043 |
| Test Validity | 10279 |
| Reliability | 9761 |
| Foreign Countries | 7144 |
| Test Construction | 4825 |
| Validity | 4191 |
| Measures (Individuals) | 3877 |
| Factor Analysis | 3825 |
| Psychometrics | 3526 |
| Interrater Reliability | 3124 |
| Correlation | 3040 |
| More ▼ | |
Source
Author
Publication Type
Education Level
Audience
| Researchers | 709 |
| Practitioners | 451 |
| Teachers | 208 |
| Administrators | 122 |
| Policymakers | 66 |
| Counselors | 42 |
| Students | 38 |
| Parents | 11 |
| Community | 7 |
| Support Staff | 6 |
| Media Staff | 5 |
| More ▼ | |
Location
| Turkey | 1328 |
| Australia | 436 |
| Canada | 379 |
| China | 368 |
| United States | 271 |
| United Kingdom | 256 |
| Indonesia | 253 |
| Taiwan | 234 |
| Netherlands | 223 |
| Spain | 217 |
| California | 215 |
| More ▼ | |
Laws, Policies, & Programs
Assessments and Surveys
What Works Clearinghouse Rating
| Meets WWC Standards without Reservations | 8 |
| Meets WWC Standards with or without Reservations | 9 |
| Does not meet standards | 6 |
Caravolas, Marketa; Kessler, Brett; Hulme, Charles; Snowling, Margaret – Journal of Experimental Child Psychology, 2005
This study investigated children's sensitivity to spelling consistency, and lexical and sublexical (rime) frequency, and their use of explicitly learned canonical vowel graphemes in the early stages of learning to spell. Vowel spellings produced by 78 British children at the end of reception year (mean age 5 years, 7 months) and 6 months later in…
Descriptors: Graphemes, Vowels, Spelling, Child Psychology
Goldsborough, Reid – Black Issues in Higher Education, 2004
This article discusses PC reliability, one of the most pressing issues regarding computers. Nearly a quarter century after the introduction of the first IBM PC and the outset of the personal computer revolution, PCs have largely become commodities, with little differentiating one brand from another in terms of capability and performance. Most of…
Descriptors: Computers, Reliability, Computer Uses in Education, Computer Selection
Lecavalier, L.; Havercamp, S. M. – Journal of Intellectual Disability Research, 2004
Sensitivity theory proposes that there are wide individual differences in what motivates people with intellectual disability. The Reiss Profile MRDD is a rating scale that measures 15 fundamental motives. This study examined the internal consistency and interrater reliability of the 15 subscales as well as the validity of motivational profiles.…
Descriptors: Profiles, Caregivers, Validity, Rating Scales
Howell, Scott L. – New Directions for Teaching and Learning, 2004
Although instructional methods are moving in ever greater number to a multimedia base, testing is not. What principles should be considered in correcting this misalignment?
Descriptors: Multimedia Instruction, Teaching Methods, Test Validity, Test Reliability
Freedman, David A. – Evaluation Review, 2006
Experiments offer more reliable evidence on causation than observational studies, which is not to gainsay the contribution to knowledge from observation. Experiments should be analyzed as experiments, not as observational studies. A simple comparison of rates might be just the right tool, with little value added by "sophisticated" models. This…
Descriptors: Experiments, Control Groups, Inferences, Comparative Analysis
Krantz-Girod, Catherine; Bonvin, Raphael; Lanares, Jacques; Cueanot, Seagoleine; Feihl, Francois; Bosman, Fred; Waeber, Bernard – Assessment & Evaluation in Higher Education, 2004
The second preclinical year of the medical curriculum at the Medical Faculty of the University of Lausanne in Switzerland includes nine multidisciplinary organ-system-oriented modules consisting of lectures and problem-based-learning tutorials. This study reports the experience accumulated with the evaluation of lectures during the academic years…
Descriptors: Foreign Countries, Student Evaluation, Medical Students, Medical Education
Franklin, Anna – Journal of Experimental Child Psychology, 2006
Kowalski and Zimiles (2006) and O'Hanlon and Roberson (2006) address an age-old question: Why do children find it difficult to learn color terms? Here these articles are reflected on, providing a focused examination of the issues central to this question. First, the criteria by which children are said to find color naming difficult are considered.…
Descriptors: Children, Color, Test Validity, Test Reliability
Lucke, Joseph F. – Applied Psychological Measurement, 2005
Psychometric theory focuses primarily on tests that are homogeneous, measuring only one attribute of a psychosocial entity. However, the complexity of psychosocial behavior often requires tests that are heterogeneous, measuring more than one attribute. In this presentation, reliability and internal consistency are extended to heterogeneous tests…
Descriptors: Psychometrics, Item Response Theory, Test Reliability, Psychological Studies
Baer, Ruth A.; Smith, Gregory T.; Allen, Kristin B. – Assessment, 2004
A self-report inventory for the assessment of mindfulness skills was developed, and its psychometric characteristics and relationships with other constructs were examined. Participants included three samples of undergraduate students and a sample of outpatients with borderline personality disorder. Based on discussions of mindfulness in the…
Descriptors: Undergraduate Students, Psychometrics, Personality, Personality Problems
Bellamy, G. Thomas; Crawford, Lindy; Marshall, Laura Huber; Coulter, Gail A. – Educational Administration Quarterly, 2005
As public policies increasingly hold schools responsible for preventing school failure, experiences of other organizations that must operate with high reliability may be helpful. This article builds on previous studies of high reliability organizations to inquire how their strategies might inform efforts to improve reliability in loosely coupled…
Descriptors: Learning Problems, Identification, Reliability, Public Policy
Franklin, Christine A.; Mulekar, Madhuri S. – Teaching Statistics: An International Journal for Teachers, 2004
This article describes an activity through which students collect data and explore ways to display them through graphs and charts. It also motivates various summary measures for location, spread and shape. Finally, it gives an introduction to concepts of validity, reliability and unbiasedness.
Descriptors: Computation, Measurement, Prior Learning, Charts
Ryngala, Donna J.; Shields, Alan L.; Caruso, John C. – Educational and Psychological Measurement, 2005
A reliability generalization of the Revised Children's Manifest Anxiety Scale (RCMAS) was conducted using the normative sample. The RCMAS consists of a Total Anxiety scale as well as four subscales. Results suggest that the Total Anxiety scores are typically reliable (median across 48 samples = .81). Subscale scores were less reliable: The median…
Descriptors: Measures (Individuals), Test Reliability, Generalization, Anxiety
Peer reviewedOnwuegbuzie, Anthony J.; Roberts, J. Kyle; Daniel, Larry G. – Measurement and Evaluation in Counseling and Development, 2005
In this article, the authors (a) illustrate how displaying disattenuated correlation coefficients alongside their unadjusted counterparts will allow researchers to assess the impact of unreliability on bivariate relationships and (b) demonstrate how a proposed new "what if reliability" analysis can complement null hypothesis significance…
Descriptors: Correlation, Statistical Significance, Reliability, Error of Measurement
Bachman, Lyle F. – Language Assessment Quarterly, 2005
The fields of language testing and educational and psychological measurement have not, as yet, developed a set of principles and procedures for linking test scores and score-based inferences to test use and the consequences of test use. Although Messick (1989) discusses test use and consequences, his framework provides virtually no guidance on how…
Descriptors: Test Use, Testing, Language Tests, Validity
Tilley, Susan A.; Powick, Kelly D. – Canadian Journal of Education, 2004
In this article, we report on our qualitative study involving eight individuals hired to transcribe research tapes in university contexts. We consider issues of data analysis and data trustworthiness and the implications for both when transcription is assigned to someone other than the researcher. We explore the challenges transcribers faced…
Descriptors: Data Analysis, Research Methodology, Interviews, Qualitative Research

Direct link
