Publication Date
| In 2026 | 7 |
| Since 2025 | 690 |
| Since 2022 (last 5 years) | 3191 |
| Since 2017 (last 10 years) | 7432 |
| Since 2007 (last 20 years) | 15070 |
Descriptor
| Test Reliability | 15055 |
| Test Validity | 10290 |
| Reliability | 9763 |
| Foreign Countries | 7150 |
| Test Construction | 4828 |
| Validity | 4192 |
| Measures (Individuals) | 3880 |
| Factor Analysis | 3826 |
| Psychometrics | 3532 |
| Interrater Reliability | 3126 |
| Correlation | 3040 |
| More ▼ | |
Source
Author
Publication Type
Education Level
Audience
| Researchers | 709 |
| Practitioners | 451 |
| Teachers | 208 |
| Administrators | 122 |
| Policymakers | 66 |
| Counselors | 42 |
| Students | 38 |
| Parents | 11 |
| Community | 7 |
| Support Staff | 6 |
| Media Staff | 5 |
| More ▼ | |
Location
| Turkey | 1329 |
| Australia | 436 |
| Canada | 379 |
| China | 368 |
| United States | 271 |
| United Kingdom | 256 |
| Indonesia | 253 |
| Taiwan | 234 |
| Netherlands | 224 |
| Spain | 218 |
| California | 215 |
| More ▼ | |
Laws, Policies, & Programs
Assessments and Surveys
What Works Clearinghouse Rating
| Meets WWC Standards without Reservations | 8 |
| Meets WWC Standards with or without Reservations | 9 |
| Does not meet standards | 6 |
Chamba-Eras, Luis; Arruarte, Ana; Elorriaga, Jon A. – IEEE Transactions on Learning Technologies, 2023
In the context of virtual learning communities (VLCs), where the participants may not know each other, it is necessary to have a mechanism to help when deciding who to work with and what reliable contents and information sources are. This study aims to design a generic trust model, named T-VLC, applicable to VLCs, which can be adapted to different…
Descriptors: Communities of Practice, Electronic Learning, Trust (Psychology), Models
Kunar, Melina A.; Watson, Derrick G. – Cognitive Research: Principles and Implications, 2023
Computer-Aided Detection (CAD) has been proposed to help operators search for cancers in mammograms. Previous studies have found that although accurate CAD leads to an improvement in cancer detection, inaccurate CAD leads to an increase in both missed cancers and false alarms. This is known as the over-reliance effect. We investigated whether…
Descriptors: Assistive Technology, Computer Use, Clinical Diagnosis, Screening Tests
Bryce D. McLeod; Nicole Porter; Aaron Hogue; Emily M. Becker-Haimes; Amanda Jensen-Doss – Grantee Submission, 2023
Objective: The precise measurement of treatment fidelity (quantity and quality in the delivery of treatment strategies in an intervention) is essential for intervention development, evaluation, and implementation. Various informants are used in fidelity assessment (e.g., observers, practitioners [clinicians, teachers], clients), but these…
Descriptors: Measurement, Fidelity, Educational Research, Evidence Based Practice
Orhan, Ali – Journal of Psychoeducational Assessment, 2022
The aims of this reliability generalization study were to provide the overall alpha values of the California critical thinking disposition inventory (CCTDI) total score and subscales scores and investigate the characteristics of the studies that may be associated with the variability in the reliability values of the CCTDI total score and subscales…
Descriptors: Critical Thinking, Measures (Individuals), Test Reliability, Generalization
Bonett, Douglas G. – Journal of Educational and Behavioral Statistics, 2022
The limitations of Cohen's ? are reviewed and an alternative G-index is recommended for assessing nominal-scale agreement. Maximum likelihood estimates, standard errors, and confidence intervals for a two-rater G-index are derived for one-group and two-group designs. A new G-index of agreement for multirater designs is proposed. Statistical…
Descriptors: Statistical Inference, Statistical Data, Interrater Reliability, Design
Ilhan Çiçek; Mete Sipahioglu; Ümit Dilekçi – Psychology in the Schools, 2026
This study aims to examine the mediating roles of resilience and occupational self-efficacy in the relationship between occupational stress and subjective well-being and to adapt the Teacher Occupational Self-Efficacy-Short Form (OSS-SF) to Turkish culture. Using a cross-sectional design, convenience sampling was employed to collect the data. The…
Descriptors: Self Efficacy, Stress Variables, Teaching (Occupation), Resilience (Psychology)
Using Differential Item Functioning to Test for Interrater Reliability in Constructed Response Items
Walker, Cindy M.; Göçer Sahin, Sakine – Educational and Psychological Measurement, 2020
The purpose of this study was to investigate a new way of evaluating interrater reliability that can allow one to determine if two raters differ with respect to their rating on a polytomous rating scale or constructed response item. Specifically, differential item functioning (DIF) analyses were used to assess interrater reliability and compared…
Descriptors: Test Bias, Interrater Reliability, Responses, Correlation
Gunjawate, Dhanshree R.; Ravi, Rohit; Bhagavan, Srividya – Journal of Speech, Language, and Hearing Research, 2020
Purpose: The purpose of this study was to evaluate the reliability and validity of the Kannada version of the Consensus Auditory-Perceptual Evaluation of Voice (CAPE-V). Method: The Kannada version of CAPE-V comprises six phrases that are phonetically designed as per the CAPE-V requirements. Sixty-five (21 individuals with dysphonia and 44…
Descriptors: Test Reliability, Test Validity, Dravidian Languages, Voice Disorders
Jonathan Arthur Schmidt; Gisa Aschersleben; Anne Henning – International Journal of Behavioral Development, 2025
In this longitudinal study, we investigated the factor structure and stability of early-life temperament in a German sample, using three measures developed within Rothbart's psychobiological approach. Temperament was measured using the Infant Behavior Questionnaire Revised (IBQ-R) at the ages of 6 and 12 months, the Early Childhood Behavior…
Descriptors: Foreign Countries, Personality, Personality Measures, Infants
José Hernando Ávila-Toscano; Laura Isabel Rambal-Rivaldo; David Javier Fortich Pérez; Leonardo Vargas-Delgado – International Journal of Education in Mathematics, Science and Technology, 2025
Technological mediation has gained relevance in teaching mathematics. Its usefulness and impact depend, to a great extent, on how students approach the learning of the discipline. Two independent instrumental studies were conducted to analyze the psychometric properties of the Spanish version of the Mathematics and Technology Attitudes Scale…
Descriptors: Mathematics Instruction, Educational Technology, Technology Uses in Education, Psychometrics
Ali Özcan; Fatma Tezel Sahin – International Journal of Psychology and Educational Studies, 2025
The present study aims to adapt the Technoference in Parent-Child Relationships Scale (TPCRS) to Turkish culture by conducting validity and reliability analyses. The study group consists of the parents of 445 children between the ages of 3 and 6 attending preschool in the Denizli province. Expert opinions were consulted for the language validity…
Descriptors: Foreign Countries, Parent Child Relationship, Test Validity, Test Reliability
Behçet Oral; Nese Dokumaci-Sütçü – Journal of Theoretical Educational Science, 2025
The aim of this study is to develop a valid and reliable scale to measure the time traps teachers fall into during the teaching-learning process. The sample consists of 234 final-year students continuing their education at the Faculty of Education in the first implementation and 233 pedagogical formation students in the second implementation.…
Descriptors: Test Construction, Time Management, Test Validity, Test Reliability
Kuan-Yu Jin; Wai-Lok Siu – Journal of Educational Measurement, 2025
Educational tests often have a cluster of items linked by a common stimulus ("testlet"). In such a design, the dependencies caused between items are called "testlet effects." In particular, the directional testlet effect (DTE) refers to a recursive influence whereby responses to earlier items can positively or negatively affect…
Descriptors: Models, Test Items, Educational Assessment, Scores
Siti Suprihatiningsih; Masriyah; Rooselyna Ekawati – Journal of Education and Learning (EduLearn), 2025
The knowledge of the materials to be taught to the students is the basic knowledge that preservice mathematics teachers should possess, as they need to prepare themselves for teaching. In order to research preservice teachers' understanding of the subject matter and teaching skils, valid and reliable test instruments are required. Knowledge of…
Descriptors: Preservice Teachers, Pedagogical Content Knowledge, Preservice Teacher Education, Mathematics Teachers
Chia-Lin Tsai; Stefanie Wind; Samantha Estrada – Measurement: Interdisciplinary Research and Perspectives, 2025
Researchers who work with ordinal rating scales sometimes encounter situations where the scale categories do not function in the intended or expected way. For example, participants' use of scale categories may result in an empirical difficulty ordering for the categories that does not match what was intended. Likewise, the level of distinction…
Descriptors: Rating Scales, Item Response Theory, Psychometrics, Self Efficacy

Peer reviewed
Direct link
