Publication Date
| In 2026 | 3 |
| Since 2025 | 656 |
| Since 2022 (last 5 years) | 3157 |
| Since 2017 (last 10 years) | 7398 |
| Since 2007 (last 20 years) | 15036 |
Descriptor
| Test Reliability | 15028 |
| Test Validity | 10265 |
| Reliability | 9757 |
| Foreign Countries | 7137 |
| Test Construction | 4821 |
| Validity | 4191 |
| Measures (Individuals) | 3876 |
| Factor Analysis | 3822 |
| Psychometrics | 3520 |
| Interrater Reliability | 3124 |
| Correlation | 3039 |
| More ▼ | |
Source
Author
Publication Type
Education Level
Audience
| Researchers | 709 |
| Practitioners | 451 |
| Teachers | 208 |
| Administrators | 122 |
| Policymakers | 66 |
| Counselors | 42 |
| Students | 38 |
| Parents | 11 |
| Community | 7 |
| Support Staff | 6 |
| Media Staff | 5 |
| More ▼ | |
Location
| Turkey | 1326 |
| Australia | 436 |
| Canada | 379 |
| China | 368 |
| United States | 271 |
| United Kingdom | 256 |
| Indonesia | 251 |
| Taiwan | 234 |
| Netherlands | 223 |
| Spain | 216 |
| California | 214 |
| More ▼ | |
Laws, Policies, & Programs
Assessments and Surveys
What Works Clearinghouse Rating
| Meets WWC Standards without Reservations | 8 |
| Meets WWC Standards with or without Reservations | 9 |
| Does not meet standards | 6 |
Sidney Newton; Rui Wang – Educational Studies, 2024
Notwithstanding the neuromyth controversy, the malleability of learning style preferences impacts the validity of the measurement instrument and the effectiveness of the associated model of learning. This study investigates the test-retest reliability and underlying dynamics of Kolb's Learning Style Inventory (KLSI). It surveys 245 college-level…
Descriptors: Cognitive Style, Preferences, Reliability, Validity
Bronson Hui; Zhiyi Wu – Studies in Second Language Acquisition, 2024
A slowdown or a speedup in response times across experimental conditions can be taken as evidence of online deployment of knowledge. However, response-time difference measures are rarely evaluated on their reliability, and there is no standard practice to estimate it. In this article, we used three open data sets to explore an approach to…
Descriptors: Reliability, Reaction Time, Psychometrics, Criticism
Richard S. Balkin; Quentin Hunter; Bradley T. Erford – Measurement and Evaluation in Counseling and Development, 2024
We describe best practices in reporting reliability estimates in counseling research with consideration to precision, generalization, and diverse populations. We provide a historical context to reporting reliability estimates, the limitations of past practices, and new methods to address reliability generalization. We highlight best practices…
Descriptors: Best Practices, Reliability, Counseling, Research
Monica L. Coleman; Moira Ragan; Tahani Dari – Measurement and Evaluation in Counseling and Development, 2024
Intercoder reliability can increase trustworthiness, accuracy, rigor, collaboration, and power sharing in qualitative research. Though not every qualitative design can utilize intercoder reliability, this article highlights how positivist qualitative research, community-based participatory research, and participatory evaluation all strengthen when…
Descriptors: Interrater Reliability, Qualitative Research, Counseling, Research
Rizky Putri Amalia; Fitri Ariyanti Abidin; Fitriani Yustikasari Lubis; Hery Susanto – Cogent Education, 2024
This study aimed to adapt and validate the Parents Education Anxiety Questionnaire (PEAQ) for the Indonesian context. The sample included 222 parents of school-aged children, predominantly mothers (84.7%) and fathers (15.3%). The results indicated that the adapted questionnaire exhibited good reliability, with a Cronbach's alpha coefficient of…
Descriptors: Foreign Countries, Parent Attitudes, Anxiety, Questionnaires
Elizabeth J. Preas; Mary E. Halbur; Regina A. Carroll – Analysis of Verbal Behavior, 2024
Procedural fidelity refers to the degree to which procedures for an assessment or intervention (i.e., independent variables) are implemented consistent with the prescribed protocols. Procedural fidelity is an important factor in demonstrating the internal validity of an experiment and clinical treatments. Previous reviews evaluating the inclusion…
Descriptors: Verbal Communication, Behavioral Science Research, Periodicals, Fidelity
Russell P. Houpt; Kevin J. Grimm; Aaron T. McLaughlin; Daryl R. Van Tongeren – Structural Equation Modeling: A Multidisciplinary Journal, 2024
Numerous methods exist to determine the optimal number of classes when using latent profile analysis (LPA), but none are consistently correct. Recently, the likelihood incremental percentage per parameter (LI3P) was proposed as a model effect-size measure. To evaluate the LI3P more thoroughly, we simulated 50,000 datasets, manipulating factors…
Descriptors: Structural Equation Models, Profiles, Sample Size, Evaluation Methods
Ilona Kocvarová; Jan Kalenda; Jitka Vaculíková; Zuzana Neupauer; Ruženka Šimonji Cernak; Anna Wloch – Higher Education Quarterly, 2024
The article focuses on adaptation and validation of the Academic Motivation Scale questionnaire (AMS-28) in higher education in four Eastern European countries: Czechia, Slovakia, Serbia, and Poland. The research was conducted with a total of 1711 respondents. We examined the construct validity of AMS-28 including measurement invariance and…
Descriptors: Foreign Countries, Learning Motivation, Measures (Individuals), Validity
Hüseyin Öztürk; Mustafa Karabulut; Mine Baydan-Aran; Suna Tokgöz-Yilmaz – Journal of Deaf Studies and Deaf Education, 2024
This methodological study aimed to assess the validity and reliability of the Turkish version of the Evaluation of the Impact of Hearing Loss in Adults (ERSA) questionnaire for individuals with treated hearing loss. The study involved 200 participants, and both exploratory factor analysis and confirmatory factor analysis were used to examine…
Descriptors: Turkish, Test Validity, Test Reliability, Hearing Impairments
Leah Ward; Kamila Polišenská; Colin Bannard – Journal of Speech, Language, and Hearing Research, 2024
Purpose: This systematic review and multilevel meta-analysis examines the accuracy of sentence repetition (SR) tasks in distinguishing between typically developing (TD) children and children with developmental language disorder (DLD). It explores variation in the way that SR tasks are administered and/or evaluated and examines whether variability…
Descriptors: Children, Language Impairments, Repetition, Sentences
McCluskey, Sydne – ProQuest LLC, 2023
Rater comparison analysis is commonly necessary in the social sciences. Conventional approaches to the problem generally focus on calculation of agreement statistics, which provide useful but incomplete information about rater agreement. Importantly, one-number agreement statistics give no indication regarding the nature of disagreements, nor do…
Descriptors: Bayesian Statistics, Structural Equation Models, Interrater Reliability, Beliefs
Luu, Kimberly; Sidhu, Ravi; Chadha, Neil K.; Eva, Kevin W. – Advances in Health Sciences Education, 2023
Clinical supervisors are known to assess trainee performance idiosyncratically, causing concern about the validity of their ratings. The literature on this issue relies heavily on retrospective collection of decisions, resulting in the risk of inaccurate information regarding what actually drives raters' perceptions. Capturing in-the-moment…
Descriptors: Clinical Experience, Practicum Supervision, Student Evaluation, Evaluation Methods
Egmose, Ida; Skou, Mia; Madsen, Eva Back; Stuart, Anne Christine; Krogh, Marianne Thode; Haase, Tina Wahl; Vaever, Mette Skovgaard – European Journal of Developmental Psychology, 2023
Mind-mindedness (MM) refers to the parent's ability to treat the child as an individual with a mind of his or her own. Studies have found representational and interactional MM to predict child development, but more research is needed on the validity of representational MM in parents of infants. Therefore, we examine the reliability and validity of…
Descriptors: Individualism, Mothers, Infants, Foreign Countries
Feldberg, Zachary R. – ProQuest LLC, 2023
Cognitive diagnostic models (CDMs) provide pedagogically relevant information in the form of a student profile of multiple binary categorizations of students into mastery or nonmastery statuses on latent traits called attributes. Federal educational accountability requires accountability measures to designate students into one of at least three…
Descriptors: Accountability, Standards, Cutting Scores, Models
Tavares, Walter; Kinnear, Benjamin; Schumacher, Daniel J.; Forte, Milena – Advances in Health Sciences Education, 2023
In this perspective, the authors critically examine "rater training" as it has been conceptualized and used in medical education. By "rater training," they mean the educational events intended to "improve" rater performance and contributions during assessment events. Historically, rater training programs have focused…
Descriptors: Medical Education, Interrater Reliability, Evaluation Methods, Training

Peer reviewed
Direct link
