Publication Date
| In 2026 | 0 |
| Since 2025 | 621 |
| Since 2022 (last 5 years) | 3121 |
| Since 2017 (last 10 years) | 7362 |
| Since 2007 (last 20 years) | 15000 |
Descriptor
| Test Reliability | 15006 |
| Test Validity | 10245 |
| Reliability | 9748 |
| Foreign Countries | 7119 |
| Test Construction | 4807 |
| Validity | 4189 |
| Measures (Individuals) | 3872 |
| Factor Analysis | 3820 |
| Psychometrics | 3513 |
| Interrater Reliability | 3117 |
| Correlation | 3037 |
| More ▼ | |
Source
Author
Publication Type
Education Level
Audience
| Researchers | 709 |
| Practitioners | 451 |
| Teachers | 208 |
| Administrators | 122 |
| Policymakers | 66 |
| Counselors | 42 |
| Students | 38 |
| Parents | 11 |
| Community | 7 |
| Support Staff | 6 |
| Media Staff | 5 |
| More ▼ | |
Location
| Turkey | 1319 |
| Australia | 436 |
| Canada | 379 |
| China | 368 |
| United States | 271 |
| United Kingdom | 256 |
| Indonesia | 249 |
| Taiwan | 234 |
| Netherlands | 223 |
| Spain | 216 |
| California | 214 |
| More ▼ | |
Laws, Policies, & Programs
Assessments and Surveys
What Works Clearinghouse Rating
| Meets WWC Standards without Reservations | 8 |
| Meets WWC Standards with or without Reservations | 9 |
| Does not meet standards | 6 |
Na Liu; Mingren Zhao; Rui Jin – Psychology in the Schools, 2025
Empirical research has extensively used the reading strategy scales to explore English reading strategies within the Chinese context. However, there is a lack of studies examining the reading strategies used by Chinese students in their native language (L1). Considering the differences between Chinese logographic characters and alphabetic English…
Descriptors: Reading Strategies, Rating Scales, Chinese, Native Language
Hae Sun Jung; Haein Lee; Keon Chul Park – SAGE Open, 2025
This study investigates user experience (UX) priorities in early childhood education applications by analyzing Korean-language user reviews using Bidirectional Encoder Representations from Transformers topic modeling (BERTopic). Eighteen latent topics were extracted and systematically mapped to the eight software quality characteristics defined by…
Descriptors: Early Childhood Education, Computer Uses in Education, Computer Software, Usability
Venessa F. Manna; Shuhong Li; Spiros Papageorgiou; Lixiong Gu – ETS Research Report Series, 2025
This technical manual describes the purpose and intended uses of the TOEFL iBT test, its target test-taker population, and relevant language use domains. The test design and scoring procedures are presented first, followed by a research agenda intended to support the interpretation and use of test scores. Given the updates to the test starting…
Descriptors: Second Language Learning, English (Second Language), Language Tests, Test Construction
Özlem Dönmez; Ozana Ural – International Journal of Early Childhood, 2025
The main aim of this study is to develop a valid and reliable child-family interaction scale to assess child-family interactions from the eyes of children, and then to examine interaction behaviors according to different demographic variables. Also, the positive and negative interaction behaviors according to children's expressions are examined.…
Descriptors: Measures (Individuals), Test Construction, Test Validity, Test Reliability
Patrick H. M. Sins; Lida T. Klaver; Jaap de Brouwer; Tessa H. S. Eysink; Alieke M. van Dijk – Psychology in the Schools, 2025
Supporting students' self-regulated learning (SRL) is essential in education, yet most interventions adopt a one-size-fits-all approach, overlooking individual differences in students' ability to engage in SRL. Tailored instruction requires reliable assessment tools that distinguish between students' "availability" (knowledge) and…
Descriptors: Elementary School Students, Independent Study, Learning Strategies, Inquiry
Gresham, Frank; Elliott, Stephen; Metallo, Sarah; Byrd, Shelby; Wilson, Elizabeth; Erickson, Megan; Cassidy, Kaitlin; Altman, Robert – Assessment for Effective Intervention, 2020
This study described the development of the "Social Skills Improvement System Social Emotional Learning Edition Rating Forms" (SSIS SEL RF) for teachers, parents, and students. This new multirater assessment is a reconfiguration of the SSIS Rating Scales items inspired by the CASEL Social Emotional Competency framework. The internal…
Descriptors: Interpersonal Competence, Rating Scales, Psychometrics, Social Development
Chen, Zhen; Fang, Rui; Zhang, Yi; Ge, Pingjiang; Zhuang, Peiyun; Chou, Adriana; Jiang, Jack – Journal of Speech, Language, and Hearing Research, 2018
Purpose: The purpose of this study is to develop the Mandarin version of the Consensus Auditory-Perceptual Evaluation of Voice (CAPE-V) and evaluate its reliability compared with the Grade, Roughness, Breathiness, Asthenia, Strain (GRBAS). Method: The Mandarin version of the CAPE-V tool was translated from the validated English version with…
Descriptors: Voice Disorders, Diagnostic Tests, Mandarin Chinese, Test Reliability
Guo, Cuihua; Luo, Meifang; Wang, Xuxiang; Huang, Saijun; Meng, Zhaoxue; Shao, Jie; Zhang, Xuan; Shao, Zhi; Wu, Jieling; Robins, Diana L.; Jing, Jin – Journal of Autism and Developmental Disorders, 2019
Although early detection of autism facilitates intervention, early detection strategies are not yet widespread in China. To improve the situation, the Chinese version of the Modified Checklist for Autism in Toddlers, Revised with Follow-Up (M-CHAT-R/F) was validated. The sample included 7928 toddlers, aged 16 to 30 months, screened during their…
Descriptors: Check Lists, Autism, Pervasive Developmental Disorders, Toddlers
Kramer, Robin S. S.; Jones, Alex L.; Gous, Georgina – Applied Cognitive Psychology, 2021
Deciding whether two different face photographs or voice samples are from the same person represent fundamental challenges within applied settings. To date, most research has focussed on average performance in these tests, failing to consider individual differences and within-person consistency in responses. Here, participants completed the same…
Descriptors: Individual Differences, Accuracy, Reliability, Correlation
Hartstein, Bonnie; Yackel, Edward – Learning Organization, 2021
Purpose: This study aims to describe how the Army and the Army Medical Department matured as a learning organization (LO) during the period after the 2014 Military Health System Review through the incorporation of changes aimed at improving patient safety, data transparency and becoming a high-reliability organization (HRO). This study explores…
Descriptors: Armed Forces, Medical Services, Organizational Learning, Organizational Change
Lewis, Carly A.; Myers, Carl L. – Contemporary School Psychology, 2021
Behavior rating scales are frequently used to assess social-emotional behaviors of children. While broadband behavior rating scales often measure similarly named constructs, it is unclear how consistently different instruments measure those constructs. Head Start teachers completed the preschool versions of the Behavior Assessment System for…
Descriptors: Preschool Teachers, Interrater Reliability, Child Behavior, Behavior Rating Scales
Schüler, Anne; Merkt, Martin – Journal of Computer Assisted Learning, 2021
In two experiments, the multimedia contradiction paradigm was used to investigate whether learners map information conveyed through the audio and the picture track of a video. In Experiment 1 (N = 85), the information conveyed through the audio track and the picture track was always consistent (control group) or was made inconsistent by changing…
Descriptors: Video Technology, Cognitive Processes, Multimedia Materials, Eye Movements
McNulty, Richard J.; Floyd, Randy G. – Psychology in the Schools, 2021
This study examined the factor structure of the Detroit Tests of Learning Abilities, Fifth Edition (DTLA-5) using principal axis factoring, multiple factor extraction criteria, and the Schmid-Leiman orthogonalization procedures not utilized by test publishers. Results suggest that the publisher's six-factor structure model was over factored.…
Descriptors: Aptitude Tests, Cognitive Ability, Factor Structure, Factor Analysis
Watts, Field M.; Finkenstaedt-Quinn, Solaire A. – Chemistry Education Research and Practice, 2021
The tradition of qualitative research drives much of chemistry education research activity. When performing qualitative studies, researchers must demonstrate the trustworthiness of their analysis so researchers and practitioners consuming their work can understand if and how the presented research claims and conclusions might be transferable to…
Descriptors: Qualitative Research, Educational Research, Research Methodology, Chemistry
Belur, Jyoti; Tompson, Lisa; Thornton, Amy; Simon, Miranda – Sociological Methods & Research, 2021
A methodologically sound systematic review is characterized by transparency, replicability, and a clear inclusion criterion. However, little attention has been paid to reporting the details of interrater reliability (IRR) when multiple coders are used to make decisions at various points in the screening and data extraction stages of a study. Prior…
Descriptors: Interrater Reliability, Decision Making, Accuracy, Coding

Peer reviewed
Direct link
