Publication Date
| In 2026 | 3 |
| Since 2025 | 636 |
| Since 2022 (last 5 years) | 3137 |
| Since 2017 (last 10 years) | 7378 |
| Since 2007 (last 20 years) | 15016 |
Descriptor
| Test Reliability | 15015 |
| Test Validity | 10252 |
| Reliability | 9751 |
| Foreign Countries | 7126 |
| Test Construction | 4811 |
| Validity | 4189 |
| Measures (Individuals) | 3875 |
| Factor Analysis | 3821 |
| Psychometrics | 3515 |
| Interrater Reliability | 3122 |
| Correlation | 3037 |
| More ▼ | |
Source
Author
Publication Type
Education Level
Audience
| Researchers | 709 |
| Practitioners | 451 |
| Teachers | 208 |
| Administrators | 122 |
| Policymakers | 66 |
| Counselors | 42 |
| Students | 38 |
| Parents | 11 |
| Community | 7 |
| Support Staff | 6 |
| Media Staff | 5 |
| More ▼ | |
Location
| Turkey | 1320 |
| Australia | 436 |
| Canada | 379 |
| China | 368 |
| United States | 271 |
| United Kingdom | 256 |
| Indonesia | 251 |
| Taiwan | 234 |
| Netherlands | 223 |
| Spain | 216 |
| California | 214 |
| More ▼ | |
Laws, Policies, & Programs
Assessments and Surveys
What Works Clearinghouse Rating
| Meets WWC Standards without Reservations | 8 |
| Meets WWC Standards with or without Reservations | 9 |
| Does not meet standards | 6 |
Serap Keles; Seyda Özcan – Journal of Teacher Education and Educators, 2025
The purpose of this study is to adapt the scale titled "Classroom Management Self-Efficacy Instrument (CMSEI)" adapted by Slater and Main (2020) into Turkish and to test its validity and reliability. In line with the scale adaptation process, initial translation studies were conducted, followed by construct validity testing using…
Descriptors: Classroom Techniques, Self Efficacy, Test Validity, Test Reliability
Hongliang Guo; Hongqin Chai; Rui Xue; Lei Yao – SAGE Open, 2025
This study aims to develop a validated scale to measure the sports moral character (SMC) development of Chinese primary and secondary school learners. The primary and secondary indicators in the scale were based on the structural framework of physical education virtues in the "Physical Education and Health Curriculum Standards for Compulsory…
Descriptors: Test Construction, Moral Values, Elementary Secondary Education, Physical Education
Yuyu Jiang; Hua Chen – SAGE Open, 2025
This study aimed to develop and validate an analytical rating scale specifically designed to assess the lexical proficiency of Chinese college students in Academic English speaking tasks. A multi-layer construct of lexical proficiency was first developed and operationalized into an eight-dimension rating scale, including word diversity, word…
Descriptors: Test Validity, Rating Scales, Academic Language, Speech
Hüseyin Ataseven; Ömay Çokluk-Bökeoglu; Fazilet Tasdemir – Journal of Theoretical Educational Science, 2025
This study investigates the reliability and consistency of a custom GPT-based scoring system in comparison to trained human raters, focusing on B1-level opinion paragraphs written by English preparatory students. Addressing the limited evidence on how AI scoring systems align with human evaluations in foreign language contexts, the study provides…
Descriptors: Artificial Intelligence, Technology Uses in Education, Writing Skills, Student Evaluation
Filiz Arzu Yalin; Ahmet Özbay; Safak Oguz – European Journal of Education, 2025
This study developed and validated a Decision-Making Skill Test (DMST) for Turkish adolescents to address the lack of culturally appropriate assessment tools for multi-criteria decision-making skills. A cross-sectional design was employed with 427 participants aged 11-17 years from diverse socioeconomic backgrounds across Turkey. Following…
Descriptors: Test Construction, Test Validity, Student Evaluation, Decision Making
Andrea Gjorevski; Mimi Li; Troy L. Cox – TESOL Quarterly: A Journal for Teachers of English to Speakers of Other Languages and of Standard English as a Second Dialect, 2025
Open access to novel AI tools offers unprecedented opportunities for human-AI collaboration in writing instruction and assessment. While research on using generative AI tools like ChatGPT in these contexts is emerging, more is needed to understand their effectiveness as Automated Writing Evaluation (AWE) tools. This study explores the potential of…
Descriptors: Artificial Intelligence, Criterion Referenced Tests, Essay Tests, Automation
Ryan, Joseph J.; Gontkovsky, Samuel T. – Journal of Psychoeducational Assessment, 2021
We analyzed data from the WASI-II manual to determine discrepancy score reliabilities of the Verbal Comprehension (VCI) and Perceptual Reasoning (PRI) indexes and the four subtests in the child and adult standardization samples. Reliabilities of the VCI-PRI discrepancy scores range from 0.78 to 0.86 for children and 0.82 to 0.89 for adults and…
Descriptors: Intelligence Tests, Test Reliability, Scores, Children
Foster, Robert C. – Educational and Psychological Measurement, 2021
This article presents some equivalent forms of the common Kuder-Richardson Formula 21 and 20 estimators for nondichotomous data belonging to certain other exponential families, such as Poisson count data, exponential data, or geometric counts of trials until failure. Using the generalized framework of Foster (2020), an equation for the reliability…
Descriptors: Test Reliability, Data, Computation, Mathematical Formulas
Hu, Kaiyan; Zhao, Li; Zhou, Qi; Mei, Fan; Gao, Qianqian; Chen, Fei; Jiang, Mengyao; Zhao, Bing; Zhang, Weiyi; Kwong, Joey S. W.; Ma, Yuxia; Mou, Chenghua; Ma, Bin – Research Synthesis Methods, 2021
The author should give careful consideration to the study eligibility criteria of systematic reviews (SRs) and follow it after review protocol development to reduce the possibility of manipulation of inclusion. Our aim was to investigate the prevalence of differences in study eligibility criteria between non-Cochrane SRs and their pre-registered…
Descriptors: Eligibility, Reliability, Criteria, Literature Reviews
Raykov, Tenko; Marcoulides, George A. – Educational and Psychological Measurement, 2021
The population discrepancy between unstandardized and standardized reliability of homogeneous multicomponent measuring instruments is examined. Within a latent variable modeling framework, it is shown that the standardized reliability coefficient for unidimensional scales can be markedly higher than the corresponding unstandardized reliability…
Descriptors: Test Reliability, Computation, Measures (Individuals), Research Problems
Stephanie M. Bell; R. Philip Chalmers; David B. Flora – Educational and Psychological Measurement, 2024
Coefficient omega indices are model-based composite reliability estimates that have become increasingly popular. A coefficient omega index estimates how reliably an observed composite score measures a target construct as represented by a factor in a factor-analysis model; as such, the accuracy of omega estimates is likely to depend on correct…
Descriptors: Influences, Models, Measurement Techniques, Reliability
Cassandra Alighieri; Silke Meerschaert; Kristiane Van Lierde – Journal of Speech, Language, and Hearing Research, 2024
Purpose: This study compared the interrater reliability of adult naïve listeners' perceptual assessments of different speech variables in children with a cleft palate with or without a cleft lip (CP ± L). In addition, the study investigated whether the listeners were able to perceive differences in these speech variables before and after speech…
Descriptors: Adults, Listening Skills, Speech Therapy, Congenital Impairments
Sean N. Weeks; Tyler L. Renshaw; Allysia A. Rainey; Aubrey Hiatt – Journal of Emotional and Behavioral Disorders, 2024
Internalizing and externalizing problems are common targets for school mental health screening. Prior research supports the interpretation of scores from the Youth Internalizing Problems Screener (YIPS) and the Youth Externalizing Problems Screener (YEPS), which were developed separately yet intended as companion measures. We extended previous…
Descriptors: Adolescents, Screening Tests, Behavior Problems, Mental Health
Pornphan Sureeyatanapas; Panitas Sureeyatanapas; Uthumporn Panitanarak; Jittima Kraisriwattana; Patchanan Sarootyanapat; Daniel O'Connell – Language Testing in Asia, 2024
Ensuring consistent and reliable scoring is paramount in education, especially in performance-based assessments. This study delves into the critical issue of marking consistency, focusing on speaking proficiency tests in English language learning, which often face greater reliability challenges. While existing literature has explored various…
Descriptors: Foreign Countries, Students, English Language Learners, Speech
Duong Thi Ngoc Ngan; Maria Hercz – Asia-Pacific Education Researcher, 2024
As there is a paucity of instrument investigating a hybrid teaching conception, the current study is seen as part of attempt to fill this gap. The subjects in the study were 310 University participants--instructors in Socialist Republic of Viet Nam (Vietnam). The survey was implemented with the use of Cognitive Constructivism-oriented Teaching…
Descriptors: Blended Learning, Faculty, Teaching Methods, Foreign Countries

Peer reviewed
Direct link
