Publication Date
| In 2026 | 3 |
| Since 2025 | 656 |
| Since 2022 (last 5 years) | 3157 |
| Since 2017 (last 10 years) | 7398 |
| Since 2007 (last 20 years) | 15036 |
Descriptor
| Test Reliability | 15028 |
| Test Validity | 10265 |
| Reliability | 9757 |
| Foreign Countries | 7137 |
| Test Construction | 4821 |
| Validity | 4191 |
| Measures (Individuals) | 3876 |
| Factor Analysis | 3822 |
| Psychometrics | 3520 |
| Interrater Reliability | 3124 |
| Correlation | 3039 |
| More ▼ | |
Source
Author
Publication Type
Education Level
Audience
| Researchers | 709 |
| Practitioners | 451 |
| Teachers | 208 |
| Administrators | 122 |
| Policymakers | 66 |
| Counselors | 42 |
| Students | 38 |
| Parents | 11 |
| Community | 7 |
| Support Staff | 6 |
| Media Staff | 5 |
| More ▼ | |
Location
| Turkey | 1326 |
| Australia | 436 |
| Canada | 379 |
| China | 368 |
| United States | 271 |
| United Kingdom | 256 |
| Indonesia | 251 |
| Taiwan | 234 |
| Netherlands | 223 |
| Spain | 216 |
| California | 214 |
| More ▼ | |
Laws, Policies, & Programs
Assessments and Surveys
What Works Clearinghouse Rating
| Meets WWC Standards without Reservations | 8 |
| Meets WWC Standards with or without Reservations | 9 |
| Does not meet standards | 6 |
Zirou Lin; Hanbing Yan; Li Zhao – Journal of Computer Assisted Learning, 2024
Background: Peer assessment has played an important role in large-scale online learning, as it helps promote the effectiveness of learners' online learning. However, with the emergence of numerical grades and textual feedback generated by peers, it is necessary to detect the reliability of the large amount of peer assessment data, and then develop…
Descriptors: Peer Evaluation, Automation, Grading, Models
Darmawan Muttaqin – Journal of Psychoeducational Assessment, 2024
The Vocational Identity Status Assessment (VISA) is one of the instruments that can be used to assess vocational identity. Conceptually, VISA consists of six sub-dimensions and has been validated using factor analysis. This study provides a factor structure test of the Indonesian version of VISA using the exploratory structural equation modeling…
Descriptors: Foreign Countries, Structural Equation Models, Vocational Interests, Occupational Tests
Ehri Ryu – Society for Research on Educational Effectiveness, 2024
Background/Context: Confirmatory factor analysis (CFA) model is a commonly adopted framework to estimate and test a measurement model. Once a well-fitting final CFA model is selected, the selected model may be used to test structural relationships of the latent constructs with other variables, to construct a test with desired reliability and…
Descriptors: Research Problems, Factor Analysis, Scores, Computation
Amal Abdullah Alibrahim – South African Journal of Education, 2024
After ChatGPT was released late in 2022, many arguments about its accuracy and use in education arose. In this article, I seek to provide evidence of the accuracy and validity of ChatGPT's responses to users' queries in education by applying a systematic review methodology to analyse publications in specific databases following PRISMA guidelines…
Descriptors: Artificial Intelligence, Technology Uses in Education, Reliability, Natural Language Processing
Reeta Neittaanmäki; Iasonas Lamprianou – Language Testing, 2024
This article focuses on rater severity and consistency and their relation to major changes in the rating system in a high-stakes testing context. The study is based on longitudinal data collected from 2009 to 2019 from the second language (L2) Finnish speaking subtest in the National Certificates of Language Proficiency in Finland. We investigated…
Descriptors: Foreign Countries, Interrater Reliability, Evaluators, Item Response Theory
Ann C. Jolly; Kristen D. Beach; Heather H. Aiken; Steven J. Amendum – Grantee Submission, 2024
The field of education relies heavily on instructional coaches to build teacher capacity in the implementation of evidence-based practices (EBPs). Although observation tools are commonly used to measure the fidelity of implementation by teachers, fewer tools are available to identify specific coaching behaviors used during in situ coaching…
Descriptors: Coaching (Performance), Observation, Research Tools, Reliability
Ann C. Jolly; Kristen D. Beach; Heather H. Aiken; Steven J. Amendum – Journal of Educational and Psychological Consultation, 2024
The field of education relies heavily on instructional coaches to build teacher capacity in the implementation of evidence-based practices (EBPs). Although observation tools are commonly used to measure the fidelity of implementation by teachers, fewer tools are available to identify specific coaching behaviors used during in situ coaching…
Descriptors: Coaching (Performance), Observation, Research Tools, Reliability
Siqi Huang – North American Chapter of the International Group for the Psychology of Mathematics Education, 2023
The goal of this paper is twofold. First, the paper clarifies and elaborates on an important theoretical construct called orientation with respect to understanding in mathematics, which denotes the degree to which students exhibit an inclination towards and demonstrate an earnest concern for understanding in mathematical learning. Second, the…
Descriptors: Mathematics Instruction, Teaching Methods, Problem Solving, Reliability
Tagavi, Daina M.; Dick, Catherine C.; Attar, Shana M.; Ibanez, Lisa V.; Stone, Wendy L. – Autism: The International Journal of Research and Practice, 2023
This study examined the feasibility of implementing the Screening Tool for Autism in Toddlers, an interactive Level-2 screen for autism spectrum disorder, within Part C Early Intervention settings. Participants included 69 Early Intervention providers (M age = 43.3 years, 93.7% females, 92.4% Whites) from nine programs who attended a one-day…
Descriptors: Autism Spectrum Disorders, Toddlers, Early Intervention, Diagnostic Tests
Raykov, Tenko; Anthony, James C.; Menold, Natalja – Educational and Psychological Measurement, 2023
The population relationship between coefficient alpha and scale reliability is studied in the widely used setting of unidimensional multicomponent measuring instruments. It is demonstrated that for any set of component loadings on the common factor, regardless of the extent of their inequality, the discrepancy between alpha and reliability can be…
Descriptors: Correlation, Evaluation Research, Reliability, Measurement Techniques
Ntumi, Simon; Agbenyo, Sheilla; Bulala, Tapela – Shanlax International Journal of Education, 2023
There is no need or point to testing of knowledge, attributes, traits, behaviours or abilities of an individual if information obtained from the test is inaccurate. However, by and large, it seems the estimation of psychometric properties of test items in classroomshas been completely ignored otherwise dying slowly in most testing environments. In…
Descriptors: Psychometrics, Accuracy, Test Validity, Factor Analysis
Tsangaridou, Niki; Charalambous, Charalambos Y. – Quest, 2023
Focusing on systematic observation, one of the most potent methods of studying teaching quality, represents one of the numerous contributions of Daryl Siedentop to the profession. While he had a clear focus on issues of validity and reliability concerning systematic observation, over the past decades, attention to such issues appears to have…
Descriptors: Physical Education Teachers, Observation, Validity, Reliability
Sella-Weiss, Oshrat – International Journal of Language & Communication Disorders, 2023
Background: Quantitative measures can increase precision in describing swallowing function, improve interrater and test-retest reliability, and advance clinical decision-making. The Test of Mastication and Swallowing Solids (TOMASS) and the Timed Water Swallow Test (TWST) are functional tests for swallowing that provide quantitative results. Aims:…
Descriptors: Human Body, Motor Reactions, Tests, Test Reliability
Poulsen, Mads; Juul, Holger; Elbro, Carsten – Annals of Dyslexia, 2023
Different definitions and tests of dyslexia can cause unfairness and make life difficult for people with dyslexia as well as for the professionals. In 2012, the Danish government decided to support the fight against dyslexia. The government issued a public tender for the development of "a standardized, electronically administered test of…
Descriptors: Dyslexia, National Competency Tests, Foreign Countries, Test Construction
Moore, C. Missy; Crawford, Carey C.; Tertichny, Alissa – Measurement and Evaluation in Counseling and Development, 2023
We examined dimensionality and temporal stability of the Interpersonal Stress Scale-Counselor (ISS-C) scores in a sample of professional counselors (n = 518). Confirmatory factor analyses provided support for a four-factor model previously identified through exploratory factor analysis and a bifactor model. Using a randomized test-retest, temporal…
Descriptors: Counselors, Interpersonal Relationship, Stress Variables, Measures (Individuals)

Peer reviewed
Direct link
