Publication Date
In 2025 | 4 |
Since 2024 | 21 |
Since 2021 (last 5 years) | 96 |
Since 2016 (last 10 years) | 425 |
Since 2006 (last 20 years) | 848 |
Descriptor
Correlation | 1234 |
Test Reliability | 1234 |
Test Validity | 762 |
Foreign Countries | 449 |
Factor Analysis | 364 |
Test Construction | 276 |
Psychometrics | 270 |
Statistical Analysis | 251 |
Measures (Individuals) | 199 |
Scores | 197 |
Questionnaires | 184 |
More ▼ |
Source
Author
Kilgus, Stephen P. | 6 |
Zimmerman, Donald W. | 6 |
Lowe, Patricia A. | 5 |
Tsai, Chin-Chung | 5 |
Eklund, Katie | 4 |
Fraser, Barry J. | 4 |
Joe, George W. | 4 |
Linn, Robert L. | 4 |
Liu, Ou Lydia | 4 |
Taylor, Crystal N. | 4 |
Williams, Richard H. | 4 |
More ▼ |
Publication Type
Education Level
Audience
Researchers | 24 |
Practitioners | 5 |
Teachers | 5 |
Students | 3 |
Counselors | 2 |
Administrators | 1 |
Parents | 1 |
Policymakers | 1 |
Location
Turkey | 113 |
Australia | 27 |
China | 24 |
Netherlands | 20 |
Canada | 18 |
Taiwan | 18 |
Hong Kong | 15 |
United Kingdom | 15 |
Germany | 14 |
United States | 12 |
California | 11 |
More ▼ |
Laws, Policies, & Programs
Individuals with Disabilities… | 1 |
No Child Left Behind Act 2001 | 1 |
United Nations Convention on… | 1 |
Assessments and Surveys
What Works Clearinghouse Rating
Meets WWC Standards without Reservations | 1 |
Meets WWC Standards with or without Reservations | 1 |
Hsin-Yun Lee; You-Lin Chen; Li-Jen Weng – Journal of Experimental Education, 2024
The second version of Kaiser's Measure of Sampling Adequacy (MSA[subscript 2]) has been widely applied to assess the factorability of data in psychological research. The MSA[subscript 2] is developed in the population and little is known about its behavior in finite samples. If estimated MSA[subscript 2]s are biased due to sampling errors,…
Descriptors: Error of Measurement, Reliability, Sampling, Statistical Bias
Juliana Reyes-Martin; David Simó-Pinatella; Ana Andrés – Journal of Applied Research in Intellectual Disabilities, 2025
Background: Behavioural problems in individuals with intellectual disabilities have a negative impact on them. Limited assessment measures exist in Spain. This study aimed to validate the Behavior Problems Inventory--Short Form (BPI-S) in the Spanish population by examining its psychometric properties and factorial structures. Method: This study…
Descriptors: Foreign Countries, Behavior Problems, Students with Disabilities, Intellectual Disability
Augustin Mutak; Robert Krause; Esther Ulitzsch; Sören Much; Jochen Ranger; Steffi Pohl – Journal of Educational Measurement, 2024
Understanding the intraindividual relation between an individual's speed and ability in testing scenarios is essential to assure a fair assessment. Different approaches exist for estimating this relationship, that either rely on specific study designs or on specific assumptions. This paper aims to add to the toolbox of approaches for estimating…
Descriptors: Testing, Academic Ability, Time on Task, Correlation
Kelvin Terrell Pompey – ProQuest LLC, 2021
Many methods are used to measure interrater reliability for studies where each target receives ratings by a different set of judges. The purpose of this study is to explore the use of hierarchical modeling for estimating interrater reliability using the intraclass correlation coefficient. This study provides a description of how the ICC can be…
Descriptors: Interrater Reliability, Evaluation Methods, Test Reliability, Correlation
Venkatraman, Yamini; Mahalingam, Shenbagavalli; Boominathan, Prakash – Journal of Speech, Language, and Hearing Research, 2022
Purpose: The Consensus Auditory-Perceptual Evaluation of Voice (CAPE-V) is a standardized instrument used in voice assessment to assess voice quality. It has been translated and culturally adapted in several languages. This study aimed at developing and validating a Tamil version of CAPE-V through auditory perceptual evaluation of remotely…
Descriptors: Sentences, Dravidian Languages, Acoustics, Auditory Perception
Using Differential Item Functioning to Test for Interrater Reliability in Constructed Response Items
Walker, Cindy M.; Göçer Sahin, Sakine – Educational and Psychological Measurement, 2020
The purpose of this study was to investigate a new way of evaluating interrater reliability that can allow one to determine if two raters differ with respect to their rating on a polytomous rating scale or constructed response item. Specifically, differential item functioning (DIF) analyses were used to assess interrater reliability and compared…
Descriptors: Test Bias, Interrater Reliability, Responses, Correlation
Sanja Lestarevic; Marko Kalanj; Luka Milutinovic; Roberto Grujicic; Jelena Vasic; Jovana Maslak; Marija Mitkovic-Voncina; Natasa Ljubomirovic; Milica Pejovic-Milovancevic – Journal of Autism and Developmental Disorders, 2024
We aimed to evaluate the internal consistency of Stanford Social Dimensions Scale (SSDS) translated to Serbian and to test it against the Strengths and Difficulties Questionnaire (SDQ). The sample consisted of 200 patients (32% ASD) of the Institute of Mental Health in Belgrade, Serbia (68 females, 132 males, M[subscript age]=9.61, SD[subscript…
Descriptors: Foreign Countries, Questionnaires, Translation, Test Reliability
Ehri Ryu – Society for Research on Educational Effectiveness, 2024
Background/Context: Confirmatory factor analysis (CFA) model is a commonly adopted framework to estimate and test a measurement model. Once a well-fitting final CFA model is selected, the selected model may be used to test structural relationships of the latent constructs with other variables, to construct a test with desired reliability and…
Descriptors: Research Problems, Factor Analysis, Scores, Computation
Hung Tan Ha; Duyen Thi Bich Nguyen; Tim Stoeckel – Language Assessment Quarterly, 2025
This article compares two methods for detecting local item dependence (LID): residual correlation examination and Rasch testlet modeling (RTM), in a commonly used 3:6 matching format and an extended matching test (EMT) format. The two formats are hypothesized to facilitate different levels of item dependency due to differences in the number of…
Descriptors: Comparative Analysis, Language Tests, Test Items, Item Analysis
Li, Minzi; Zhang, Xian – Language Testing, 2021
This meta-analysis explores the correlation between self-assessment (SA) and language performance. Sixty-seven studies with 97 independent samples involving more than 68,500 participants were included in our analysis. It was found that the overall correlation between SA and language performance was 0.466 (p < 0.01). Moderator analysis was…
Descriptors: Meta Analysis, Self Evaluation (Individuals), Likert Scales, Research Reports
Muhammed Tayyib Kadak; Nihal Serdengeçti; Meryem Seçen Yazici; Tuncay Sandikçi; Aybike Aydin; Zehra Koyuncu; Yavuz Meral; Abas Hasimoglu; Yasin Çaliskan; Gizem Bayraktar; Elif Can Öztürk; Mehmet Enes Gökler; Roula Choueiri; Mahmut Cem Tarakçioglu – Autism: The International Journal of Research and Practice, 2024
This study aims to investigate the validation of the Rapid Interactive Screening Test for Autism in Toddlers (RITA-T) in Turkish toddlers between 18 and 36 months of age. Children aged 18-36 months were referred to the department of child psychiatry for concerns of autism spectrum disorder, language disorder, developmental delay, and typically…
Descriptors: Foreign Countries, Turkish, Screening Tests, Autism Spectrum Disorders
Yunus Emre Tütüneken; Yasemin Buran Çirak; Kübra Kardes; Burcu Isikci; Ramazan Binbuga; Emre Çetrefli; Mehmet Sarili; Recep Tayyip Öz – Measurement in Physical Education and Exercise Science, 2024
The aim of the study was to investigate the reliability and validity of the timed up & go (TUG) test and the 30-s sit-to-stand (30-s STS) test performed via tele-assessment in ambulatory patients with stroke. Sixty-one patients with chronic stroke were included. For reliability, test-retest and inter-rater reliability were determined. For…
Descriptors: Test Reliability, Test Validity, Telecommunications, Health Services
Lewis, Samala B. – ProQuest LLC, 2023
This dissertation measures the multicultural teaching competency (MTC) and the frequency at which science educators use culturally relevant educational practices (CREPs). This study's mixed-method, convergent design is grounded in critical theory, and the MTC and culturally relevant education (CRE) framework. Findings suggest that the CREPs-F…
Descriptors: Cultural Pluralism, Culturally Relevant Education, Teacher Competencies, Science Teachers
Enrico Gandolfi; Richard E. Ferdig – Educational Technology Research and Development, 2025
Augmented Reality (AR) is increasingly being adopted in education to foster engagement and interest in a variety of subjects and content areas. However, there is a scarcity of instruments to measure the instructional impact of this innovation. This article addresses this gap in two unique ways. First, it presents validation results of the…
Descriptors: Simulated Environment, Measures (Individuals), Rating Scales, Item Response Theory
Cátia Marques; Íris M. Oliveira; Jaisso Vautero; Ana Daniela Silva – International Journal for Educational and Vocational Guidance, 2024
This study examined the psychometric properties of the Career Adapt-Abilities Scale in a Lebanese sample. The study includes 236 Lebanese citizens (54.2% women; M[subscript age] = 30.14). Confirmatory factor analyses indicated that a hierarchical model yielded a good fit, with the CAAS measuring four distinct dimensions that can be combined in a…
Descriptors: Psychometrics, Career Development, Factor Analysis, Goodness of Fit