Publication Date
In 2025 | 15 |
Since 2024 | 62 |
Since 2021 (last 5 years) | 269 |
Since 2016 (last 10 years) | 1022 |
Since 2006 (last 20 years) | 2398 |
Descriptor
Correlation | 3031 |
Reliability | 1517 |
Test Reliability | 1234 |
Foreign Countries | 1140 |
Factor Analysis | 844 |
Test Validity | 803 |
Measures (Individuals) | 761 |
Validity | 682 |
Statistical Analysis | 633 |
Scores | 535 |
Psychometrics | 529 |
More ▼ |
Source
Author
Tsai, Chin-Chung | 10 |
Zimmerman, Donald W. | 10 |
Gill, Brian | 9 |
Kilgus, Stephen P. | 8 |
Attali, Yigal | 7 |
Linn, Robert L. | 7 |
Lipscomb, Stephen | 6 |
Liu, Ou Lydia | 6 |
Lowe, Patricia A. | 6 |
Mendoza, Jorge L. | 6 |
Raykov, Tenko | 6 |
More ▼ |
Publication Type
Education Level
Audience
Researchers | 39 |
Teachers | 11 |
Practitioners | 10 |
Counselors | 4 |
Students | 3 |
Administrators | 2 |
Policymakers | 2 |
Parents | 1 |
Location
Turkey | 227 |
China | 68 |
Canada | 63 |
Taiwan | 51 |
Netherlands | 47 |
Australia | 43 |
Hong Kong | 42 |
United Kingdom | 36 |
California | 34 |
Nigeria | 33 |
South Korea | 32 |
More ▼ |
Laws, Policies, & Programs
No Child Left Behind Act 2001 | 7 |
Individuals with Disabilities… | 4 |
Americans with Disabilities… | 1 |
Every Student Succeeds Act… | 1 |
Rehabilitation Act 1973… | 1 |
United Nations Convention on… | 1 |
Assessments and Surveys
What Works Clearinghouse Rating
Meets WWC Standards without Reservations | 2 |
Meets WWC Standards with or without Reservations | 2 |
Hsin-Yun Lee; You-Lin Chen; Li-Jen Weng – Journal of Experimental Education, 2024
The second version of Kaiser's Measure of Sampling Adequacy (MSA[subscript 2]) has been widely applied to assess the factorability of data in psychological research. The MSA[subscript 2] is developed in the population and little is known about its behavior in finite samples. If estimated MSA[subscript 2]s are biased due to sampling errors,…
Descriptors: Error of Measurement, Reliability, Sampling, Statistical Bias
Juliana Reyes-Martin; David Simó-Pinatella; Ana Andrés – Journal of Applied Research in Intellectual Disabilities, 2025
Background: Behavioural problems in individuals with intellectual disabilities have a negative impact on them. Limited assessment measures exist in Spain. This study aimed to validate the Behavior Problems Inventory--Short Form (BPI-S) in the Spanish population by examining its psychometric properties and factorial structures. Method: This study…
Descriptors: Foreign Countries, Behavior Problems, Students with Disabilities, Intellectual Disability
Augustin Mutak; Robert Krause; Esther Ulitzsch; Sören Much; Jochen Ranger; Steffi Pohl – Journal of Educational Measurement, 2024
Understanding the intraindividual relation between an individual's speed and ability in testing scenarios is essential to assure a fair assessment. Different approaches exist for estimating this relationship, that either rely on specific study designs or on specific assumptions. This paper aims to add to the toolbox of approaches for estimating…
Descriptors: Testing, Academic Ability, Time on Task, Correlation
Kelvin Terrell Pompey – ProQuest LLC, 2021
Many methods are used to measure interrater reliability for studies where each target receives ratings by a different set of judges. The purpose of this study is to explore the use of hierarchical modeling for estimating interrater reliability using the intraclass correlation coefficient. This study provides a description of how the ICC can be…
Descriptors: Interrater Reliability, Evaluation Methods, Test Reliability, Correlation
Venkatraman, Yamini; Mahalingam, Shenbagavalli; Boominathan, Prakash – Journal of Speech, Language, and Hearing Research, 2022
Purpose: The Consensus Auditory-Perceptual Evaluation of Voice (CAPE-V) is a standardized instrument used in voice assessment to assess voice quality. It has been translated and culturally adapted in several languages. This study aimed at developing and validating a Tamil version of CAPE-V through auditory perceptual evaluation of remotely…
Descriptors: Sentences, Dravidian Languages, Acoustics, Auditory Perception
John Gero; Julie Milovanovic – Creativity Research Journal, 2024
In this paper, we explore measurements of design creativity through metrics related to the processes used in designing and relate them to the metrics used in psychology for idea creativity, ie, novelty and fluency. Our goal was to test the reliability of psychometric measures of creativity to assess creativity in team design. We studied 19 teams…
Descriptors: Correlation, Creativity, Psychology, Psychometrics
Pin, Tamis W.; So, Vincent K. K.; Siu, Cynthia S. H.; Yip, Sheila S. N.; Cheung, Stella See-wing; Kan, Jenny Yim-mui – Journal of Autism and Developmental Disorders, 2021
To examine reliability and validity of the new Social Motor Function Classification System for Children with Autism Spectrum Disorders (SMFCS-ASD). The SMFCS-ASD reliability was examined on 25 children (62.4 months SD 7.8) with ASD among six physical therapists. The validity study involved 1001 children (57.0 months, SD 9.9) with ASD using the…
Descriptors: Autism, Pervasive Developmental Disorders, Children, Classification
Using Differential Item Functioning to Test for Interrater Reliability in Constructed Response Items
Walker, Cindy M.; Göçer Sahin, Sakine – Educational and Psychological Measurement, 2020
The purpose of this study was to investigate a new way of evaluating interrater reliability that can allow one to determine if two raters differ with respect to their rating on a polytomous rating scale or constructed response item. Specifically, differential item functioning (DIF) analyses were used to assess interrater reliability and compared…
Descriptors: Test Bias, Interrater Reliability, Responses, Correlation
Sanja Lestarevic; Marko Kalanj; Luka Milutinovic; Roberto Grujicic; Jelena Vasic; Jovana Maslak; Marija Mitkovic-Voncina; Natasa Ljubomirovic; Milica Pejovic-Milovancevic – Journal of Autism and Developmental Disorders, 2024
We aimed to evaluate the internal consistency of Stanford Social Dimensions Scale (SSDS) translated to Serbian and to test it against the Strengths and Difficulties Questionnaire (SDQ). The sample consisted of 200 patients (32% ASD) of the Institute of Mental Health in Belgrade, Serbia (68 females, 132 males, M[subscript age]=9.61, SD[subscript…
Descriptors: Foreign Countries, Questionnaires, Translation, Test Reliability
Ehri Ryu – Society for Research on Educational Effectiveness, 2024
Background/Context: Confirmatory factor analysis (CFA) model is a commonly adopted framework to estimate and test a measurement model. Once a well-fitting final CFA model is selected, the selected model may be used to test structural relationships of the latent constructs with other variables, to construct a test with desired reliability and…
Descriptors: Research Problems, Factor Analysis, Scores, Computation
Dankiw, Kylie A.; Baldock, Katherine L.; Kumar, Saravana; Tsiros, Margarita D. – Australasian Journal of Early Childhood, 2021
Identifying and describing children's play behaviours is an important component of evaluating child development. The Behaviour Mapping Schedule is a direct observational tool which aims to describe and quantify children's play behaviours but is yet to undergo reliability testing. This study aimed to determine the intra- and inter-rater reliability…
Descriptors: Interrater Reliability, Classification, Child Behavior, Play
Siqi Huang – North American Chapter of the International Group for the Psychology of Mathematics Education, 2023
The goal of this paper is twofold. First, the paper clarifies and elaborates on an important theoretical construct called orientation with respect to understanding in mathematics, which denotes the degree to which students exhibit an inclination towards and demonstrate an earnest concern for understanding in mathematical learning. Second, the…
Descriptors: Mathematics Instruction, Teaching Methods, Problem Solving, Reliability
Raykov, Tenko; Anthony, James C.; Menold, Natalja – Educational and Psychological Measurement, 2023
The population relationship between coefficient alpha and scale reliability is studied in the widely used setting of unidimensional multicomponent measuring instruments. It is demonstrated that for any set of component loadings on the common factor, regardless of the extent of their inequality, the discrepancy between alpha and reliability can be…
Descriptors: Correlation, Evaluation Research, Reliability, Measurement Techniques
Swapna Haresh Teckwani; Amanda Huee-Ping Wong; Nathasha Vihangi Luke; Ivan Cherh Chiet Low – Advances in Physiology Education, 2024
The advent of artificial intelligence (AI), particularly large language models (LLMs) like ChatGPT and Gemini, has significantly impacted the educational landscape, offering unique opportunities for learning and assessment. In the realm of written assessment grading, traditionally viewed as a laborious and subjective process, this study sought to…
Descriptors: Accuracy, Reliability, Computational Linguistics, Standards
Hung Tan Ha; Duyen Thi Bich Nguyen; Tim Stoeckel – Language Assessment Quarterly, 2025
This article compares two methods for detecting local item dependence (LID): residual correlation examination and Rasch testlet modeling (RTM), in a commonly used 3:6 matching format and an extended matching test (EMT) format. The two formats are hypothesized to facilitate different levels of item dependency due to differences in the number of…
Descriptors: Comparative Analysis, Language Tests, Test Items, Item Analysis