Publication Date
In 2025 | 19 |
Since 2024 | 55 |
Since 2021 (last 5 years) | 210 |
Since 2016 (last 10 years) | 537 |
Since 2006 (last 20 years) | 1006 |
Descriptor
Scores | 1538 |
Test Validity | 1538 |
Test Reliability | 578 |
Foreign Countries | 355 |
Correlation | 311 |
Test Construction | 267 |
Psychometrics | 243 |
Factor Analysis | 185 |
Test Items | 177 |
Language Tests | 169 |
Comparative Analysis | 157 |
More ▼ |
Source
Author
Publication Type
Education Level
Audience
Researchers | 31 |
Practitioners | 22 |
Teachers | 6 |
Parents | 5 |
Policymakers | 5 |
Administrators | 3 |
Community | 3 |
Students | 2 |
Counselors | 1 |
Location
Turkey | 40 |
Canada | 28 |
China | 28 |
United Kingdom | 20 |
United States | 18 |
Australia | 16 |
Netherlands | 16 |
Florida | 15 |
Japan | 14 |
Texas | 14 |
Germany | 13 |
More ▼ |
Laws, Policies, & Programs
Assessments and Surveys
What Works Clearinghouse Rating
Tiffany Wu; Christina Weiland; Meghan McCormick; JoAnn Hsueh; Catherine Snow; Jason Sachs – Grantee Submission, 2024
The Hearts and Flowers (H&F) task is a computerized executive functioning (EF) assessment that has been used to measure EF from early childhood to adulthood. It provides data on accuracy and reaction time (RT) across three different task blocks (hearts, flowers, and mixed). However, there is a lack of consensus in the field on how to score the…
Descriptors: Scoring, Executive Function, Kindergarten, Young Children
Zafer Ozen; Nielsen Pereira; Tugce Karatas; Hernán Castillo-Hermosilla; Yukiko Maeda – Gifted Child Quarterly, 2025
Cognitive Abilities Test (CogAT) is one of the most frequently used gifted identification tools. In this meta-analytic study, we investigated empirical evidence of the validity of CogAT, in relation to different types of instruments. After reviewing 1,480 studies, a total of 24 with 33 effect sizes were included in the meta-analysis. According to…
Descriptors: Test Validity, Cognitive Tests, Disability Identification, Scores
Stefan O'Grady – TESOL Journal, 2025
Task-based language assessment represents a major component of task-based language teaching syllabi. Current perspectives emphasise the importance of tasks in the assessment process, suggesting that adherence to influential models of language production during task design yields predictable test outcomes. The current study contends that the…
Descriptors: Task Analysis, Language Tests, Evaluators, Rating Scales
Chet Robie; Sabah Rasheed; Stephen D. Risavy; Piers Steel – International Journal of Testing, 2024
This meta-analysis examined the validity of an alternative to traditional assessments called the Wonderlic which is a brief measure of general mental ability. Our results showed significant, positive correlations between Wonderlic scores and academic performance in general ([r-bar] = 0.26), between Wonderlic scores and undergraduate GPA in…
Descriptors: Meta Analysis, Test Validity, Alternative Assessment, Scores
Meyer, J. Patrick; Hu, Ann; Li, Sylvia – NWEA, 2023
The Content Proximity Project was designed to improve the content validity of the MAP® Growth™ assessments while retaining the ability for the test to adapt off-grade and meet students wherever they are in their learning. Two main features of the project were the development of an enhanced item selection algorithm, and a spring pilot study…
Descriptors: Achievement Tests, Mathematics Achievement, Content Validity, Mathematics Tests
Johayra Bouza; Rebecca J. Bulotsky-Shearer; Krystal M. Bichay-Awadala; Jhonelle Bailey; Patricia Gaona; Lisa White; Veronica A. Fernandez – Psychology in the Schools, 2024
The purpose of this study was to validate the Spanish version of the Family Involvement Questionnaire-Short Form (FIQ-SF) for use with Spanish-speaking families of children enrolled in early childhood education programs. This study examined the factor structure of the FIQ-SF and established criterion validity for the resulting FIQ-SF dimension…
Descriptors: Family Involvement, Questionnaires, Early Childhood Education, Spanish
Huei-Wen Tsai; Ching-Ling Cheng – Journal of Psychoeducational Assessment, 2025
This study aimed to evaluate the psychometric properties and gather evidence supporting the validity of scores from a traditional Chinese version of the Claremont Purpose Scale (TC-CPS) among Taiwanese adolescents. The TC-CPS, measuring meaningfulness, goal directedness, and beyond-the-self orientation, was administered to 233 high school and 445…
Descriptors: Foreign Countries, Adolescents, Measures (Individuals), High School Students
Folger, Timothy D.; Bostic, Jonathan; Krupa, Erin E. – Educational Measurement: Issues and Practice, 2023
Validity is a fundamental consideration of test development and test evaluation. The purpose of this study is to define and reify three key aspects of validity and validation, namely test-score interpretation, test-score use, and the claims supporting interpretation and use. This study employed a Delphi methodology to explore how experts in…
Descriptors: Test Interpretation, Scores, Test Use, Test Validity
Ehri Ryu – Society for Research on Educational Effectiveness, 2024
Background/Context: Confirmatory factor analysis (CFA) model is a commonly adopted framework to estimate and test a measurement model. Once a well-fitting final CFA model is selected, the selected model may be used to test structural relationships of the latent constructs with other variables, to construct a test with desired reliability and…
Descriptors: Research Problems, Factor Analysis, Scores, Computation
Allison R. Lombardi; Graham G. Rifenbark; H. Jane Rogers; Hariharan Swaminathan; Ashley Taconet; Valerie L. Mazzotti; Mary E. Morningstar; Rongxiu Wu; Shannon Langdon – Career Development and Transition for Exceptional Individuals, 2023
The purpose of this study was to establish construct validity of a college and career readiness measure using a sample of youth with (n = 356) and without (n = 1,599) disabilities from five high schools across three U.S. states. We established content validity through expert item review, structural validity through initial field-testing, and…
Descriptors: Test Validity, Construct Validity, Adolescents, College Readiness
James Soland – Journal of Research on Educational Effectiveness, 2024
When randomized control trials are not possible, quasi-experimental methods often represent the gold standard. One quasi-experimental method is difference-in-difference (DiD), which compares changes in outcomes before and after treatment across groups to estimate a causal effect. DiD researchers often use fairly exhaustive robustness checks to…
Descriptors: Item Response Theory, Testing, Test Validity, Intervention
Aberdine R. Dwight; Amy M. Briesch; Jessica A. Hoffman; Christopher Rutt – Child & Youth Care Forum, 2024
Background: Although the Depression Anxiety Stress Scales, Short Form (DASS-21) was developed for adults, its authors noted no compelling reasons to not use the measure with youth as young as 12 years. Despite increasingly widespread use with youth, psychometric evidence in support of its use with this population needs to be investigated to fully…
Descriptors: Depression (Psychology), Measures (Individuals), Anxiety, Stress Variables
Jesus Alfonso D. Datu; Frank Fincham; Jet U. Buenconsejo – Journal of American College Health, 2024
The Caring for Bliss Scale (CBS) is a new measure that assesses an individuals' capacity to cultivate inner joy and happiness. Developed in the United States, its generalizability remains unknown in non-Western contexts. This research explored the scale's cross-national invariance among college students in the Philippines (n = 546) and the United…
Descriptors: Psychometrics, Likert Scales, Well Being, Beliefs
Youmi Suk – Asia Pacific Education Review, 2024
Regression discontinuity (RD) designs have gained significant popularity as a quasi-experimental device for evaluating education programs and policies. In this paper, we present a comprehensive review of RD designs, focusing on the continuity-based framework, the most widely adopted RD framework. We first review the fundamental aspects of RD…
Descriptors: Educational Research, Preschool Education, Regression (Statistics), Test Validity
Suzanna Dooley; Tammy Hopper; Rachael Doyle; Orla Gilheaney; Margaret Walshe – International Journal of Language & Communication Disorders, 2025
Background: Individuals with dementia have communication limitations resulting from cognitive impairments that define the syndrome. Whereas there are numerous cognitive assessments for individuals with dementia, there are far fewer communication assessments. The Profiling Communication Ability in Dementia (P-CAD) was developed to address this gap.…
Descriptors: Communication Skills, Communication Problems, Dementia, Intellectual Disability