Publication Date
In 2025 | 205 |
Since 2024 | 705 |
Since 2021 (last 5 years) | 2293 |
Since 2016 (last 10 years) | 4594 |
Since 2006 (last 20 years) | 6899 |
Descriptor
Test Reliability | 14762 |
Test Validity | 9771 |
Test Construction | 4248 |
Foreign Countries | 3657 |
Psychometrics | 2361 |
Factor Analysis | 2251 |
Measures (Individuals) | 1717 |
Evaluation Methods | 1401 |
Higher Education | 1384 |
Correlation | 1234 |
Questionnaires | 1228 |
More ▼ |
Source
Author
Publication Type
Education Level
Audience
Researchers | 452 |
Practitioners | 319 |
Teachers | 128 |
Administrators | 73 |
Policymakers | 33 |
Counselors | 31 |
Students | 17 |
Parents | 10 |
Community | 6 |
Support Staff | 5 |
Location
Turkey | 797 |
Australia | 236 |
Canada | 205 |
China | 195 |
Indonesia | 142 |
Spain | 124 |
United States | 121 |
United Kingdom | 117 |
Germany | 106 |
Taiwan | 103 |
Netherlands | 99 |
More ▼ |
Laws, Policies, & Programs
Assessments and Surveys
What Works Clearinghouse Rating
Meets WWC Standards without Reservations | 2 |
Meets WWC Standards with or without Reservations | 2 |
Does not meet standards | 1 |
Anna Cecilia McWhirter; Katherine A. Hails; David S. DeGarmo; Laura Lee McIntyre; S. Andrew Garbacz; Elizabeth A. Stormshak – Grantee Submission, 2024
Reliable and valid assessment of parenting and child behaviors is critical for clinicians and researchers alike, and observational measures of parenting behaviors are often considered the gold standard for assessing parenting and parent-child interaction quality. The current study sought to evaluate the reliability and validity of the Coder…
Descriptors: Questionnaires, Test Reliability, Test Validity, Kindergarten
Rommel AlAli; Yousef Wardat; Shoeb Saleh; Nedal Alshraifin – European Journal of STEM Education, 2024
Background: The STEM education system has garnered significant interest due to its innovative approach in creating a multidisciplinary and integrated knowledge structure that connects science, technology, engineering, and mathematics, and allows students to apply scientific knowledge in a comprehensive and holistic manner. Consequently, evaluating…
Descriptors: Teaching Methods, Gifted Education, Academically Gifted, Creative Teaching
Jennifer R. Banas; Sarah Gershon – Clearing House: A Journal of Educational Strategies, Issues and Ideas, 2024
Nationally recognized social justice standards guide educators in developing social justice education. Absent from the guidance are tools to conduct initial formative assessment or to measure the impact of related instruction. To fill that gap, an academic researcher and 10th-grade teacher used a 3-phased, 9-step process to develop, pilot test,…
Descriptors: Social Justice, Rating Scales, Test Construction, Self Evaluation (Individuals)
Sarah French; Ashton Dickerson; Raoul A. Mulder – Higher Education: The International Journal of Higher Education Research, 2024
High-stakes examinations enjoy widespread use as summative assessments in higher education. We review the arguments for and against their use, across seven common themes: memory recall and knowledge retention; student motivation and learning; authenticity and real-world relevance; validity and reliability; academic misconduct and contract…
Descriptors: High Stakes Tests, Program Effectiveness, Evidence Based Practice, Summative Evaluation
Nilda Hocaoglu; Gürbüz Ocak – International Journal of Contemporary Educational Research, 2024
Motivation is crucial in the pace and success of language learning, and the effect of motivation on language learning has been extensively studied. Many scales have been developed to measure the motivation levels of the students. However, there are a limited number of studies conducted to specifically measure the motivation towards English…
Descriptors: Test Construction, Test Validity, Student Motivation, English (Second Language)
Semma, Brandie; Henri, Maria; Luo, Wen; Thompson, Christopher G. – Journal of Psychoeducational Assessment, 2019
Meaning in life is a psychological construct linked to several subjective well-being indicators. One commonly used meaning in life measure is the Meaning in Life Questionnaire (MLQ), a 10-item self-report measure that assesses perceived presence of and search for meaning in life. Despite its extensive use, the variability of the questionnaire's…
Descriptors: Questionnaires, Test Reliability, Generalization, Psychological Patterns
Faran, Yifat; Zanbar, Lea – International Journal of Social Research Methodology, 2019
The present study is the first to examine empirically whether required fields in online surveys impair reliability and response pattern, as participants forced to respond to all items may provide arbitrary answers. Two hundred and thirteen participants completed a survey consisting of six questionnaires testing personal and social issues and…
Descriptors: Online Surveys, Test Reliability, Response Style (Tests), Questionnaires
Kondo, Kanako; Mizuta, Masanobu; Kawai, Yoshitaka; Sogami, Tohru; Fujimura, Shintaro; Kojima, Tsuyoshi; Abe, Chika; Tanaka, Ryo; Shiromoto, Osamu; Uozumi, Ryuji; Kishimoto, Yo; Tateya, Ichiro; Omori, Koichi; Haji, Tomoyuki – Journal of Speech, Language, and Hearing Research, 2021
Purpose: Auditory-perceptual evaluation is essential for the assessment of voice quality. The Consensus Auditory-Perceptual Evaluation of Voice (CAPE-V) provides a standardized protocol and assessment form for clinicians to analyze the voice quality and has been adapted into several different languages. The aims of this study were to develop the…
Descriptors: Japanese, Test Validity, Test Reliability, Voice Disorders
Lenz, A. Stephen; Rocha, Lauran; Aras, Yahyahan – International Journal for the Advancement of Counselling, 2021
A systematic search was conducted to identify measures of school climate developed and reported between 1993 to 2017. We coded data related to participant and setting characteristics, qualities of measures, amounts of validity evidence, and degrees of reliability estimates. Results indicated 9 school climate measures featuring disparate…
Descriptors: Educational Environment, Evaluation, Literature Reviews, Test Construction
Cunha, Marina; Silva, Patrícia; Ferreira, Cláudia; Galhardo, Ana – Child & Youth Care Forum, 2021
Background: Shame, as a self-conscious, complex, and universal emotion, plays an important role in mental health. In adolescents, given their greater vulnerability to the development of psychological difficulties, the assessment of shame, in its various dimensions, is especially relevant. Objective: To adapt and validate the External and Internal…
Descriptors: Measures (Individuals), Psychological Patterns, Adolescents, Test Validity
Li, Nan; Fan, Weihua; Wiesner, Margit; Arbona, Consuelo; Hein, Sascha – Journal of Engineering Education, 2021
Background: Engineering identity is associated with students' academic success and retention in engineering programs. However, there is a lack of psychometrically evaluated measures for assessing engineering identity formation. Purpose: This cross-sectional study aimed to adapt the Utrecht-Management of Identity Commitments Scale (U-MICS) to…
Descriptors: Test Construction, Identification (Psychology), Engineering Education, Professional Identity
Takamatsu, Reina; Tsou, Yung-Ting; Kusumi, Takashi; Rieffe, Carolien – International Journal of Behavioral Development, 2021
Empathy is assumed to be a universal human motivation to act altruistically toward others. Developmental models of empathy explaining when and how children acquire the capacity to empathize have been proposed. However, the existing knowledge is largely built upon studies conducted in the Western context. To fill this gap, a cross-culturally…
Descriptors: Foreign Countries, Empathy, Questionnaires, Preschool Children
Jiang, Zhehan; Shi, Dexin; Distefano, Christine – Educational and Psychological Measurement, 2021
The costs of an objective structured clinical examination (OSCE) are of concern to health profession educators globally. As OSCEs are usually designed under generalizability theory (G-theory) framework, this article proposes a machine-learning-based approach to optimize the costs, while maintaining the minimum required generalizability…
Descriptors: Artificial Intelligence, Generalizability Theory, Objective Tests, Foreign Countries
Knoch, Ute; Deygers, Bart; Khamboonruang, Apichat – Language Testing, 2021
Rating scale development in the field of language assessment is often considered in dichotomous ways: It is assumed to be guided either by expert intuition or by drawing on performance data. Even though quite a few authors have argued that rating scale development is rarely so easily classifiable, this dyadic view has dominated language testing…
Descriptors: Rating Scales, Test Construction, Language Tests, Test Use
Thams, L.; Hvid, L. G.; Damsgaard, C. T.; Hansen, M. – Measurement in Physical Education and Exercise Science, 2021
We aimed to assess the test-retest reliability of five muscle strength and physical function tests in healthy children. Forty-one children (6--9 years) were tested three times 4-10 days apart. The test protocol included maximal isometric leg press, hand grip strength, squat jump, long jump, and a 30-sec sit-to-stand test (STST). When comparing…
Descriptors: Test Reliability, Muscular Strength, Physical Fitness, Children