Publication Date
| In 2026 | 0 |
| Since 2025 | 621 |
| Since 2022 (last 5 years) | 3121 |
| Since 2017 (last 10 years) | 7362 |
| Since 2007 (last 20 years) | 15000 |
Descriptor
| Test Reliability | 15006 |
| Test Validity | 10245 |
| Reliability | 9748 |
| Foreign Countries | 7119 |
| Test Construction | 4807 |
| Validity | 4189 |
| Measures (Individuals) | 3872 |
| Factor Analysis | 3820 |
| Psychometrics | 3513 |
| Interrater Reliability | 3117 |
| Correlation | 3037 |
| More ▼ | |
Source
Author
Publication Type
Education Level
Audience
| Researchers | 709 |
| Practitioners | 451 |
| Teachers | 208 |
| Administrators | 122 |
| Policymakers | 66 |
| Counselors | 42 |
| Students | 38 |
| Parents | 11 |
| Community | 7 |
| Support Staff | 6 |
| Media Staff | 5 |
| More ▼ | |
Location
| Turkey | 1319 |
| Australia | 436 |
| Canada | 379 |
| China | 368 |
| United States | 271 |
| United Kingdom | 256 |
| Indonesia | 249 |
| Taiwan | 234 |
| Netherlands | 223 |
| Spain | 216 |
| California | 214 |
| More ▼ | |
Laws, Policies, & Programs
Assessments and Surveys
What Works Clearinghouse Rating
| Meets WWC Standards without Reservations | 8 |
| Meets WWC Standards with or without Reservations | 9 |
| Does not meet standards | 6 |
Farmer, Ryan L.; Kim, Samuel Y. – Psychology in the Schools, 2020
Many prominent intelligence tests (e.g., Wechsler Intelligence Scale for Children, Fifth Edition [WISC-V] and Reynolds Intellectual Abilities Scale, Second Edition [RIAS-2]) offer methods for computing subtest- and composite-level difference scores. This study uses data provided in the technical manual of the WISC-V and RIAS-2 to calculate…
Descriptors: Children, Intelligence Tests, Scores, Test Reliability
Yildiz, Mehmet; Fidan, Ugur – Measurement in Physical Education and Exercise Science, 2020
The aim of this investigation was to assess the reliability and validity of the Fitjump system. Fifty-seven participants (age = 22.62 [plus or minus] 5.24 years, height = 180.69 [plus or minus]12.53 cm, body mass = 75.61 [plus or minus] 9.56 kg) performed three countermovement jump (CMJ) and squat jump (SJ) with a 1-week interval for test and…
Descriptors: Physical Activities, Reliability, Validity, Measurement Equipment
Shard; Devesh Kumar; Sapna Koul – International Journal of Information and Learning Technology, 2024
Purpose: This study aims to gain insights into how students perceive online examination practices and evaluation, as well as identify the key factors that impact their intentions toward online exams. Design/methodology/approach: This empirical study conducted in India utilized an online survey method between May 24 and June 14, 2022. The data were…
Descriptors: Foreign Countries, Undergraduate Students, Graduate Students, Student Attitudes
Pinar Mihci Türker; Ömer Kirmaci; Emrah Kayabasi; Erinç Karatas; Ebru Kiliç Çakmak; Serçin Karatas – Journal of Educational Technology and Online Learning, 2024
The COVID-19 epidemic has precipitated a rapid and widespread adoption of online education, leading to its normalization in contemporary society. Online education is evident across several educational levels. However, assessing the efficacy and effectiveness of these training programs can only be achieved by implementing a suitable evaluation…
Descriptors: Online Courses, Distance Education, Evaluation Methods, Test Construction
Peeraya Sukkeewan; Noawanit Songkram; Jaitip Nasongkhla – International Journal of Educational Methodology, 2024
The objective of this study was to develop a measure that possesses both reliability and validity in order to evaluate innovative thinking within the realm of education. To achieve this, the instrument's validity and reliability were evaluated through quantitative methods in two distinct phases. A team of educational experts conducted the process…
Descriptors: Likert Scales, Test Construction, Test Validity, Test Reliability
Nurdan Akgun; Seyda Gul – Science Insights Education Frontiers, 2024
This study aims to develop a test to measure sixth grade students' success in 'Circulatory System'. For this purpose, a draft test containing 22 questions was prepared by the researchers. And then, the draft test was submitted to expert opinion and examined in terms of language and content. Following expert opinions, the 22 multiple-choice…
Descriptors: Achievement Tests, Human Body, Test Construction, Science Education
Nathalie Liechti García; Albert Sesé – International Journal of Educational Management, 2024
Purpose: A crucial issue in educational management refers to helping teachers reach their full potential and manage their talents. Although managing talent is advised as an essential resource for organizational transformation to maximize performance and to promote a school's knowledge capital increase, Teachers' talent management (TTM) is not an…
Descriptors: Teacher Effectiveness, Talent Development, Definitions, Measures (Individuals)
Clara Margaça; José Carlos Sánchez-García; Brizeida Hernández Sánchez; Susana Lucas Mangas – International Journal of Sustainability in Higher Education, 2024
Purpose: To protect the environment and society, research on responsible behavior and personal values has increased. Values have been identified as important for understanding and predicting environmental preservation behaviors. The purpose of this study is to analyze the validity and reliability of the Environmental Portrait Value Questionnaire…
Descriptors: Universities, Conservation (Environment), Altruism, Self Concept
Ramazan Demir; Mehmet Murat; Gökmen Arslan – Educational and Developmental Psychologist, 2024
Objective: This study aimed to develop and validate a new measure- the Adolescent Strengths Use Scale- to assess school-based strength use among Turkish adolescents. Method The study group consisted 1209 Turkish students in grades 7-12. A 12-item pool was initially created in light of the relevant literature and content analysis of open-ended…
Descriptors: Foreign Countries, Test Construction, Adolescents, Individual Characteristics
Julie Sriken; Bradley T. Erford; Martin F. Sherman; Kristen Watson; Heather L. Smith – Measurement and Evaluation in Counseling and Development, 2024
Psychometric characteristics of CESD-R scores were explored on a sample of 966 undergraduate students. Internal consistency ([alpha] = 0.92), external convergent and discriminant validity, and response bias were adequate to excellent. Strong measurement invariance was evident for gender and race comparisons, and the unidimensional model fit the…
Descriptors: Symptoms (Individual Disorders), Depression (Psychology), Measures (Individuals), Undergraduate Students
José M. García-Fernández; María Isabel Gómez-Núñez; Ornela Mateu-Martínez; Dori J. A. Urbán; Cándido J. Inglés – Psychology in the Schools, 2024
Anxiety and school fears are relatively frequent in childhood. Psychology and education professionals need to have assessment instruments for screening for school anxiety in schools. This study aimed to develop, adapt, and examine the reliability and validity evidence of the School Anxiety Inventory for Primary Education (SAI-PE) scores. Using…
Descriptors: Anxiety, School Phobia, Screening Tests, Reliability
Elizabeth B. Vaughan; A. Montoya-Cowan; Jack Barbera – Chemistry Education Research and Practice, 2024
The Meaningful Learning in the Laboratory Instrument (MLLI) was designed to measure students' expectations before and after their laboratory courses and experiences. Although the MLLI has been used in various studies and laboratory environments to investigate students' cognitive and affective laboratory expectations, the authors of the instrument…
Descriptors: Test Validity, Test Reliability, Expectation, Measures (Individuals)
Marine Simon; Alexandra Budke – Journal of Geography in Higher Education, 2024
Comparison is an important geographic method and a common task in geography education. Mastering comparison is a complex competency and written comparisons are challenging tasks both for students and assessors. As yet, however, there is no set test for evaluating comparison competency nor tool for enhancing it. Moreover, little is known about…
Descriptors: Geography Instruction, Student Evaluation, Comparative Analysis, Reliability
Huiying Cai; Xun Yan – Language Testing, 2024
Rater comments tend to be qualitatively analyzed to indicate raters' application of rating scales. This study applied natural language processing (NLP) techniques to quantify meaningful, behavioral information from a corpus of rater comments and triangulated that information with a many-facet Rasch measurement (MFRM) analysis of rater scores. The…
Descriptors: Natural Language Processing, Item Response Theory, Rating Scales, Writing Evaluation
A. Stephen Lenz; Carla Smith; Amber Meegan – Measurement and Evaluation in Counseling and Development, 2024
The Professional Quality of Life Scale (ProQOL) has an extensive history of use that often relies on inductions of reliability from precedent literature. We completed a systematic review of the literature and extracted sample-specific reliability estimates for ProQOL subscale scores. Random effects meta-analytic modeling was implemented to…
Descriptors: Literature Reviews, Quality of Working Life, Professional Identity, Meta Analysis

Peer reviewed
Direct link
