Publication Date
In 2025 | 285 |
Since 2024 | 1149 |
Since 2021 (last 5 years) | 3719 |
Since 2016 (last 10 years) | 7918 |
Since 2006 (last 20 years) | 15095 |
Descriptor
Test Reliability | 14751 |
Test Validity | 10028 |
Reliability | 9655 |
Foreign Countries | 6903 |
Test Construction | 4695 |
Validity | 4150 |
Measures (Individuals) | 3801 |
Factor Analysis | 3768 |
Psychometrics | 3447 |
Interrater Reliability | 3093 |
Correlation | 3027 |
More ▼ |
Source
Author
Publication Type
Education Level
Audience
Researchers | 705 |
Practitioners | 449 |
Teachers | 206 |
Administrators | 122 |
Policymakers | 66 |
Counselors | 42 |
Students | 37 |
Parents | 11 |
Community | 7 |
Media Staff | 5 |
Support Staff | 5 |
More ▼ |
Location
Turkey | 1274 |
Australia | 432 |
Canada | 375 |
China | 346 |
United States | 268 |
United Kingdom | 250 |
Taiwan | 227 |
Indonesia | 223 |
Netherlands | 218 |
California | 212 |
Spain | 210 |
More ▼ |
Laws, Policies, & Programs
Assessments and Surveys
What Works Clearinghouse Rating
Meets WWC Standards without Reservations | 8 |
Meets WWC Standards with or without Reservations | 9 |
Does not meet standards | 6 |
Jordan, Altricia – ProQuest LLC, 2023
Data science, as a discipline can be used in any area. However, in order to utilize data science techniques, data scientist must be taught domain knowledge, referred to as a partner discipline, in the area with which the techniques are to be utilized. Using a quantitative analysis of publicly available information and survey methodology, this…
Descriptors: Data Science, Training, Scientists, Reliability
Metsämuuronen, Jari – Practical Assessment, Research & Evaluation, 2023
Traditional estimators of reliability such as coefficients alpha, theta, omega, and rho (maximal reliability) are prone to give radical underestimates of reliability for the tests common when testing educational achievement. These tests are often structured by widely deviating item difficulties. This is a typical pattern where the traditional…
Descriptors: Test Reliability, Achievement Tests, Computation, Test Items
Xiong, Yao; Schunn, Christian D.; Wu, Yong – Journal of Computer Assisted Learning, 2023
Background: For peer assessment, reliability (i.e., consistency in ratings across peers) and validity (i.e., consistency of peer ratings with instructors or experts) are frequently examined in the research literature to address a central concern of instructors and students. Although the average levels are generally promising, both reliability and…
Descriptors: Peer Evaluation, Computer Assisted Testing, Test Reliability, Test Validity
Abbas, Mohsin; van Rosmalen, Peter; Kalz, Marco – IEEE Transactions on Learning Technologies, 2023
For predicting and improving the quality of essays, text analytic metrics (surface, syntactic, morphological, and semantic features) can be used to provide formative feedback to the students in higher education. In this study, the goal was to identify a sufficient number of features that exhibit a fair proxy of the scores given by the human raters…
Descriptors: Feedback (Response), Automation, Essays, Scoring
Aktas, Fatma Nur – Acta Didactica Napocensia, 2023
This phenomenology research aims to examine prospective elementary mathematics teachers' proving and proof evaluation and their thoughts on convincing according to proof type and argument type. The participants were eight prospective teachers. The data collection tools were semi-structured group interviews, interviews video recordings and the…
Descriptors: Persuasive Discourse, Mathematical Logic, Logical Thinking, Visual Aids
Pin, Tamis W.; So, Vincent K. K.; Siu, Cynthia S. H.; Yip, Sheila S. N.; Cheung, Stella See-wing; Kan, Jenny Yim-mui – Journal of Autism and Developmental Disorders, 2021
To examine reliability and validity of the new Social Motor Function Classification System for Children with Autism Spectrum Disorders (SMFCS-ASD). The SMFCS-ASD reliability was examined on 25 children (62.4 months SD 7.8) with ASD among six physical therapists. The validity study involved 1001 children (57.0 months, SD 9.9) with ASD using the…
Descriptors: Autism, Pervasive Developmental Disorders, Children, Classification
Özaydin, Zeynep; Arslan, Çigdem – Journal of Theoretical Educational Science, 2022
The aim of this study is to develop a rubric to assess mathematical reasoning competence. Since the aim is to assess a competency, the frameworks of the PISA exams in the literature, which give an important place to competencies, have been examined. Due to its focus and in-depth analysis of mathematical reasoning, each of the actions expected from…
Descriptors: Foreign Countries, Scoring Rubrics, Mathematical Logic, Competence
Joshi, Ashwini; Baheti, Isha; Angadi, Vrushali – Journal of Speech, Language, and Hearing Research, 2020
Aim: The purpose of this study was to develop and assess the reliability of a Hindi version of the Consensus Auditory-Perceptual Evaluation of Voice (CAPE-V). Reliability was assessed by comparing Hindi CAPE-V ratings with English CAPE-V ratings and by the Grade, Roughness, Breathiness, Asthenia and Strain (GRBAS) scale. Method: Hindi sentences…
Descriptors: Test Construction, Indo European Languages, Test Reliability, Voice Disorders
Jones, Nathan; Bell, Courtney; Qi, Yi; Lewis, Jennifer; Kirui, David; Stickler, Leslie; Redash, Amanda – ETS Research Report Series, 2021
The observation systems being used in all 50 states require administrators to learn to accurately and reliably score their teachers' instruction using standardized observation systems. Although the literature on observation systems is growing, relatively few studies have examined the outcomes of trainings focused on developing administrators'…
Descriptors: Observation, Standardized Tests, Teacher Evaluation, Test Reliability
Osman Birgin; Elif Seval Peker – Psychology in the Schools, 2025
The aim of this study was to develop an instrument for assessing sixth-grade students' number sense skills in fractions and decimals. This study was conducted on 452 sixth graders (10-11 years old) from the western region of Turkey. The construct validity of the number sense test (NST) was examined via exploratory factor analysis (EFA) and…
Descriptors: Foreign Countries, Grade 6, Test Construction, Mathematics Education
Yongtian Cheng; K. V. Petrides – Educational and Psychological Measurement, 2025
Psychologists are emphasizing the importance of predictive conclusions. Machine learning methods, such as supervised neural networks, have been used in psychological studies as they naturally fit prediction tasks. However, we are concerned about whether neural networks fitted with random datasets (i.e., datasets where there is no relationship…
Descriptors: Psychological Studies, Artificial Intelligence, Cognitive Processes, Predictive Validity
Melissa Raspa; Angela Gwaltney; Carla Bann; Jana von Hehn; Timothy A. Benke; Eric D. Marsh; Sarika U. Peters; Amitha Ananth; Alan K. Percy; Jeffrey L. Neul – Journal of Autism and Developmental Disorders, 2025
Rett syndrome is a severe neurodevelopmental disorder that affects about 1 in 10,000 females. Clinical trials of disease modifying therapies are on the rise, but there are few psychometrically sound caregiver-reported outcome measures available to assess treatment benefit. We report on a new caregiver-reported outcome measure, the Rett Caregiver…
Descriptors: Neurodevelopmental Disorders, Genetic Disorders, Females, Test Validity
Huei-Wen Tsai; Ching-Ling Cheng – Journal of Psychoeducational Assessment, 2025
This study aimed to evaluate the psychometric properties and gather evidence supporting the validity of scores from a traditional Chinese version of the Claremont Purpose Scale (TC-CPS) among Taiwanese adolescents. The TC-CPS, measuring meaningfulness, goal directedness, and beyond-the-self orientation, was administered to 233 high school and 445…
Descriptors: Foreign Countries, Adolescents, Measures (Individuals), High School Students
Mehmet Emin Ören; Servet Atik – International Journal of Assessment Tools in Education, 2025
In this study, it was aimed to adapt the DigiFuehr 2.0 Scale developed by Claassen et al. (2023) to Turkish and to conduct validity and reliability studies on three groups of participants consisting of teachers. In the study, exploratory and confirmatory factor analyses were performed in line with translation study, linguistic application, and…
Descriptors: Test Reliability, Test Validity, Test Construction, Translation
Hongwei Yang; Müslim Alanoglu; Songül Karabatak; Kelly D. Bradley – International Journal of Assessment Tools in Education, 2025
The study took a Rasch measurement theory approach to validating the 10-item Digital Literacy Scale (DLS) using the unidimensional rating scale model (RSM). To that end, the study used the data from a sample of online Turkish university students. The study began the Rasch analysis with all 10 items in the scale and, to improve in the local…
Descriptors: Digital Literacy, Measures (Individuals), Test Validity, Foreign Countries