Publication Date
| In 2026 | 0 |
| Since 2025 | 2142 |
| Since 2022 (last 5 years) | 12652 |
| Since 2017 (last 10 years) | 33777 |
| Since 2007 (last 20 years) | 68268 |
Descriptor
| Foreign Countries | 30502 |
| Test Validity | 21718 |
| Scores | 18245 |
| Academic Achievement | 16904 |
| Test Construction | 16724 |
| Test Reliability | 15006 |
| Achievement Tests | 14836 |
| Standardized Tests | 14707 |
| Comparative Analysis | 14429 |
| Elementary Secondary Education | 13033 |
| Language Tests | 12545 |
| More ▼ | |
Source
Author
Publication Type
Education Level
Audience
| Practitioners | 5033 |
| Teachers | 3390 |
| Researchers | 2630 |
| Policymakers | 1229 |
| Administrators | 976 |
| Students | 687 |
| Parents | 325 |
| Counselors | 216 |
| Community | 162 |
| Support Staff | 50 |
| Media Staff | 34 |
| More ▼ | |
Location
| Turkey | 2813 |
| Australia | 2425 |
| Canada | 2269 |
| California | 1851 |
| United States | 1725 |
| Texas | 1613 |
| China | 1577 |
| United Kingdom | 1315 |
| Florida | 1312 |
| United Kingdom (England) | 1202 |
| Germany | 1120 |
| More ▼ | |
Laws, Policies, & Programs
Assessments and Surveys
What Works Clearinghouse Rating
| Meets WWC Standards without Reservations | 121 |
| Meets WWC Standards with or without Reservations | 189 |
| Does not meet standards | 174 |
Kuan-Yu Jin; Wai-Lok Siu – Journal of Educational Measurement, 2025
Educational tests often have a cluster of items linked by a common stimulus ("testlet"). In such a design, the dependencies caused between items are called "testlet effects." In particular, the directional testlet effect (DTE) refers to a recursive influence whereby responses to earlier items can positively or negatively affect…
Descriptors: Models, Test Items, Educational Assessment, Scores
Omar Saleh Bani Yassin; Aiman Mohammad Freihat; Sabri Hassan Al-Tarawneh – Educational Process: International Journal, 2025
Background/purpose: This study aimed to investigate the differences among the equations used in estimating the reliability coefficient using the half-split method. These equations demonstrate Spearman-Brown's, Rulon's, Guttman's, Mosier's, Flanagan's, and Horst's. Materials/methods: The study instrument was a 43-item scale for evaluating the…
Descriptors: Foreign Countries, Equations (Mathematics), Mathematics Instruction, Grade 10
Melissa H. Black; Karl Lundin Remnélius; Lovisa Alehagen; Thomas Bourgeron; Sven Bölte – Journal of Autism and Developmental Disorders, 2025
Purpose: A considerable number of screening and diagnostic tools for autism exist, but variability in these measures presents challenges to data harmonization and the comparability and generalizability of findings. At the same time, there is a movement away from autism symptomatology to stances that capture heterogeneity and appreciate diversity.…
Descriptors: Symptoms (Individual Disorders), Classification, Measures (Individuals), Autism Spectrum Disorders
Mohd Norlizam Mohd Razali; Aida Hanim A. Hamid; Bity Salwana Alias; Azlin Norhaini Mansor – Journal of Education and Learning (EduLearn), 2025
A teacher competency instrument was developed to determine the level of teacher competency in small schools in Peninsular Malaysia. This study was conducted in Perak and Negeri Sembilan to determine the instrument's reliability and validity. Exploratory factor analysis (EFA) and item reliability analysis were used to determine the questionnaire's…
Descriptors: Foreign Countries, Elementary Secondary Education, Small Schools, Rural Schools
Victoria Crisp; Sylvia Vitello; Abdullah Ali Khan; Heather Mahy; Sarah Hughes – Research Matters, 2025
This research set out to enhance our understanding of the exam techniques and types of written annotations or markings that learners may wish to use to support their thinking when taking digital multiple-choice exams. Additionally, we aimed to further explore issues around the factors that contribute to learners writing less rough work and…
Descriptors: Computer Assisted Testing, Test Format, Multiple Choice Tests, Notetaking
Kazuhiro Yamaguchi – Journal of Educational and Behavioral Statistics, 2025
This study proposes a Bayesian method for diagnostic classification models (DCMs) for a partially known Q-matrix setting between exploratory and confirmatory DCMs. This Q-matrix setting is practical and useful because test experts have pre-knowledge of the Q-matrix but cannot readily specify it completely. The proposed method employs priors for…
Descriptors: Models, Classification, Bayesian Statistics, Evaluation Methods
Matthew T. Mahar; Hoyong Sung – Measurement in Physical Education and Exercise Science, 2025
Field-based tests of aerobic fitness that can be administered quickly and do not require maximal effort are desirable. The purpose was to develop and validate quarter-mile walk tests for 10--13-year-olds. Participants (N = 59) walked one mile on two different days. Walk times, heart rates, body mass, physical activity, and aerobic fitness were…
Descriptors: Physical Fitness, Test Construction, Exercise, Early Adolescents
Ali M. Alodat; Qais Al-Meqdad; Maha Al-Hendawi; Nawaf Al-Zyoud; Osamah Bataineh – Journal of Advanced Academics, 2025
This study uses a rigorous research process to explore the psychometric properties of the Gifted Rating Scale-School Form (GRS-S) within the Qatari educational context. We employed stratified cluster sampling of 326 students (aged 6-13 years, M = 10.9) from 25 public schools in Doha. Data was collected in the second semester of the 2023-2024…
Descriptors: Academically Gifted, Rating Scales, Psychometrics, Foreign Countries
Angela Chamberlain; Emily D'Arcy; Andrew J. O. Whitehouse; Kerry Wallace; Maya Hayden-Evans; Sonya Girdler; Benjamin Milbourn; Sven Bölte; Kiah Evans – Journal of Autism and Developmental Disorders, 2025
Purpose: The PEDI-CAT (ASD) is used to assess functioning of children and youth on the autism spectrum; however, current psychometric evidence is limited. This study aimed to explore the reliability, validity and acceptability of the PEDI-CAT (ASD) using a large Australian sample. Methods: Caregivers of 134 children and youth on the spectrum…
Descriptors: Autism Spectrum Disorders, Children, Youth, Test Reliability
Samiul Biswas; Anshu Narad – Journal of Education and Learning (EduLearn), 2025
Adolescence, a crucial period preceded by childhood and followed by adulthood, involves significant growth and developmental changes leading to various psychological challenges and aggressive tendencies. Several scales have been developed to measure aggression, but 12-item short form aggression questionnaire has been widely used for assessing…
Descriptors: Adolescents, Aggression, Psychometrics, Questionnaires
J. Weidlich; I. Jivet; S. Woitt; D. Orhan Göksün; J. Kraus; H. Drachsler – Assessment & Evaluation in Higher Education, 2025
Feedback literacy is gaining recognition as a key concept for understanding how engage with and learn from feedback in higher education. This study presents validity evidence for a refined version of the Student Feedback Literacy Instrument (SFLI), designed to measure the construct across two dimensions--feedback attitudes and feedback…
Descriptors: Feedback (Response), Knowledge Level, Measures (Individuals), Test Construction
Kylie Gorney; Mark D. Reckase – Journal of Educational Measurement, 2025
In computerized adaptive testing, item exposure control methods are often used to provide a more balanced usage of the item pool. Many of the most popular methods, including the restricted method (Revuelta and Ponsoda), use a single maximum exposure rate to limit the proportion of times that each item is administered. However, Barrada et al.…
Descriptors: Computer Assisted Testing, Adaptive Testing, Test Items, Item Banks
Zeynep Gül Dertli; Behiye Akçay; Ibrahim Delen; Nisa Nur Karabacak; Hakan Akçay; Bahadir Yildiz; Bora Senceylan; Gökhan Ince – International Journal on Social and Education Sciences, 2025
There is growing interest in climate change education, however there are very limited tools in elementary and secondary level to measure students' understanding. Departing from this need, this study aimed to adapt the Climate Change Stages of Change Questionnaire into Turkish and examine its validity and reliability. The adaptation process stated…
Descriptors: Foreign Countries, Turkish, Media Adaptation, Test Validity
Rodrigo Moreta-Herrera; Alberto Rodríguez-Lorenzana; Nazuri Santillán; Micaela Jiménez-Borja; Carlos José Jiménez-Mosquera; Xavier Oriol-Granado; Sergio Domínguez-Lara – Psychology in the Schools, 2025
The aim of this study is to find evidence of factorial validity, measure equivalence by gender, and internal consistency of the Mental Health Continuum-Short Form (MHC-SF) in a sample of Ecuadorian teenagers. The study uses a psychometric design to explore the validity and reliability of the measure. Participants of the study were 1154 teenagers…
Descriptors: Psychometrics, Mental Health, Questionnaires, Foreign Countries
Christian Myles; Laura Gorman; James F. X. Jones – Anatomical Sciences Education, 2025
Textbook anatomy depiction of the hepatobiliary tree is present in 55%-62% of the population. Misidentification of hepatobiliary variants can lead to bile duct injuries in cholecystectomies. A better understanding of variants has been cited as a key area for improvement in anatomy education. The aim of this study was to compare the effectiveness…
Descriptors: Computer Peripherals, Printing, Science Instruction, Teaching Methods

Peer reviewed
Direct link
