Publication Date
| In 2026 | 3 |
| Since 2025 | 636 |
| Since 2022 (last 5 years) | 3137 |
| Since 2017 (last 10 years) | 7378 |
| Since 2007 (last 20 years) | 15016 |
Descriptor
| Test Reliability | 15015 |
| Test Validity | 10252 |
| Reliability | 9751 |
| Foreign Countries | 7126 |
| Test Construction | 4811 |
| Validity | 4189 |
| Measures (Individuals) | 3875 |
| Factor Analysis | 3821 |
| Psychometrics | 3515 |
| Interrater Reliability | 3122 |
| Correlation | 3037 |
| More ▼ | |
Source
Author
Publication Type
Education Level
Audience
| Researchers | 709 |
| Practitioners | 451 |
| Teachers | 208 |
| Administrators | 122 |
| Policymakers | 66 |
| Counselors | 42 |
| Students | 38 |
| Parents | 11 |
| Community | 7 |
| Support Staff | 6 |
| Media Staff | 5 |
| More ▼ | |
Location
| Turkey | 1320 |
| Australia | 436 |
| Canada | 379 |
| China | 368 |
| United States | 271 |
| United Kingdom | 256 |
| Indonesia | 251 |
| Taiwan | 234 |
| Netherlands | 223 |
| Spain | 216 |
| California | 214 |
| More ▼ | |
Laws, Policies, & Programs
Assessments and Surveys
What Works Clearinghouse Rating
| Meets WWC Standards without Reservations | 8 |
| Meets WWC Standards with or without Reservations | 9 |
| Does not meet standards | 6 |
Meryem Konu Kadirhanogullari; Esra Özay Köse – Science Insights Education Frontiers, 2025
This study aims to develop a valid and reliable achievement test in accordance with the content framework of the 9th-grade Biology Course Curriculum published within the scope of the Turkish Century Maarif Model on the subject of "Organic Matter". The screening method was used for this purpose. The sample of the study consists of 258…
Descriptors: Science Tests, Test Construction, Grade 9, Biology
Eli Rohaeti; Nur Huda – European Journal of Science and Mathematics Education, 2025
Computational thinking (CT) is a thinking skill developed and integrated into curricula worldwide in recent years. However, limited assessment is one of the challenges in integrating CT skills into the educational curriculum of developing countries such as Indonesia. This study aimed to develop and validate a CT assessment instrument tailored for…
Descriptors: Computation, Thinking Skills, Science Education, Mathematics Education
Barbara Jane Cunningham; Peter Rosenbaum; Anastasia Nepotiuk; Nancy Thomas-Stonell – Communication Disorders Quarterly, 2024
This brief report presents interrater reliability data for the Focus on the Outcomes of Communication Under Six (FOCUS-34) between parents, and between parents and speech-language pathologists (SLPs). Reliability for all three raters combined was good to excellent across three assessments. Reliability for pairs of raters was variable but generally…
Descriptors: Interrater Reliability, Outcome Measures, Preschool Children, Parents
Laura Scholes; Sarah McDonald; Garth Stahl; Barbara Comber – British Educational Research Journal, 2024
Sourcing information related to socio-scientific issues requires sophisticated literacies to read and evaluate conflicting accounts often signified by disagreement among experts, multiple solutions or misinformation. Much of the previous work exploring how young people approach conflicting information has tended to focus on students in the…
Descriptors: Middle School Students, Information Sources, Internet, Search Strategies
Joana Soares; Maria do Céu Taveira; Paulo Cardoso; Ana Daniela Silva – International Journal for Educational and Vocational Guidance, 2024
The Student Career Construction Inventory measures students' adapting behaviors. The present study validates this inventory in a sample of 314 Portuguese college students. Measurement confirmatory factorial analysis indicates better fit for the 18-items measurement model, comparing to the 25-items model. Reliability and criterion-related analyses…
Descriptors: College Students, Test Reliability, Test Validity, Vocational Interests
Fuat Ozcan; Ali Meydan – Journal of Education in Science, Environment and Health, 2024
The goal of this study is to create the Zero Waste Attitude Scale, which will be used to determine the zero-waste attitude of social studies teacher candidates and to conduct validity and reliability studies. The data for the study were collected with a 5-point Likert-type form from pre-service teachers studying in the social studies teaching…
Descriptors: Test Construction, Preservice Teachers, Social Studies, Test Validity
Merve Sapmaz Atalar; Gençer Genç; Ahsen Erim; Beyza Pehlivan; Bertug Sakin; Serpil Bulut; Neila J. Donovan – International Journal of Language & Communication Disorders, 2024
Background: Communication of people with Parkinson's disease (PwPD) is negatively affected. For PwPD with communication difficulties, it is important to use self-assessment tools as a primary assessment approach to evaluate their perspectives on communication. It is also important to evaluate PwPDs with self-assessment scales in order to determine…
Descriptors: Communication Skills, Neurological Impairments, Self Evaluation (Individuals), Test Validity
Cameron Downing; Markéta Caravolas – Reading and Writing: An Interdisciplinary Journal, 2024
Spelling and handwriting are related skills which are critical for writing but are typically assessed separately. Doing so makes it more difficult to understand their respective development. We describe the creation and evaluation of a tool for their concurrent assessment: the Spelling and Handwriting Legibility Test (SaHLT). We examined whether…
Descriptors: Spelling, Handwriting, Writing Skills, Test Construction
Denise Swanson; Gerald Tindal – Behavioral Research and Teaching, 2024
This technical report provides an authoritative bibliographic resource of all the studies conducted on "easyCBM"® and published on the main website for Behavioral Research and Teaching under Publications (https://brtprojects.org). The "easyCBM"© software is a direct descendent of "Curriculum-based Measurement" (CBM)…
Descriptors: Bibliographies, Computer Software, Test Construction, Test Reliability
Amanda A. Wolkowitz; Russell Smith – Practical Assessment, Research & Evaluation, 2024
A decision consistency (DC) index is an estimate of the consistency of a classification decision on an exam. More specifically, DC estimates the percentage of examinees that would have the same classification decision on an exam if they were to retake the same or a parallel form of the exam again without memory of taking the exam the first time.…
Descriptors: Testing, Test Reliability, Replication (Evaluation), Decision Making
Terra Blevins – ProQuest LLC, 2024
While large language models (LLMs) continue to grow in scale and gain new zero-shot capabilities, their performance for languages beyond English increasingly lags behind. This gap is due to the "curse of multilinguality," where multilingual language models perform worse on individual languages than a monolingual model trained on that…
Descriptors: Multilingualism, Computational Linguistics, Second Languages, Reliability
Ozan Evrim Tunca; Evrim Genc Kumtepe; Sukru Torun; Yusuf Zafer Can Ugurhan – International Journal of Music Education, 2024
In Turkey, children are accepted to conservatory music departments after fourth grade and fine arts high school music departments after eighth grade by taking a musical talent test. For students with high musical aural skills to know about their potential and be directed to the related education institutions there needs to be a valid test. This…
Descriptors: Foreign Countries, Test Construction, Music Theory, Test Validity
Jesus M. Pichardo; Megan Foley-Nicpon; Danae Fields; Jung Eui Hong; Court – Journal of Autism and Developmental Disorders, 2024
Currently, there are no existing measures to screen for or diagnose Social (Pragmatic) Communication Disorder (SPCD). We conducted an exploratory factor analysis (EFA) of the Social Communication Disorder Screener (SCDS), a 14-item, parent-report measure based on the DSM-5 diagnostic criteria for SPCD. This EFA examined the internal consistency…
Descriptors: Communication Disorders, Screening Tests, Factor Analysis, Parents
Sanja Lestarevic; Marko Kalanj; Luka Milutinovic; Roberto Grujicic; Jelena Vasic; Jovana Maslak; Marija Mitkovic-Voncina; Natasa Ljubomirovic; Milica Pejovic-Milovancevic – Journal of Autism and Developmental Disorders, 2024
We aimed to evaluate the internal consistency of Stanford Social Dimensions Scale (SSDS) translated to Serbian and to test it against the Strengths and Difficulties Questionnaire (SDQ). The sample consisted of 200 patients (32% ASD) of the Institute of Mental Health in Belgrade, Serbia (68 females, 132 males, M[subscript age]=9.61, SD[subscript…
Descriptors: Foreign Countries, Questionnaires, Translation, Test Reliability
Elisabeth Rukmini; Raychana Assegaf – Journal of Education and Learning (EduLearn), 2024
The volunteer function inventory (VFI) is an assessment tool to measure individual volunteer motivation. VFI measures individual motivation to volunteer by examining the functional motives of each volunteer. This research aimed to adapt the VFI to the Indonesian language. VFI consists of 30 items divided into five dimensions. This study utilized a…
Descriptors: Foreign Countries, Volunteers, Measures (Individuals), Test Validity

Peer reviewed
Direct link
