Publication Date
| In 2026 | 0 |
| Since 2025 | 621 |
| Since 2022 (last 5 years) | 3121 |
| Since 2017 (last 10 years) | 7362 |
| Since 2007 (last 20 years) | 15000 |
Descriptor
| Test Reliability | 15006 |
| Test Validity | 10245 |
| Reliability | 9748 |
| Foreign Countries | 7119 |
| Test Construction | 4807 |
| Validity | 4189 |
| Measures (Individuals) | 3872 |
| Factor Analysis | 3820 |
| Psychometrics | 3513 |
| Interrater Reliability | 3117 |
| Correlation | 3037 |
| More ▼ | |
Source
Author
Publication Type
Education Level
Audience
| Researchers | 709 |
| Practitioners | 451 |
| Teachers | 208 |
| Administrators | 122 |
| Policymakers | 66 |
| Counselors | 42 |
| Students | 38 |
| Parents | 11 |
| Community | 7 |
| Support Staff | 6 |
| Media Staff | 5 |
| More ▼ | |
Location
| Turkey | 1319 |
| Australia | 436 |
| Canada | 379 |
| China | 368 |
| United States | 271 |
| United Kingdom | 256 |
| Indonesia | 249 |
| Taiwan | 234 |
| Netherlands | 223 |
| Spain | 216 |
| California | 214 |
| More ▼ | |
Laws, Policies, & Programs
Assessments and Surveys
What Works Clearinghouse Rating
| Meets WWC Standards without Reservations | 8 |
| Meets WWC Standards with or without Reservations | 9 |
| Does not meet standards | 6 |
Jennifer Bronson; Jennifer Krajewski – State Education Standard, 2025
High-dosage tutoring or high-impact tutoring refers to a type of tutoring proven to be effective at closing learning gaps and improving student outcomes. This systemic approach uses an empirically supported model and is delivered by a consistent, trained tutor on a near-daily basis and takes place during the school day for 10-36 weeks. In the wake…
Descriptors: Tutoring, Incidence, Achievement Gap, Student Improvement
María Pilar Aparicio-Flores; Rosa Pilar Esteve-Faubel; Aitana Fernández-Sogorb; Carolina Gonzálvez – Education and Information Technologies, 2025
The use of Information and Communication Technologies (ICT) has been increasing in education. Despite its benefits, not everyone perceives its use with the same ease. This raises the need to observe the perceived ease of use (PEOU) of ICT among future teachers, which requires a valid and reliable instrument to measure this variable for the Spanish…
Descriptors: Spanish, Test Validity, Measures (Individuals), Usability
I Made Ratih Rosanawati; Warto Warto; Djono Djono; Hieronymus Purwanta – Educational Process: International Journal, 2025
Background/purpose: The challenges in implementing Indonesia's Merdeka Curriculum highlight the urgency for innovative educational approaches to address gaps in historical knowledge and develop 21st-century skills. This study explores the integration of the Among System into the Merdeka Curriculum framework to foster critical thinking,…
Descriptors: History Instruction, Teaching Methods, Student Projects, Active Learning
Reza Shahi; Hamdollah Ravand; Golam Reza Rohani – International Journal of Language Testing, 2025
The current paper intends to exploit the Many Facet Rasch Model to investigate and compare the impact of situations (items) and raters on test takers' performance on the Written Discourse Completion Test (WDCT) and Discourse Self-Assessment Tests (DSAT). In this study, the participants were 110 English as a Foreign Language (EFL) students at…
Descriptors: Comparative Analysis, English (Second Language), Second Language Learning, Second Language Instruction
Yubin Xu; Lin Liu; Jianwen Xiong; Guangtian Zhu – Journal of Baltic Science Education, 2025
As the development and application of large language models (LLMs) in physics education progress, the well-known AI-based chatbot ChatGPT4 has presented numerous opportunities for educational assessment. Investigating the potential of AI tools in practical educational assessment carries profound significance. This study explored the comparative…
Descriptors: Physics, Artificial Intelligence, Computer Software, Accuracy
Shun-Fu Hu; Amery D. Wu; Jake Stone – Journal of Educational Measurement, 2025
Scoring high-dimensional assessments (e.g., > 15 traits) can be a challenging task. This paper introduces the multilabel neural network (MNN) as a scoring method for high-dimensional assessments. Additionally, it demonstrates how MNN can score the same test responses to maximize different performance metrics, such as accuracy, recall, or…
Descriptors: Tests, Testing, Scores, Test Construction
Yue Huang; Joshua Wilson – Journal of Computer Assisted Learning, 2025
Background: Automated writing evaluation (AWE) systems, used as formative assessment tools in writing classrooms, are promising for enhancing instruction and improving student performance. Although meta-analytic evidence supports AWE's effectiveness in various contexts, research on its effectiveness in the U.S. K-12 setting has lagged behind its…
Descriptors: Writing Evaluation, Writing Skills, Writing Tests, Writing Instruction
Sibling Attachment Inventory for Senior Secondary School Students: Standardization in Indian Context
Sampurna Guha; Nimisha Beri – Journal of Education and Learning (EduLearn), 2025
The study aimed at exploring, confirming, and validating the factor structure of sibling attachment inventory (SAI) within the Indian cultural context. The hypothesis posits that SAI will show reliable sibling attachment measures, evaluating its psychometric properties with 250 students enrolled in Class XI within govt and private schools,…
Descriptors: Foreign Countries, Sibling Relationship, Test Validity, Factor Structure
Kyle Reardon; Dawn A. Rowe; Deanne K. Unruh – Assessment for Effective Intervention, 2025
Achieving successful employment outcomes is critical for individuals with disabilities (IWD). Employers' perspectives toward employability skills for entry-level employees with disabilities is an important factor in employment rates. This study investigated the psychometric properties of the Entry-Level Employability Skills and Behaviors (EL-ESB)…
Descriptors: Employer Attitudes, Job Skills, Employees, Disabilities
Damla Eyuboglu; Murat Eyuboglu; Ferhat Yaylaci; Baris Guller; Begum Sahbudak; Aslihan Avunduk; Onur Oktay Dagli; Seval Caliskan Pala; Didem Arslantas – Journal of Autism and Developmental Disorders, 2025
The aim of this study was to examine the reliability and validity of the Turkish version of the AFEQ for Turkish parents of children with ASD. The Turkish-translated version of the AFEQ was administered to 241 parents of children aged 2-12 years with ASD to examine the construct validity and internal consistencies. Parents completed the Autism…
Descriptors: Foreign Countries, Autism Spectrum Disorders, Family Relationship, Questionnaires
Andrea Lucky; Vanda Janštová; Petr Novotný; Jan Mourek – International Journal of STEM Education, 2025
Background: In an era of precipitous insect declines, effective entomology education is especially needed to support firsthand knowledge of nature. Understanding what students know and feel about insects is instrumental to teaching and curriculum development. This study describes the development and validation of a new survey instrument, EntoEdu,…
Descriptors: Entomology, Test Construction, Test Validity, Global Approach
Kelemu Zelalem Berhanu – International Journal of Educational Management, 2025
Purpose: Pedagogical leadership (PL) has been regarded as the best leadership style in the education sector. Thus, the aim of this study was to develop and validate a pedagogical leadership scale (PLS). Design/methodology/approach: Two distinct approaches (inductive and deductive) were utilized. First, a review of the literature was conducted, and…
Descriptors: Test Construction, Test Validity, Instructional Leadership, Measures (Individuals)
Umi Farisiyah; Edi Istiyono; Aminuddin Hassan; Nur Hidayanto P. S. Putro; Yulia Ayriza; Farida Agus Setiawati; Erwin Syahril Mubarok – Journal of Education and Learning (EduLearn), 2025
Concerning the Indonesian government's endeavors to safeguard Indonesia's standardized language and national language, numerous initiatives have been undertaken to uphold the disposition and consciousness of the Indonesian youth towards the language since they are the future custodians of the nation. This paper aims to present the psychometric…
Descriptors: Psychometrics, Measures (Individuals), Foreign Countries, Native Language
Daniel A. DeCino; Steven R. Chesnut; Phillip L. Waalkes; Reed N. Keen – Measurement and Evaluation in Counseling and Development, 2025
Objective: The purpose of this study was to develop and validate the Counselor Self-Reflection Inventory (CSRI) from a Transformative Learning Theory framework for counselors, and counselors-in-training to use in clinical and training settings. Method: A sample of 351, mostly female (86.89%), white (85.19%), counselors with MS or MA (88.08%)…
Descriptors: Test Construction, Test Validity, Test Reliability, Attitude Measures
Leonidas Gavrilas; Konstantinos T. Kotsis – International Journal of Research & Method in Education, 2025
Educational robotics activities have proven instrumental in creating a dynamic learning environment that provides students with hands-on experiences, aligning with the interdisciplinary principles advocated by STEM education. The integration of educational robotics into the educational process significantly impacts students of various age groups,…
Descriptors: Test Construction, Test Validity, Surveys, Measures (Individuals)

Peer reviewed
Direct link
