Publication Date
| In 2026 | 3 |
| Since 2025 | 666 |
| Since 2022 (last 5 years) | 3167 |
| Since 2017 (last 10 years) | 7408 |
| Since 2007 (last 20 years) | 15046 |
Descriptor
| Test Reliability | 15036 |
| Test Validity | 10272 |
| Reliability | 9759 |
| Foreign Countries | 7141 |
| Test Construction | 4823 |
| Validity | 4191 |
| Measures (Individuals) | 3877 |
| Factor Analysis | 3825 |
| Psychometrics | 3525 |
| Interrater Reliability | 3124 |
| Correlation | 3039 |
| More ▼ | |
Source
Author
Publication Type
Education Level
Audience
| Researchers | 709 |
| Practitioners | 451 |
| Teachers | 208 |
| Administrators | 122 |
| Policymakers | 66 |
| Counselors | 42 |
| Students | 38 |
| Parents | 11 |
| Community | 7 |
| Support Staff | 6 |
| Media Staff | 5 |
| More ▼ | |
Location
| Turkey | 1327 |
| Australia | 436 |
| Canada | 379 |
| China | 368 |
| United States | 271 |
| United Kingdom | 256 |
| Indonesia | 252 |
| Taiwan | 234 |
| Netherlands | 223 |
| Spain | 216 |
| California | 214 |
| More ▼ | |
Laws, Policies, & Programs
Assessments and Surveys
What Works Clearinghouse Rating
| Meets WWC Standards without Reservations | 8 |
| Meets WWC Standards with or without Reservations | 9 |
| Does not meet standards | 6 |
Lorena Atarés Huerta; Juan Antonio Llorens Molina – Journal of Chemical Education, 2023
Despite the demonstrated learning benefits of peer evaluation, fears of teachers about its low reliability may restrict its use. In this study, the validity of peer assessment, in terms of agreement with the ratings of the teacher, has been tested in an organic chemistry course. The students were organized into small groups and commissioned to…
Descriptors: Introductory Courses, Organic Chemistry, College Students, Tutorial Programs
D. Betsy McCoach; Anthony J. Gambino; Scott J. Peters; Daniel Long; Del Siegle – Annenberg Institute for School Reform at Brown University, 2023
Teacher rating scales (TRS) are often used to make service eligibility decisions for exceptional learners. Although TRS are regularly used to identify student exceptionalism either as part of an informal nomination process or through behavioral rating scales, there is little research documenting the between-teacher variance in teacher ratings or…
Descriptors: Rating Scales, Student Evaluation, Academically Gifted, Ability Identification
Kelly Little; Yongyue Qi; Vanessa D. Jewell – Journal of Occupational Therapy Education, 2023
The Occupation-Centered Intervention Assessment (OCIA) was developed as a reflective tool for students to improve their comprehension of occupation-centered practice. Finding new and innovative ways to incorporate occupation-centered assignments can serve as a strategy to develop student integration of occupation-centered practice and allow…
Descriptors: Occupational Therapy, Allied Health Occupations Education, Interrater Reliability, Intervention
Cafer Kiliç; Ibrahim Keklik – Educational and Developmental Psychologist, 2025
Objective: There are multiple risk factors, such as individual features, parenting behaviours, and peer influences associated with the development of conduct problems. The Alabama Parenting Questionnaire-Child Form (APQ-CF) is a commonly used instrument to measure parenting behaviours. The study aimed to examine the factor structure of the…
Descriptors: Psychometrics, Questionnaires, Child Rearing, Culture Fair Tests
Jennifer Bronson; Jennifer Krajewski – State Education Standard, 2025
High-dosage tutoring or high-impact tutoring refers to a type of tutoring proven to be effective at closing learning gaps and improving student outcomes. This systemic approach uses an empirically supported model and is delivered by a consistent, trained tutor on a near-daily basis and takes place during the school day for 10-36 weeks. In the wake…
Descriptors: Tutoring, Incidence, Achievement Gap, Student Improvement
María Pilar Aparicio-Flores; Rosa Pilar Esteve-Faubel; Aitana Fernández-Sogorb; Carolina Gonzálvez – Education and Information Technologies, 2025
The use of Information and Communication Technologies (ICT) has been increasing in education. Despite its benefits, not everyone perceives its use with the same ease. This raises the need to observe the perceived ease of use (PEOU) of ICT among future teachers, which requires a valid and reliable instrument to measure this variable for the Spanish…
Descriptors: Spanish, Test Validity, Measures (Individuals), Usability
I Made Ratih Rosanawati; Warto Warto; Djono Djono; Hieronymus Purwanta – Educational Process: International Journal, 2025
Background/purpose: The challenges in implementing Indonesia's Merdeka Curriculum highlight the urgency for innovative educational approaches to address gaps in historical knowledge and develop 21st-century skills. This study explores the integration of the Among System into the Merdeka Curriculum framework to foster critical thinking,…
Descriptors: History Instruction, Teaching Methods, Student Projects, Active Learning
Reza Shahi; Hamdollah Ravand; Golam Reza Rohani – International Journal of Language Testing, 2025
The current paper intends to exploit the Many Facet Rasch Model to investigate and compare the impact of situations (items) and raters on test takers' performance on the Written Discourse Completion Test (WDCT) and Discourse Self-Assessment Tests (DSAT). In this study, the participants were 110 English as a Foreign Language (EFL) students at…
Descriptors: Comparative Analysis, English (Second Language), Second Language Learning, Second Language Instruction
Yubin Xu; Lin Liu; Jianwen Xiong; Guangtian Zhu – Journal of Baltic Science Education, 2025
As the development and application of large language models (LLMs) in physics education progress, the well-known AI-based chatbot ChatGPT4 has presented numerous opportunities for educational assessment. Investigating the potential of AI tools in practical educational assessment carries profound significance. This study explored the comparative…
Descriptors: Physics, Artificial Intelligence, Computer Software, Accuracy
Shun-Fu Hu; Amery D. Wu; Jake Stone – Journal of Educational Measurement, 2025
Scoring high-dimensional assessments (e.g., > 15 traits) can be a challenging task. This paper introduces the multilabel neural network (MNN) as a scoring method for high-dimensional assessments. Additionally, it demonstrates how MNN can score the same test responses to maximize different performance metrics, such as accuracy, recall, or…
Descriptors: Tests, Testing, Scores, Test Construction
Yue Huang; Joshua Wilson – Journal of Computer Assisted Learning, 2025
Background: Automated writing evaluation (AWE) systems, used as formative assessment tools in writing classrooms, are promising for enhancing instruction and improving student performance. Although meta-analytic evidence supports AWE's effectiveness in various contexts, research on its effectiveness in the U.S. K-12 setting has lagged behind its…
Descriptors: Writing Evaluation, Writing Skills, Writing Tests, Writing Instruction
Sibling Attachment Inventory for Senior Secondary School Students: Standardization in Indian Context
Sampurna Guha; Nimisha Beri – Journal of Education and Learning (EduLearn), 2025
The study aimed at exploring, confirming, and validating the factor structure of sibling attachment inventory (SAI) within the Indian cultural context. The hypothesis posits that SAI will show reliable sibling attachment measures, evaluating its psychometric properties with 250 students enrolled in Class XI within govt and private schools,…
Descriptors: Foreign Countries, Sibling Relationship, Test Validity, Factor Structure
Kyle Reardon; Dawn A. Rowe; Deanne K. Unruh – Assessment for Effective Intervention, 2025
Achieving successful employment outcomes is critical for individuals with disabilities (IWD). Employers' perspectives toward employability skills for entry-level employees with disabilities is an important factor in employment rates. This study investigated the psychometric properties of the Entry-Level Employability Skills and Behaviors (EL-ESB)…
Descriptors: Employer Attitudes, Job Skills, Employees, Disabilities
Damla Eyuboglu; Murat Eyuboglu; Ferhat Yaylaci; Baris Guller; Begum Sahbudak; Aslihan Avunduk; Onur Oktay Dagli; Seval Caliskan Pala; Didem Arslantas – Journal of Autism and Developmental Disorders, 2025
The aim of this study was to examine the reliability and validity of the Turkish version of the AFEQ for Turkish parents of children with ASD. The Turkish-translated version of the AFEQ was administered to 241 parents of children aged 2-12 years with ASD to examine the construct validity and internal consistencies. Parents completed the Autism…
Descriptors: Foreign Countries, Autism Spectrum Disorders, Family Relationship, Questionnaires
Andrea Lucky; Vanda Janštová; Petr Novotný; Jan Mourek – International Journal of STEM Education, 2025
Background: In an era of precipitous insect declines, effective entomology education is especially needed to support firsthand knowledge of nature. Understanding what students know and feel about insects is instrumental to teaching and curriculum development. This study describes the development and validation of a new survey instrument, EntoEdu,…
Descriptors: Entomology, Test Construction, Test Validity, Global Approach

Peer reviewed
Direct link
