Publication Date
| In 2026 | 3 |
| Since 2025 | 656 |
| Since 2022 (last 5 years) | 3157 |
| Since 2017 (last 10 years) | 7398 |
| Since 2007 (last 20 years) | 15036 |
Descriptor
| Test Reliability | 15028 |
| Test Validity | 10265 |
| Reliability | 9757 |
| Foreign Countries | 7137 |
| Test Construction | 4821 |
| Validity | 4191 |
| Measures (Individuals) | 3876 |
| Factor Analysis | 3822 |
| Psychometrics | 3520 |
| Interrater Reliability | 3124 |
| Correlation | 3039 |
| More ▼ | |
Source
Author
Publication Type
Education Level
Audience
| Researchers | 709 |
| Practitioners | 451 |
| Teachers | 208 |
| Administrators | 122 |
| Policymakers | 66 |
| Counselors | 42 |
| Students | 38 |
| Parents | 11 |
| Community | 7 |
| Support Staff | 6 |
| Media Staff | 5 |
| More ▼ | |
Location
| Turkey | 1326 |
| Australia | 436 |
| Canada | 379 |
| China | 368 |
| United States | 271 |
| United Kingdom | 256 |
| Indonesia | 251 |
| Taiwan | 234 |
| Netherlands | 223 |
| Spain | 216 |
| California | 214 |
| More ▼ | |
Laws, Policies, & Programs
Assessments and Surveys
What Works Clearinghouse Rating
| Meets WWC Standards without Reservations | 8 |
| Meets WWC Standards with or without Reservations | 9 |
| Does not meet standards | 6 |
Joyce M. W. Moonen-van Loon; Jeroen Donkers – Practical Assessment, Research & Evaluation, 2025
The reliability of assessment tools is critical for accurately monitoring student performance in various educational contexts. When multiple assessments are combined to form an overall evaluation, each assessment serves as a data point contributing to the student's performance within a broader educational framework. Determining composite…
Descriptors: Programming Languages, Reliability, Evaluation Methods, Student Evaluation
Rona L. Pogrund; Beth Jones; Maria Carlson – Journal of the American Academy of Special Education Professionals, 2025
Teacher workload leading to burnout is a significant problem facing many teachers today. The "Visual Impairment Scale of Staffing Pattern Analysis (VISSPA)" was designed to address the workload of vision professionals, teachers of students with visual impairments, and orientation and mobility specialists who work itinerantly with…
Descriptors: Itinerant Teachers, Visual Impairments, Students with Disabilities, Faculty Workload
Jiwon Baek; Jiwoo Kim; Hyeonseong Lee; Youn-Jeng Choi – SAGE Open, 2025
The rapid shift to online learning during the COVID-19 pandemic highlighted the need to equip future educators with essential skills for an online learning environment. This study aims to address this gap by developing and validating a scale to measure pre-service teacher competency in an online learning context using the scenario method. After a…
Descriptors: Test Construction, Test Validity, Preservice Teachers, Teacher Competencies
Olifa J. Asmara; Alina Morawska; April Hoang; Yulina Eva Riany – Infant and Child Development, 2025
Child self-regulation has been considered a valuable skill that shapes a child's future life trajectory. Parents have crucial roles in its development, making parenting interventions a strategic means to promote child self-regulation. Nonetheless, there are no available measures of child self-regulation suitable for assessing outcomes in…
Descriptors: Test Construction, Test Validity, Children, Preadolescents
Justin Wang; Ali Aijaz; Harika Dabbara; Deniz Goodman; Brett Cassidy; Lindsey Claus; Jessica Landau-Taylor; Minali Prasad; Vincent Baribeau; Han Xu; Glenn McFadden; Jonathan J. Wisco – Anatomical Sciences Education, 2025
There has been growing interest in incorporating point-of-care ultrasound (POCUS) into the curriculum of medical schools. In this study, we describe a condensed version of the 4-view cardiac assessment and assess its reliability and validity in pre-clerkship medical students at Boston University Aram V. Chobanian & Edward Avedisian School of…
Descriptors: Medical Students, Medical Education, Medical Schools, Diagnostic Tests
Büsra Arik Güngör; Mustafa Metin; Sibel Saraçoglu – International Journal of Technology in Education and Science, 2025
The aim of this study is to develop a valid and reliable scale that can be used to assess teachers' digital competencies. The research employed a survey design, one of the quantitative research methods. The sample of the study consisted of 463 teachers from various disciplines working in Kayseri during the 2023-2024 academic year. Initially, a…
Descriptors: Digital Literacy, Teacher Competencies, Test Construction, Test Validity
Beyza Nur Görken; Mehmet Fatih Kaya – International Journal of Assessment Tools in Education, 2025
This study aims to develop a financial literacy scale that can be used to assess the level of financial literacy among university students and the relevant age group. This study employs quantitative research methods and is a scale development study. The study group consists of 580 students enrolled at a state university during the 2023-2024…
Descriptors: Financial Literacy, Test Construction, Measures (Individuals), College Students
Erik Voss – Language Testing, 2025
An increasing number of language testing companies are developing and deploying deep learning-based automated essay scoring systems (AES) to replace traditional approaches that rely on handcrafted feature extraction. However, there is hesitation to accept neural network approaches to automated essay scoring because the features are automatically…
Descriptors: Artificial Intelligence, Automation, Scoring, English (Second Language)
Tian Kar Quar; Muhammad Hafiz Zawawie; Mohd Fadzil Nor Rashid; Wan Syafira Ishak; Mohd Hasrul Hosshan; Rafidah Mazlan; Wan Nur Hanim Mohd Yusoff; Teresa Y. C. Ching – Language, Speech, and Hearing Services in Schools, 2025
The validity and reliability of parental questionnaires for evaluating children's hearing in real-world listening environments have been reported in previous studies, but very little research on teacher-reported questionnaires on young children has been reported. Purpose: The present study aimed to examine the validity and reliability of the Malay…
Descriptors: Foreign Countries, Preschool Children, Auditory Evaluation, Preschool Teachers
Miguel Bernabé; Richard Merhi; Ana Lisbona; Francisco Palací – SAGE Open, 2025
Proactivity about a professional career predicts the employability of students in the job market. Having a scale will facilitate research and intervention using proactive behaviors. The objective of the present study is to adapt the Proactive Career Behavior questionnaire in Spanish distance university students. To that end, the questionnaire is…
Descriptors: Undergraduate Students, Distance Education, Student Attitudes, Career Choice
Renáta Kiss; Beno Csapó – International Journal of Early Childhood, 2025
Previous research has shown that phonological awareness is one of the most important prerequisites for early reading. Monitoring its development requires reliable, easy-to-use instruments especially in the last years of kindergarten. The present study aims to explore the potential for assessing phonological awareness and some of its subskills…
Descriptors: Phonological Awareness, Kindergarten, Reading Skills, Student Evaluation
Jing Zhou; Zhongbing Ding; Meng Zhang; Xinxin Wei; Wenjun An; Ziqiao Zhu; Peiling Guo; Li Qiu; Qiang Guo; Yinting Bai – International Journal of Language & Communication Disorders, 2025
Purpose: The purpose of this study is to develop an accurate test scale for vocabulary comprehension ability applicable to Mandarin-speaking preschool children aged 3-5. Methods: First, an initial scale was developed and evaluated using the expert consultation method. Subsequently, 490 typically developing 3-5-year-old Mandarin-speaking children…
Descriptors: Preschool Children, Test Validity, Test Reliability, Language Tests
Yoonseo Kim – TESOL Quarterly: A Journal for Teachers of English to Speakers of Other Languages and of Standard English as a Second Dialect, 2025
This study explores the potential of OpenAI's ChatGPT-4 (gpt-4-0613) as an automated essay scoring (AES) tool in a trial involving 300 essays from an American university's academic English program placement test. Three prompting strategies (minimal/detailed rubric, require/not require rationale, and with/without scoring examples) were tested for…
Descriptors: Automation, Scoring, Artificial Intelligence, Placement Tests
Kjetil Egelandsdal; Eva Hartell; Jan-Ove Faerstad – Assessment & Evaluation in Higher Education, 2025
This study explores the practical feasibility of Adaptive Comparative Judgment (ACJ) as a summative assessment method in legal education, focusing on a Property and Intellectual Property Law course at a Norwegian university. Eight examiners assessed 300 student responses using ACJ within traditional time constraints, performing pairwise…
Descriptors: Summative Evaluation, Legal Education (Professions), Foreign Countries, Examiners
Huscroft-D'Angelo, Jacqueline; Wery, Jessica; Martin-Gutel, Jodie D.; Pierce, Corey; Loftin, Kara – Assessment for Effective Intervention, 2022
The Scales for Assessing Emotional Disturbance Screener--Third Edition (SAED-3) is a standardized, norm-referenced measure designed to identify school-age students at risk for emotional and behavioral problems. Four studies are reported to address the psychometric status of the SAED-3 Screener. Study 1 examined the internal consistency of the…
Descriptors: Emotional Disturbances, Test Reliability, Test Validity, Screening Tests

Peer reviewed
Direct link
