Publication Date
| In 2026 | 3 |
| Since 2025 | 666 |
| Since 2022 (last 5 years) | 3167 |
| Since 2017 (last 10 years) | 7408 |
| Since 2007 (last 20 years) | 15046 |
Descriptor
| Test Reliability | 15036 |
| Test Validity | 10272 |
| Reliability | 9759 |
| Foreign Countries | 7141 |
| Test Construction | 4823 |
| Validity | 4191 |
| Measures (Individuals) | 3877 |
| Factor Analysis | 3825 |
| Psychometrics | 3525 |
| Interrater Reliability | 3124 |
| Correlation | 3039 |
| More ▼ | |
Source
Author
Publication Type
Education Level
Audience
| Researchers | 709 |
| Practitioners | 451 |
| Teachers | 208 |
| Administrators | 122 |
| Policymakers | 66 |
| Counselors | 42 |
| Students | 38 |
| Parents | 11 |
| Community | 7 |
| Support Staff | 6 |
| Media Staff | 5 |
| More ▼ | |
Location
| Turkey | 1327 |
| Australia | 436 |
| Canada | 379 |
| China | 368 |
| United States | 271 |
| United Kingdom | 256 |
| Indonesia | 252 |
| Taiwan | 234 |
| Netherlands | 223 |
| Spain | 216 |
| California | 214 |
| More ▼ | |
Laws, Policies, & Programs
Assessments and Surveys
What Works Clearinghouse Rating
| Meets WWC Standards without Reservations | 8 |
| Meets WWC Standards with or without Reservations | 9 |
| Does not meet standards | 6 |
Wind, Stefanie A. – Measurement: Interdisciplinary Research and Perspectives, 2022
In many performance assessments, one or two raters from the complete rater pool scores each performance, resulting in a sparse rating design, where there are limited observations of each rater relative to the complete sample of students. Although sparse rating designs can be constructed to facilitate estimation of student achievement, the…
Descriptors: Evaluators, Bias, Identification, Performance Based Assessment
Talan, Teri N.; Bella, Jill M.; Bloom, Paula Jorde – Teachers College Press, 2022
The "Program Administration Scale" (PAS) is designed to reliably measure and improve the leadership and management practices of center-based programs--the only instrument of its kind to focus exclusively on organization-wide administrative issues. In the third edition, the authors share updated information supporting the reliability and…
Descriptors: Program Administration, Evaluation Methods, Leadership, Early Childhood Education
Bottoms, Bryndle Laine – ProQuest LLC, 2022
Teacher evaluations are routinely conducted across the United States for licensure and professional development supports. However, there is limited research on the interrater reliability of these evaluation assessment systems, despite federal recommendations (Graham et al., 2012). This research explores the systematic approach to interrater…
Descriptors: Interrater Reliability, Early Childhood Teachers, Teacher Evaluation, Performance Based Assessment
Jayden J. Lee – ProQuest LLC, 2022
The functional neuroanatomy of language localization in dyslexia has primarily been studied in the context of reading. However, dyslexia is sometimes referred to as a "language-based learning disability," yet the functional signature of the core language comprehension network in dyslexia is far less understood. This thesis presents a…
Descriptors: Dyslexia, Brain Hemisphere Functions, Comparative Analysis, Speech Communication
Natalie R. Charamut; Sarah J. Racz; Mo Wang; Andres De Los Reyes – Grantee Submission, 2022
Accurately assessing youth mental health involves obtaining reports from multiple informants who typically display low levels of correspondence. This low correspondence may reflect "situational specificity." That is, youth vary as to where they display mental health concerns and informants vary as to where and from what perspective they…
Descriptors: Youth, Parents, Mental Health, Researchers
Gazal Bharara; Scott Duncan – Psychology in the Schools, 2024
The transition to secondary school can be a challenging period for adolescents. Although several questionnaires exist to measure transition-related concerns, there is a need to develop a comprehensive survey for assessing the knowledge and skills that adolescents require to adapt effectively to a new school. Thus, the purpose of this study was to…
Descriptors: Surveys, Psychometrics, Construct Validity, Test Reliability
Minjin Kim; Xiaofei Lu – Modern Language Journal, 2024
The effects of learner- and task-related variables on second language (L2) writing syntactic complexity (SC) have been extensively investigated. However, previous research has rarely assessed the reliability of computational tools for analyzing the SC of L2 spoken production, and we know less about the effects of such variables on L2 speaking SC.…
Descriptors: Speech Communication, Syntax, English (Second Language), Second Language Learning
Strom, Paris S.; Strom, Robert D.; Wang, Chih-hsuan – International Journal of Educational Reform, 2024
Employers agree that high school graduates lack teamwork skills needed for workplace productivity. The lag in student readiness to meet employer expectations urges school reforms that implicate group interaction, empirical assessment, and reporting of teamwork competencies. The "Teamwork Skills Inventory" (TSI) evaluated how well…
Descriptors: Peer Evaluation, Self Evaluation (Individuals), Teamwork, High School Students
Yu-Tzu Chang; Ann Tai Choe; Daniel Holden; Daniel R. Isbell – Language Testing, 2024
In this Brief Report, we describe an evaluation of and revisions to a rubric adapted from the Jacobs et al.'s (1981) ESL COMPOSITION PROFILE, with four rubric categories and 20-point rating scales, in the context of an intensive English program writing placement test. Analysis of 4 years of rating data (2016-2021, including 434 essays) using…
Descriptors: Language Tests, Rating Scales, Second Language Learning, English (Second Language)
David Meechan; Zeta Williams-Brown; Tracy Whatmore; Simon Halfhead – Education 3-13, 2024
The paper focuses on findings from research that investigated teachers' and key stakeholders' perspectives on the use of Reception Baseline Assessment. Data collection was carried out in 2021-2022, which was the year this assessment was introduced into Reception classes in England. In total, 70 teachers and key stakeholders from 47 Local…
Descriptors: Foreign Countries, Preschool Education, Preschool Teachers, Achievement Tests
Firdissa J. Aga – Intersection: A Journal at the Intersection of Assessment and Learning, 2024
The study investigated hurdles to the quality of student learning assessment by examining issues related to assessment procedures and practices, learners and learning, learning resources and test constructs, and test admin and feedback. Quantitative and qualitative data were collected from two Ethiopian universities using two types of…
Descriptors: Foreign Countries, College Faculty, College Students, Test Construction
Kathleen D. Dyer; Dermot Donnelly-Hermosillo – Research in Higher Education, 2024
This study aimed to demonstrate how one university worked to overcome some of the measurement problems associated with legacy student rating instruments through the creation and investigation of a new student rating instrument based on the most current scholarship on teaching and learning. Measurement problems with legacy instruments include…
Descriptors: Case Studies, Universities, Teacher Student Relationship, Student Evaluation of Teacher Performance
João M. Santos – Research Evaluation, 2024
The allocation of scientific funding through grant programs is crucial for research advancement. While independent peer panels typically handle evaluations, their decisions can lean on personal preferences that go beyond the stated criteria, leading to inconsistencies and potential biases. Given these concerns, our study employs a novel method,…
Descriptors: Grants, Program Proposals, Funding Formulas, Scientific Research
Zachary J. Roman; Patrick Schmidt; Jason M. Miller; Holger Brandt – Structural Equation Modeling: A Multidisciplinary Journal, 2024
Careless and insufficient effort responding (C/IER) is a situation where participants respond to survey instruments without considering the item content. This phenomena adds noise to data leading to erroneous inference. There are multiple approaches to identifying and accounting for C/IER in survey settings, of these approaches the best performing…
Descriptors: Structural Equation Models, Bayesian Statistics, Response Style (Tests), Robustness (Statistics)
Fatma Özgün Öztürk; Ganime Can Gür – International Journal of Assessment Tools in Education, 2024
This research aims to develop an instrument for the evaluation of impulsivity traits in children and to examine the psychometric features of the developed scale. The process of developing the scale involved three main phases: namely, item generation, evaluation of content validity, and analysis of psychometric properties. The study sample…
Descriptors: Construct Validity, Content Validity, Test Reliability, Psychometrics

Peer reviewed
Direct link
