Publication Date
| In 2026 | 7 |
| Since 2025 | 690 |
| Since 2022 (last 5 years) | 3191 |
| Since 2017 (last 10 years) | 7432 |
| Since 2007 (last 20 years) | 15070 |
Descriptor
| Test Reliability | 15055 |
| Test Validity | 10290 |
| Reliability | 9763 |
| Foreign Countries | 7150 |
| Test Construction | 4828 |
| Validity | 4192 |
| Measures (Individuals) | 3880 |
| Factor Analysis | 3826 |
| Psychometrics | 3532 |
| Interrater Reliability | 3126 |
| Correlation | 3040 |
| More ▼ | |
Source
Author
Publication Type
Education Level
Audience
| Researchers | 709 |
| Practitioners | 451 |
| Teachers | 208 |
| Administrators | 122 |
| Policymakers | 66 |
| Counselors | 42 |
| Students | 38 |
| Parents | 11 |
| Community | 7 |
| Support Staff | 6 |
| Media Staff | 5 |
| More ▼ | |
Location
| Turkey | 1329 |
| Australia | 436 |
| Canada | 379 |
| China | 368 |
| United States | 271 |
| United Kingdom | 256 |
| Indonesia | 253 |
| Taiwan | 234 |
| Netherlands | 224 |
| Spain | 218 |
| California | 215 |
| More ▼ | |
Laws, Policies, & Programs
Assessments and Surveys
What Works Clearinghouse Rating
| Meets WWC Standards without Reservations | 8 |
| Meets WWC Standards with or without Reservations | 9 |
| Does not meet standards | 6 |
National Institute for Excellence in Teaching, 2023
Aspiring teachers must develop an in-depth understanding of high-quality instructional practices. In order to prepare, instruct, and coach aspiring teachers, the National Institute for Excellence in Teaching (NIET) has developed a the NIET Aspiring Teacher Rubric (ATR) based on principles of excellence in instruction. This research brief…
Descriptors: Scoring Rubrics, Preservice Teachers, Test Construction, Test Validity
Tine S. Prøitz – Teaching in Higher Education, 2023
Drawing on the concepts of consistency, this study contributes to the discussion of study programme plans and the links between curriculum elements. The main argument is that a universal requirement of consistency is taken for granted in study programme planning, even though critics have noted a need for closer scrutiny and debate. The literature…
Descriptors: Curriculum Development, Reliability, College Curriculum, Alignment (Education)
Roberto Brazileio Paixão; Michael C. Rodriguez – Educational Research and Evaluation, 2023
The usefulness of evaluation is critical. Evaluation use occurs when, from its results or process, decisions are made about the program, it changes people's mindsets, or persuasive or legitimation actions happen (instrumental, conceptual, and symbolic uses respectively). Few quantitative evaluation use studies have been conducted in recent years.…
Descriptors: Measures (Individuals), College Faculty, Test Validity, Test Reliability
El Alaoui, Mohamed – IEEE Transactions on Learning Technologies, 2023
Classical evaluation methods, assessments, exams, and so forth accentuate the perception of one against all, professor versus learners. Including students in the assessment process, allows transforming the professor from an opponent to a critical friend, with the role of helping students to recognize both their strengths and weaknesses. However,…
Descriptors: Peer Evaluation, Educational Improvement, Test Validity, Test Reliability
Matthew J. Madison; Seungwon Chung; Junok Kim; Laine P. Bradshaw – Grantee Submission, 2023
Recent developments have enabled the modeling of longitudinal assessment data in a diagnostic classification model (DCM) framework. These longitudinal DCMs were developed to provide measures of student growth on a discrete scale in the form of attribute mastery transitions, thereby supporting categorical and criterion-referenced interpretations of…
Descriptors: Models, Cognitive Measurement, Diagnostic Tests, Classification
Dankiw, Kylie A.; Baldock, Katherine L.; Kumar, Saravana; Tsiros, Margarita D. – Australasian Journal of Early Childhood, 2021
Identifying and describing children's play behaviours is an important component of evaluating child development. The Behaviour Mapping Schedule is a direct observational tool which aims to describe and quantify children's play behaviours but is yet to undergo reliability testing. This study aimed to determine the intra- and inter-rater reliability…
Descriptors: Interrater Reliability, Classification, Child Behavior, Play
Starmer, Heather M.; Arrese, Loni; Langmore, Susan; Ma, Yifei; Murray, Joseph; Patterson, Joanne; Pisegna, Jessica; Roe, Justin; Tabor-Gray, Lauren; Hutcheson, Katherine – Journal of Speech, Language, and Hearing Research, 2021
Purpose: While flexible endoscopic evaluation of swallowing (FEES) is a common clinical procedure used in the head and neck cancer (HNC) population, extant outcome measures for FEES such as bolus-level penetration-aspiration and residue scores are not well suited as global patient-level endpoint measures of dysphagia severity in cooperative group…
Descriptors: Medical Evaluation, Physical Disabilities, Safety, Efficiency
Evans, Tanya; Mejía-Ramos, Juan Pablo; Inglis, Matthew – Educational Studies in Mathematics, 2022
Offering explanations is a central part of teaching mathematics, and understanding those explanations is a vital activity for learners. Given this, it is natural to ask what makes a good mathematical explanation. This question has received surprisingly little attention in the mathematics education literature, perhaps because the field has no…
Descriptors: Mathematics, Professional Personnel, Undergraduate Students, Mathematics Activities
Simner, Julia; Smees, Rebecca; Rinaldi, Louisa J.; Carmichael, Duncan A.; McDonald, Toby J. – Journal of Creative Behavior, 2022
Creative orientation is the extent to which different individuals are drawn toward creative activities (e.g., art, music). We know relatively little about child-level creative orientation given certain testing limitations. Adult tools often measure time spent engaged in creative pursuits, but this method is unsuitable for children because their…
Descriptors: Influences, Creativity, Creative Activities, Measures (Individuals)
Almehrizi, Rashid S. – Educational Measurement: Issues and Practice, 2022
Coefficient alpha reliability persists as the most common reliability coefficient reported in research. The assumptions for its use are, however, not well-understood. The current paper challenges the commonly used expressions of coefficient alpha and argues that while these expressions are correct when estimating reliability for summed scores,…
Descriptors: Reliability, Scores, Scaling, Statistical Analysis
Hull, Michael M.; Jansky, Alexandra; Hopf, Martin – Physical Review Physics Education Research, 2022
Our study investigates whether confidence correlates with consistency in reasoning, specifically about radioactive decay. In prior work, we developed and tested a survey designed to measure consistency of student reasoning about radioactive decay by comparing responses to three prompts that are isomorphic, meaning that, despite having different…
Descriptors: Students, Self Esteem, Responses, Accuracy
Eric Jones – ProQuest LLC, 2022
The assessment of human performance is not a new phenomenon. We have evidence that people have been required to prove their worth dating back at least to the Epic of Gilgamesh. What has changed, at least on a large scale, is the importance given to quantitative evidence in the evaluation process. For example, many employers have begun subjecting…
Descriptors: Performance Based Assessment, Evaluation Methods, Semiotics, Theories
Vonna L. Hemmler; Allison W. Kenney; Susan Dulong Langley; Carolyn M. Callahan; E. Jean Gubbins; Shannon Holder – Grantee Submission, 2022
Though qualitative research has become more prevalent in practice over the last 30 years, there is still considerable uncertainty among researchers regarding how to ensure inter-rater consistency when teams are tasked with coding qualitative data. In this article, we offer an explanation of a methodology our qualitative team used to achieve…
Descriptors: Interrater Reliability, Coding, Guides, Data Collection
Niziolek, Caroline A.; Parrell, Benjamin – Journal of Speech, Language, and Hearing Research, 2021
Purpose: Speakers use auditory feedback to guide their speech output, although individuals differ in the magnitude of their compensatory response to perceived errors in feedback. Little is known about the factors that contribute to the compensatory response or how fixed or flexible they are within an individual. Here, we test whether manipulating…
Descriptors: Acoustics, Speech, Auditory Perception, Reliability
Uyumaz, Gizem; Sirganci, Gözde – International Journal of Contemporary Educational Research, 2021
In this study, the assumption of the equality of psychological distance between categories of rating scale was tested based on the number of categories and ability distributions. Category parameters were estimated by using generalized partial credit model. The data sets based on the conditions of categories counts and ability distributions were…
Descriptors: Rating Scales, Classification, Reliability, Likert Scales

Peer reviewed
Direct link
