Publication Date
| In 2026 | 3 |
| Since 2025 | 675 |
| Since 2022 (last 5 years) | 3176 |
| Since 2017 (last 10 years) | 7417 |
| Since 2007 (last 20 years) | 15055 |
Descriptor
| Test Reliability | 15043 |
| Test Validity | 10279 |
| Reliability | 9761 |
| Foreign Countries | 7144 |
| Test Construction | 4825 |
| Validity | 4191 |
| Measures (Individuals) | 3877 |
| Factor Analysis | 3825 |
| Psychometrics | 3526 |
| Interrater Reliability | 3124 |
| Correlation | 3040 |
| More ▼ | |
Source
Author
Publication Type
Education Level
Audience
| Researchers | 709 |
| Practitioners | 451 |
| Teachers | 208 |
| Administrators | 122 |
| Policymakers | 66 |
| Counselors | 42 |
| Students | 38 |
| Parents | 11 |
| Community | 7 |
| Support Staff | 6 |
| Media Staff | 5 |
| More ▼ | |
Location
| Turkey | 1328 |
| Australia | 436 |
| Canada | 379 |
| China | 368 |
| United States | 271 |
| United Kingdom | 256 |
| Indonesia | 253 |
| Taiwan | 234 |
| Netherlands | 223 |
| Spain | 217 |
| California | 215 |
| More ▼ | |
Laws, Policies, & Programs
Assessments and Surveys
What Works Clearinghouse Rating
| Meets WWC Standards without Reservations | 8 |
| Meets WWC Standards with or without Reservations | 9 |
| Does not meet standards | 6 |
Greer, Fred W.; DiStefano, Christine A.; Liu, Jin; Cain, Leia K. – Assessment for Effective Intervention, 2015
The aim of this study was to provide psychometric evidence related to the "Behavioral and Emotional Screening System Teacher Rating Scale-Preschool" form's (BESS TRS-P) ability to identify emerging problems in preschool children. Reliability and validity associated with screener scores were compared by analyzing teacher ratings of…
Descriptors: Rating Scales, Psychometrics, Preschool Children, Behavior Problems
Blondin, Carolyn A.; Voils, Kyle; Galyon, Charles E.; Williams, Robert L. – Journal on Excellence in College Teaching, 2015
Concepts from the Response-to-Intervention (RTI) Model were used to promote a successful course outcome for students at risk for making low grades in an entry-level college course. The first exam served as a universal screener to identify students who could potentially benefit from RTI assistance. The researchers developed a tiered coaching…
Descriptors: Response to Intervention, Models, At Risk Students, Coaching (Performance)
Gresham, Frank M.; Dart, Evan H.; Collins, Tai A. – School Psychology Review, 2017
The concept of treatment integrity is an essential component to databased decision making within a response-to-intervention model. Although treatment integrity is a topic receiving increased attention in the school-based intervention literature, relatively few studies have been conducted regarding the technical adequacy of treatment integrity…
Descriptors: Fidelity, Generalizability Theory, Observation, Measurement Techniques
McNicholas, Patrick J.; Floyd, Randy G. – Canadian Journal of School Psychology, 2017
The Reynolds Intellectual Assessment Scales, Second Edition (RIAS-2; Reynolds & Kamphaus, 2015) is an intelligence test for those aged 3 to 94 years. It contains eight subtests designed to assess general intelligence, verbal and nonverbal intelligence, memory, and processing speed. The two subtests targeting processing speed are new to the…
Descriptors: Intelligence Tests, Verbal Ability, Nonverbal Ability, Memory
Floman, James L.; Hagelskamp, Carolin; Brackett, Marc A.; Rivers, Susan E. – Journal of Psychoeducational Assessment, 2017
Classroom observations increasingly inform high-stakes decisions and research in education, including the allocation of school funding and the evaluation of school-based interventions. However, trends in rater scoring tendencies over time may undermine the reliability of classroom observations. Accordingly, the present investigations, grounded in…
Descriptors: Observation, Bias, Psychological Patterns, Grade 5
Russ, Laura B.; Webster, Collin A.; Beets, Michael W.; Egan, Catherine; Weaver, Robert Glenn; Harvey, Rachel; Phillips, David S. – Health Education & Behavior, 2017
National attention on whole-of-school approaches to decrease children's sedentary behavior and increase physical activity includes movement integration (MI) in classrooms. The purpose of this study was to describe instrument development, reliability, and validity of the System for Observing Student Movement in Academic Routines and Transitions…
Descriptors: Classroom Observation Techniques, Physical Activity Level, Reliability, Validity
Caspersen, Janna R.; Van Holt, Tracy; Johnson, Jeffrey C. – Field Methods, 2017
This article offers a way to measure agreement in participatory mapping. We asked subject matter experts (SMEs) to draw where Sudanese ethnic groups were located on a map. We then used an eigenanalysis approach to determine whether SMEs agreed on the location of ethnic groups. We used minimum residual factor analysis to assess the extent of…
Descriptors: Measurement Techniques, Expertise, Maps, Ethnic Groups
Driller, Matthew; Brophy-Williams, Ned; Walker, Anthony – Measurement in Physical Education and Exercise Science, 2017
The purpose of the present study was to determine the reliability of a 5km run test on a motorized treadmill. Over three consecutive weeks, 12 well-trained runners completed three 5km time trials on a treadmill following a standardized warm-up. Runners were partially-blinded to their running speed and distance covered. Total time to complete the…
Descriptors: Athletics, Physical Activities, Athletes, Test Reliability
Schoenfeld, Alan H. – Assessment in Education: Principles, Policy & Practice, 2017
The challenge of "educational" assessments--assessments that advance the purposes of learning and instruction--is to provide useful information regarding students' progress towards the goals of instruction in ways that are reliable and not idiosyncratic. In this commentary, the author indicates that the challenges are actually more…
Descriptors: Educational Assessment, Learning, Student Evaluation, Psychometrics
Kane, Michael T. – Assessment in Education: Principles, Policy & Practice, 2017
In response to an argument by Baird, Andrich, Hopfenbeck and Stobart (2017), Michael Kane states that there needs to be a better fit between educational assessment and learning theory. In line with this goal, Kane will examine how psychometric constraints might be loosened by relaxing some psychometric "rules" in some assessment…
Descriptors: Educational Assessment, Psychometrics, Standards, Test Reliability
Deaton, Cynthia C. M.; Malloy, Jacquelynn A. – International Journal of Adult Vocational Education and Technology, 2017
Design-based case studies address research questions that involve instructional innovations within a bounded system. This blend of case study and design-based research provides a systematic approach to examining instructional innovations that are bounded by perspective, context, and time. Design-based case studies provide a framework for engaging…
Descriptors: Blended Learning, Teaching Methods, Case Studies, Instructional Innovation
Cillessen, Antonius H. N.; Marks, Peter E. L. – New Directions for Child and Adolescent Development, 2017
Although peer nomination measures have been used by researchers for nearly a century, common methodological practices and rules of thumb (e.g., which variables to measure; use of limited vs. unlimited nomination methods) have continued to develop in recent decades. At the same time, other key aspects of the basic nomination procedure (e.g.,…
Descriptors: Peer Relationship, Research Methodology, Decision Making, Data Collection
Cipriano, Robert E.; Buller, Jeffrey L. – Change: The Magazine of Higher Learning, 2017
There are two primary means to prevent the abuse of collegiality and transform it into a shield to protect the most vulnerable. First, colleges and universities should follow the examples of their peers by developing clear definitions of what types of behavior constitute collegiality and what types of activity are protected as academic freedom or…
Descriptors: Collegiality, College Faculty, Definitions, Academic Freedom
Kieftenbeld, Vincent; Boyer, Michelle – Applied Measurement in Education, 2017
Automated scoring systems are typically evaluated by comparing the performance of a single automated rater item-by-item to human raters. This presents a challenge when the performance of multiple raters needs to be compared across multiple items. Rankings could depend on specifics of the ranking procedure; observed differences could be due to…
Descriptors: Automation, Scoring, Comparative Analysis, Test Items
Liu, Ren; Huggins-Manley, Anne Corinne; Bradshaw, Laine – Educational and Psychological Measurement, 2017
There is an increasing demand for assessments that can provide more fine-grained information about examinees. In response to the demand, diagnostic measurement provides students with feedback on their strengths and weaknesses on specific skills by classifying them into mastery or nonmastery attribute categories. These attributes often form a…
Descriptors: Matrices, Classification, Accuracy, Diagnostic Tests

Peer reviewed
Direct link
