Publication Date
| In 2026 | 3 |
| Since 2025 | 666 |
| Since 2022 (last 5 years) | 3167 |
| Since 2017 (last 10 years) | 7408 |
| Since 2007 (last 20 years) | 15046 |
Descriptor
| Test Reliability | 15036 |
| Test Validity | 10272 |
| Reliability | 9759 |
| Foreign Countries | 7141 |
| Test Construction | 4823 |
| Validity | 4191 |
| Measures (Individuals) | 3877 |
| Factor Analysis | 3825 |
| Psychometrics | 3525 |
| Interrater Reliability | 3124 |
| Correlation | 3039 |
| More ▼ | |
Source
Author
Publication Type
Education Level
Audience
| Researchers | 709 |
| Practitioners | 451 |
| Teachers | 208 |
| Administrators | 122 |
| Policymakers | 66 |
| Counselors | 42 |
| Students | 38 |
| Parents | 11 |
| Community | 7 |
| Support Staff | 6 |
| Media Staff | 5 |
| More ▼ | |
Location
| Turkey | 1327 |
| Australia | 436 |
| Canada | 379 |
| China | 368 |
| United States | 271 |
| United Kingdom | 256 |
| Indonesia | 252 |
| Taiwan | 234 |
| Netherlands | 223 |
| Spain | 216 |
| California | 214 |
| More ▼ | |
Laws, Policies, & Programs
Assessments and Surveys
What Works Clearinghouse Rating
| Meets WWC Standards without Reservations | 8 |
| Meets WWC Standards with or without Reservations | 9 |
| Does not meet standards | 6 |
Berk, Ronald A. – Journal of Faculty Development, 2016
Recently, student outcomes have bubbled to the top of debates about how to evaluate teaching in community and liberal arts colleges, universities, and professional schools, but even more international attention has been riveted on how outcomes are being used to evaluate teachers and administrators K-12 (Harris, 2012; Rowen & Raudenbush, 2016;…
Descriptors: Value Added Models, Academic Achievement, Outcomes of Education, Teacher Evaluation
Lao, Huei-Chen – ProQuest LLC, 2016
In this quantitative study, a survey was developed and administered to middle and high school teachers to examine what factors motivated them to implement problem-based learning (PBL). Using Expectancy-Value Theory by Eccles et al. (1983) and Self-Determination Theory by Ryan and Deci (2000b) as the theoretical framework, this instrument measured…
Descriptors: Test Construction, Middle Schools, High Schools, Secondary School Teachers
Schoen, Robert C.; LaVenia, Mark; Bauduin, Charity; Farina, Kristy – Grantee Submission, 2016
The subject of this report is a pair of written, group-administered tests designed to measure the performance of grade 1 and grade 2 students at the beginning of the school year in the domain of number and operations. These tests build on previous versions field-tested in fall 2013 (Schoen, LaVenia, Bauduin, & Farina, 2016). Because the tests…
Descriptors: Elementary School Mathematics, Grade 1, Grade 2, Mathematics Tests
Ruffini, Stephen J.; Miskell, Ryan; Lindsay, Jim; McInerney, Maurice; Waite, Winsome – Regional Educational Laboratory Midwest, 2016
Many schools identified by states as needing improvement through their Elementary and Secondary Education Act waivers have selected Response to Intervention (RTI), a three-tiered instruction program sometimes referred to as tiered levels of instruction, as one of their main strategies for improving school performance and closing achievement gaps.…
Descriptors: Program Implementation, Fidelity, Response to Intervention, Public Schools
Ruffini, Steffen J.; Lindsay, Jim; Miskell, Ryan; Proger, Amy – Regional Educational Laboratory Midwest, 2016
Regional Educational Laboratory Midwest assisted Milwaukee Public Schools in developing a fidelity monitoring system for measuring schools' progress in implementing Response to Intervention (RTI). The study examined the ratings produced by that system to determine the system's reliability, schools' progress in implementing RTI, and whether ratings…
Descriptors: Program Implementation, Fidelity, Response to Intervention, Public Schools
Sabatini, John P.; Halderman, Laura K.; O'Reilly, Tenaha; Weeks, Jonathan P. – Grantee Submission, 2016
Traditional measures of reading ability designed for younger students typically focus on componential skills (e.g., decoding, vocabulary) and the items are often presented in a discrete and decontextualized format. The current study was designed to explore whether it was feasible to develop a more integrated, scenario-based assessment of…
Descriptors: Early Childhood Education, Reading Ability, Outcome Measures, Reading Comprehension
Candee, Allyson Joelle – ProQuest LLC, 2016
In this paper, the Classroom Learning Activities Checklist (CLAC) is proposed as a classroom observation measure that effectively captures the classroom environments and strategies that support self-regulation via task-oriented learning in young students. The CLAC's dimensionality, reliability, and concurrent and predictive validity evidence are…
Descriptors: Check Lists, Learning Activities, Metacognition, Evidence Based Practice
Olson, David – Journal of Marital and Family Therapy, 2011
Family Adaptability and Cohesion Evaluation Scale (FACES) IV was developed to tap the full continuum of the cohesion and flexibility dimensions from the Circumplex Model of Marital and Family Systems. Six scales were developed, with two balanced scales and four unbalanced scales designed to tap low and high cohesion (disengaged and enmeshed) and…
Descriptors: Measures (Individuals), Family Relationship, Reliability, Validity
Zimmerman, Donald W. – Journal of Educational and Behavioral Statistics, 2011
Many well-known equations in classical test theory are mathematical identities in populations of individuals but not in random samples from those populations. First, test scores are subject to the same sampling error that is familiar in statistical estimation and hypothesis testing. Second, the assumptions made in derivation of formulas in test…
Descriptors: Test Theory, Equations (Mathematics), Scores, Sampling
Intra- and Inter-Observer Reliability of the Trunk Impairment Scale for Children with Cerebral Palsy
Saether, Rannei; Jorgensen, Lone – Research in Developmental Disabilities: A Multidisciplinary Journal, 2011
Standardized scales to evaluate qualities of trunk movements in children with dysfunction are sparse. An examination of the reliability of scales that may be useful in the clinic is important. The aim of this study was to examine the reliability of the Trunk Impairment Scale (TIS) for children with cerebral palsy (CP). Standardized scales are…
Descriptors: Cerebral Palsy, Measures (Individuals), Interrater Reliability, Psychomotor Skills
Tonmyr, Lil; Draca, Jasminka; Crain, Jennifer; MacMillan, Harriet L. – Child Abuse & Neglect: The International Journal, 2011
Background: Emotional/psychological child maltreatment (ECM) is a major public health problem with serious consequences including emotional and behavioral problems. Nevertheless, ECM is an understudied area. Objectives: The aims of this review are to identify measures of ECM and to evaluate their psychometric properties and utilities. We provide a…
Descriptors: Measures (Individuals), Antisocial Behavior, Child Abuse, Validity
Clemens, Elysia V.; Shipp, Adria; Kimbel, Tyler – Professional School Counseling, 2011
This article reports on the development and the exploration of the underlying psychometric properties of the School Counselor Self-Advocacy Questionnaire, a measure of skills school counselors can use to advocate for their roles and programs. An exploratory factor analysis (N = 188) suggested a unidimensional model, and a confirmatory factor…
Descriptors: Questionnaires, Self Advocacy, School Counselors, Factor Analysis
Yocum, Allison; McCoy, Sarah Westcott; Bjornson, Kristie F.; Mullens, Pamela; Burton, Gay Naganuma – Physical & Occupational Therapy in Pediatrics, 2010
A standardized protocol for a pediatric heel-rise test was developed and reliability and validity are reported. Fifty-seven children developing typically (CDT) and 34 children with plantar flexion weakness performed three tests: unilateral heel rise, vertical jump, and force measurement using handheld dynamometry. Intraclass correlation…
Descriptors: Test Validity, Test Reliability, Interrater Reliability, Psychomotor Skills
Latimer, Marvin E., Jr.; Bergee, Martin J.; Cohen, Mary L. – Journal of Research in Music Education, 2010
The purpose of this study was to investigate the reliability and perceived pedagogical utility of a multidimensional weighted performance assessment rubric used in Kansas state high school large-group festivals. Data were adjudicator rubrics (N = 2,016) and adjudicator and director questionnaires (N = 515). Rubric internal consistency was…
Descriptors: Music Activities, State Programs, Performance Based Assessment, Weighted Scores
Ritter, Nicola L. – Online Submission, 2010
It is important to explore score reliability in virtually all studies, because tests are not reliable. The present paper explains the most frequently used reliability estimate, coefficient alpha, so that the coefficient's conceptual underpinnings will be understood. Researchers need to understand score reliability because of the possible impact…
Descriptors: Scores, Reliability, Statistics, Misconceptions

Peer reviewed
Direct link
