Publication Date
| In 2026 | 3 |
| Since 2025 | 666 |
| Since 2022 (last 5 years) | 3167 |
| Since 2017 (last 10 years) | 7408 |
| Since 2007 (last 20 years) | 15046 |
Descriptor
| Test Reliability | 15036 |
| Test Validity | 10272 |
| Reliability | 9759 |
| Foreign Countries | 7141 |
| Test Construction | 4823 |
| Validity | 4191 |
| Measures (Individuals) | 3877 |
| Factor Analysis | 3825 |
| Psychometrics | 3525 |
| Interrater Reliability | 3124 |
| Correlation | 3039 |
| More ▼ | |
Source
Author
Publication Type
Education Level
Audience
| Researchers | 709 |
| Practitioners | 451 |
| Teachers | 208 |
| Administrators | 122 |
| Policymakers | 66 |
| Counselors | 42 |
| Students | 38 |
| Parents | 11 |
| Community | 7 |
| Support Staff | 6 |
| Media Staff | 5 |
| More ▼ | |
Location
| Turkey | 1327 |
| Australia | 436 |
| Canada | 379 |
| China | 368 |
| United States | 271 |
| United Kingdom | 256 |
| Indonesia | 252 |
| Taiwan | 234 |
| Netherlands | 223 |
| Spain | 216 |
| California | 214 |
| More ▼ | |
Laws, Policies, & Programs
Assessments and Surveys
What Works Clearinghouse Rating
| Meets WWC Standards without Reservations | 8 |
| Meets WWC Standards with or without Reservations | 9 |
| Does not meet standards | 6 |
McCullough, Moira; Lipscomb, Stephen; Chiang, Hanley; Gill, Brian; Cheban, Irina – Regional Educational Laboratory Mid-Atlantic, 2016
This "Stated Briefly" report is a companion piece that summarizes the results of another report of the same name. This study examines the accuracy of performance ratings from the Framework for Leadership (FFL), Pennsylvania's tool for evaluating the leadership practices of principals and assistant principals. The study analyzed four key…
Descriptors: Leadership Effectiveness, Principals, Assistant Principals, Administrator Evaluation
Halpin, Peter F. – Society for Research on Educational Effectiveness, 2016
Recent research on multiple measures of teaching effectiveness has redefined the role of in-classroom observations in teacher evaluation systems. In particular, most states now mandate that teachers are observed on multiple occasions during the school year, and it is increasingly common that multiple raters are utilized across the different rating…
Descriptors: Models, Multivariate Analysis, Scoring Rubrics, Teacher Evaluation
Northwest Evaluation Association, 2016
Northwest Evaluation Association™ (NWEA™) is committed to providing partners with useful tools to help make inferences from Measures of Academic Progress® (MAP®) interim assessment scores. One important tool is the concordance table between MAP and state summative assessments. Concordance tables have been used for decades to relate scores on…
Descriptors: Tables (Data), Benchmarking, Scoring Formulas, Scores
Fernandes, Amanda Careena – ProQuest LLC, 2016
Assessment is an integral part of learning as it is used to gather information about a test-taker. Those in the field of academia, such as educational policy makers, instructors, and administrators are able to use information gathered from tests to further instruction and learning decisions (Baker, 2006; Drianna, 2007; Kasper & Ross, 2013;…
Descriptors: Foreign Countries, Test Bias, Sex Fairness, English (Second Language)
Johnson, Evelyn S.; Moylan, Laura A.; Crawford, Angela; Ford, Jeremy W. – Grantee Submission, 2016
Observation systems can provide teachers with information about how to improve their instructional practice and can lead to improved student outcomes. However, most observation systems have not been designed to address issues specific to special education. An effective special education teacher evaluation system must measure and provide targeted,…
Descriptors: Special Education Teachers, Teacher Evaluation, Observation, Evidence Based Practice
Slotnik, William J.; Bugler, Daniel; Liang, Guodong – Mid-Atlantic Comprehensive Center at WestEd, 2016
The Maryland State Department of Education (MSDE) is leading and supporting the implementation of a Teacher and Principal Evaluation (TPE) system in all school districts in the state. This study examines the progress of four years of TPE implementation, 2013 to 2016. The study draws on nearly 60,000 survey responses from principals and teachers…
Descriptors: Teacher Evaluation, Administrator Evaluation, Principals, Program Implementation
Begay, Kristin – ProQuest LLC, 2016
Rating scales are often used as part of the evaluation process to diagnose autism spectrum disorder (ASD). Rating scales that are modeled after the experiences and understanding of the Caucasian American race may not reflect the unique experiences of individuals from other races or ethnicities. If parent ratings do not uniformly identify the ASD…
Descriptors: Autism, Pervasive Developmental Disorders, Rating Scales, Racial Differences
Eunice Eyitayo Olakanmi – Journal of Baltic Science Education, 2016
The purpose of this research was to develop a questionnaire that measures students' self and co-regulated learning processes during science learning. An instrument named Co-regulated Strategies for Learning Questionnaire (CRSLQ) was developed, and its validity and reliability were analysed. Factor analytic evidence from a sample (n=214) of science…
Descriptors: Test Construction, Questionnaires, Cooperative Learning, Science Instruction
Witwer, Andrea N.; Lecavalier, Luc; Norris, Megan – Journal of Autism and Developmental Disorders, 2012
The "Children's Interview for Psychiatric Syndromes-Parent Version" (P-ChIPS) is a structured psychiatric interview designed to assess the presence of psychiatric disorders in children and adolescents. This study examined the reliability and validity of the P-ChIPS in 61 youngsters (6- to 17-years-old) with Autism Spectrum Disorders. Reliability…
Descriptors: Autism, Interrater Reliability, Test Validity, Test Reliability
McKenzie, Karen; Paxton, Donna; Murray, George; Milanesi, Paula; Murray, Aja Louise – Research in Developmental Disabilities: A Multidisciplinary Journal, 2012
The study outlines the evaluation of an intellectual disability screening tool, the "Child and Adolescent Intellectual Disability Screening Questionnaire" ("CAIDS-Q"), with two age groups. A number of aspects of the reliability and validity of the "CAIDS-Q" were assessed for these two groups, including inter-rater reliability, convergent and…
Descriptors: Mental Retardation, Validity, Interrater Reliability, Measures (Individuals)
Winke, Paula; Lee, Shinhye; Ahn, Jieun Irene; Choi, Ina; Cui, Yaqiong; Yoon, Hyung-Jo – TESOL Quarterly: A Journal for Teachers of English to Speakers of Other Languages and of Standard English as a Second Dialect, 2018
This study investigated the cognitive validity of two child English language tests. Some teachers maintain that these types of tests may be cognitively invalid because native-English-speaking children would not do well on them (Winke, 2011). So the researchers had native speakers and learners of English aged 7 to 9 take sample versions of two…
Descriptors: Language Tests, English, English (Second Language), Second Language Learning
McLean, Leigh; Connor, Carol McDonald – School Psychology Quarterly, 2018
Recent studies have observed connections among teachers' depressive symptoms and student outcomes; however, the specific mechanisms through which teachers' mental health characteristics operate in the classroom remain largely unknown. The present study used student-level observation methods to examine the relations between third-grade teachers' (N…
Descriptors: Elementary School Teachers, Grade 3, Depression (Psychology), Symptoms (Individual Disorders)
Long, Avizia Y.; Shin, Sun-Young; Geeslin, Kimberly; Willis, Erik W. – Language Learning & Technology, 2018
In response to the need for examples of test validation from which everyday language programs can benefit, this paper reports on a study that used Bachman's (2005) assessment use argument (AUA) framework to examine evidence to support claims made about the intended interpretations and uses of scores based on a new web-based Spanish language…
Descriptors: Second Language Instruction, Second Language Learning, Spanish, Computer Assisted Testing
Hendriana, Heris; Johanto, Tri; Sumarmo, Utari – Journal on Mathematics Education, 2018
This study is a pre test-post-test experimental control group design having a goal to analyze the role of problembased learning on students' mathematical problem-solving ability (MPSA) and self-confidence (MSC). The study involves 66 tenth grade students, a mathematical problem-solving test, a mathematical self-confidence scale, and perception of…
Descriptors: Role, Problem Based Learning, Student Improvement, Mathematics Instruction
Kural, Faruk – Journal of Language and Linguistic Studies, 2018
The present paper, which is a study based on midterm exam results of 53 University English prep-school students, examines correlation between a direct writing test, measured holistically by multiple-trait scoring, and two indirect writing tests used in a competence exam, one of which is a multiple-choice cloze test and the other a rewrite test…
Descriptors: Writing Evaluation, Cloze Procedure, Comparative Analysis, Essays

Peer reviewed
Direct link
