Publication Date
| In 2026 | 3 |
| Since 2025 | 675 |
| Since 2022 (last 5 years) | 3176 |
| Since 2017 (last 10 years) | 7417 |
| Since 2007 (last 20 years) | 15055 |
Descriptor
| Test Reliability | 15043 |
| Test Validity | 10279 |
| Reliability | 9761 |
| Foreign Countries | 7144 |
| Test Construction | 4825 |
| Validity | 4191 |
| Measures (Individuals) | 3877 |
| Factor Analysis | 3825 |
| Psychometrics | 3526 |
| Interrater Reliability | 3124 |
| Correlation | 3040 |
| More ▼ | |
Source
Author
Publication Type
Education Level
Audience
| Researchers | 709 |
| Practitioners | 451 |
| Teachers | 208 |
| Administrators | 122 |
| Policymakers | 66 |
| Counselors | 42 |
| Students | 38 |
| Parents | 11 |
| Community | 7 |
| Support Staff | 6 |
| Media Staff | 5 |
| More ▼ | |
Location
| Turkey | 1328 |
| Australia | 436 |
| Canada | 379 |
| China | 368 |
| United States | 271 |
| United Kingdom | 256 |
| Indonesia | 253 |
| Taiwan | 234 |
| Netherlands | 223 |
| Spain | 217 |
| California | 215 |
| More ▼ | |
Laws, Policies, & Programs
Assessments and Surveys
What Works Clearinghouse Rating
| Meets WWC Standards without Reservations | 8 |
| Meets WWC Standards with or without Reservations | 9 |
| Does not meet standards | 6 |
Hodgson, John – English in Education, 2017
This article reflects on the values and practices of a revolutionary A level (senior secondary) course that achieved a high degree of validity and reliability in assessing the study of English literature. John Hodgson and Bill Greenwell were involved in its teaching and assessment from an early stage, and Greenwell's comments on an early draft of…
Descriptors: English Instruction, English Literature, Validity, Reliability
Liu, Ran; Koedinger, Kenneth R. K – International Educational Data Mining Society, 2017
Research in Educational Data Mining could benefit from greater efforts to ensure that models yield reliable, valid, and interpretable parameter estimates. These efforts have especially been lacking for individualized student-parameter models. We collected two datasets from a sizable student population with excellent "depth" -- that is,…
Descriptors: Data Analysis, Intelligent Tutoring Systems, Bayesian Statistics, Pretests Posttests
New York State Education Department, 2017
This technical report provides detailed information regarding the technical, statistical, and measurement attributes of the New York State Testing Program (NYSTP) for the Grades 3-8 English Language Arts (ELA) and Mathematics 2017 Operational Tests. This report includes information about test content and test development, item (i.e., individual…
Descriptors: English, Language Arts, Language Tests, Mathematics Tests
Wallace, Colin S.; Prather, Edward E.; Duncan, Douglas K. – Astronomy Education Review, 2011
This is the second of five papers detailing our national study of general education astronomy students' conceptual and reasoning difficulties with cosmology. This article begins our quantitative investigation of the data. We describe how we scored students' responses to four conceptual cosmology surveys, and we present evidence for the inter-rater…
Descriptors: Astronomy, Scientific Concepts, College Students, Introductory Courses
Harrison, Gina L.; Ogle, Keira C.; Keilty, Megan – Canadian Journal of School Psychology, 2011
A reliability analysis was conducted on the Written Expression Scale from the Oral and Written Language Scales, (OWLS, Carrow-Woolfolk, 1996), with 68 ESL and 56 non-ESL kindergarten students. Interrater and internal consistency estimates for the Written Expression Scale were examined separately for each language group. Despite lower oral English…
Descriptors: Written Language, Vocabulary, Reliability, Measures (Individuals)
Rufino, Katrina A.; Boccaccini, Marcus T.; Guy, Laura S. – Assessment, 2011
Although reliability is essential to validity, most research on violence risk assessment tools has paid little attention to strategies for improving rater agreement. The authors evaluated the degree to which perceived subjectivity in scoring guidelines for items from two measures--the Psychopathy Checklist-Revised (PCL-R) and the Historical,…
Descriptors: Risk Management, Predictive Validity, Interrater Reliability, Scoring
Tait-McCutcheon, Sandi; Knewstubb, Bernadette – Assessment & Evaluation in Higher Education, 2018
The ability to reflect and self-assess are essential attributes for graduates to develop during their education. At many tertiary institutions, peer and lecturer-assessment contribute to summative assessment, but self-assessment, whilst recommended for development, does not. In order to make a case for the inclusion of self-assessment as a…
Descriptors: Self Evaluation (Individuals), Summative Evaluation, Preservice Teacher Education, College Faculty
Unal, Zafer; Bodur, Yasar; Unal, Aslihan – Contemporary Issues in Technology and Teacher Education (CITE Journal), 2012
The researchers in this study undertook development of a webquest evaluation rubric and investigated its reliability. The rubric was created using the strengths of the currently available webquest rubrics with improvements based on the comments provided in the literature and feedback received from educators. After the rubric was created, 23…
Descriptors: Test Construction, Test Reliability, Instructional Material Evaluation, Scoring Rubrics
Storch, Eric A.; Wood, Jeffrey J.; Ehrenreich-May, Jill; Jones, Anna M.; Park, Jennifer M.; Lewin, Adam B.; Murphy, Tanya K. – Journal of Autism and Developmental Disorders, 2012
The psychometric properties of the Pediatric Anxiety Rating Scale (PARS), a clinician-administered measure for assessing severity of anxiety symptoms, were examined in 72 children and adolescents diagnosed with an autism spectrum disorder (ASD). The internal consistency of the PARS was 0.59, suggesting that the items were related but not…
Descriptors: Test Reliability, Test Validity, Rating Scales, Anxiety
Omeroglu, Esra; Buyukozturk, Sener; Aydogan, Yasemin; Cakan, Mehtap; Cakmak, Ebru Kilic; Ozyurek, Arzu; Akduman, Gulumser Gultekin; Gunindi, Yunus; Kutlu, Omer; Coban, Aysel; Yurt, Ozlem; Kogar, Hakan; Karayol, Seda – Educational Sciences: Theory and Practice, 2015
This study aimed to determine and interpret norms of the Preschool Social Skills Rating Scale (PSSRS) teacher form. The sample included 224 independent preschools and 169 primary schools. The schools are distributed among 48 provinces and 3324 children were included. Data were obtained from the PSSRS teacher form. The validity and reliability…
Descriptors: Preschool Children, Preschool Education, Rating Scales, Norms
Prieß-Buchheit, Julia – Journal of Social Science Education, 2015
The Economic Actions in Education training module (EAE) teaches how to handle, use and judge external standardized tests in schools. The EAE programme was implemented in teacher training at the University of Kiel, because teachers are increasingly under external scrutiny and are being held accountable for student and school achievements. The EAE…
Descriptors: Standardized Tests, Best Practices, Preservice Teacher Education, Value Judgment
Rekalidou, Galini; Panitsides, Eugenia A. – Early Years: An International Journal of Research and Development, 2015
Within the knowledge era, Early Childhood Education has been attracting continued interest from scholars globally. In this regard, several studies have been conducted, with results identifying the multifarious attributes which members of the early years workforce should possess. But what does this imply for Early Childhood Teacher Education…
Descriptors: Foreign Countries, Early Childhood Education, Preservice Teachers, Graduate Students
A. C., John; Manabete, S. S. – Journal of Education and Practice, 2015
This study sought to determine the procedural influence on internal and external assessment scores of undergraduate research projects in vocational and technical education programmes in the university under study. A survey research design was used for the conduct of this study. The population consisted of 130 lecturers and 1,847 students in the…
Descriptors: Foreign Countries, Undergraduate Students, Student Research, Research Projects
Barakat, Asia; Othman, Afaf – Journal of Education and Practice, 2015
The present study aims to identify the relationship between the five-factor model of personality and its relationship to cognitive style (rush and prudence) and academic achievement among a sample of students. The study is based on descriptive approach for studying the relationship between the variables of the study, results and analysis. The…
Descriptors: Foreign Countries, Cognitive Style, Academic Achievement, Correlation
Ebadi, Mandana Rohollahzadeh; Abedalaziz, Nabeel; Saad, Mohd Rashid Mohd – Malaysian Online Journal of Educational Sciences, 2015
Lack of valid means of measuring explicit and implicit knowledge in acquisition of second language is a concern issue in investigations of explicit and implicit learning. This paper endeavors to validate the use of four tests (i.e., Untimed Judgment Grammatical Test, UJGT; Test of Metalinguistic Knowledge, TMK; Elicited Oral Imitation Test, EOIT;…
Descriptors: Knowledge Level, Psychometrics, Second Language Learning, Outcome Measures

Peer reviewed
Direct link
