Publication Date
| In 2026 | 5 |
| Since 2025 | 627 |
| Since 2022 (last 5 years) | 2564 |
| Since 2017 (last 10 years) | 5599 |
| Since 2007 (last 20 years) | 9195 |
Descriptor
| Test Validity | 21771 |
| Test Reliability | 10011 |
| Test Construction | 5891 |
| Foreign Countries | 4955 |
| Psychometrics | 2963 |
| Factor Analysis | 2941 |
| Measures (Individuals) | 2377 |
| Higher Education | 2250 |
| Evaluation Methods | 2085 |
| College Students | 1813 |
| Correlation | 1723 |
| More ▼ | |
Source
Author
Publication Type
Education Level
Audience
| Researchers | 728 |
| Practitioners | 429 |
| Teachers | 142 |
| Administrators | 96 |
| Policymakers | 57 |
| Counselors | 36 |
| Students | 20 |
| Parents | 13 |
| Community | 7 |
| Support Staff | 6 |
| Media Staff | 2 |
| More ▼ | |
Location
| Turkey | 807 |
| Australia | 347 |
| Canada | 324 |
| China | 300 |
| United States | 188 |
| Indonesia | 172 |
| Spain | 169 |
| United Kingdom | 160 |
| Netherlands | 159 |
| California | 156 |
| Germany | 153 |
| More ▼ | |
Laws, Policies, & Programs
Assessments and Surveys
What Works Clearinghouse Rating
| Meets WWC Standards without Reservations | 1 |
| Meets WWC Standards with or without Reservations | 3 |
| Does not meet standards | 1 |
Greenlees, Jane – Mathematics Education Research Group of Australasia, 2010
Standardised testing has received a lot of political and public attention recently in Australia. This paper describes the sense-making of Year 3 students as they interpret items from the 2008 NAPLAN. Results show that student performance changed dramatically when the terminology of an item was modified and subsequently were not a true indication…
Descriptors: Foreign Countries, Mathematics Instruction, Mathematics Achievement, Mathematics Curriculum
Pakarinen, Eija; Lerkkanen, Marja-Kristiina; Poikkeus, Anna-Maija; Kiuru, Noona; Siekkinen, Martti; Rasku-Puttonen, Helena; Nurmi, Jari-Erik – Early Education and Development, 2010
Research Findings: This study examined the validity and reliability of the Classroom Assessment Scoring System (CLASS; R. C. Pianta, K. M. La Paro, & B. K. Hamre, 2008) in Finnish kindergartens. A pair of trained observers used the CLASS to observe 49 kindergarten teachers (47 female, 2 male) on two different days. Questionnaires measuring…
Descriptors: Scoring, Factor Analysis, Kindergarten, Foreign Countries
Heldsinger, Sandra; Humphry, Stephen – Australian Educational Researcher, 2010
Demands for accountability have seen the implementation of large scale testing programs in Australia and internationally. There is, however, a growing body of evidence to show that externally imposed testing programs do not have a sustained impact on student achievement. It has been argued that teacher assessment is more effective in raising…
Descriptors: Testing Programs, Testing, Academic Achievement, Measures (Individuals)
Bagner, Daniel M.; Boggs, Stephen R.; Eyberg, Sheila M. – Education and Treatment of Children, 2010
This study examined the psychometric properties of the Revised Edition of the School Observation Coding System (REDSOCS). Participants were 68 children ages 3 to 6 who completed parent-child interaction therapy for Oppositional Defiant Disorder as part of a larger efficacy trial. Interobserver reliability on REDSOCS categories was moderate to…
Descriptors: Behavior Problems, Student Behavior, Teacher Evaluation, Test Reliability
Artdej, Romklao; Ratanaroutai, Thasaneeya; Coll, Richard Kevin; Thongpanchang, Tienthong – Research in Science & Technological Education, 2010
This study involved the development of a two-tier diagnostic instrument to assess Thai high school students' understanding of acid-base chemistry. The acid-base diagnostic test (ABDT) comprising 18 items was administered to 55 Grade 11 students in a science and mathematics programme during the second semester of the 2008 academic year. Analysis of…
Descriptors: Classification, Comprehension, Diagnostic Tests, Chemistry
Llewellyn, Gwynnyth; Bundy, Anita; Mayes, Rachel; McConnell, David; Emerson, Eric; Brentnall, Jennie – Journal of Applied Research in Intellectual Disabilities, 2010
Background: This study describes the development and trialling of the Family Life Interview (FLI), a clinical tool designed to examine sustainability of family routines. Materials and Methods: The FLI, a self-report instrument completed by a parent within a semi-structured practitioner--parent interview, was administered to 118 parents, with…
Descriptors: Family Life, Construct Validity, Test Validity, Family Environment
Hedley, Darren; Young, Robyn; Angelica, Maria; Gallegos, Juarez; Salazar, Carlos, Marcin – Autism: The International Journal of Research and Practice, 2010
A Spanish translation of the Autism Detection in Early Childhood (ADEC-SP) was administered to 115 children aged 15-73 months in Mexico. In Phase 1, children with Autistic Disorder (AD), a non-Pervasive Developmental Disorder (PDD) diagnosis or typical development were assessed with the ADEC-SP by a clinician blind to the child's diagnostic…
Descriptors: Incidence, Autism, Mental Disorders, Rating Scales
Lee, Miyoung; Peterson, Jana J.; Dixon, Alicia – Research in Developmental Disabilities: A Multidisciplinary Journal, 2010
The purpose of this study was to investigate the construct validity of the Self-Efficacy/Social Support for Activity for persons with Intellectual Disability (SE/SS-AID) scales developed by Peterson, Peterson, Lowe, & Nothwehr (2009). A total of 146 participants with intellectual disabilities completed 6 self-efficacy (SE) items and 18 social…
Descriptors: Physical Activities, Self Efficacy, Mental Retardation, Construct Validity
Hohlfeld, Tina N.; Ritzhaupt, Albert D.; Barron, Ann E. – Journal of Research on Technology in Education, 2010
This article provides an overview of the development and validation of the Student Tool for Technology Literacy (ST[superscript 2]L). Developing valid and reliable objective performance measures for monitoring technology literacy is important to all organizations charged with equipping students with the technology skills needed to successfully…
Descriptors: Test Validity, Ability Grouping, Grade 8, Test Construction
Almond, Patricia; Winter, Phoebe; Cameto, Renee; Russell, Michael; Sato, Edynn; Clarke-Midura, Jody; Torres, Chloe; Haertel, Geneva; Dolan, Robert; Beddow, Peter; Lazarus, Sheryl – Journal of Technology, Learning, and Assessment, 2010
This paper represents one outcome from the "Invitational Research Symposium on Technology-Enabled and Universally Designed Assessments," which examined technology-enabled assessments (TEA) and universal design (UD) as they relate to students with disabilities (SWD). It was developed to stimulate research into TEAs designed to make tests…
Descriptors: Disabilities, Inferences, Computer Assisted Testing, Alternative Assessment
Wuang, Yee-Pay; Wang, Li-Chen; Su, Chwen-Yng – Research in Developmental Disabilities: A Multidisciplinary Journal, 2010
The aim of this study was to examine the validation of the Hooper Visual Organization Test (HVOT) for use in children by testing for item fit, unidimensionality, item hierarchy, reliability, and screening capacity. A modified scoring system was devised for the HVOT so that children received some credit for being able to describe the function of…
Descriptors: Test Bias, Down Syndrome, Scoring, Item Response Theory
Norman, Antony D. – Theory Into Practice, 2010
This article surveys efforts at the national and international level to define and assess accomplished teaching with particular attention devoted to how assessments of accomplished teaching connect to student learning. The author finds that most assessments are based on aspects of teaching that, presumably, come together as accomplished teaching.…
Descriptors: National Standards, Teaching Skills, Best Practices, Barriers
Rojahn, Johannes; Rowe, Ellen W.; Macken, Jennifer; Gray, Amy; Delitta, Denise; Booth, Alison; Kimbrell, Kelly – Journal of Mental Health Research in Intellectual Disabilities, 2010
This study was conducted to assess the psychometric properties of 2 assessment instruments, the "Behavior Problems Inventory-01" ("BPI-01"; Rojahn, Matson, Lott, Esbensen, & Smalls, 2001) and the "Nisonger Child Behavior Rating Form" ("NCBRF"; Aman, Tass, Rojahn, & Hammer, 1996). The sample consisted…
Descriptors: Behavior Problems, Mental Retardation, Test Validity, Factor Structure
Roberts, William L.; McKinley, Danette W.; Boulet, John R. – Advances in Health Sciences Education, 2010
Due to the high-stakes nature of medical exams it is prudent for test agencies to critically evaluate test data and control for potential threats to validity. For the typical multiple station performance assessments used in medicine, it may take time for examinees to become comfortable with the test format and administrative protocol. Since each…
Descriptors: Student Evaluation, Pretests Posttests, Licensing Examinations (Professions), Scores
Olson, Lynn – Education Week, 2008
Starting next month, 300 schools nationwide will take part in a field test of a new way to gauge principals' effectiveness. Known as VAL-ED, for the Vanderbilt Assessment of Leadership in Education, the tool has been developed by a team of leadership and testing experts at Vanderbilt University and the University of Pennsylvania to measure…
Descriptors: Principals, Administrator Effectiveness, Field Tests, Administrator Evaluation

Peer reviewed
Direct link
