Publication Date
| In 2026 | 3 |
| Since 2025 | 666 |
| Since 2022 (last 5 years) | 3167 |
| Since 2017 (last 10 years) | 7408 |
| Since 2007 (last 20 years) | 15046 |
Descriptor
| Test Reliability | 15036 |
| Test Validity | 10272 |
| Reliability | 9759 |
| Foreign Countries | 7141 |
| Test Construction | 4823 |
| Validity | 4191 |
| Measures (Individuals) | 3877 |
| Factor Analysis | 3825 |
| Psychometrics | 3525 |
| Interrater Reliability | 3124 |
| Correlation | 3039 |
| More ▼ | |
Source
Author
Publication Type
Education Level
Audience
| Researchers | 709 |
| Practitioners | 451 |
| Teachers | 208 |
| Administrators | 122 |
| Policymakers | 66 |
| Counselors | 42 |
| Students | 38 |
| Parents | 11 |
| Community | 7 |
| Support Staff | 6 |
| Media Staff | 5 |
| More ▼ | |
Location
| Turkey | 1327 |
| Australia | 436 |
| Canada | 379 |
| China | 368 |
| United States | 271 |
| United Kingdom | 256 |
| Indonesia | 252 |
| Taiwan | 234 |
| Netherlands | 223 |
| Spain | 216 |
| California | 214 |
| More ▼ | |
Laws, Policies, & Programs
Assessments and Surveys
What Works Clearinghouse Rating
| Meets WWC Standards without Reservations | 8 |
| Meets WWC Standards with or without Reservations | 9 |
| Does not meet standards | 6 |
Johnson, Eboneé T.; Yaghmaian, Rana A.; Best, Andrew; Chan, Fong; Burrell, Reginald, Jr. – Rehabilitation Research, Policy, and Education, 2016
Purpose: The purpose of this study was to validate the 10-item version of the HIV Stigma Scale (HSS-10) in a sample of African Americans with HIV/AIDS. Method: One hundred and ten African Americans living with HIV/AIDS were recruited from 3 case management agencies in Baton Rouge, Louisiana. Measurement structure of the HSS-10 was evaluated using…
Descriptors: Acquired Immunodeficiency Syndrome (AIDS), Social Bias, African Americans, Factor Analysis
Romine, William L.; Todd, Amber N.; Clark, Travis B. – Science Education, 2016
We developed and validated a new instrument, called "Measuring Concept progressions in Acid-Base chemistry" (MCAB) and used it to better understand the progression of undergraduate students' understandings about acid-base chemistry. Items were developed based on an existing learning progression for acid-base chemistry. We used the Rasch…
Descriptors: Test Construction, Chemistry, Undergraduate Students, Scientific Concepts
Chang, Ming-Mei; Li, Anna; Feissner, Robert; Ahmad, Talal – Biochemistry and Molecular Biology Education, 2016
Reverse transcription quantitative polymerase chain reaction (RT-qPCR) is widely used in diagnosis and research to determine specific mRNA expressions in cells. As RT-qPCR applications increase, it is necessary to provide undergraduates hands-on experience of this modern technique. Here, we report a 3-week laboratory exercise using RT-qPCR to…
Descriptors: Genetics, Cytology, Science Laboratories, Laboratory Experiments
O'Brien, Edward J.; Cook, Anne E. – Discourse Processes: A multidisciplinary journal, 2016
Common to all models of reading comprehension is the assumption that a reader's level of comprehension is heavily influenced by their standards of coherence (van den Broek, Risden, & Husbye-Hartman, 1995). Our discussion focuses on a subcomponent of the readers' standards of coherence: the coherence threshold. We situate this discussion within…
Descriptors: Reading Comprehension, Models, Rhetoric, Reading Ability
Zumbo, Bruno D.; Hubley, Anita M. – Assessment in Education: Principles, Policy & Practice, 2016
Ultimately, measures in research, testing, assessment and evaluation are used, or have implications, for ranking, intervention, feedback, decision-making or policy purposes. Explicit recognition of this fact brings the often-ignored and sometimes maligned concept of consequences to the fore. Given that measures have personal and social…
Descriptors: Testing Programs, Testing Problems, Measurement Techniques, Student Evaluation
Osborne, Jonathan F.; Henderson, J. Bryan; MacPherson, Anna; Szu, Evan; Wild, Andrew; Yao, Shi-Ying – Journal of Research in Science Teaching, 2016
Given the centrality of argumentation in the Next Generation Science Standards, there is an urgent need for an empirically validated learning progression of this core practice and the development of high-quality assessment items. Here, we introduce a hypothesized three-tiered learning progression for scientific argumentation. The learning…
Descriptors: Item Response Theory, Science Instruction, Science Education, Thinking Skills
Brown, Jennifer L.; Sifuentes, Lucía Macías – Journal of Curriculum and Teaching, 2016
With growing numbers of Hispanic students enrolling in post-secondary school, there is a need to increase retention and graduation rates. The purpose of this study was to validate the Spanish adaptation of the Abbreviated Math Anxiety Scale (AMAS). The AMAS was translated and administered to 804 freshman students at a post-secondary institution in…
Descriptors: Mathematics Anxiety, Test Validity, Affective Measures, Spanish
Hunt, Tim; Jordan, Sally – Practitioner Research in Higher Education, 2016
Many practitioner researchers strive to understand which assessment practices have the best impact on learning, but in authentic educational settings, it can be difficult to determine whether one intervention, for example the introduction of an online quiz to a course studied by diverse students, is responsible for the observed effect. This paper…
Descriptors: Evaluation Research, Reliability, Research Problems, Correlation
Hoon, Teoh Sian; Satiman, Faziana – Asian Journal of University Education, 2016
The study was conducted to investigate the level of service quality for different dimensions based on the perception of private school teachers towards service quality. The investigation indicated level of perception for different dimensions namely tangibles, responsiveness, empathy, reliability and assurance. A questionnaire on Service Quality…
Descriptors: Educational Quality, Private Schools, Teacher Attitudes, Questionnaires
Gorard, Stephen; Gorard, Jonathan – International Journal of Social Research Methodology, 2016
This brief paper introduces a new approach to assessing the trustworthiness of research comparisons when expressed numerically. The 'number needed to disturb' a research finding would be the number of counterfactual values that can be added to the smallest arm of any comparison before the difference or 'effect' size disappears, minus the number of…
Descriptors: Statistical Significance, Testing, Sampling, Attrition (Research Studies)
Chiang, Hanley; McCullough, Moira; Lipscomb, Stephen; Gill, Brian – National Center for Education Evaluation and Regional Assistance, 2016
States and districts need ways of measuring principal performance that correctly identify effective principals. Unfortunately, existing research offers little guidance to policymakers on which types of performance measures provide valid information about principals' contributions to student achievement. States have therefore had to develop…
Descriptors: Scores, Academic Achievement, Principals, Administrator Effectiveness
Larsen, Linda; Kohnen, Saskia; Nickels, Lyndsey; McArthur, Genevieve – Australian Journal of Learning Difficulties, 2015
Children who have difficulty learning to read are at increased risk for academic failure, poor self-esteem, anxiety and depression, and unemployment. To help reduce these risks, it is important to identify and treat weaknesses in a child's reading as early as possible. The aim of this study was to develop a valid and reliable comprehensive…
Descriptors: Phoneme Grapheme Correspondence, Reading Tests, Standardized Tests, Test Reliability
Aktay, Sayim – International Technology and Education Journal, 2018
The aim of this study is to develop a valid and reliable scale named "Smartphone Self-Efficacy Scale". The purpose of the scale is to determine the smartphone self-efficacy levels of people. The study was carried out with 520 pre-service teachers in the spring academic term, and 103 pre-service teachers in the academic year of 2018-2019…
Descriptors: Measures (Individuals), Reliability, Telecommunications, Handheld Devices
Martínez-González, A. E.; Piqueras, J. A. – Journal of Autism and Developmental Disorders, 2018
Restricted and repetitive behavior (RRB) is one of the two key diagnostic features of autism spectrum disorder (ASD). DSM-5 highlights the importance of severity-based diagnostic modifiers assigned on the basis of intensity of needed supports. Therefore, there is a need for available measures that assess the severity of RRB. The repetitive…
Descriptors: Pervasive Developmental Disorders, Autism, Clinical Diagnosis, Psychometrics
Grundin, Hans U. – Literacy, 2018
This paper aims to present a critical analysis of the Year 1 Phonics Screening Check (PSC), with special focus on the relationship between the UK Department for Education's policy-making and the evidence considered in the process of developing and evaluating the PSC. The reports from the in-house Standards and Testing Agency and from commissioned…
Descriptors: Foreign Countries, Criticism, Screening Tests, Phonics

Peer reviewed
Direct link
