Publication Date
| In 2026 | 3 |
| Since 2025 | 666 |
| Since 2022 (last 5 years) | 3167 |
| Since 2017 (last 10 years) | 7408 |
| Since 2007 (last 20 years) | 15046 |
Descriptor
| Test Reliability | 15036 |
| Test Validity | 10272 |
| Reliability | 9759 |
| Foreign Countries | 7141 |
| Test Construction | 4823 |
| Validity | 4191 |
| Measures (Individuals) | 3877 |
| Factor Analysis | 3825 |
| Psychometrics | 3525 |
| Interrater Reliability | 3124 |
| Correlation | 3039 |
| More ▼ | |
Source
Author
Publication Type
Education Level
Audience
| Researchers | 709 |
| Practitioners | 451 |
| Teachers | 208 |
| Administrators | 122 |
| Policymakers | 66 |
| Counselors | 42 |
| Students | 38 |
| Parents | 11 |
| Community | 7 |
| Support Staff | 6 |
| Media Staff | 5 |
| More ▼ | |
Location
| Turkey | 1327 |
| Australia | 436 |
| Canada | 379 |
| China | 368 |
| United States | 271 |
| United Kingdom | 256 |
| Indonesia | 252 |
| Taiwan | 234 |
| Netherlands | 223 |
| Spain | 216 |
| California | 214 |
| More ▼ | |
Laws, Policies, & Programs
Assessments and Surveys
What Works Clearinghouse Rating
| Meets WWC Standards without Reservations | 8 |
| Meets WWC Standards with or without Reservations | 9 |
| Does not meet standards | 6 |
Brennan, Robert L. – Measurement: Interdisciplinary Research and Perspectives, 2015
Koretz, in his article published in this issue, provides compelling arguments that the high stakes currently associated with accountability testing lead to behavioral changes in students, teachers, and other stakeholders that often have negative consequences, such as inflated scores. Koretz goes on to argue that these negative consequences require…
Descriptors: Accountability, High Stakes Tests, Behavior Change, Student Behavior
Kuçukosmanoglu, Hayrettin Onur – Educational Research and Reviews, 2015
The main purpose of this study is to develop a scale to determine students' attitude levels on individual instruments and individual instrument courses in instrument training, which is an important dimension of music education, and to conduct a validity-reliability research of the scale that has been developed. The scale consists of 16 items. The…
Descriptors: Foreign Countries, Music, Music Education, Musical Instruments
Mukherjee, Mousumi – Policy Futures in Education, 2015
The aim of the first "Global Conclave of Young Scholars of Indian Education" held in New Delhi in 2011 was to help build bridges for research collaboration primarily for young scholars of Indian education to collaborate with their partners across borders. This paper draws on an "intrinsic case study" of this "global…
Descriptors: Foreign Countries, Research, Cooperation, Global Approach
Yang, Hongfei; Hong, Chaoqin; Tao, Xiaodan; Zhu, Lingyi – Measurement and Evaluation in Counseling and Development, 2015
This study examined the structure, reliability, and validity of the revised Chinese version of the Child and Adolescent Perfectionism Scale (N = 933). The results confirmed the four-factor structure of the Chinese version of the Child and Adolescent Perfectionism Scale. Implications, limitations, and suggestions for future research are provided.
Descriptors: Foreign Countries, Personality Measures, Personality Traits, Test Reliability
Lowe, Patricia A. – Journal of Psychoeducational Assessment, 2015
The present study examined measurement invariance across gender and gender differences on two measures of test anxiety developed for U.S. middle and high school, and college students. It was hypothesized that measurement invariance and gender differences would be found on the two measures of test anxiety, suggesting no separate scoring system is…
Descriptors: Test Anxiety, Affective Measures, Gender Differences, Test Bias
Lee, Cheng-Yuan – Distance Education, 2015
This study was designed to investigate whether course content self-efficacy, online technologies self-efficacy, and task value change over the course of a semester. Sixty-nine participating students from four classes provided data through two instruments: (1) the self-efficacy instrument and (2) the task value instrument. Students' self-efficacy…
Descriptors: Self Efficacy, Online Courses, Course Content, Measures (Individuals)
Rapp, John T.; Carroll, Regina A.; Stangeland, Lindsay; Swanson, Greg; Higgins, William J. – Behavior Modification, 2011
The authors evaluated the extent to which interobserver agreement (IOA) scores, using the block-by-block method for events scored with continuous duration recording (CDR), were higher when the data from the same sessions were converted to discontinuous methods. Sessions with IOA scores of 89% or less with CDR were rescored using 10-s partial…
Descriptors: Intervals, Sampling, Comparative Analysis, Measures (Individuals)
Grodberg, David; Weinger, Paige M.; Kolevzon, Alexander; Soorya, Latha; Buxbaum, Joseph D. – Journal of Autism and Developmental Disorders, 2012
The Autism Mental Status Examination (AMSE) described here is an eight-item observational assessment that prompts the observation and recording of signs and symptoms of autism spectrum disorders (ASD). The AMSE is intended to take place seamlessly in the context of a clinical exam and produces a total score. Subjects were independently…
Descriptors: Observation, Autism, Interrater Reliability, At Risk Persons
Kerr, Jacqueline; Sallis, James F.; Bromby, Erica; Glanz, Karen – Journal of Nutrition Education and Behavior, 2012
Objective: To evaluate reliability and validity of a new tool for assessing the placement and promotional environment in grocery stores. Methods: Trained observers used the "GroPromo" instrument in 40 stores to code the placement of 7 products in 9 locations within a store, along with other promotional characteristics. To test construct validity,…
Descriptors: Reliability, Validity, Food, Retailing
Nagle, Kathy F.; Eadie, Tanya L. – Journal of Communication Disorders, 2012
The purpose of this study was to determine whether: (a) inexperienced listeners can reliably judge listener effort and (b) whether listener effort provides unique information beyond speech intelligibility or acceptability in tracheoesophageal speech. Twenty inexperienced listeners made judgments of speech acceptability and amount of effort…
Descriptors: Listening, Reliability, Speech, Articulation (Speech)
Zhang, Mo; Williamson, David M.; Breyer, F. Jay; Trapani, Catherine – International Journal of Testing, 2012
This article describes two separate, related studies that provide insight into the effectiveness of "e-rater" score calibration methods based on different distributional targets. In the first study, we developed and evaluated a new type of "e-rater" scoring model that was cost-effective and applicable under conditions of absent human rating and…
Descriptors: Automation, Scoring, Models, Essay Tests
Mason, Richard W.; Schroeder, Mark P. – Educational Assessment, Evaluation and Accountability, 2012
Letters of reference are commonly used in acquiring a job in education. Despite serious issues of validity and reliability in writing and evaluating letters, there is a dearth of research that systematically examines the evaluation process and defines the constructs that define high quality letters. The current study used NVivo to examine 160…
Descriptors: Student Teachers, Letters (Correspondence), Validity, Reliability
Kim, Seonghoon – Psychometrika, 2012
Assuming item parameters on a test are known constants, the reliability coefficient for item response theory (IRT) ability estimates is defined for a population of examinees in two different ways: as (a) the product-moment correlation between ability estimates on two parallel forms of a test and (b) the squared correlation between the true…
Descriptors: Reliability, Item Response Theory, Tests, Correlation
Weerasinghe, I. M. S.; Fernando, R. L. S. – Quality Assurance in Education: An International Perspective, 2018
Purpose: The purpose of this study is to explain critical factors affecting student satisfaction levels in selected state universities in Sri Lanka. Design/methodology/approach: The study has applied an quantitative survey design guided by six hypotheses. A conceptual framework has been developed to address the research questions on the basis of a…
Descriptors: Foreign Countries, Student Satisfaction, Hypothesis Testing, Undergraduate Students
Leonnard – Journal on Efficiency and Responsibility in Education and Science, 2018
The increasing number of educational services has caused a high competition in this industry. In Indonesia, the number of private universities is the highest compared to state universities and other forms of higher education institutions. Ability to predict factors that are important in providing educational services to achieve student…
Descriptors: Educational Quality, Private Colleges, Public Relations, Foreign Countries

Peer reviewed
Direct link
