Publication Date
| In 2026 | 7 |
| Since 2025 | 690 |
| Since 2022 (last 5 years) | 3191 |
| Since 2017 (last 10 years) | 7432 |
| Since 2007 (last 20 years) | 15070 |
Descriptor
| Test Reliability | 15055 |
| Test Validity | 10290 |
| Reliability | 9763 |
| Foreign Countries | 7150 |
| Test Construction | 4828 |
| Validity | 4192 |
| Measures (Individuals) | 3880 |
| Factor Analysis | 3826 |
| Psychometrics | 3532 |
| Interrater Reliability | 3126 |
| Correlation | 3040 |
| More ▼ | |
Source
Author
Publication Type
Education Level
Audience
| Researchers | 709 |
| Practitioners | 451 |
| Teachers | 208 |
| Administrators | 122 |
| Policymakers | 66 |
| Counselors | 42 |
| Students | 38 |
| Parents | 11 |
| Community | 7 |
| Support Staff | 6 |
| Media Staff | 5 |
| More ▼ | |
Location
| Turkey | 1329 |
| Australia | 436 |
| Canada | 379 |
| China | 368 |
| United States | 271 |
| United Kingdom | 256 |
| Indonesia | 253 |
| Taiwan | 234 |
| Netherlands | 224 |
| Spain | 218 |
| California | 215 |
| More ▼ | |
Laws, Policies, & Programs
Assessments and Surveys
What Works Clearinghouse Rating
| Meets WWC Standards without Reservations | 8 |
| Meets WWC Standards with or without Reservations | 9 |
| Does not meet standards | 6 |
Tierney, Robin D.; Simon, Marielle; Charland, Julie – Educational Forum, 2011
Knowing that grades can have long-term consequences for students, teachers voice concern about being fair in the grading process. However, their interpretations of fairness are varied and sometimes contradictory. This study looked at how teachers in one standards-based educational system determined secondary students' grades, focusing specifically…
Descriptors: Grades (Scholastic), Academic Achievement, Grading, Educational Principles
Perpina, Conxa; Cebolla, Ausias; Botella, Cristina; Lurbe, Empar; Torro, Maria-Isabel – Journal of Clinical Child and Adolescent Psychology, 2011
The aims of this study were to validate the Emotional Eating Scale version for children (EES-C) in a Spanish population and study the differences in emotional eating among children with binge eating (BE), overeating (OE), and no episodes of disordered eating (NED). The questionnaire was completed by 199 children aged 9 to 16 years. Confirmatory…
Descriptors: Check Lists, Helplessness, Eating Disorders, Factor Structure
McDonell, James R.; Waters, Tracy J. – Social Indicators Research, 2011
This paper reports the development and validation of the Neighborhood Observation Scale, a 41 item measure of neighborhood physical appearance, social appearance, safety, and amenities. Three independent ratings were collected on each of 244 neighborhoods in 132 census block groups in five South Carolina counties, for a total of 732 observations.…
Descriptors: Neighborhoods, Child Abuse, Safety, Validity
Rhodes, Terrel L. – Change: The Magazine of Higher Learning, 2011
People are inundated with technology on campuses and in their lives. Students are increasingly technology savvy, expecting faculty and administrators to function comfortably within the digital world. They have responded by using technology more and more in teaching and learning. In this article, the author focuses on one such use--student…
Descriptors: Portfolios (Background Materials), Undergraduate Study, Campuses, Evaluation
Stewart, Jeffrey; White, David A. – TESOL Quarterly: A Journal for Teachers of English to Speakers of Other Languages and of Standard English as a Second Dialect, 2011
Multiple-choice tests such as the Vocabulary Levels Test (VLT) are often viewed as a preferable estimator of vocabulary knowledge when compared to yes/no checklists, because self-reporting tests introduce the possibility of students overreporting or underreporting scores. However, multiple-choice tests have their own unique disadvantages. It has…
Descriptors: Guessing (Tests), Scoring Formulas, Multiple Choice Tests, Test Reliability
He, Qingping; Boyle, Andrew; Opposs, Dennis – Evaluation & Research in Education, 2011
Building on findings from existing qualitative research into public perceptions of reliability in examination results in England, a questionnaire was developed and administered to samples of teachers, students and employers to study their awareness of and opinions about various aspects of reliability quantitatively. Main findings from the study…
Descriptors: Qualitative Research, Student Evaluation, Tests, Program Effectiveness
Hornsveld, Ruud H. J.; Muris, Peter; Kraaimaat, Floris W. – Psychological Assessment, 2011
We examined the psychometric properties of the Novaco Anger Scale--Provocation Inventory (NAS-PI, 1994 version) in Dutch violent forensic psychiatric patients and secondary vocational students. A confirmatory factor analysis of the subscale structure of the NAS was carried out, reliability was investigated, and relations were calculated between…
Descriptors: Control Groups, Personality Traits, Persuasive Discourse, Personality
Bell, Lindsay; Long, Susanne; Garvan, Cynthia; Bussing, Regina – Psychology in the Schools, 2011
Attention-Deficit/Hyperactivity Disorder (ADHD) is one of the most frequently diagnosed psychiatric disorders in childhood and adolescence. It is associated with high levels of stigma, which may lead to treatment barriers, self-fulfilling prophecies, and social rejection. This study established the reliability of the ADHD Stigma Questionnaire…
Descriptors: Attention Deficit Hyperactivity Disorder, Factor Structure, Special Education Teachers, Teacher Certification
Gekara, Victor Oyaro; Bloor, Michael; Sampson, Helen – Journal of Vocational Education and Training, 2011
Vocational education and training (VET) concerns the cultivation and development of specific skills and competencies, in addition to broad underpinning knowledge relating to paid employment. VET assessment is, therefore, designed to determine the extent to which a trainee has effectively acquired the knowledge, skills, and competencies required by…
Descriptors: Marine Education, Occupational Safety and Health, Computer Assisted Testing, Vocational Education
Aktamis, Hilal – Educational Research and Reviews, 2011
The aim of this study was to determine energy saving behavior and energy awareness of secondary school students and the effects of socio-demographic characteristics (gender, residential area and grade level) on energy saving and energy awareness. The research is a survey model with an approach that aims to describe the current status. A total of…
Descriptors: Urban Schools, Student Attitudes, Reliability, Energy
Baartman, Liesbeth K. J.; Prins, Frans J.; Kirschner, Paul A.; van der Vleuten, Cees P. M. – Evaluation and Program Planning, 2011
The goal of this article is to contribute to the validation of a self-evaluation method, which can be used by schools to evaluate the quality of their Competence Assessment Program (CAP). The outcomes of the self-evaluations of two schools are systematically compared: a novice school with little experience in competence-based education and…
Descriptors: Educational Innovation, Competency Based Education, Self Evaluation (Groups), Program Validation
O'Sullivan, Maureen – Psychological Bulletin, 2008
In 2006, C. F. Bond Jr. and B. M. DePaulo provided a meta-analysis of means and concluded that average lie detection accuracy was significantly greater than chance for most people. Now, they have presented an analysis of standard deviations (C. F. Bond Jr. & B. M. DePaulo, 2008), claiming that there are no reliable individual differences in lie…
Descriptors: Deception, Test Theory, Meta Analysis, Individual Differences
Ratcliff, Roger; Schmiedek, Florian; McKoon, Gail – Intelligence, 2008
The worst performance rule for cognitive tasks [Coyle, T.R. (2003). IQ, the worst performance rule, and Spearman's law: A reanalysis and extension. "Intelligence," 31, 567-587] in which reaction time is measured is the result that IQ scores correlate better with longer (i.e., 0.7 and 0.9 quantile) reaction times than shorter (i.e., 0.1 and 0.3…
Descriptors: Reaction Time, Intelligence Quotient, Correlation, Models
Wiliam, Dylan – Assessment in Education: Principles, Policy & Practice, 2008
While international comparisons such as those provided by PISA may be meaningful in terms of overall judgements about the performance of educational systems, caution is needed in terms of more fine-grained judgements. In particular it is argued that the results of PISA to draw conclusions about the quality of instruction in different systems is…
Descriptors: Test Bias, Test Construction, Comparative Testing, Evaluation
Jaswal, Vikram K.; McKercher, David A.; VanderBorght, Mieke – Child Development, 2008
Two studies investigated 3- to 5-year-olds' trust in a reliable informant when judging novel labels and novel plural and past tense forms. In Study 1, children (N = 24) endorsed the names of new objects given by an informant who had earlier labeled familiar objects correctly over the names given by an informant who had labeled the same objects…
Descriptors: Nouns, Morphemes, Young Children, Trust (Psychology)

Peer reviewed
Direct link
