Publication Date
| In 2026 | 3 |
| Since 2025 | 656 |
| Since 2022 (last 5 years) | 3157 |
| Since 2017 (last 10 years) | 7398 |
| Since 2007 (last 20 years) | 15036 |
Descriptor
| Test Reliability | 15028 |
| Test Validity | 10265 |
| Reliability | 9757 |
| Foreign Countries | 7137 |
| Test Construction | 4821 |
| Validity | 4191 |
| Measures (Individuals) | 3876 |
| Factor Analysis | 3822 |
| Psychometrics | 3520 |
| Interrater Reliability | 3124 |
| Correlation | 3039 |
| More ▼ | |
Source
Author
Publication Type
Education Level
Audience
| Researchers | 709 |
| Practitioners | 451 |
| Teachers | 208 |
| Administrators | 122 |
| Policymakers | 66 |
| Counselors | 42 |
| Students | 38 |
| Parents | 11 |
| Community | 7 |
| Support Staff | 6 |
| Media Staff | 5 |
| More ▼ | |
Location
| Turkey | 1326 |
| Australia | 436 |
| Canada | 379 |
| China | 368 |
| United States | 271 |
| United Kingdom | 256 |
| Indonesia | 251 |
| Taiwan | 234 |
| Netherlands | 223 |
| Spain | 216 |
| California | 214 |
| More ▼ | |
Laws, Policies, & Programs
Assessments and Surveys
What Works Clearinghouse Rating
| Meets WWC Standards without Reservations | 8 |
| Meets WWC Standards with or without Reservations | 9 |
| Does not meet standards | 6 |
Peer reviewedGibbs, William; Graves, Pat R.; Bernas, Ronan S. – Journal of Research on Technology in Education, 2001
Describes a study that used a Web-based survey and a modified Delphi technique to identify criteria important to multimedia instructional courseware evaluation, validate them with a panel of instructional technology experts, and examine the effect of conducting panel discussions online. Shows information accuracy and reliability as the most…
Descriptors: Computer Software Evaluation, Courseware, Delphi Technique, Discussion
Peer reviewedSarafino, Edward P.; Ewing, Maureen – Journal of American College Health, 1999
Describes the development of the Hassles Assessment Scale for Students in College, which measured student stress. Development involved item generation, psychometric evaluation, and revision. Separate student samples participated in each phase. Results found very high levels of internal consistency for the frequency, unpleasantness, and dwelling…
Descriptors: College Students, Coping, Higher Education, Stress Management
VanSciver, James H. – High School Magazine, 1999
Teachers deserve objective evaluations that will help them to improve. Delaware uses a four-point rubric ranging from "unsatisfactory" to "needs improvement,""effective," and "exemplary." Teachers' placement depends on frequency of demonstrated behaviors identified in each rubric. For lower ratings, the…
Descriptors: Evaluation Criteria, Feedback, Interrater Reliability, Program Descriptions
Peer reviewedRiddle, Kathryn P.; Aponte, Joseph F. – Child Abuse & Neglect: The International Journal, 1999
Data collected from 95 college students found the Comprehensive Childhood Maltreatment Inventory to have excellent test-retest reliability and 2 of the 4 subscales (Psychological Maltreatment and Neglect and Sexual Abuse) to possess adequate internal consistency. Reasons for low internal consistency for the Physical Maltreatment and Physical…
Descriptors: Child Abuse, Child Neglect, College Students, Evaluation Methods
Peer reviewedEpstein, Michael H.; Hertzog, Melody A.; Reid, Robert – Behavioral Disorders, 2001
Data are reported on the long-term (6-month) test-retest reliability of the Behavioral and Emotional Rating Scale (BERS), which is a strength-based assessment instrument. Participants included 95 typical elementary students and 26 children at risk of having or having emotional or behavioral disorders. Moderate to high test-retest correlations were…
Descriptors: Behavior Disorders, Behavior Rating Scales, Clinical Diagnosis, Disability Identification
Peer reviewedHunter, Darryl M.; Randhawa, Bikkar S. – Alberta Journal of Educational Research, 2001
Examines reliability issues in the large-scale assessment of speech communication through authentic techniques, used recently in Saskatchewan. Performance-based approaches enable educators to evaluate the integrated, interpersonal communication skills of large student populations, thereby modeling best professional practice. However, decentralized…
Descriptors: Audiolingual Skills, Communication Skills, Elementary Secondary Education, Foreign Countries
Peer reviewedNICHD Early Child Care Research Network. – Developmental Psychology, 2001
Analyzed relationships between child-care experience and preschoolers' attachment. Found that maternal sensitivity was the strongest predictor of preschoolers' attachment. When maternal sensitivity was low, more hours per week in child care at 15 months somewhat increased the risk of the insecure-ambivalent classification at 36 months. Found…
Descriptors: Attachment Behavior, Day Care, Family Characteristics, Longitudinal Studies
Peer reviewedHamilton, Jan; Reddel, Sue; Spratt, Mary – System, 2001
Describes a pilot on-line system of rater training developed at Hong Kong Polytechnic University to support the English Language Centre's English for academic purposes and English in the workplace subjects. Presents findings from a project that investigated raters' attitudes towards assessment and rater training, and evaluated the pilot on-line…
Descriptors: English (Second Language), English for Academic Purposes, Foreign Countries, Interrater Reliability
Naevdal, F. – Journal of Adolescence, 2005
The article presents a psychometric description of 11 statements related to use of physical violence. The items were tested in a normal sample (N=1700, age: 15-16) from urban and rural areas in Western Norway. The internal reliability was @a=0.86, and the factor analysis resulted in two factors. Boys had higher mean scores than girls.…
Descriptors: Test Reliability, Predictor Variables, Test Validity, Gender Differences
Burton, Richard F. – Assessment and Evaluation in Higher Education, 2005
Examiners seeking guidance on multiple-choice and true/false tests are likely to encounter various faulty or questionable ideas. Twelve of these are discussed in detail, having to do mainly with the effects on test reliability of test length, guessing and scoring method (i.e. number-right scoring or negative marking). Some misunderstandings could…
Descriptors: Guessing (Tests), Multiple Choice Tests, Objective Tests, Test Reliability
Wilson, Coralie J.; Deane, Frank P.; Ciarrochi, Joseph; Rickwood, Debra – Canadian Journal of Counselling, 2005
Understanding help seeking intentions and behaviour is fundamental to the identification of factors that can be modified to increase engagement in counselling. Despite considerable research on these variables, integrating prior research has been impeded by a lack of consistent and psychometrically sound help-seeking measures. The General…
Descriptors: Intention, Measures (Individuals), Help Seeking, Student Attitudes
Hill, Laura G.; Coie, John D.; Lochman, John E.; Greenberg, Mark T. – Journal of Consulting and Clinical Psychology, 2004
Accurate, early screening is a prerequisite for indicated interventions intended to prevent development of externalizing disorders and delinquent behaviors. Using the Fast Track longitudinal sample of 396 children drawn from high-risk environments, the authors varied assumptions about base rates and examined effects of multiple-time-point and…
Descriptors: Cost Effectiveness, Delinquency, Screening Tests, Early Intervention
Malofeeva, Elena; Day, Jeanne; Saco, Ximena; Young, Laura; Ciancio, Dennis – Journal of Educational Psychology, 2004
The reliability and, to a lesser extent, the validity of the newly created Number Sense Test was evaluated with a group of 40 3- to 5-year-old children attending Head Start. Six number sense skills (e.g., counting, number identification, addition-subtraction) and children's feelings about school were assessed both before and after instruction…
Descriptors: Disadvantaged Youth, Student Attitudes, Preschool Children, Mathematics Skills
Hodges, Timothy D.; Harter, James K. – Educational Horizons, 2005
StrengthsQuest is a student program that focuses on strengths rather than weaknesses. It is intended to lead students to discover their natural talents and gain unique and valuable insights into how to develop such talents into strengths--strengths that equip them to succeed and to make important decisions that enable them to balance the demands…
Descriptors: Test Reliability, Test Validity, Talent Development, Student Empowerment
Ball, S. L.; Holland, A. J.; Huppert, F. A.; Treppner, P.; Watson, P.; Hon, J. – Journal of Intellectual Disability Research, 2004
Dementia because of Alzheimer's disease (AD) commonly affects older adults with Down's syndrome (DS). Methods are needed, with established concurrent and predictive validity, to facilitate the diagnostic assessment of dementia, when it is complicated by pre-existing intellectual disabilities (ID). We report on the reliability and validity of a…
Descriptors: Identification, Predictive Validity, Interrater Reliability, Alzheimers Disease

Direct link
