Publication Date
| In 2026 | 3 |
| Since 2025 | 666 |
| Since 2022 (last 5 years) | 3167 |
| Since 2017 (last 10 years) | 7408 |
| Since 2007 (last 20 years) | 15046 |
Descriptor
| Test Reliability | 15036 |
| Test Validity | 10272 |
| Reliability | 9759 |
| Foreign Countries | 7141 |
| Test Construction | 4823 |
| Validity | 4191 |
| Measures (Individuals) | 3877 |
| Factor Analysis | 3825 |
| Psychometrics | 3525 |
| Interrater Reliability | 3124 |
| Correlation | 3039 |
| More ▼ | |
Source
Author
Publication Type
Education Level
Audience
| Researchers | 709 |
| Practitioners | 451 |
| Teachers | 208 |
| Administrators | 122 |
| Policymakers | 66 |
| Counselors | 42 |
| Students | 38 |
| Parents | 11 |
| Community | 7 |
| Support Staff | 6 |
| Media Staff | 5 |
| More ▼ | |
Location
| Turkey | 1327 |
| Australia | 436 |
| Canada | 379 |
| China | 368 |
| United States | 271 |
| United Kingdom | 256 |
| Indonesia | 252 |
| Taiwan | 234 |
| Netherlands | 223 |
| Spain | 216 |
| California | 214 |
| More ▼ | |
Laws, Policies, & Programs
Assessments and Surveys
What Works Clearinghouse Rating
| Meets WWC Standards without Reservations | 8 |
| Meets WWC Standards with or without Reservations | 9 |
| Does not meet standards | 6 |
Karadag, Ruhan – Online Submission, 2014
The aim of this study to explore primary school teachers' views towards critical reading skills and their perceptions of competence. The participants are 25 teacher candidates who are fourth year students studying in the Department of Primary School Education at the Education Faculty of Adiyaman University. Adopting a qualitative data approach,…
Descriptors: Foreign Countries, Preservice Teachers, Student Attitudes, Semi Structured Interviews
Bhat, Mehraj A. – Online Submission, 2014
This paper is based on the construction and evaluation of reliability and validity of reasoning ability test at secondary school students. In this paper an attempt was made to evaluate validity, reliability and to determine the appropriate standards to interpret the results of reasoning ability test. The test includes 45 items to measure six types…
Descriptors: Foreign Countries, Secondary Schools, Secondary School Students, Grade 10
Kratz, Hilary E.; Locke, Jill; Piotrowski, Zinnia; Ouellette, Rachel R.; Xie, Ming; Stahmer, Aubyn C.; Mandell, David S. – Grantee Submission, 2014
This study sought to validate a new measure, the Classroom Cohesion Survey (CCS), designed to examine the relationship between teachers and classroom assistants in autism support classrooms. Teachers, classroom assistants, and external observers showed good inter-rater agreement on the CCS and good internal consistency for all scales. Simple…
Descriptors: Special Education Teachers, Teacher Aides, Autism, Pervasive Developmental Disorders
Lamb, Lindsay M. – Online Submission, 2014
Austin Independent School District (AISD) Social Emotional Learning (SEL) coaches rated schools on the degree to which they implemented 10 domains believed to best exemplify program goals. This study examines the validity and reliability of the implementation rubric they used. A separate research brief report also was published. [For the research…
Descriptors: Social Emotional Learning, Program Implementation, Scoring Rubrics, School Districts
Patel, Sona; Shrivastav, Rahul; Eddins, David A. – Journal of Speech, Language, and Hearing Research, 2012
Purpose: Perceptual estimates of voice quality obtained using rating scales are subject to contextual biases that influence how individuals assign numbers to estimate the magnitude of vocal quality. Because rating scales are commonly used in clinical settings, assessments of voice quality are also subject to the limitations of these scales.…
Descriptors: Rating Scales, Objective Tests, Voice Disorders, Test Reliability
Li, Deping; Jiang, Yanlin; von Davier, Alina A. – Journal of Educational Measurement, 2012
This study investigates a sequence of item response theory (IRT) true score equatings based on various scale transformation approaches and evaluates equating accuracy and consistency over time. The results show that the biases and sample variances for the IRT true score equating (both direct and indirect) are quite small (except for the mean/sigma…
Descriptors: True Scores, Equated Scores, Item Response Theory, Accuracy
Savickas, Mark L.; Porfeli, Erik J. – Journal of Vocational Behavior, 2012
Researchers from 13 countries collaborated in constructing a psychometric scale to measure career adaptability. Based on four pilot tests, a research version of the proposed scale consisting of 55 items was field tested in 13 countries. The resulting Career Adapt-Abilities Scale (CAAS) consists of four scales, each with six items. The four scales…
Descriptors: Foreign Countries, Vocational Adjustment, Measures (Individuals), Test Reliability
Kim, Sooyeon; Walker, Michael E.; Larkin, Kevin – International Journal of Testing, 2012
We demonstrate how to assess the potential changes to a test's score scale necessitated by changes to the test specifications when a field study is not feasible. We used a licensure test, which is currently under revision, as an example. We created two research forms from an actual form of the test. One research form was developed with the current…
Descriptors: Equated Scores, Licensing Examinations (Professions), Test Reliability, Construct Validity
Wilson, Erin M.; Green, Jordan R.; Weismer, Gary – Journal of Speech, Language, and Hearing Research, 2012
Purpose: The purpose of this investigation was to describe age- and consistency-related changes in the temporal characteristics of chewing in typically developing children between the ages of 4 and 35 months and adults using high-resolution optically based motion capture technology. Method: Data were collected from 60 participants (48 children, 12…
Descriptors: Human Body, Motion, Biomechanics, Time
Arterberry, Brooke J.; Martens, Matthew P.; Cadigan, Jennifer M.; Smith, Ashley E. – Measurement and Evaluation in Counseling and Development, 2012
This study assessed the score reliability of the Drinking Motives Questionnaire-Revised (DMQ-R) via generalizability theory. Participants (n = 367 college students) completed the DMQ-R at three time points. Across subscale scores, persons, persons x occasions, and persons x items interactions accounted for meaningful variance. Findings illustrate…
Descriptors: Generalizability Theory, Drinking, Questionnaires, Motivation
Sheehan, Dwayne P.; Lafave, Mark R.; Katz, Larry – Measurement in Physical Education and Exercise Science, 2011
This study was designed to test the intra- and inter-rater reliability of the University of North Carolina's Balance Error Scoring System in 9- and 10-year-old children. Additionally, a modified version of the Balance Error Scoring System was tested to determine if it was more sensitive in this population ("raw scores"). Forty-six…
Descriptors: Elementary School Students, Interrater Reliability, Scoring, Raw Scores
Kucuker, Sevgi; Kapci, Emine Gul; Uslu, Runa Idil – Infants and Young Children, 2011
The applicability of the Age and Stages Questionnaires: Social Emotional (ASQ-SE; J. Squires, D. Bricker & E. Twombly, 2003) for Turkish children was examined. A total of 608 mothers completed the ASQ-SE's. Overall sensitivity and overall specificity were 83.7% and 89.9%, respectively. Test-retest reliability, assessed by classifying children…
Descriptors: Questionnaires, Children, Mothers, Emotional Problems
Withana, Eran Chinthaka – ProQuest LLC, 2011
From time-critical, real time computational experimentation to applications which process petabytes of data there is a continuing search for faster, more responsive computing platforms capable of supporting computational experimentation. Weather forecast models, for instance, process gigabytes of data to produce regional (mesoscale) predictions on…
Descriptors: Computers, Computation, Science Experiments, Reliability
Sinharay, Sandip; Haberman, Shelby J.; Wainer, Howard – Educational and Psychological Measurement, 2011
There are several techniques that increase the precision of subscores by borrowing information from other parts of the test. These techniques have been criticized on validity grounds in several of the recent publications. In this note, the authors question the argument used in these publications and suggest both inherent limits to the validity…
Descriptors: Scores, Methods, Validity, Reliability
Yang, Yanyun; Green, Samuel B. – Journal of Psychoeducational Assessment, 2011
Coefficient alpha is almost universally applied to assess reliability of scales in psychology. We argue that researchers should consider alternatives to coefficient alpha. Our preference is for structural equation modeling (SEM) estimates of reliability because they are informative and allow for an empirical evaluation of the assumptions…
Descriptors: Structural Equation Models, Reliability, Measures (Individuals)

Peer reviewed
Direct link
