Publication Date
| In 2026 | 3 |
| Since 2025 | 656 |
| Since 2022 (last 5 years) | 3157 |
| Since 2017 (last 10 years) | 7398 |
| Since 2007 (last 20 years) | 15036 |
Descriptor
| Test Reliability | 15028 |
| Test Validity | 10265 |
| Reliability | 9757 |
| Foreign Countries | 7137 |
| Test Construction | 4821 |
| Validity | 4191 |
| Measures (Individuals) | 3876 |
| Factor Analysis | 3822 |
| Psychometrics | 3520 |
| Interrater Reliability | 3124 |
| Correlation | 3039 |
| More ▼ | |
Source
Author
Publication Type
Education Level
Audience
| Researchers | 709 |
| Practitioners | 451 |
| Teachers | 208 |
| Administrators | 122 |
| Policymakers | 66 |
| Counselors | 42 |
| Students | 38 |
| Parents | 11 |
| Community | 7 |
| Support Staff | 6 |
| Media Staff | 5 |
| More ▼ | |
Location
| Turkey | 1326 |
| Australia | 436 |
| Canada | 379 |
| China | 368 |
| United States | 271 |
| United Kingdom | 256 |
| Indonesia | 251 |
| Taiwan | 234 |
| Netherlands | 223 |
| Spain | 216 |
| California | 214 |
| More ▼ | |
Laws, Policies, & Programs
Assessments and Surveys
What Works Clearinghouse Rating
| Meets WWC Standards without Reservations | 8 |
| Meets WWC Standards with or without Reservations | 9 |
| Does not meet standards | 6 |
Gersten, Russell; Baker, Scott K.; Haager, Diane; Graves, Anne W. – Remedial & Special Education, 2005
The first portion of this article describes the development and validation of a classroom observation measure. The goal of the measure was to assess the quality of reading instruction provided to first-grade English learners. We report the internal consistency reliability, interrater reliability, the development of empirically derived subscales,…
Descriptors: Second Language Learning, English (Second Language), Reading Instruction, Teacher Effectiveness
Sykes, Robert C.; Hou, Liling – Applied Measurement in Education, 2003
Weighting responses to Constructed-Response (CR) items has been proposed as a way to increase the contribution these items make to the test score when there is insufficient testing time to administer additional CR items. The effect of various types of weighting items of an IRT-based mixed-format writing examination was investigated.…
Descriptors: Item Response Theory, Weighted Scores, Responses, Scores
Jung, Lee Ann; McWilliam, R. A. – Journal of Early Intervention, 2005
Evidence is presented regarding the construct validity and internal consistency reliability of scores for an investigator-developed individualized family service plan (IFSP) rating scale. One hundred and twenty IFSPs were rated using a 12-item instrument, the IFSP Rating Scale (McWilliam & Jung, 2001). Using principal components factor…
Descriptors: Test Validity, Rating Scales, Factor Analysis, Construct Validity
Lynch, Elizabeth; Medin, Douglas – Cognitive Psychology, 2006
The current studies explore causal models of heart attack and depression generated from American healers whom use distinct explanatory frameworks. Causal chains leading to two illnesses, heart attack and depression, were elicited from participant groups: registered nurses (RNs), energy healers, RN energy healers, and undergraduates. The…
Descriptors: Cultural Differences, Causal Models, Heart Disorders, Depression (Psychology)
Kucuker, Sevgi; Acarlar, Funda; Kapci, Emine G. – Early Child Development and Care, 2006
This study aimed to develop a new scale, the "Supports Scale For Preschool Inclusion" (SSPI), to assess preschool teachers perceptions of necessary factors and availability of supports for a successful inclusion in pre-school educational settings. Pre-school teachers ("n" = 183, mean age = 32.81, standard deviation = 8.29) from…
Descriptors: Foreign Countries, Measures (Individuals), Psychometrics, Preschool Teachers
Graham, James M.; Liu, Yenling J.; Jeziorski, Jennifer L. – Journal of Marriage and Family, 2006
We conducted a reliability generalization meta-analysis to examine the internal consistency of Dyadic Adjustment Scale (DAS; Spanier, 1976) scores across 91 published studies with 128 samples and 25,035 participants. The DAS was found to produce total and Dyadic cohesion, Consensus, and Satisfaction scores of acceptable internal consistency,…
Descriptors: Sexual Orientation, Marital Status, Generalization, Reliability
Adaptation of the 36-Month Ages and Stages Questionnaire in Taiwan: Results from a Preliminary Study
Tsai, Huei-Ling Agnes; McClelland, Megan M.; Pratt, Clara; Squires, Jane – Journal of Early Intervention, 2006
Identification of children with developmental disabilities is the first critical step in providing early intervention services. Currently, only 20% of Taiwanese children who could potentially benefit from early intervention have been identified. One possible reason for this low identification rate is the lack of a culturally appropriate,…
Descriptors: Foreign Countries, Questionnaires, Disability Identification, Young Children
Henkens, Kene – Canadian Journal on Aging, 2005
This article presents the results of a study into stereotyping by managers of their older workers and the influence of these stereotypes on the inclination of managers to keep their older workers in employment. The data for the study were gathered among 796 managers. Through principal components analysis, 15 opinions about older workers were…
Descriptors: Retirement, Older Workers, Administrators, Administrator Attitudes
Myers, Nicholas D.; Feltz, Deborah L.; Maier, Kimberly S.; Wolfe, Edward W.; Reckase, Mark D. – Research Quarterly for Exercise and Sport, 2006
This study provided initial validity evidence for multidimensional measures of coaching competency derived from the Coaching Competency Scale (CCS). Data were collected from intercollegiate men's (n = 8) and women's (n = 13) soccer and women's ice hockey teams (n = 11). The total number of athletes was 585. Within teams, a multidimensional…
Descriptors: Athletes, Athletic Coaches, Competence, Team Sports
ChanLin, Lih-Juan – Journal of Instructional Psychology, 2005
This paper uses the data from a survey among school teachers to conduct a series of factor analysis to test the reliability of a set of items to determine the factors deemed important in technology integration among teachers. The results suggest that there are specific dimensions of items that can be used to determine the factors perceived by…
Descriptors: Questionnaires, Technology Integration, Teacher Surveys, Factor Analysis
Diaz, Juan Jose; Handa, Sudhanshu – Journal of Human Resources, 2006
Not all policy questions can be addressed by social experiments. Nonexperimental evaluation methods provide an alternative to experimental designs but their results depend on untestable assumptions. This paper presents evidence on the reliability of propensity score matching (PSM), which estimates treatment effects under the assumption of…
Descriptors: Evaluation Methods, Research Design, Reliability, Program Evaluation
Peer reviewedLau, Anna L.D.; Cummins, Robert A.; McPherson, Wenda – Social Indicators Research, 2005
The Personal Wellbeing Index (PWI) is being developed for the cross-cultural measurement of subjective wellbeing (SWB). This paper reports the findings of its utility with the Hong Kong Chinese and Australian populations. An item on affect, "satisfaction with own happiness" was also investigated to determine whether it should be added to…
Descriptors: Foreign Countries, Cultural Differences, Quality of Life, Cross Cultural Studies
Kozaki ,Y. – Language Testing, 2004
This article presents a standard-setting procedure for performance assessment in a foreign language, through which some of the major problems facing performance assessment in criterion-referenced testing can be addressed. The procedure, which was geared to revealing and accommodating inter-judge variability, employed the synergy of multiple…
Descriptors: Data Analysis, Testing, Performance Tests, Generalizability Theory
Oh, Deborah M.; Kim, Joshua M.; Garcia, Raymond E.; Krilowicz, Beverly L. – Advances in Physiology Education, 2005
There is increasing pressure, both from institutions central to the national scientific mission and from regional and national accrediting agencies, on natural sciences faculty to move beyond course examinations as measures of student performance and to instead develop and use reliable and valid authentic assessment measures for both individual…
Descriptors: Evaluation Methods, Biochemistry, Natural Sciences, Generalizability Theory
DiTommaso, Enrico; Brannen, Cyndi; Best, Lisa A. – Educational and Psychological Measurement, 2004
This article presents a psychometric study of the short form of the Social and Emotional Loneliness Scale for Adults (SELSA-S). Data were collected via self-report measures and mail surveys from several samples including university students, spouses of military personnel, and psychiatric patients. A total of 1,526 individuals took part in this…
Descriptors: Psychological Patterns, Measures (Individuals), Psychometrics, Emotional Response

Direct link
