Publication Date
| In 2026 | 6 |
| Since 2025 | 481 |
| Since 2022 (last 5 years) | 1960 |
| Since 2017 (last 10 years) | 4532 |
| Since 2007 (last 20 years) | 7017 |
Descriptor
| Test Reliability | 15055 |
| Test Validity | 10022 |
| Test Construction | 4374 |
| Foreign Countries | 3840 |
| Psychometrics | 2435 |
| Factor Analysis | 2302 |
| Measures (Individuals) | 1787 |
| Evaluation Methods | 1410 |
| Higher Education | 1391 |
| Questionnaires | 1264 |
| Factor Structure | 1249 |
| More ▼ | |
Source
Author
Publication Type
Education Level
Audience
| Researchers | 454 |
| Practitioners | 319 |
| Teachers | 128 |
| Administrators | 73 |
| Policymakers | 33 |
| Counselors | 31 |
| Students | 17 |
| Parents | 10 |
| Community | 6 |
| Support Staff | 5 |
Location
| Turkey | 840 |
| Australia | 239 |
| China | 211 |
| Canada | 207 |
| Indonesia | 163 |
| Spain | 131 |
| United States | 123 |
| United Kingdom | 121 |
| Germany | 112 |
| Taiwan | 108 |
| Netherlands | 103 |
| More ▼ | |
Laws, Policies, & Programs
Assessments and Surveys
What Works Clearinghouse Rating
| Meets WWC Standards without Reservations | 2 |
| Meets WWC Standards with or without Reservations | 2 |
| Does not meet standards | 1 |
Peer reviewedFishkin, Anne S.; Johnson, Aileen S. – Roeper Review, 1998
This article examines assessment instruments, measurement considerations, and factors that affect understanding of a child's creativity. It compares strengths and weaknesses of methods of assessing creativity and lists more than 60 standardized assessment measures. Procedures for using formal and informal measures in the decision-making process…
Descriptors: Ability Identification, Children, Creativity, Divergent Thinking
Peer reviewedWheeler, Patricia H. – Evaluation Practice, 1995
This volume is the fourth in a series for college faculty and advanced graduate students, "Survival Skills for Scholars." It offers practical advice for developing, using, and grading classroom examinations, focusing on traditional multiple-choice and constructed-response tests rather than alternative assessments. (SLD)
Descriptors: College Faculty, Constructed Response, Grading, Higher Education
Peer reviewedKoren, Shira – System, 1995
This article proposes a new type of pronunciation test, based on the variability principle and language continuum paradigm developed by Tarone (1983, 1985). Trials with 80 elementary and 73 university students indicate that the test distinguishes between subjects with and without phonetic training and is generally reliable and valid. Contains 31…
Descriptors: College Students, Elementary Education, Elementary School Students, Higher Education
Peer reviewedSherman, Tracy; Shulman, Brian B. – Infant-Toddler Intervention: The Transdisciplinary Journal, 1999
This study examined test characteristics of the Pediatric Language Acquisition Screening Tool for Early Referral-Revised (PLASTER-R), a set of developmental questionnaires for children 3 to 60 months of age. The PLASTER-R was moderately to highly successful in identifying children within normal limits for language development. Test-retest…
Descriptors: Disability Identification, Early Intervention, Infants, Language Acquisition
Peer reviewedEducational Researcher, 2000
Presents the American Educational Research Association's position on high-stakes educational testing, which stresses conditions essential to sound implementation of such testing, including: protection against high-stakes decisions based on single tests; adequate resources and opportunity to learn; validation for each intended use; alignment…
Descriptors: Disabilities, Elementary Secondary Education, Evaluation Methods, High Stakes Tests
Peer reviewedRothbart, Mary K.; Ahadi, Stephan A.; Hershey, Karen L.; Fisher, Phillip – Child Development, 2001
Reviews evidence on reliability and validity of the Children's Behavior Questionnaire (CBQ); presents CBQ data on structure of temperament in childhood. Factor analyses indicate three broad dimensions of temperament: extroversion/surgency, negative affectivity, and effortful control. This factor structure also appears in ratings of children in…
Descriptors: Behavior Development, Cross Cultural Studies, Individual Differences, Measures (Individuals)
Peer reviewedTaylor, Catherine S.; Nolen, Susan Bobbitt – Education Policy Analysis Archives, 1996
The usefulness of traditional concepts of validity and reliability, developed for large-scale assessments, for the classroom context is explored. Alternate frameworks that situate these constructs in teachers' work in classrooms are presented, and their use in an assessment course for preservice teachers is described. (SLD)
Descriptors: Educational Assessment, Learning, Models, Preservice Teachers
Peer reviewedErford, Bradley T. – Measurement and Evaluation in Counseling and Development, 1998
The Disruptive Behavior Rating Scale (DBRS) was designed to differentiate among the disruptive behavior disorders through administration to teachers and parents. Four studies are presented here to provide a technical analysis to fathers' responses to the DBRS-Parent Version. Reliability and construct and criterion-related validity of father…
Descriptors: Behavior Problems, Behavior Rating Scales, Concurrent Validity, Construct Validity
Peer reviewedSarouphim, Ketty M. – Gifted Child Quarterly, 1999
A relatively new performance-based assessment, the DISCOVER process, is presented in this review. The theoretical framework of the assessment is explained, followed by a delineation of the assessment process, checklist characteristics and development, and the tasks in the five different activities. Findings on assessment validity and reliability…
Descriptors: Ability Identification, Elementary Secondary Education, Evaluation Methods, Gifted
Peer reviewedForeman, Phil; Bourke, Sid; Mishra, Gita; Frost, Rick – International Journal of Disability, Development and Education, 2001
An instrument was developed to assess needs of students with disabilities in regular classes and was used as the basis for providing funding support for 12,375 students. The three domains of the instrument, physical needs, learning needs, and social needs, had good construct and face validities and high score reliabilities. (Contains 10…
Descriptors: Disabilities, Elementary Secondary Education, Evaluation Methods, Foreign Countries
Morgeson, Frederick P.; Humphrey, Stephen E. – Journal of Applied Psychology, 2006
Although there are thousands of studies investigating work and job design, existing measures are incomplete. In an effort to address this gap, the authors reviewed the work design literature, identified and integrated previously described work characteristics, and developed a measure to tap those work characteristics. The resultant Work Design…
Descriptors: Questionnaires, Job Development, Work Environment, Test Construction
Peer reviewedSiu, Andrew M. H.; Shek, Daniel T. L. – Adolescence (San Diego): an international quarterly devoted to the physiological, psychological, psychiatric, sociological, and educational aspects of the second decade of human life, 2005
This paper reports evidence on the factor structure, reliability, and validity of the Chinese Family Assessment Instrument (C-FAI), an instrument developed to assess family functioning in Chinese populations. A convenience sample of 1,462 adolescents from junior secondary schools completed the C-FAI and measures of parent-adolescent conflict.…
Descriptors: Foreign Countries, Psychometrics, Factor Structure, Conflict
Gamliel, Eyal; Davidovitz, Liema – Assessment and Evaluation in Higher Education, 2005
Using an experimental mixed design, this study compared the traditional paper-and-pencil method for evaluating teaching with the online method. Replicating previous findings, the comparison revealed similar evaluation means of the two methods. However, the stability of teaching evaluations using paper-and-pencil twice was substantially higher than…
Descriptors: Foreign Countries, Undergraduate Students, Student Evaluation of Teacher Performance, Teacher Evaluation
Lee, Richard M.; Yoo, Hyung Chol – Journal of Counseling Psychology, 2004
The authors investigated the structure and measurement of ethnic identity using the Multigroup Ethnic Identity Measure (MEIM; J. S. Phinney, 1992) on a diverse sample of Asian American college students. The authors drew upon 3 previously published datasets to examine the factor structure of the MEIM, initial reliability and construct validity,…
Descriptors: Test Reliability, Test Validity, Ethnicity, Asian American Students
Mossbarger, Brad – Educational Gerontology, 2005
Terminology in the Global Assessment of Functioning (GAF) Scale of DSM-IV often is irrelevant to the realities of nursing homes, assisted living centers, and similar facilities in which residents encounter stressors that are unique to their living environment and circumstances. As the mental health needs of long-term care residents are…
Descriptors: Measures (Individuals), Nursing Homes, Health Needs, Mental Health

Direct link
