Publication Date
| In 2026 | 10 |
| Since 2025 | 642 |
| Since 2022 (last 5 years) | 2579 |
| Since 2017 (last 10 years) | 5614 |
| Since 2007 (last 20 years) | 9210 |
Descriptor
| Test Validity | 21786 |
| Test Reliability | 10022 |
| Test Construction | 5897 |
| Foreign Countries | 4963 |
| Psychometrics | 2969 |
| Factor Analysis | 2942 |
| Measures (Individuals) | 2382 |
| Higher Education | 2250 |
| Evaluation Methods | 2085 |
| College Students | 1813 |
| Correlation | 1724 |
| More ▼ | |
Source
Author
Publication Type
Education Level
Audience
| Researchers | 728 |
| Practitioners | 429 |
| Teachers | 142 |
| Administrators | 96 |
| Policymakers | 57 |
| Counselors | 36 |
| Students | 20 |
| Parents | 13 |
| Community | 7 |
| Support Staff | 6 |
| Media Staff | 2 |
| More ▼ | |
Location
| Turkey | 808 |
| Australia | 347 |
| Canada | 324 |
| China | 300 |
| United States | 188 |
| Indonesia | 172 |
| Spain | 170 |
| Netherlands | 160 |
| United Kingdom | 160 |
| California | 156 |
| Germany | 154 |
| More ▼ | |
Laws, Policies, & Programs
Assessments and Surveys
What Works Clearinghouse Rating
| Meets WWC Standards without Reservations | 1 |
| Meets WWC Standards with or without Reservations | 3 |
| Does not meet standards | 1 |
Bracey, Gerald W. – School Administrator, 1993
A recent RAND study questions the reliability of judgments used in portfolio evaluation in Vermont Schools. Researchers at the Center for Research in Evaluation, Standards, and Student Testing are developing guidelines to determine whether the new performance-based assessments are valid and reliable. Emerging CRESST criteria concerning testing…
Descriptors: Costs, Elementary Secondary Education, Evaluation Criteria, Performance Based Assessment
Peer reviewedRange, Lillian M.; And Others – Death Studies, 1993
Examined factor structure of Reasons for Living Inventory (RFL) and its internal consistency for 128 high school and 145 college students. Means were higher than those of adults who reported never considering suicide. Goodness of Fit Index was poor, suggesting that factors in teenagers' reasons for living are different from those of adults.…
Descriptors: Adolescents, College Students, Factor Structure, High School Students
Peer reviewedMarzano, Robert J. – Educational Leadership, 1994
Students generally do better on outcome-based performance tasks than on domain-specific tasks. Results on performance tasks must be interpreted in the context of instruction or guidance provided before or during their administration. Reliability is sometimes questionable, since teachers are highly influenced by students' overall academic…
Descriptors: Context Effect, Elementary Secondary Education, Holistic Approach, Performance Based Assessment
Peer reviewedYarroch, William L. – Journal of Research in Science Teaching, 1991
The use of content validity as the primary assurance of the measurement accuracy for science assessment examinations is questioned. An alternative accuracy measure, item validity, is proposed. Item validity is based on research using qualitative comparisons between (1) student answers to objective items, (2) clinical interviews, and (3) student…
Descriptors: Content Validity, Educational Research, Elementary Secondary Education, Evaluation
Peer reviewedDunkel, Patricia – Language Learning & Technology, 1999
Describes what a computer-adaptive test (CAT) is, examines its roots, and points out some of the challenges this approach to assessment presents. (Author/VWL)
Descriptors: Computer Assisted Testing, Computer Software, Evaluation Methods, Higher Education
Mabry, Linda – Phi Delta Kappan, 1999
Education remains heavily shackled by punitive, test-driven reform. Despite reasonable alternatives, testing increasingly drives educational accountability and reform. Standardization of direct writing assessments promotes scoring reliability and facilitates educational comparisons and rankings. However, standardized writing is not good writing,…
Descriptors: Elementary Secondary Education, Interrater Reliability, Performance Based Assessment, Scoring Rubrics
Peer reviewedSkowron, Elizabeth A.; Friedlander, Myrna L. – Journal of Counseling Psychology, 1998
The Differentiation of Self Inventory (DSI) focuses on adults, their significant relationships, and current relations with family of origin. Three studies are reported to (1) create the DSI; (2) improve theoretical focus, item content, and psychometric properties; and (3) test validity. Factor analyses are presented and results discussed; test…
Descriptors: Adult Development, Adults, Counseling, Counseling Psychology
Peer reviewedGreenwald, Anthony G.; Gillmore, Gerald M. – American Psychologist, 1997
Identifies four additional data patterns that discriminate among five theories of students' expected grades/teacher ratings correlation. The presence of all four of these markers in student ratings data is consistent with the theory that the correlation is due to an unwanted influence of instructors' grading leniency. (MMU)
Descriptors: Correlation, Grade Inflation, Grades (Scholastic), Higher Education
Erford, Bradley T. – Diagnostique, 1997
Internal consistency, test-retest reliability, item analysis, and construct and concurrent criterion-related validity of the Writing Essential Skill Screener-Preschool Version were studied using four independent samples of children (ages 4 to 5). The test displayed a high degree of reliability and validity for a brief pre-writing skills screener.…
Descriptors: Learning Disabilities, Preschool Children, Preschool Education, Screening Tests
Peer reviewedNapoli, Anthony R.; Raymond, Lanette A.; Coffey, Cheryl A.; Bosco, Diane M. – Journal of Developmental Education, 1998
Describes a study done at Suffolk County Community College (New York) that assessed the validity of the College Board's Computerized Placement Test in Reading Comprehension (CPT-R) by comparing test results of 1,154 freshmen with the results of the Degree of Power Reading Test. Results confirmed the CPT-R's reliability in identifying basic…
Descriptors: Community Colleges, Cutting Scores, Predictive Validity, Reading Comprehension
Peer reviewedBrown, James D.; Hudson, Thom – TESOL Quarterly, 1998
Discusses the types of language tests that language teachers can use in their classrooms for their specific purposes. Language assessments are classified into three categories: selected-response assessments; constructed-response assessments; and personal-response assessments. A definition is provided for each assessment type, and advantages and…
Descriptors: Alternative Assessment, Evaluation Methods, Feedback, Language Teachers
Peer reviewedHarlan, Elena; Clark, Lee Anna – Assessment, 1999
Reports the development of a paragraph-descriptor short form of the Schedule for Nonadaptive and Adaptive Personality (SNAP); (L. Clark, 1993) with self- and other versions. Data from 294 college students, with parental ratings for 94 students, support the reliability and validity of the measure. (SLD)
Descriptors: Adjustment (to Environment), College Students, Higher Education, Parents
Peer reviewedMcCabe, Marita P.; Deeks, Amanda A.; Cummins, Robert A. – Research in Developmental Disabilities, 1999
This study reports on the development and assessment of the psychometric properties of three measures to assess sexual knowledge, experience, feelings, and needs. Results demonstrate the good psychometric properties of the scales and their suitability for assessing these areas in people with disabilities. (Author/CR)
Descriptors: Adults, Emotional Adjustment, Mental Retardation, Needs Assessment
Peer reviewedWhite, Joseph M.; Wampler, Richard S.; Winn, Krista I. – Journal of Adolescent Research, 1998
Revised the Identity Style Inventory (ISI) to sixth-grade reading level for use with adolescents/adults with reading limitations. Administered original and revised scales to 361 college students. Found that revised ISI (ISI-6G) scales were reliable and valid. Factor analysis and paired-sample t-test results demonstrated construct validity. ISI-6G…
Descriptors: Adolescent Development, College Students, Higher Education, Identification (Psychology)
Peer reviewedLittle, Graham – English in Australia, 1998
Suggests that ELLA (English Language and Literacy Assessment ) implemented in New South Wales, Australia, fails four tests for sound diagnostic assessment set out in a standard reference (K.W. Howell et al. "Curriculum Based Evaluation")--tests for logicality, reliability, validity, and accuracy and practicality. (RS)
Descriptors: English Instruction, Foreign Countries, Language Arts, Secondary Education


