Publication Date
| In 2026 | 1 |
| Since 2025 | 599 |
| Since 2022 (last 5 years) | 2536 |
| Since 2017 (last 10 years) | 5571 |
| Since 2007 (last 20 years) | 9167 |
Descriptor
| Test Validity | 21743 |
| Test Reliability | 9997 |
| Test Construction | 5880 |
| Foreign Countries | 4941 |
| Psychometrics | 2956 |
| Factor Analysis | 2938 |
| Measures (Individuals) | 2370 |
| Higher Education | 2248 |
| Evaluation Methods | 2084 |
| College Students | 1810 |
| Correlation | 1722 |
| More ▼ | |
Source
Author
Publication Type
Education Level
Audience
| Researchers | 728 |
| Practitioners | 429 |
| Teachers | 142 |
| Administrators | 96 |
| Policymakers | 57 |
| Counselors | 36 |
| Students | 20 |
| Parents | 13 |
| Community | 7 |
| Support Staff | 6 |
| Media Staff | 2 |
| More ▼ | |
Location
| Turkey | 805 |
| Australia | 347 |
| Canada | 324 |
| China | 300 |
| United States | 188 |
| Indonesia | 170 |
| Spain | 168 |
| United Kingdom | 160 |
| Netherlands | 158 |
| California | 155 |
| Germany | 153 |
| More ▼ | |
Laws, Policies, & Programs
Assessments and Surveys
What Works Clearinghouse Rating
| Meets WWC Standards without Reservations | 1 |
| Meets WWC Standards with or without Reservations | 3 |
| Does not meet standards | 1 |
Peer reviewedMilfort, Roline; Greenfield, Daryl B. – Early Childhood Research Quarterly, 2002
Compared teacher and observer ratings of young children's social behaviors during peer play and examined the construct validity of the Penn Interactive Peer Play Scale. Findings indicated that both teacher and observer ratings revealed factor structures reflecting play interaction, disruption, and disconnection. Observer ratings distinguished…
Descriptors: Aggression, Behavior Rating Scales, Child Behavior, Comparative Analysis
Peer reviewedHelwig, Robert; Tindal, Gerald – Assessment for Effective Intervention, 2002
Four alternate versions of a 15-item general outcome measure (GOM) of mathematics conceptual understanding and applications were developed and administered to 117 eighth-graders. Results were correlated with scores on state multiple-choice mathematics achievement tests. Correlations ranged from .81 to .87 with no significant differences, offering…
Descriptors: Educational Assessment, Evaluation Methods, Grade 8, Learning Disabilities
Peer reviewedGoldstein, Sam – Journal of Autism and Developmental Disorders, 2002
The reliability, validity, and clinical utility of the Asperger Syndrome Diagnostic Scale in the diagnosis of pervasive developmental disorders are reviewed. While the measure holds promise as a research tool, there appears little evidence that it can distinguish among the variety of types of pervasive developmental disorders, or diagnose Asperger…
Descriptors: Asperger Syndrome, Autism, Behavior Rating Scales, Classification
Peer reviewedKolstad, Rosemarie K.; Kolstad, Robert A. – Journal of Dental Education, 1991
A study evaluated the use of a "none-of-these" option on multiple-choice achievement tests in undergraduate dental education. Results indicated this option neither enhanced nor diminished examinee performance stability but did reduce the examinee's opportunity to select correct choices by means unrelated to course objectives, thereby enhancing…
Descriptors: Achievement Tests, Dental Schools, Difficulty Level, Higher Education
Peer reviewedLock, Roger – Research in Science and Technological Education, 1990
Investigated are the context dependency and construct validity of practical skill assessment in science which may include the skills of observing, manipulating, planning, interpreting, reporting, and self-reliance. Pupil performance on a variety of practical tasks were compared with external examination grades in biology and chemistry. (CW)
Descriptors: Academic Achievement, Biology, Chemistry, Content Validity
Peer reviewedSymington, David; Spurling, Heather – Research in Science and Technological Education, 1990
Presented is a criticism of the instructions given for and the scoring of the Draw-a-Scientist test (Chambers, 1983). A response to this criticism by authors who employ this method is included. (CW)
Descriptors: Content Validity, Educational Assessment, Elementary School Science, Elementary Secondary Education
Peer reviewedNorris, Stephen P. – Journal of Educational Measurement, 1990
The relevance of verbal reports of thinking for validating multiple-choice critical thinking tests was examined. Results from 342 senior high school students in Newfoundland (Canada) indicate that verbal reports can meet a necessary condition of validation data and collecting data does not alter thinking and performance. (SLD)
Descriptors: Cognitive Tests, Critical Thinking, Foreign Countries, High School Students
Peer reviewedPecheone, Raymond L.; Carey, Neil B. – Journal of Personnel Evaluation in Education, 1990
The Connecticut Teacher Assessment Center Project has, since 1986, been developing a semistructured interview in the area of mathematics to evaluate beginning teacher competence. The strategy for validation of the project's performance tests, Connecticut's reform initiatives, and implications of systematic validity for traditional psychometric…
Descriptors: Beginning Teachers, Higher Education, Interviews, Licensing Examinations (Professions)
Peer reviewedWatkins, C. Edward, Jr.; Campbell, Vicki L. – Counseling Psychologist, 1990
Introduces this issue of "The Counseling Psychologist," which considers several contemporary developments and issues in the areas of testing and assessment and their relevance for counseling psychologists. Topics examined by the papers are identified, along with five additional current issues in testing and assessment. (Author/TE)
Descriptors: Career Counseling, Computer Assisted Testing, Counseling, Evaluation Methods
Peer reviewedHarper, Dennis C.; Wadsworth, John S. – Research in Developmental Disabilities, 1990
This article investigates cognitive decline and depressive symptomatology among older adults with mental retardation. A pilot study of assessment instruments is reported. Findings reveal that decreasing cognitive ability is associated with higher rates of observed depression and reported behavioral problems. Cognitive decline was associated with…
Descriptors: Aging (Individuals), Behavior Problems, Clinical Diagnosis, Cognitive Ability
Peer reviewedRead, John – English for Specific Purposes, 1990
Considers the question of how best to elicit samples of writing for assessment in an English-for-academic-purposes proficiency test and assure that every test taker has something to write about. Three types of writing tasks are defined and analyzed, and examples are given. (25 references) (GLR)
Descriptors: English for Academic Purposes, Higher Education, Language Proficiency, Prior Learning
Peer reviewedNist, Sherrie L.; And Others – Reading Research and Instruction, 1990
Investigates the utility and predictive validity of the Learning and Study Strategies Inventory (LASSI) as a means of measuring college students' cognitive and affective growth following a study strategies course. Finds cognitive and affective growth in both regularly admitted and developmental studies students. Finds that LASSI cannot yet be used…
Descriptors: Affective Measures, Cognitive Measurement, College Students, Developmental Studies Programs
Peer reviewedChletsos, Peter N.; And Others – Journal of Research and Development in Education, 1989
This article presents evidence of the reliability and validity of a new paper-and-pencil test of proportional reasoning, Paper-and-Pencil Balance Beam Test. A Total of 627 individuals, aged 8-47, participated in the 3 studies discussed. Results support previous research which correlates performance on proportional reasoning problems with…
Descriptors: Age Differences, Cognitive Development, Elementary Secondary Education, Formal Operations
Peer reviewedMatthews, Margaret – ELT Journal, 1990
Discusses problems with the current trend in using behavior trait-based criteria to assess English-as-a-Second-Language productivity skills, and describes alternatives to such testing that involve the matching of linguistic tasks against nonlinguistic criteria. (Author/CB)
Descriptors: Communicative Competence (Languages), English (Second Language), Evaluation Criteria, Language Proficiency
Peer reviewedCaudery, Tim – ELT Journal, 1990
An examination found no significant differences between timed- and untimed-essay test scores of adolescent students of English as a Second Language. Results suggest that there is a need for more expansive research regarding sample size and age, different time limits, students' educational backgrounds, writing skill training, communicative aims,…
Descriptors: English (Second Language), Essay Tests, Language Proficiency, Language Research


