Publication Date
| In 2026 | 12 |
| Since 2025 | 958 |
| Since 2022 (last 5 years) | 4567 |
| Since 2017 (last 10 years) | 10500 |
| Since 2007 (last 20 years) | 21963 |
Descriptor
| Test Validity | 21786 |
| Validity | 13791 |
| Test Reliability | 10864 |
| Foreign Countries | 9887 |
| Test Construction | 6897 |
| Factor Analysis | 5761 |
| Measures (Individuals) | 5633 |
| Predictive Validity | 5022 |
| Psychometrics | 4820 |
| Reliability | 4635 |
| Correlation | 4376 |
| More ▼ | |
Source
Author
Publication Type
Education Level
Audience
| Researchers | 1169 |
| Practitioners | 629 |
| Teachers | 336 |
| Administrators | 165 |
| Policymakers | 110 |
| Counselors | 63 |
| Students | 63 |
| Parents | 15 |
| Community | 12 |
| Media Staff | 10 |
| Support Staff | 8 |
| More ▼ | |
Location
| Turkey | 1397 |
| Australia | 705 |
| Canada | 626 |
| China | 528 |
| United States | 439 |
| Indonesia | 389 |
| United Kingdom | 363 |
| Germany | 340 |
| California | 338 |
| Netherlands | 336 |
| Spain | 311 |
| More ▼ | |
Laws, Policies, & Programs
Assessments and Surveys
What Works Clearinghouse Rating
| Meets WWC Standards without Reservations | 7 |
| Meets WWC Standards with or without Reservations | 12 |
| Does not meet standards | 10 |
Peer reviewedBrennan, Robert L. – Educational Measurement: Issues and Practice, 2001
Discusses some problems, pitfalls, and paradoxes that challenge measurement theory and practice, especially for K-12 achievement testing. Considers a number of technical issues, especially some related to reliability. Also discusses a number of practical or political issues related to validation and accountability. (SLD)
Descriptors: Accountability, Achievement Tests, Educational Testing, Educational Theories
Peer reviewedDenzine, Gypsy M.; Kowalski, Gerard J. – Measurement and Evaluation in Counseling and Development, 2002
The Assessment for Living and Learning (ALL; G.M. Denzine, 1994, 1996) measures college students' perceptions of the academic climate in their residence hall. Confirmatory factor analyses results reveal that the data did not provide an adequate fit to the measurement model underlying the ALL. A revised model was tested and is recommended for use.…
Descriptors: Attitude Measures, College Environment, College Students, Dormitories
Peer reviewedGibbs, William; Graves, Pat R.; Bernas, Ronan S. – Journal of Research on Technology in Education, 2001
Describes a study that used a Web-based survey and a modified Delphi technique to identify criteria important to multimedia instructional courseware evaluation, validate them with a panel of instructional technology experts, and examine the effect of conducting panel discussions online. Shows information accuracy and reliability as the most…
Descriptors: Computer Software Evaluation, Courseware, Delphi Technique, Discussion
Peer reviewedBowen, Betsy A. – English Education, 2002
Discusses the Praxis II exam, "English Language, Literature, and Composition: Content Knowledge," a two-hour multiple-choice exam with questions on American, British, and world literature, literary terms, grammar and usage, and teaching. Suggests that knowledge of subject matter seems to be related to successful teaching, but whether…
Descriptors: Creative Writing, English Instruction, Higher Education, Literature
Peer reviewedGood, Roland H., III; Simmons, Deborah C.; Kame'enui, Edward J. – Scientific Studies of Reading, 2001
Explores the utility of a continuum of fluency-based indicators of foundational early literacy skills to predict reading outcomes, inform educational decisions, and change reading outcomes for students at risk of reading difficulty. Outlines a continuum of fluency-based indicators of foundational reading skills. Examines utility and predictive…
Descriptors: Evaluation Methods, High Risk Students, High Stakes Tests, Primary Education
Peer reviewedWolming, Simon – Studies in Educational Evaluation, 1999
Investigated the validity of the Swedish model for selection to higher education, using S. Messick's (1989) four-faceted model of validity and grade-point-average and admissions test data for 314 college students. Results challenges the unidimensional use of the selection instruments. (SLD)
Descriptors: Admission (School), College Entrance Examinations, College Students, Foreign Countries
Peer reviewedSarafino, Edward P.; Ewing, Maureen – Journal of American College Health, 1999
Describes the development of the Hassles Assessment Scale for Students in College, which measured student stress. Development involved item generation, psychometric evaluation, and revision. Separate student samples participated in each phase. Results found very high levels of internal consistency for the frequency, unpleasantness, and dwelling…
Descriptors: College Students, Coping, Higher Education, Stress Management
Peer reviewedBachman, Lyle F. – Language Testing, 2000
Reviews developments in language testing research and practice over the last 20 years, and suggests future directions in the areas of professionalizing the field and validation research. Argues that concerns for ethical conduct must be grounded in valid test use, so that professionalization and validation research are inseparable. (Author/VWL)
Descriptors: Ethics, Language Research, Language Tests, Second Language Instruction
Peer reviewedKoretz, Daniel; Stecher, Brian; Klein, Stephen; McCaffrey, Daniel – Educational Measurement: Issues and Practice, 1994
Reports on an ongoing evaluation of the Vermont portfolio assessment program. Indicates that the positive news about the instructional effects of the assessment program are in contrast with the empirical findings about the quality of the data the program has yielded. (SLD)
Descriptors: Accountability, Elementary Secondary Education, Performance Based Assessment, Portfolio Assessment
Peer reviewedBennett, Randy Elliot; Rock, Donald A. – Journal of Educational Measurement, 1995
Examined the generalizability and validity and examinee perceptions of a computer-delivered version of 8 formulating-hypotheses tasks administered to 192 graduate students. Results support previous research that has suggested that formulating-hypotheses items can broaden the abilities measured by graduate admissions measures. (SLD)
Descriptors: Admission (School), College Entrance Examinations, Computer Assisted Testing, Generalizability Theory
Peer reviewedBurchinal, Margaret R.; Nelson, Lauren – Early Childhood Research Quarterly, 2000
Discusses family selection issues that should be considered in child care research, and evidence demonstrating why each should be considered. Issues include whether causal inferences can be made from observational studies and the impact on conclusions from regression analyses that include highly correlated measures of child care experiences,…
Descriptors: Data Interpretation, Day Care, Early Experience, Influences
Peer reviewedSmith, Tina T.; Lee, Evan; McDade, Hiram L. – Communication Disorders Quarterly, 2001
This study investigated the dialectal sensitivity of the T-unit as a nonbiased alternative for assessing the oral grammatical skills of school-age, nonstandard English speakers. Analysis of language samples from 28 9-year-old children (half African-American) revealed no significant differences between groups, suggesting that the T-unit may be a…
Descriptors: Black Dialects, Black Students, Culture Fair Tests, Elementary Education
Peer reviewedStichter, Janine Peck – Focus on Autism and Other Developmental Disabilities, 2001
This article suggests possible applications of experimental analyses using analogues to empirically verify results of functional assessments in classrooms for students with autism and related disabilities. Analogue assessments involve creating conditions in which antecedents and consequences are held constant and specific variables suspected to…
Descriptors: Action Research, Autism, Behavior Modification, Behavioral Science Research
Peer reviewedMcCracken, Nancy Mellin; McCracken, Hugh Thomas – English Journal, 2001
Asks several teachers what they have lost from their teaching or their classroom since the growth in mandated, standardized testing. Considers the ill effects of mandated testing, and names some educational essentials at risk of being lost while testing rules. Discusses what is lost in high-stakes multiple-choice testing of new teachers. (SG)
Descriptors: High Stakes Tests, Higher Education, Preservice Teachers, Secondary Education
Naevdal, F. – Journal of Adolescence, 2005
The article presents a psychometric description of 11 statements related to use of physical violence. The items were tested in a normal sample (N=1700, age: 15-16) from urban and rural areas in Western Norway. The internal reliability was @a=0.86, and the factor analysis resulted in two factors. Boys had higher mean scores than girls.…
Descriptors: Test Reliability, Predictor Variables, Test Validity, Gender Differences

Direct link
