NotesFAQContact Us
Collection
Advanced
Search Tips
Showing all 6 results Save | Export
van Rijsbergen, C. J. – Information Storage and Retrieval, 1974
Reports the results of experiments in document clustering using three well-known test collections. (Author)
Descriptors: Classification, Cluster Grouping, Evaluation Methods, Information Retrieval
Cornell Univ., Ithaca, NY. Dept. of Computer Science. – 1970
Two papers are included as Part Four of this report on Salton's Magical Automatic Retriever of Texts (SMART) project report. The first paper: "A Controlled Single Pass Classification Algorithm with Application to Multilevel Clustering" by D. B. Johnson and J. M. Laferente presents a single pass clustering method which compares favorably…
Descriptors: Algorithms, Automation, Classification, Cluster Grouping
Peer reviewed Peer reviewed
Shaw, W. M., Jr. – Journal of the American Society for Information Science, 1991
Two articles discuss the clustering of composite representations in the Cystic Fibrosis Document Collection from the National Library of Medicine's MEDLINE file. Clustering is evaluated as a function of the exhaustivity of composite representations based on Medical Subject Headings (MeSH) and citation indexes, and evaluation of retrieval…
Descriptors: Citation Indexes, Cluster Grouping, Cystic Fibrosis, Evaluation Methods
Peer reviewed Peer reviewed
Nomoto, Tadashi; Matsumoto, Yuji – Information Processing & Management, 2003
Introduces a novel approach to unsupervised text summarization. Proposes an "information-centric" approach to evaluation, where the quality of summaries is judged not in terms of how well they match human-created summaries but in terms of how well they represent their source documents in information retrieval tasks such as document…
Descriptors: Cluster Analysis, Cluster Grouping, Electronic Text, Evaluation Methods
Peer reviewed Peer reviewed
Shaw, W. M., Jr. – Information Processing and Management, 1990
These two articles discuss clustering structure in the Cystic Fibrosis Document Collection, which is derived from the National Library of Medicine's MEDLINE file. The exhaustivity of four subject representations and two citation representations is examined, and descriptor-weight thresholds and similarity thresholds are used to compute…
Descriptors: Citation Indexes, Citations (References), Cluster Grouping, Comparative Analysis
SALTON, GERALD – 1967
THE TWELFTH IN A SERIES COVERING RESEARCH IN AUTOMATIC STORAGE AND RETRIEVAL, THIS REPORT IS DIVIDED INTO THREE PARTS TITLED EVALUATION, CLUSTER SEARCHING, AND USER FEEDBACK METHODS, RESPECTIVELY. THE FIRST PART, EVALUATION, CONTAINS A COMPLETE SUMMARY OF THE RETRIEVAL RESULTS DERIVED FROM SOME SIXTY DIFFERENT TEXT ANALYSIS EXPERIMENTS. IN EACH…
Descriptors: Cluster Grouping, Computational Linguistics, Correlation, Evaluation Methods