Descriptor
Cluster Analysis | 42 |
Information Retrieval | 42 |
Cluster Grouping | 15 |
Classification | 13 |
Relevance (Information… | 12 |
Search Strategies | 12 |
Databases | 10 |
Information Systems | 10 |
Subject Index Terms | 9 |
Comparative Analysis | 8 |
Online Systems | 7 |
More ▼ |
Source
Author
Willett, Peter | 4 |
Minker, Jack | 3 |
Griffiths, Alan | 2 |
Bookstein, A. | 1 |
Borner, Katy | 1 |
Boyack, Kevin W. | 1 |
Brittain, J. Michael | 1 |
Carlyle, Allyson | 1 |
Chen, Chaomei | 1 |
Chen, Hsinchun | 1 |
Damerau, Fred J. | 1 |
More ▼ |
Publication Type
Journal Articles | 30 |
Reports - Research | 23 |
Reports - Descriptive | 7 |
Information Analyses | 6 |
Opinion Papers | 4 |
Reports - Evaluative | 3 |
Speeches/Meeting Papers | 3 |
Reports - General | 1 |
Education Level
Audience
Researchers | 9 |
Location
United Kingdom (England) | 1 |
West Germany | 1 |
Laws, Policies, & Programs
Assessments and Surveys
What Works Clearinghouse Rating

Voorhees, Ellen M. – Information Processing and Management, 1986
Describes a computerized information retrieval system that uses three agglomerative hierarchic clustering algorithms--single link, complete link, and group average link--and explains their implementations. It is noted that these implementations have been used to cluster a collection of 12,000 documents. (LRW)
Descriptors: Algorithms, Cluster Analysis, Databases, Information Retrieval

Willett, Peter – Journal of the American Society for Information Science, 1984
Describes a cluster-based information retrieval procedure that can significantly reduce the computational requirements of the single linkage method, while still maintaining the retrieval effectiveness of the resulting classifications. Use of nearest neighbors, experimental details, and results and conclusions are highlighted. Fourteen references…
Descriptors: Cluster Analysis, Cluster Grouping, Information Retrieval, Relevance (Information Retrieval)

Minker, Jack; And Others – Journal of the American Society for Information Science, 1973
The objectives of this paper are to describe the effect of using weighted index terms in a document retrieval system, and to evaluate retrieval performance when queries are expanded by terms occurring in clusters with the query terms. (16 references) (Authors)
Descriptors: Cluster Analysis, Evaluation, Information Retrieval, Information Systems

Sparck Jones, K.; Van Rijsbergen, C. J. – Journal of Documentation, 1973
Substantial alterations to a system often have little or no effect on particular collections. This may be due to poor separation of relevant and non-relevant documents. The paper presents a procedure for characterizing this separation, which can show whether proposed modifications of the base system are likely to be useful. (8 references)…
Descriptors: Automatic Indexing, Classification, Cluster Analysis, Databases
Sparck Jones, Karen – Information Storage and Retrieval, 1973
Retrieval performance with automatic term classifications for three test collections has been variable. This paper attempts to discover why. The real difference between the collections is in the separation of relevant from non-relevant documents. The separation is so poor that classification cannot be expected to succeed. (14 references)…
Descriptors: Automatic Indexing, Classification, Cluster Analysis, Databases
Minker, Jack; And Others – 1972
The objectives of this paper are to describe the effect of using weighted index terms in a document retrieval system, and to evaluate retrieval performance when queries are expanded by terms occurring in clusters with the query terms. Three data collections, each indexed by several methods, two of which were studied and reported on in previous…
Descriptors: Classification, Cluster Analysis, Data Processing, Information Retrieval

Shaw, Rachel J.; Willett, Peter – Information Processing and Management, 1993
Examines research on the observed values of retrieval effectiveness obtained in searches of files of nearest neighbor document clusters. Results show interdocument similarities used to generate nearest-neighbor clusters are significantly different from randomly generated clusters. (Contains 14 references.) (EAM)
Descriptors: Bibliographic Records, Cluster Analysis, Comparative Analysis, Information Retrieval
Minker, Jack; And Others – Information Storage and Retrieval, 1972
An evaluation of graph theoretical clusters of index terms which can be extracted from an automatically indexed document collection, and the effects of employing such clusters in automatic document retrieval are described. (19 references) (Author)
Descriptors: Automatic Indexing, Cluster Analysis, Data Processing, Information Retrieval

Nomoto, Tadashi; Matsumoto, Yuji – Information Processing & Management, 2003
Introduces a novel approach to unsupervised text summarization. Proposes an "information-centric" approach to evaluation, where the quality of summaries is judged not in terms of how well they match human-created summaries but in terms of how well they represent their source documents in information retrieval tasks such as document…
Descriptors: Cluster Analysis, Cluster Grouping, Electronic Text, Evaluation Methods

Griffiths, Alan; And Others – Journal of the American Society for Information Science, 1986
Reports on comparative study of document classifications produced by use of single linkage, complete linkage, group average, and Ward clustering methods. Findings of work that compares use of clusters consisting of pairs of documents with conventional best match searches are also reported. Thirty-four references are provided. (EJS)
Descriptors: Cluster Analysis, Cluster Grouping, Comparative Analysis, Information Retrieval

Griffiths, Alan; And Others – Journal of Documentation, 1984
Considers classifications produced by application of single linkage, complete linkage, group average, and word clustering methods to Keen and Cranfield document test collections, and studies structure of hierarchies produced, extent to which methods distort input similarity matrices during classification generation, and retrieval effectiveness…
Descriptors: Algorithms, Classification, Cluster Analysis, Cluster Grouping

Sumner, Robert G., Jr. – Information Processing & Management, 1995
The effectiveness of using the age of references to control the exhaustivity of the reference representation in information retrieval was investigated through analysis of optimal cluster-based retrieval results. The results show that the foreground representation at its optimal level of exhaustivity is restricted to references with ages less than…
Descriptors: Bibliographic Coupling, Bibliographic Databases, Citation Analysis, Citations (References)

Murtagh, F. – Information Processing and Management, 1984
Using examples of data from the areas of information retrieval and of multivariate data analysis, six hierarchic clustering algorithms (single link, median, centroid, group average, complete link, Wards's) are examined and evaluated by using three proposed coefficients of hierarchic structure. Nine references are cited. (EJS)
Descriptors: Algorithms, Cluster Analysis, Cluster Grouping, Data Analysis

Yu, C. T. – Journal of the American Society for Information Science, 1976
A measure for the quantification of the changes in classification under small changes in data is proposed. (Author)
Descriptors: Classification, Cluster Analysis, Cluster Grouping, Information Retrieval
Salton, G. – Information Storage and Retrieval, 1972
The author emphasized that one cannot conclude from the experiments reported upon that term clusters (or equivalently, keyword classifications or thesauruses) are not useful in retrieval. (2 references) (Author)
Descriptors: Cluster Analysis, Information Retrieval, Information Systems, Subject Index Terms