NotesFAQContact Us
Collection
Advanced
Search Tips
Showing all 3 results Save | Export
Peer reviewed Peer reviewed
Shaw, W. M., Jr. – Information Processing and Management, 1990
Investigates the presence of clustering structure in a document collection and the influence of the presence of clustering structure on the success of cluster-based retrieval. Term-weight and similarity thresholds are discussed, empirical and statistical significance are considered, and indexing exhaustivity for document representation is…
Descriptors: Cluster Grouping, Documentation, Indexing, Information Retrieval
PDF pending restoration PDF pending restoration
White, Lee J.; And Others – 1975
The major advantage of sequential classification, a technique for automatically classifying documents into previously selected categories, is that the entire document need not be processed before it is classified. This method assumes the availability of a priori categories, a selection of keywords representative of these categories, and the a…
Descriptors: Algorithms, Automatic Indexing, Bayesian Statistics, Classification
Kar, B. Gautam; White, Lee J. – 1975
The feasibility of using a distance measure, called the Bayesian distance, for automatic sequential document classification was studied. Results indicate that, by observing the variation of this distance measure as keywords are extracted sequentially from a document, the occurrence of noisy keywords may be detected. This property of the distance…
Descriptors: Algorithms, Automatic Indexing, Bayesian Statistics, Classification