Descriptor
Automatic Indexing | 3 |
Cluster Grouping | 3 |
Databases | 3 |
Algorithms | 2 |
Bayesian Statistics | 2 |
Classification | 2 |
Documentation | 2 |
Mathematical Models | 2 |
Probability | 2 |
Sequential Approach | 2 |
Statistical Analysis | 2 |
More ▼ |
Source
Information Processing and… | 1 |
Publication Type
Reports - Research | 2 |
Education Level
Audience
Location
Laws, Policies, & Programs
Assessments and Surveys
What Works Clearinghouse Rating

Crouch, Donald B. – Information Processing and Management, 1975
Describes a clustering algorithm designed for dynamic data bases and presents an update procedure which maintains an effective document classification without reclustering. The effectiveness of the algorithms is demonstrated for a subset of the Cranfield collection. (Author)
Descriptors: Automatic Indexing, Cluster Grouping, Databases, Information Retrieval

White, Lee J.; And Others – 1975
The major advantage of sequential classification, a technique for automatically classifying documents into previously selected categories, is that the entire document need not be processed before it is classified. This method assumes the availability of a priori categories, a selection of keywords representative of these categories, and the a…
Descriptors: Algorithms, Automatic Indexing, Bayesian Statistics, Classification
Kar, B. Gautam; White, Lee J. – 1975
The feasibility of using a distance measure, called the Bayesian distance, for automatic sequential document classification was studied. Results indicate that, by observing the variation of this distance measure as keywords are extracted sequentially from a document, the occurrence of noisy keywords may be detected. This property of the distance…
Descriptors: Algorithms, Automatic Indexing, Bayesian Statistics, Classification