ERIC - Search Results

Descriptor

Algorithms	5
Automatic Indexing	5
Mathematical Models	5
Statistical Analysis	3
Bayesian Statistics	2
Classification	2
Cluster Grouping	2
Databases	2
Documentation	2
Probability	2
Sequential Approach	2
Subject Index Terms	2
Bibliographic Databases	1
Comparative Analysis	1
Computational Linguistics	1
Discriminant Analysis	1
Feasibility Studies	1
Flow Charts	1
Indexing	1
Information Retrieval	1
Measurement Techniques	1
Relevance (Information…	1
Statistical Distributions	1
Tables (Data)	1
More ▼

Source

Information Processing and…	1
Journal of Documentation	1
Journal of the American…	1

Author

White, Lee J.	2
Biru, Tesfaye	1
Crouch, Carolyn J.	1
Harter, Stephen P.	1
Kar, B. Gautam	1

Publication Type

Reports - Research	4
Journal Articles	2

Education Level

Audience

Researchers

Location

Laws, Policies, & Programs

Assessments and Surveys

What Works Clearinghouse Rating

Showing all 5 results Save | Export

A Probabilistic Approach to Automatic Keyword Indexing: Part II, An Algorithm for Probabilistic Indexing

Peer reviewed

Harter, Stephen P. – Journal of the American Society for Information Science, 1975

A probabilistic model of keyword indexing is outlined, and some of the consequences of the model are examined. An algorithm defining a measure of indexability is developed--a measure intended to reflect the relative significance of words in documents. (Author)

Descriptors: Algorithms, Automatic Indexing, Indexing, Mathematical Models

An Analysis of Approximate versus Exact Discrimination Values.

Peer reviewed

Crouch, Carolyn J. – Information Processing and Management, 1988

Describes the two basic approaches to the calculation of term discrimination values for automatic indexing. The results of an experiment that investigated the differences between algorithms of these two approaches in terms of their impact on the discrimination value model are reported and discussed. (13 references) (Author/CLB)

Descriptors: Algorithms, Automatic Indexing, Comparative Analysis, Computational Linguistics

Inclusion of Relevance Information in the Term Discrimination Model.

Peer reviewed

Biru, Tesfaye; And Others – Journal of Documentation, 1989

Discusses the effect of including relevance data on the calculation of term discrimination values in bibliographic databases. Algorithms that calculate the ability of index terms to discriminate between relevant and non-relevant documents are described and tested. The results are discussed in terms of the relationship between term frequency and…

Descriptors: Algorithms, Automatic Indexing, Bibliographic Databases, Mathematical Models

A Sequential Method for Automatic Document Classification.

PDF pending restoration

White, Lee J.; And Others – 1975

The major advantage of sequential classification, a technique for automatically classifying documents into previously selected categories, is that the entire document need not be processed before it is classified. This method assumes the availability of a priori categories, a selection of keywords representative of these categories, and the a…

Descriptors: Algorithms, Automatic Indexing, Bayesian Statistics, Classification

A Distance Measure for Automatic Sequential Document Classification.

Download full text

Kar, B. Gautam; White, Lee J. – 1975

The feasibility of using a distance measure, called the Bayesian distance, for automatic sequential document classification was studied. Results indicate that, by observing the variation of this distance measure as keywords are extracted sequentially from a document, the occurrence of noisy keywords may be detected. This property of the distance…

Descriptors: Algorithms, Automatic Indexing, Bayesian Statistics, Classification