NotesFAQContact Us
Collection
Advanced
Search Tips
Showing all 6 results Save | Export
Peer reviewed Peer reviewed
And Others; Salton, G. – Journal of the American Society for Information Science, 1975
A new technique, known as discrimination value analysis, ranks the text words in accordance with how well they are able to discriminate the documents of a collection from each other. (Author/PF)
Descriptors: Automatic Indexing, Databases, Discriminant Analysis, Information Processing
PDF pending restoration PDF pending restoration
White, Lee J.; And Others – 1975
The major advantage of sequential classification, a technique for automatically classifying documents into previously selected categories, is that the entire document need not be processed before it is classified. This method assumes the availability of a priori categories, a selection of keywords representative of these categories, and the a…
Descriptors: Algorithms, Automatic Indexing, Bayesian Statistics, Classification
Kar, B. Gautam; White, Lee J. – 1975
The feasibility of using a distance measure, called the Bayesian distance, for automatic sequential document classification was studied. Results indicate that, by observing the variation of this distance measure as keywords are extracted sequentially from a document, the occurrence of noisy keywords may be detected. This property of the distance…
Descriptors: Algorithms, Automatic Indexing, Bayesian Statistics, Classification
Peer reviewed Peer reviewed
Parker, Lorraine M. Purgailis – Journal of the American Society for Information Science, 1983
The mathematical model proposed describes a computerized bibliographic information system which includes a method--document learning--of improving the set of index terms assigned to a document representative. Inputs to the information system (index terms and query), relevance feedback, and assumptions concerning the model are discussed.…
Descriptors: Automation, Databases, Indexing, Information Needs
Peer reviewed Peer reviewed
Wolfram, Dietmar – Information Processing and Management, 1992
Examines how informetric characteristics of information retrieval (IR) system databases can be used to help system designers decide what type of file structures would provide the best performance. The development of appropriate models describing database contents is highlighted in this first part of a two-part study. (30 references) (LRW)
Descriptors: Computer System Design, Database Design, Databases, Foreign Countries
Peer reviewed Peer reviewed
Wolfram, Dietmar – Information Processing and Management, 1992
This second report on a two-part study on the application of informetrics to information retrieval (IR) system design used models of database contents in a factorial simulation study at the University of Western Ontario. The study explored whether different file structures were better suited for different informetric environments. (32 references)…
Descriptors: Analysis of Variance, Comparative Analysis, Computer Simulation, Computer System Design