NotesFAQContact Us
Collection
Advanced
Search Tips
Showing 1 to 15 of 19 results Save | Export
Peer reviewed Peer reviewed
Lee, Kin Hong; Lu, Qin; Ng, Mau Kit Michael – Journal of the American Society for Information Science, 1999
Discussion of spell checking for Chinese words proposes a Block-of-Combinations (BOC) text-segmentation method based on frequency of word usage to reduce the word combinations from exponential growth to linear growth. Suggests user interaction to make the segmentation more suitable for spell checking. (Author/LRW)
Descriptors: Chinese, Ideography, Word Frequency
Peer reviewed Peer reviewed
Rosenberg, Victor – Journal of the American Society for Information Science, 1971
A statistical measure is developed for predicting the terms from a restricted vocabulary that will be used to index a document, given that one of the index terms is known. (Author)
Descriptors: Computational Linguistics, Indexing, Statistical Analysis, Subject Index Terms
Peer reviewed Peer reviewed
Pao, Miranda Lee – Journal of the American Society for Information Science, 1978
Describes a method of selecting index terms directly from a word frequency list, an idea originally suggested by Goffman. Results of the analysis of word frequencies of two articles seem to indicate that the automated selection of index terms from a frequency list holds some promise for automatic indexing. (Author/MBR)
Descriptors: Automatic Indexing, Comparative Analysis, Experiments, Indexing
Peer reviewed Peer reviewed
Chen, Ye-Sho; Leimkuhler, Ferdinand F. – Journal of the American Society for Information Science, 1986
A common functional relationship among Lotka's law, Bradford's law, and Zipf's law is derived. The proof takes explicit account of the sequences of observed values of the variables by means of an index. This approach results in a more realistic and precise formulation of each law. (Author/EM)
Descriptors: Comparative Analysis, Goodness of Fit, Information Theory, Mathematical Models
Peer reviewed Peer reviewed
Svenonius, Elaine – Journal of the American Society for Information Science, 1972
The question is asked: Of index terms assigned to documents, which function most effectively in retrieval, the most used or popular terms, or those which are used relatively infrequently? The experiment is a retrieval experiment and uses the Cranfield-Salton data. (14 references) (Author)
Descriptors: Indexing, Information Processing, Relevance (Information Retrieval), Subject Index Terms
Peer reviewed Peer reviewed
Thom, James A.; Zobel, Justin – Journal of the American Society for Information Science, 1992
Discusses models for the distribution of words in text and proposes a new model based on clustering that can be used to estimate the probability that a document contains a particular word as well as the number of distinct words in a document. Zipf's law and the Poisson approximation are also discussed. (18 references) (LRW)
Descriptors: Cluster Grouping, Mathematical Formulas, Models, Probability
Peer reviewed Peer reviewed
Lindsay, Robert K.; Gordon, Michael D. – Journal of the American Society for Information Science, 1999
Reports the results of experiments with MEDLINE that used lexical statistics such as word-frequency counts to discover hidden connections in medical literature. Discusses problems with relying on bibliographic citations or standard indexing methods to establish a relationship between topics that might profitably be explored by scientific research.…
Descriptors: Citations (References), Computational Linguistics, Indexing, Medical Research
Peer reviewed Peer reviewed
Salton, G.; And Others – Journal of the American Society for Information Science, 1981
Reviews major term-weighting theories, presents methods for estimating the relevance properties of terms based on their frequency characteristics in a document collection, and compares weighting systems using term relevance properties with more conventional frequency-based methodologies. Eighteen references are cited. (Author/FM)
Descriptors: Automatic Indexing, Bibliographies, Information Retrieval, Methods
Peer reviewed Peer reviewed
Nelson, Michael J.; Tague, Jean M. – Journal of the American Society for Information Science, 1985
Proposes split model for index term distribution in document set that uses rank function for high frequency terms and size function for low frequency terms; the point of transition is determined either empirically or by rule. Distributions to describe index term exhaustivity and term co-occurrence are considered briefly. (36 references) (EJS)
Descriptors: Databases, Indexing, Information Retrieval, Models
Peer reviewed Peer reviewed
Sun, Qinglan; Shaw, Debora; Davis, Charles H. – Journal of the American Society for Information Science, 1999
Proposes a model, based on a "maximum ranking method," for more simply estimating the frequency of any same-frequency words and identifying the boundary point between high-frequency and low-frequency words in a text. This model was used successfully with English and Chinese texts, demonstrating that the frequency of words and number of…
Descriptors: Chinese, Electronic Text, English, Information Science
Peer reviewed Peer reviewed
Salton, G.; And Others – Journal of the American Society for Information Science, 1983
Introduces methods designed to reduce role of search intermediaries by generating Boolean search formulations automatically using term frequency considerations from natural language statements provided by system patrons. Experimental results are supplied and methods are described for applying automatic query formulation process in practice.…
Descriptors: Information Retrieval, Online Systems, Relevance (Information Retrieval), Search Strategies
Peer reviewed Peer reviewed
Dillon, Martin; Federhart, Peggy – Journal of the American Society for Information Science, 1982
Presents a method for identifying indexing terms from word stems, using discriminant analysis to distinguish terms which refer to topics from terms which do not refer to topics. A test of the method on the Harris Survey Question database is discussed. Included are 11 data tables and a reference list. (Author/JL)
Descriptors: Automatic Indexing, Classification, Discriminant Analysis, Information Retrieval
Peer reviewed Peer reviewed
Kim, Chai – Journal of the American Society for Information Science, 1982
Examines and compares the theoretical and empirical bases for the use of the relative frequency of descriptor use in the design and maintenance of thesauri for information retrieval in the social and natural sciences. Data are presented in two tables and a reference list is included. (JL)
Descriptors: Indexing, Information Retrieval, Natural Sciences, Social Sciences
Peer reviewed Peer reviewed
Dillon, Martin; Federhart, Peggy – Journal of the American Society for Information Science, 1984
Discusses ways to improve quality of retrieval systems that depend on use of truncated words or quasi-word stems as indexing vocabulary. Problems of generalizability and stability of discriminant function analysis for selecting good topical terms in database drawn from abstracts of Harris Survey press releases are addressed. References are cited.…
Descriptors: Classification, Databases, Discriminant Analysis, Information Retrieval
Peer reviewed Peer reviewed
Fedorowicz, Jane – Journal of the American Society for Information Science, 1982
Derives the underlying structure of the Zipf distribution, with emphasis on its application to word frequencies in the inverted files of automatic bibliographic systems, and applies the Zipfian model to the National Library of Medicine's MEDLINE database. An appendix on the Zipfian mean and 12 references are included. (Author/JL)
Descriptors: Citations (References), Databases, Information Retrieval, Mathematical Models
Previous Page | Next Page ยป
Pages: 1  |  2