Descriptor
Mathematical Models | 6 |
Statistical Distributions | 6 |
Subject Index Terms | 6 |
Information Retrieval | 4 |
Databases | 3 |
Automatic Indexing | 2 |
Comparative Analysis | 2 |
Computer System Design | 2 |
Database Design | 2 |
Foreign Countries | 2 |
Higher Education | 2 |
More ▼ |
Author
Wolfram, Dietmar | 2 |
Biru, Tesfaye | 1 |
Buckley, Christopher | 1 |
Fedorowicz, Jane | 1 |
Nelson, Michael J. | 1 |
Salton, Gerard | 1 |
Publication Type
Journal Articles | 6 |
Reports - Research | 6 |
Education Level
Audience
Location
Laws, Policies, & Programs
Assessments and Surveys
What Works Clearinghouse Rating

Nelson, Michael J. – Journal of Documentation, 1989
Presents a probability model of the occurrence of index terms used to derive discrete distributions which are mixtures of Poisson and negative binomial distributions. These distributions give better fits than the simpler Zipf distribution, have the advantage of being more explanatory, and can incorporate a time parameter if necessary. (25…
Descriptors: Goodness of Fit, Mathematical Models, Probability, Statistical Distributions

Biru, Tesfaye; And Others – Journal of Documentation, 1989
Discusses the effect of including relevance data on the calculation of term discrimination values in bibliographic databases. Algorithms that calculate the ability of index terms to discriminate between relevant and non-relevant documents are described and tested. The results are discussed in terms of the relationship between term frequency and…
Descriptors: Algorithms, Automatic Indexing, Bibliographic Databases, Mathematical Models

Fedorowicz, Jane – Journal of the American Society for Information Science, 1982
Derives the underlying structure of the Zipf distribution, with emphasis on its application to word frequencies in the inverted files of automatic bibliographic systems, and applies the Zipfian model to the National Library of Medicine's MEDLINE database. An appendix on the Zipfian mean and 12 references are included. (Author/JL)
Descriptors: Citations (References), Databases, Information Retrieval, Mathematical Models

Salton, Gerard; Buckley, Christopher – Information Processing and Management, 1988
Summarizes the experimental evidence that indicates that text indexing systems based on the assignment of appropriately weighted single terms produce retrieval results superior to those obtained with more elaborate text representations, and provides baseline single term indexing models with which more elaborate content analysis procedures can be…
Descriptors: Automatic Indexing, Comparative Analysis, Content Analysis, Information Retrieval

Wolfram, Dietmar – Information Processing and Management, 1992
Examines how informetric characteristics of information retrieval (IR) system databases can be used to help system designers decide what type of file structures would provide the best performance. The development of appropriate models describing database contents is highlighted in this first part of a two-part study. (30 references) (LRW)
Descriptors: Computer System Design, Database Design, Databases, Foreign Countries

Wolfram, Dietmar – Information Processing and Management, 1992
This second report on a two-part study on the application of informetrics to information retrieval (IR) system design used models of database contents in a factorial simulation study at the University of Western Ontario. The study explored whether different file structures were better suited for different informetric environments. (32 references)…
Descriptors: Analysis of Variance, Comparative Analysis, Computer Simulation, Computer System Design