Descriptor
Source
Information Processing and… | 3 |
Publication Type
Journal Articles | 3 |
Reports - Descriptive | 2 |
Reports - Research | 1 |
Education Level
Audience
Researchers | 3 |
Location
Laws, Policies, & Programs
Assessments and Surveys
What Works Clearinghouse Rating

Wisniewski, Janusz L. – Information Processing and Management, 1986
Discussion of a new method of index term dictionary compression in an inverted-file-oriented database highlights a technique of word coding, which generates short fixed-length codes obtained from the index terms themselves by analysis of monogram and bigram statistical distributions. Substantial savings in communication channel utilization are…
Descriptors: Algorithms, Database Management Systems, Databases, Information Retrieval

Willett, Peter – Information Processing and Management, 1985
Reports algorithm for calculation of term discrimination values that is sufficiently fast in operation to permit use of exact values. Evidence is presented to show that relationship between term discrimination and term frequency is crucially dependent upon type of inter-document similarity measure used for calculation of discrimination values. (13…
Descriptors: Algorithms, Graphs, Information Retrieval, Information Systems

Smith, F. J.; Devine, K. – Information Processing and Management, 1985
Zipfian laws for frequency distributions of word pairs and longer phrases are derived from text sample analysis. From crossing of Zipfian curves, it is deduced that number of multi-word phrases that occur frequently in text is surprisingly small, of same order of magnitude as number of individual word-types. (8 references) (EJS)
Descriptors: Algorithms, Graphs, Indexing, Information Retrieval