NotesFAQContact Us
Collection
Advanced
Search Tips
Showing all 9 results Save | Export
Peer reviewed Peer reviewed
Margulis, Eugene L. – Information Processing and Management, 1993
Reports on the validity of the Multiple Poisson (nP) model of word distribution in full-text document collections. A practical algorithm for determining whether a certain word is distributed according to an nP distribution and the results of a test of this algorithm in three different document collections are described. (14 references) (KRN)
Descriptors: Algorithms, Full Text Databases, Graphs, Information Retrieval
Peer reviewed Peer reviewed
Schinke, Robyn; And Others – Journal of Documentation, 1996
Describes the design of a stemming algorithm for searching Latin text databases. The algorithm uses a longest-match approach with some recoding but differs from most stemmers in its use of two separate suffix dictionaries for processing query and database words that enables users to pursue specific searches for single grammatical forms of words.…
Descriptors: Algorithms, Full Text Databases, Grammar, Latin
Peer reviewed Peer reviewed
Sheridan, Paraic; Smeaton, Alan F. – Information Processing and Management, 1992
Describes a process of morpho-syntactic language analysis for information retrieval. Tree Structured Analytics (TSA) used for text representation is summarized; the matching process developed for such structures is outlined with an example appended; and experiments carried out to evaluate the effectiveness of TSA matching are discussed. (26…
Descriptors: Algorithms, Full Text Databases, Information Retrieval, Research Needs
Peer reviewed Peer reviewed
Bookstein, Abraham; And Others – Information Processing and Management, 1992
Discusses the problems of compressing a large textual database for storage on CD-ROM. A text-compression algorithm is presented, new algorithms for compression of indices are described, and the ARTFL (American and French Research on the Treasury of the French Language) database is used as an example. (14 references) (LRW)
Descriptors: Algorithms, Coding, Full Text Databases, Indexes
Peer reviewed Peer reviewed
Al-Hawamdeh, Suliman; And Others – Online Review, 1991
Describes a project at the University of Sheffield that is investigating the use of nearest-neighbor retrieval algorithms for full-text searching. Nearest-neighbor searching is compared with Boolean retrieval and hypertext, and an experimental text retrieval system called INSTRUCT (Interactive System for Teaching Retrieval Using Computational…
Descriptors: Algorithms, Comparative Analysis, Equations (Mathematics), Foreign Countries
Peer reviewed Peer reviewed
Melucci, Massimo – Information Processing & Management, 1998
Presents a probabilistic technique to retrieve passages from texts having a large size or heterogeneous semantic content. Results of experiments comparing the probabilistic technique to one based on a text segmentation algorithm revealed that the passage size affects passage retrieval performance; text organization and query generality may have an…
Descriptors: Algorithms, Comparative Analysis, Full Text Databases, Information Retrieval
Peer reviewed Peer reviewed
Couvreur, T. R.; And Others – Journal of the American Society for Information Science, 1994
Discusses the results of modeling the performance of searching large text databases via various parallel hardware architectures and search algorithms. The performance under load and the cost of each configuration are compared, and a common search workload used in the modeling is described. (Contains 26 references.) (LRW)
Descriptors: Algorithms, Comparative Analysis, Computer System Design, Cost Effectiveness
Peer reviewed Peer reviewed
Popovic, Mirko; Willett, Peter – Journal of the American Society for Information Science, 1992
Reports on the use of stemming for Slovene language documents and queries in free-text retrieval systems and demonstrates that an appropriate stemming algorithm results in an increase in retrieval effectiveness when compared with nonstemming processing. A comparison is made with stemming of English versions of the same documents and queries. (24…
Descriptors: Algorithms, Comparative Analysis, English, Full Text Databases
Peer reviewed Peer reviewed
Tauchert, Wolfgang; And Others – Online Review, 1991
Describes the PADOK-II project in Germany, which was designed to give information on the effects of linguistic algorithms on retrieval in a full-text database, the German Patent Information System (GPI). Relevance assessments are discussed, statistical evaluations are described, and searches are compared for the full-text section versus the…
Descriptors: Abstracts, Algorithms, Bibliographic Databases, Comparative Analysis