Descriptor
Algorithms | 13 |
Full Text Databases | 13 |
Information Retrieval | 10 |
Search Strategies | 7 |
Comparative Analysis | 5 |
Indexes | 3 |
Relevance (Information… | 3 |
Abstracts | 2 |
Computer System Design | 2 |
Foreign Countries | 2 |
Graphs | 2 |
More ▼ |
Source
Information Processing and… | 4 |
Journal of the American… | 4 |
Online Review | 2 |
Information Processing &… | 1 |
Information Systems | 1 |
Journal of Documentation | 1 |
Author
Publication Type
Journal Articles | 13 |
Reports - Research | 9 |
Reports - Descriptive | 3 |
Opinion Papers | 1 |
Reports - Evaluative | 1 |
Speeches/Meeting Papers | 1 |
Education Level
Audience
Location
Germany | 1 |
Laws, Policies, & Programs
Assessments and Surveys
What Works Clearinghouse Rating

Margulis, Eugene L. – Information Processing and Management, 1993
Reports on the validity of the Multiple Poisson (nP) model of word distribution in full-text document collections. A practical algorithm for determining whether a certain word is distributed according to an nP distribution and the results of a test of this algorithm in three different document collections are described. (14 references) (KRN)
Descriptors: Algorithms, Full Text Databases, Graphs, Information Retrieval

Bookstein, A.; Klein, S. T. – Information Processing and Management, 1990
Describes the theoretical framework of a retrieval system that uses data structures called bitmaps to improve search algorithms for medium-sized systems. Information retrieval systems that contain the full text of the documents are discussed, and the use of inverted files such as concordances for small texts is explained. (24 references) (LRW)
Descriptors: Algorithms, Data Processing, Full Text Databases, Indexes

Fraenkel, Aviezri S.; Klein, Shmuel T. – Journal of the American Society for Information Science, 1999
Investigates methods for the correct and efficient handling of annotations in a full-text information-retrieval system in a hypertext environment. Goals were to increase relevant items retrieved, reduce non-relevant items, permit selective retrieval, and to explore the notion of distance between words as given in the distance-evaluation algorithm.…
Descriptors: Abstracts, Algorithms, Full Text Databases, Hypermedia

Schinke, Robyn; And Others – Journal of Documentation, 1996
Describes the design of a stemming algorithm for searching Latin text databases. The algorithm uses a longest-match approach with some recoding but differs from most stemmers in its use of two separate suffix dictionaries for processing query and database words that enables users to pursue specific searches for single grammatical forms of words.…
Descriptors: Algorithms, Full Text Databases, Grammar, Latin

Sheridan, Paraic; Smeaton, Alan F. – Information Processing and Management, 1992
Describes a process of morpho-syntactic language analysis for information retrieval. Tree Structured Analytics (TSA) used for text representation is summarized; the matching process developed for such structures is outlined with an example appended; and experiments carried out to evaluate the effectiveness of TSA matching are discussed. (26…
Descriptors: Algorithms, Full Text Databases, Information Retrieval, Research Needs

Bookstein, Abraham; And Others – Information Processing and Management, 1992
Discusses the problems of compressing a large textual database for storage on CD-ROM. A text-compression algorithm is presented, new algorithms for compression of indices are described, and the ARTFL (American and French Research on the Treasury of the French Language) database is used as an example. (14 references) (LRW)
Descriptors: Algorithms, Coding, Full Text Databases, Indexes

Baeza-Yates, Ricardo; And Others – Information Systems, 1996
Discusses indexes for text databases and presents an efficient implementation of an index for text searching called PAT array, or suffix array, where the database is stored on secondary storage devices such as magnetic or optical disks. Additional hierarchical index structures and searching algorithms are proposed that improve searching time, and…
Descriptors: Algorithms, Full Text Databases, Indexes, Information Storage

Al-Hawamdeh, Suliman; And Others – Online Review, 1991
Describes a project at the University of Sheffield that is investigating the use of nearest-neighbor retrieval algorithms for full-text searching. Nearest-neighbor searching is compared with Boolean retrieval and hypertext, and an experimental text retrieval system called INSTRUCT (Interactive System for Teaching Retrieval Using Computational…
Descriptors: Algorithms, Comparative Analysis, Equations (Mathematics), Foreign Countries

Melucci, Massimo – Information Processing & Management, 1998
Presents a probabilistic technique to retrieve passages from texts having a large size or heterogeneous semantic content. Results of experiments comparing the probabilistic technique to one based on a text segmentation algorithm revealed that the passage size affects passage retrieval performance; text organization and query generality may have an…
Descriptors: Algorithms, Comparative Analysis, Full Text Databases, Information Retrieval

Couvreur, T. R.; And Others – Journal of the American Society for Information Science, 1994
Discusses the results of modeling the performance of searching large text databases via various parallel hardware architectures and search algorithms. The performance under load and the cost of each configuration are compared, and a common search workload used in the modeling is described. (Contains 26 references.) (LRW)
Descriptors: Algorithms, Comparative Analysis, Computer System Design, Cost Effectiveness

Popovic, Mirko; Willett, Peter – Journal of the American Society for Information Science, 1992
Reports on the use of stemming for Slovene language documents and queries in free-text retrieval systems and demonstrates that an appropriate stemming algorithm results in an increase in retrieval effectiveness when compared with nonstemming processing. A comparison is made with stemming of English versions of the same documents and queries. (24…
Descriptors: Algorithms, Comparative Analysis, English, Full Text Databases

Tauchert, Wolfgang; And Others – Online Review, 1991
Describes the PADOK-II project in Germany, which was designed to give information on the effects of linguistic algorithms on retrieval in a full-text database, the German Patent Information System (GPI). Relevance assessments are discussed, statistical evaluations are described, and searches are compared for the full-text section versus the…
Descriptors: Abstracts, Algorithms, Bibliographic Databases, Comparative Analysis

Woodruff, Allison Gyle; Plaunt, Christian – Journal of the American Society for Information Science, 1994
Presents an algorithm that automatically extracts words and phrases containing geographic place names or characteristics from a text document and uses them as input to database functions that use spatial reasoning to approximate statistically the geoposition referenced in the text. Preliminary results and future work are discussed. (28 references)…
Descriptors: Algorithms, Automatic Indexing, Computer System Design, Evaluation