NotesFAQContact Us
Collection
Advanced
Search Tips
Showing all 4 results Save | Export
Peer reviewed Peer reviewed
Robertson, Alexander M.; Willett, Peter – Journal of Documentation, 1998
An n-gram is a string of characters, usually adjacent, extracted from a section of continuous text that can be used in spelling error detection and correction, query expansion, information retrieval, dictionary search, text compression, and language identification applications. This article provides an introduction to the use of n-grams in textual…
Descriptors: Databases, Dictionaries, Error Correction, Information Retrieval
Peer reviewed Peer reviewed
Stewart, Mark; Willett, Peter – Journal of Documentation, 1987
Describes the simulation of a nearest neighbor searching algorithm for document retrieval using a pool of microprocessors. Three techniques are described which allow parallel searching of a binary search tree as well as a PASCAL-based system, PASSIM, which can simulate these techniques. Fifty-six references are provided. (Author/LRW)
Descriptors: Algorithms, Computer Simulation, Correlation, Documentation
Peer reviewed Peer reviewed
Willett, Peter – Information Processing and Management, 1985
Reports algorithm for calculation of term discrimination values that is sufficiently fast in operation to permit use of exact values. Evidence is presented to show that relationship between term discrimination and term frequency is crucially dependent upon type of inter-document similarity measure used for calculation of discrimination values. (13…
Descriptors: Algorithms, Graphs, Information Retrieval, Information Systems
Peer reviewed Peer reviewed
Pogue, Christine; Willett, Peter – Online Review, 1984
Describes preliminary investigation of the use of International Computers Limited's Distributed Array Processor (DAP) for parallel searching of large serial files of documents. DAP hardware and software, test collections, measurement of DAP performance, search algorithms, experimental results, and DAP suitability for interactive searching are…
Descriptors: Algorithms, Comparative Analysis, Computer Software, Digital Computers