Publication Date
In 2025 | 0 |
Since 2024 | 0 |
Since 2021 (last 5 years) | 0 |
Since 2016 (last 10 years) | 0 |
Since 2006 (last 20 years) | 1 |
Descriptor
Source
Information Processing and… | 6 |
Journal of Documentation | 4 |
Journal of the American… | 4 |
Education for Information | 1 |
Online Review | 1 |
Program: Electronic Library… | 1 |
Author
Publication Type
Journal Articles | 17 |
Reports - Research | 11 |
Information Analyses | 3 |
Reports - Evaluative | 3 |
Opinion Papers | 2 |
Reports - Descriptive | 2 |
Reports - General | 1 |
Education Level
Audience
Researchers | 5 |
Location
United Kingdom (England) | 1 |
Laws, Policies, & Programs
Assessments and Surveys
What Works Clearinghouse Rating
Willett, Peter – Program: Electronic Library and Information Systems, 2006
Purpose: In 1980, Porter presented a simple algorithm for stemming English language words. This paper summarises the main features of the algorithm, and highlights its role not just in modern information retrieval research, but also in a range of related subject domains. Design/methodology/approach: Review of literature and research involving use…
Descriptors: Mathematics, Information Retrieval, Programming, English

Willett, Peter – Journal of the American Society for Information Science, 1984
Describes a cluster-based information retrieval procedure that can significantly reduce the computational requirements of the single linkage method, while still maintaining the retrieval effectiveness of the resulting classifications. Use of nearest neighbors, experimental details, and results and conclusions are highlighted. Fourteen references…
Descriptors: Cluster Analysis, Cluster Grouping, Information Retrieval, Relevance (Information Retrieval)

Shaw, Rachel J.; Willett, Peter – Information Processing and Management, 1993
Examines research on the observed values of retrieval effectiveness obtained in searches of files of nearest neighbor document clusters. Results show interdocument similarities used to generate nearest-neighbor clusters are significantly different from randomly generated clusters. (Contains 14 references.) (EAM)
Descriptors: Bibliographic Records, Cluster Analysis, Comparative Analysis, Information Retrieval

Popovic, Mirko; Willett, Peter – Journal of the American Society for Information Science, 1992
Reports on the use of stemming for Slovene language documents and queries in free-text retrieval systems and demonstrates that an appropriate stemming algorithm results in an increase in retrieval effectiveness when compared with nonstemming processing. A comparison is made with stemming of English versions of the same documents and queries. (24…
Descriptors: Algorithms, Comparative Analysis, English, Full Text Databases

Peat, Helen J.; Willett, Peter – Journal of the American Society for Information Science, 1991
Identifies limitations in the use of term co-occurrence data as a basis for automatic query expansion in natural language document retrieval systems. The use of similarity coefficients to calculate the degree of similarity between pairs of terms is explained, and frequency and discriminatory characteristics for nearest neighbors of query terms are…
Descriptors: Databases, Information Retrieval, Online Searching, Online Systems

Pogue, Christine; Willett, Peter – Online Review, 1984
Describes preliminary investigation of the use of International Computers Limited's Distributed Array Processor (DAP) for parallel searching of large serial files of documents. DAP hardware and software, test collections, measurement of DAP performance, search algorithms, experimental results, and DAP suitability for interactive searching are…
Descriptors: Algorithms, Comparative Analysis, Computer Software, Digital Computers

El-Hamdouchi, Abdelmoula; Willett, Peter – Information Processing and Management, 1988
Describes an algorithm for the calculation of term discrimination values that may be used when the interdocument similarity measure used is the cosine coefficient and when the document representations have been weighted using one particular term weighting scheme. (7 references) (Author/CLB)
Descriptors: Algorithms, Automatic Indexing, Computational Linguistics, Discriminant Analysis

Robertson, Alexander M.; Willett, Peter – Journal of Documentation, 1996
Describes a genetic algorithm (GA) that assigns weights to query terms in a ranked-output document retrieval system. Experiments showed the GA often found weights slightly superior to those produced by deterministic weighting (F4). Many times, however, the two methods gave the same results and sometimes the F4 results were superior, indicating…
Descriptors: Algorithms, Comparative Analysis, Information Retrieval, Online Searching

Willett, Peter; Wood, Frances E. – Education for Information, 1989
Describes the development and functions of a text retrieval program that makes extensive use of the best match model of document retrieval rather than the Boolean model. The use of the program in teaching and research at the University of Sheffield is summarized. (24 references) (CLB)
Descriptors: Bibliographic Databases, Foreign Countries, Information Retrieval, Online Searching

Harding, Alan F.; Willett, Peter – Journal of the American Society for Information Science, 1980
Demonstrates that the process of comparing each document in an automated system with all others during the classification procedure may be avoided by the use of an inverted file. (FM)
Descriptors: Automatic Indexing, Classification, Cluster Grouping, Information Retrieval

Willett, Peter – Journal of Documentation, 1979
Describes the use of fixed-length character strings for controlling the size of indexing vocabularies in information retrieval systems. An evaluation of digram and trigram encoding, using the Cranfield document test collection, is presented and the results are compared with the hashing and the right-hand truncation of the terms. (Author)
Descriptors: Comparative Analysis, Indexes, Information Retrieval, Online Systems

Robertson, Alexander M.; Willett, Peter – Journal of Documentation, 1998
An n-gram is a string of characters, usually adjacent, extracted from a section of continuous text that can be used in spelling error detection and correction, query expansion, information retrieval, dictionary search, text compression, and language identification applications. This article provides an introduction to the use of n-grams in textual…
Descriptors: Databases, Dictionaries, Error Correction, Information Retrieval

Stewart, Mark; Willett, Peter – Journal of Documentation, 1987
Describes the simulation of a nearest neighbor searching algorithm for document retrieval using a pool of microprocessors. Three techniques are described which allow parallel searching of a binary search tree as well as a PASCAL-based system, PASSIM, which can simulate these techniques. Fifty-six references are provided. (Author/LRW)
Descriptors: Algorithms, Computer Simulation, Correlation, Documentation

Willett, Peter – Information Processing and Management, 1988
Reviews recent research into the use of hierarchic agglomerative clustering methods for document retrieval. The topics discussed include the calculation of interdocument similarities, algorithms used to implement clustering methods on large databases, validity testing of document hierarchies, appropriate search strategies, and other applications…
Descriptors: Algorithms, Bibliometrics, Cluster Analysis, Comparative Analysis

Lynch, Michael F.; Willett, Peter – Information Processing and Management, 1987
Discusses research into chemical information and document retrieval systems at the University of Sheffield. Highlights include the use of cluster analysis methods for document retrieval and drug design, representation and searching of files of generic chemical structures, and the application of parallel computer hardware to information retrieval.…
Descriptors: Chemistry, Cluster Analysis, Developed Nations, Documentation
Previous Page | Next Page ยป
Pages: 1 | 2