ERIC - Search Results

Publication Date

In 2025	0
Since 2024	0
Since 2021 (last 5 years)	0
Since 2016 (last 10 years)	0
Since 2006 (last 20 years)	1

Descriptor

Information Retrieval	17
Search Strategies	9
Algorithms	8
Comparative Analysis	6
Online Searching	6
Information Systems	5
Online Systems	5
Relevance (Information…	5
Subject Index Terms	5
Cluster Analysis	4
Tables (Data)	4
Automatic Indexing	3
Documentation	3
Foreign Countries	3
Classification	2
Cluster Grouping	2
Databases	2
English	2
Bibliographic Databases	1
Bibliographic Records	1
Bibliometrics	1
Chemistry	1
Computational Linguistics	1
Computer Simulation	1
Computer Software	1
More ▼

Source

Information Processing and…	6
Journal of Documentation	4
Journal of the American…	4
Education for Information	1
Online Review	1
Program: Electronic Library…	1

Author

Willett, Peter	17
Robertson, Alexander M.	2
El-Hamdouchi, Abdelmoula	1
Harding, Alan F.	1
Lynch, Michael F.	1
Peat, Helen J.	1
Pogue, Christine	1
Popovic, Mirko	1
Shaw, Rachel J.	1
Stewart, Mark	1
Wood, Frances E.	1
More ▼

Publication Type

Journal Articles	17
Reports - Research	11
Information Analyses	3
Reports - Evaluative	3
Opinion Papers	2
Reports - Descriptive	2
Reports - General	1

Education Level

Audience

Researchers

Location

United Kingdom (England)

Laws, Policies, & Programs

Assessments and Surveys

What Works Clearinghouse Rating

Showing 1 to 15 of 17 results Save | Export

The Porter Stemming Algorithm: Then and Now

Peer reviewed

Direct link

Willett, Peter – Program: Electronic Library and Information Systems, 2006

Purpose: In 1980, Porter presented a simple algorithm for stemming English language words. This paper summarises the main features of the algorithm, and highlights its role not just in modern information retrieval research, but also in a range of related subject domains. Design/methodology/approach: Review of literature and research involving use…

Descriptors: Mathematics, Information Retrieval, Programming, English

A Note on the Use of Nearest Neighbors for Implementing Single Linkage Document Classifications.

Peer reviewed

Willett, Peter – Journal of the American Society for Information Science, 1984

Describes a cluster-based information retrieval procedure that can significantly reduce the computational requirements of the single linkage method, while still maintaining the retrieval effectiveness of the resulting classifications. Use of nearest neighbors, experimental details, and results and conclusions are highlighted. Fourteen references…

Descriptors: Cluster Analysis, Cluster Grouping, Information Retrieval, Relevance (Information Retrieval)

On the Non-Random Nature of Nearest-Neighbour Document Clusters.

Peer reviewed

Shaw, Rachel J.; Willett, Peter – Information Processing and Management, 1993

Examines research on the observed values of retrieval effectiveness obtained in searches of files of nearest neighbor document clusters. Results show interdocument similarities used to generate nearest-neighbor clusters are significantly different from randomly generated clusters. (Contains 14 references.) (EAM)

Descriptors: Bibliographic Records, Cluster Analysis, Comparative Analysis, Information Retrieval

The Effectiveness of Stemming for Natural-Language Access to Slovene Textual Data.

Peer reviewed

Popovic, Mirko; Willett, Peter – Journal of the American Society for Information Science, 1992

Reports on the use of stemming for Slovene language documents and queries in free-text retrieval systems and demonstrates that an appropriate stemming algorithm results in an increase in retrieval effectiveness when compared with nonstemming processing. A comparison is made with stemming of English versions of the same documents and queries. (24…

Descriptors: Algorithms, Comparative Analysis, English, Full Text Databases

The Limitations of Term Co-Occurrence Data for Query Expansion in Document Retrieval Systems.

Peer reviewed

Peat, Helen J.; Willett, Peter – Journal of the American Society for Information Science, 1991

Identifies limitations in the use of term co-occurrence data as a basis for automatic query expansion in natural language document retrieval systems. The use of similarity coefficients to calculate the degree of similarity between pairs of terms is explained, and frequency and discriminatory characteristics for nearest neighbors of query terms are…

Descriptors: Databases, Information Retrieval, Online Searching, Online Systems

An Evaluation of Document Retrieval from Serial Files Using the ICL Distributed Array Processor.

Peer reviewed

Pogue, Christine; Willett, Peter – Online Review, 1984

Describes preliminary investigation of the use of International Computers Limited's Distributed Array Processor (DAP) for parallel searching of large serial files of documents. DAP hardware and software, test collections, measurement of DAP performance, search algorithms, experimental results, and DAP suitability for interactive searching are…

Descriptors: Algorithms, Comparative Analysis, Computer Software, Digital Computers

An Improved Algorithm for the Calculation of Exact Term Discrimination Values.

Peer reviewed

El-Hamdouchi, Abdelmoula; Willett, Peter – Information Processing and Management, 1988

Describes an algorithm for the calculation of term discrimination values that may be used when the interdocument similarity measure used is the cosine coefficient and when the document representations have been weighted using one particular term weighting scheme. (7 references) (Author/CLB)

Descriptors: Algorithms, Automatic Indexing, Computational Linguistics, Discriminant Analysis

An Upperbound to the Performance of Ranked-Output Searching: Optimal Weighting of Query Terms Using A Genetic Algorithm.

Peer reviewed

Robertson, Alexander M.; Willett, Peter – Journal of Documentation, 1996

Describes a genetic algorithm (GA) that assigns weights to query terms in a ranked-output document retrieval system. Experiments showed the GA often found weights slightly superior to those produced by deterministic weighting (F4). Many times, however, the two methods gave the same results and sometimes the F4 results were superior, indicating…

Descriptors: Algorithms, Comparative Analysis, Information Retrieval, Online Searching

Use of the INSTRUCT Text Retrieval Program at the Department of Information Studies, University of Sheffield.

Peer reviewed

Willett, Peter; Wood, Frances E. – Education for Information, 1989

Describes the development and functions of a text retrieval program that makes extensive use of the best match model of document retrieval rather than the Boolean model. The use of the program in teaching and research at the University of Sheffield is summarized. (24 references) (CLB)

Descriptors: Bibliographic Databases, Foreign Countries, Information Retrieval, Online Searching

Indexing Exhaustivity and the Computation of Similarity Matrices.

Peer reviewed

Harding, Alan F.; Willett, Peter – Journal of the American Society for Information Science, 1980

Demonstrates that the process of comparing each document in an automated system with all others during the classification procedure may be avoided by the use of an inverted file. (FM)

Descriptors: Automatic Indexing, Classification, Cluster Grouping, Information Retrieval

Document Retrieval Experiments Using Indexing Vocabularies of Varying Size. II. Hashing, Truncation, Digram and Trigram Encoding of Index Terms.

Peer reviewed

Willett, Peter – Journal of Documentation, 1979

Describes the use of fixed-length character strings for controlling the size of indexing vocabularies in information retrieval systems. An evaluation of digram and trigram encoding, using the Cranfield document test collection, is presented and the results are compared with the hashing and the right-hand truncation of the terms. (Author)

Descriptors: Comparative Analysis, Indexes, Information Retrieval, Online Systems

Applications of N-Grams in Textual Information Systems.

Peer reviewed

Robertson, Alexander M.; Willett, Peter – Journal of Documentation, 1998

An n-gram is a string of characters, usually adjacent, extracted from a section of continuous text that can be used in spelling error detection and correction, query expansion, information retrieval, dictionary search, text compression, and language identification applications. This article provides an introduction to the use of n-grams in textual…

Descriptors: Databases, Dictionaries, Error Correction, Information Retrieval

Nearest Neighbor Searching in Binary Search Trees: Simulation of a Multiprocessor System.

Peer reviewed

Stewart, Mark; Willett, Peter – Journal of Documentation, 1987

Describes the simulation of a nearest neighbor searching algorithm for document retrieval using a pool of microprocessors. Three techniques are described which allow parallel searching of a binary search tree as well as a PASCAL-based system, PASSIM, which can simulate these techniques. Fifty-six references are provided. (Author/LRW)

Descriptors: Algorithms, Computer Simulation, Correlation, Documentation

Recent Trends in Hierarchic Document Clustering: A Critical Review.

Peer reviewed

Willett, Peter – Information Processing and Management, 1988

Reviews recent research into the use of hierarchic agglomerative clustering methods for document retrieval. The topics discussed include the calculation of interdocument similarities, algorithms used to implement clustering methods on large databases, validity testing of document hierarchies, appropriate search strategies, and other applications…

Descriptors: Algorithms, Bibliometrics, Cluster Analysis, Comparative Analysis

Current Research into Chemical and Textual Information Retrieval at the Department of Information Studies, University of Sheffield.

Peer reviewed

Lynch, Michael F.; Willett, Peter – Information Processing and Management, 1987

Discusses research into chemical information and document retrieval systems at the University of Sheffield. Highlights include the use of cluster analysis methods for document retrieval and drug design, representation and searching of files of generic chemical structures, and the application of parallel computer hardware to information retrieval.…

Descriptors: Chemistry, Cluster Analysis, Developed Nations, Documentation

Previous Page | Next Page »

Pages: 1 | 2