NotesFAQContact Us
Collection
Advanced
Search Tips
Showing all 12 results Save | Export
Peer reviewed Peer reviewed
Bailey, Peter; Craswell, Nick; Hawking, David – Information Processing & Management, 2003
Describes a test collection that was developed as a multi-purpose testbed for experiments on the Web in distributed information retrieval, hyperlink algorithms, and conventional ad hoc retrieval. Discusses inter-server connectivity, integrity of server holdings, inclusion of documents related to a wide spread of likely queries, and distribution of…
Descriptors: Algorithms, Hypermedia, Information Retrieval, World Wide Web
Peer reviewed Peer reviewed
Savoy, Jacques; Picard, Justin – Information Processing & Management, 2001
Discusses the role of search engines in Web usability and analyzes and evaluates the retrieval effectiveness of various indexing and searching strategies on a new Web text collection. Highlights include preprocessing techniques that might improve retrieval effectiveness; and hyperlinks as useful sources of evidence in improving retrieval…
Descriptors: Algorithms, Indexing, Information Retrieval, Search Strategies
Peer reviewed Peer reviewed
Miyamoto, Sadaaki – Information Processing & Management, 2003
Proposes a fuzzy multiset model for information clustering with application to information retrieval on the World Wide Web. Highlights include search engines; term clustering; document clustering; algorithms for calculating cluster centers; theoretical properties concerning clustering algorithms; and examples to show how the algorithms work.…
Descriptors: Algorithms, Information Retrieval, Mathematical Formulas, Models
Balas, Janet L. – Computers in Libraries, 1999
Reviews Web-search tools available to librarians. Discusses the Clever project, (part of the Computer Science Principles and Methodologies Department at the IBM Almaden Research Center, Silicon Valley, California), an algorithm that finds authoritative sources on the Web; Google technology, based on links between Web pages; Ask Jeeves, a…
Descriptors: Algorithms, Information Retrieval, Information Sources, Library Services
Peer reviewed Peer reviewed
Davis, Charles H.; McKim, Geoffrey W. – Journal of the American Society for Information Science, 1999
Describes SWEAR (Systematic Weighting and Ranking), a powers-of-two algorithm that can be used for searching the World Wide Web or any large database that automatically creates discrete, well-defined result sets and displays them in decreasing order of likely relevance. Also discusses fuzzy sets. (Author/LRW)
Descriptors: Algorithms, Databases, Information Retrieval, Relevance (Information Retrieval)
Lempel, Ronny; Moran, Shlomo – Proceedings of the ASIST Annual Meeting, 2002
Discusses the use of link structure analysis in World Wide Web information retrieval and in retrieval and ranking algorithms of search engines. Presents two techniques which bias co-citation based link analyses towards favorable Web pages and away from undesired pages, and are then integrated into existing link-analyzing algorithms. (Author/LRW)
Descriptors: Algorithms, Citation Analysis, Information Retrieval, Search Engines
Peer reviewed Peer reviewed
Kim, Deok-Hwan; Chung, Chin-Wan – Information Processing & Management, 2003
Discusses the collection fusion problem of image databases, concerned with retrieving relevant images by content based retrieval from image databases distributed on the Web. Focuses on a metaserver which selects image databases supporting similarity measures and proposes a new algorithm which exploits a probabilistic technique using Bayesian…
Descriptors: Algorithms, Content Analysis, Databases, Information Retrieval
Peer reviewed Peer reviewed
Chen, Hsinchun; Chau, Michael – Annual Review of Information Science and Technology (ARIST), 2004
Presents an overview of machine learning research and reviews methods used for evaluating machine learning systems. Ways that machine-learning algorithms were used in traditional information retrieval systems in the "pre-Web" era are described, and the field of Web mining and how machine learning has been used in different Web mining…
Descriptors: Algorithms, Evaluation Methods, Information Retrieval, Information Science
Peer reviewed Peer reviewed
Cohen, Sara; Kanza, Yaron; Kogan, Yakov; Sagiv, Yehoshua; Nutt, Werner; Serebrenik, Alexander – Journal of the American Society for Information Science and Technology, 2002
Describes EquiX, a search language for XML that combines querying with searching to query the data and the meta-data content of Web pages. Topics include search engines; a data model for XML documents; search query syntax; search query semantics; an algorithm for evaluating a query on a document; and indexing EquiX queries. (LRW)
Descriptors: Algorithms, Evaluation Methods, Indexing, Information Retrieval
Peer reviewed Peer reviewed
Boley, Daniel; Gini, Maria; Hastings, Kyle; Mobasher, Bamshad; Moore, Jerry – Internet Research, 1998
Describes WebACE, the architecture of a client-side agent that explores and classifies Web documents in clusters automatically and discusses the details of the algorithms within its key components. Highlights principal direction divisive partitioning (PDDP), a scalable hierarchical clustering algorithm; compares it to other clustering methods; and…
Descriptors: Algorithms, Automation, Classification, Cluster Grouping
Peer reviewed Peer reviewed
Chen, Ping-Wen; Chang, Shi-Kuo – Telematics and Informatics, 1997
Presents a World Wide Web page model that reacts to pre-defined events and performs actions like "prefetching" automatically. Discusses the active index, conceptual page model; design of the client system, status of the current implementation, prefetching algorithm, an experiment of the algorithm, experimental results of the active index…
Descriptors: Algorithms, Computer Interfaces, Computer Software Development, Expert Systems
Peer reviewed Peer reviewed
Chen, Hsinchun; Chung, Yi-Ming; Ramsey, Marshall; Yang, Christopher C. – Journal of the American Society for Information Science, 1998
This study tested two Web personal spiders (i.e., agents that take users' requests and perform real-time customized searches) based on best first-search and genetic-algorithm techniques. Both results were comparable and complementary, although the genetic algorithm obtained higher recall value. The Java-based interface was found to be necessary…
Descriptors: Algorithms, Artificial Intelligence, Computer Interfaces, Computer Software Evaluation