Descriptor
Source
Information Processing &… | 46 |
Author
Aoe, Jun-ichi | 3 |
Guerrero Bote, Vicente P. | 2 |
Koyama, Masafumi | 2 |
Moya Anegon, Felix de | 2 |
Okada, Makoto | 2 |
Savoy, Jacques | 2 |
Shishibori, Masami | 2 |
Storer, James A. | 2 |
Abrahams, Julia | 1 |
Ahmed, Salahuddin | 1 |
Ando, Kazuaki | 1 |
More ▼ |
Publication Type
Journal Articles | 46 |
Reports - Descriptive | 27 |
Reports - Evaluative | 10 |
Reports - Research | 10 |
Speeches/Meeting Papers | 6 |
Education Level
Audience
Location
Laws, Policies, & Programs
Assessments and Surveys
What Works Clearinghouse Rating

Jung, Minsoo; Shishibori, Masami; Tanaka, Yasuhiro; Aoe, Jun-ichi – Information Processing & Management, 2002
Discussion of information retrieval focuses on the use of binary trees and how to compact it to use less memory and take less time. Explains retrieval algorithms and describes data structure and hierarchical structure. (LRW)
Descriptors: Algorithms, Information Retrieval

Fox, Brian; Fox, Christopher J. – Information Processing & Management, 2002
Discussion of word stemming for text processing and information retrieval focuses on an algorithm for generating stemmers from text stemmer specification files. Highlights include the use of finite state machines for stemming; stemmer specification file format; stemmer generation and driver algorithms; and stemmer timing comparisons. (Author/LRW)
Descriptors: Algorithms, Information Retrieval, Word Processing

Shieh, Wann-Yun; Chen, Tien-Fu; Shann, Jean Jyh-Jiun; Chung, Chung-Ping – Information Processing & Management, 2003
Discusses the use of inverted files in information retrieval systems and proposes a document identifier reassignment method to reduce the average gap values in an inverted file. Highlights include the d-gap technique; document similarity; heuristic algorithms; file compression; and performance evaluation from a simulation environment. (LRW)
Descriptors: Algorithms, Evaluation Methods, Information Retrieval, Simulation

Ozkarahan, Esen – Information Processing & Management, 1995
This study develops an integrated conceptual representation scheme for multimedia documents that are viewed to comprise an object-oriented database; the necessary abstractions for the conceptual model and extensions to the relational model used as the search structure; a retrieval model that includes associative, semantic and media-specific…
Descriptors: Algorithms, Information Retrieval, Models, Multimedia Materials

Bailey, Peter; Craswell, Nick; Hawking, David – Information Processing & Management, 2003
Describes a test collection that was developed as a multi-purpose testbed for experiments on the Web in distributed information retrieval, hyperlink algorithms, and conventional ad hoc retrieval. Discusses inter-server connectivity, integrity of server holdings, inclusion of documents related to a wide spread of likely queries, and distribution of…
Descriptors: Algorithms, Hypermedia, Information Retrieval, World Wide Web

Abrahams, Julia – Information Processing & Management, 1994
Discusses the minimum average codeword length coding under the constraint that the codewords are monotonically nondecreasing in length. Bounds on the average length of an optimal monotonic code are derived, and sufficient conditions are given such that algorithms for optimal alphabetic codes can be used to find the optimal monotonic code. (six…
Descriptors: Algorithms, Coding, Illustrations, Information Theory

Guerrero Bote, Vicente P.; Moya Anegon, Felix de; Herrero Solana, Victor – Information Processing & Management, 2002
Discussion of the classification of documents from bibliographic databases focuses on a method of vectorizing reference documents from LISA (Library and Information Science Abstracts) which permits their topological organization using Kohonen's algorithm. Analyzes possibilities of this type of neural network with respect to the development of…
Descriptors: Algorithms, Bibliographic Databases, Classification, Information Retrieval

Morato, Jorge; Llorens, J.; Genova, G.; Moreiro, J. A. – Information Processing & Management, 2003
Discusses the inclusion of contextual information in indexing and retrieval systems to improve results and the ability to carry out text analysis by means of linguistic knowledge. Presents research that investigated whether discourse variables have an impact on information and retrieval and classification algorithms. (Author/LRW)
Descriptors: Algorithms, Classification, Indexing, Information Retrieval

Koyama, Masafumi; Morita, Kazuhiro; Fuketa, Masao; Aoe, Jun-Ichi – Information Processing & Management, 1998
Presents a faster method for determining hierarchical relationships in information retrieval by using trie structures instead of a linear storage of a concept code. Highlights include case structures, a knowledge representation for natural-language understanding with semantic constraints; a compression algorithm of tries; and evaluation.…
Descriptors: Algorithms, Evaluation Methods, Information Retrieval, Knowledge Representation

Nieto Sanchez, Salvador; Triantaphyllou, Evangelos; Kraft, Donald – Information Processing & Management, 2002
Proposes a new approach for classifying text documents into two disjoint classes. Highlights include a brief overview of document clustering; a data mining approach called the One Clause at a Time (OCAT) algorithm which is based on mathematical logic; vector space model (VSM); and comparing the OCAT to the VSM. (Author/LRW)
Descriptors: Algorithms, Cluster Grouping, Comparative Analysis, Mathematical Logic

Bookstein, Abraham; Klein, Shmuel T.; Raita, Timo – Information Processing & Management, 1997
Discussion of text compression focuses on a method to reduce the amount of storage needed to represent a Markov model with an extended alphabet, by applying a clustering scheme that brings together similar states. Highlights include probability vectors; algorithms; implementation details; and experimental data with natural languages. (Author/LRW)
Descriptors: Algorithms, Computer Science, Markov Processes, Models

Mostafa, J.; Lam, W. – Information Processing & Management, 2000
Presents a multilevel model of the information filtering process that permits document classification. Evaluates a document classification approach based on a supervised learning algorithm, measures the accuracy of the algorithm in a neural network that was trained to classify medical documents on cell biology, and discusses filtering…
Descriptors: Algorithms, Classification, Cytology, Evaluation Methods

Savoy, Jacques; Picard, Justin – Information Processing & Management, 2001
Discusses the role of search engines in Web usability and analyzes and evaluates the retrieval effectiveness of various indexing and searching strategies on a new Web text collection. Highlights include preprocessing techniques that might improve retrieval effectiveness; and hyperlinks as useful sources of evidence in improving retrieval…
Descriptors: Algorithms, Indexing, Information Retrieval, Search Strategies

Okada, Makoto; Ando, Kazuaki; Lee, Samuel Sangkon; Hayashi, Yoshitaka; Aoe, Jun-ichi – Information Processing & Management, 2001
Discusses information retrieval systems and extracting appropriate keywords from documents and proposes an effective substring search method by extending a pattern matching machine for multi-keywords called delayed keyword extraction (DKE). Also proposes a construction algorithm of the retrieval structure for speeding up the substring search.…
Descriptors: Algorithms, Information Retrieval, Keywords, Search Strategies

Tan, Chade-Meng; Wang, Yuan-Fang; Lee, Chan-Do – Information Processing & Management, 2002
Presents an efficient text categorization (or text classification) algorithm for document retrieval of natural language texts that generates bigrams (two-word phrases) and uses the information gain metric, combined with various frequency thresholds. Experimental results suggest that the bigrams can substantially raise the quality of feature sets.…
Descriptors: Algorithms, Classification, Information Retrieval, Natural Language Processing