Descriptor
Source
Information Processing &… | 12 |
Author
Kim, Dongseok | 2 |
Lee, Gary Geunbae | 2 |
Aoe, Jun-ichi | 1 |
Atlam, El-Sayed | 1 |
Cha, Jeongwon | 1 |
Chan, Benjamin | 1 |
Flood, James | 1 |
Frew, Brian | 1 |
Fuketa, Masao | 1 |
Hersh, William | 1 |
Jung, Hanmin | 1 |
More ▼ |
Publication Type
Journal Articles | 12 |
Reports - Research | 10 |
Reports - Descriptive | 5 |
Education Level
Audience
Location
Laws, Policies, & Programs
Assessments and Surveys
What Works Clearinghouse Rating

Tan, Chade-Meng; Wang, Yuan-Fang; Lee, Chan-Do – Information Processing & Management, 2002
Presents an efficient text categorization (or text classification) algorithm for document retrieval of natural language texts that generates bigrams (two-word phrases) and uses the information gain metric, combined with various frequency thresholds. Experimental results suggest that the bigrams can substantially raise the quality of feature sets.…
Descriptors: Algorithms, Classification, Information Retrieval, Natural Language Processing

Atlam, El-Sayed; Fuketa, Masao; Morita, Kazuhiro; Aoe, Jun-ichi – Information Processing & Management, 2000
Discussion of natural language processing focuses on term weighting in information retrieval. Presents a new weighting method that depends on low frequency terms, called negative weighted inverse verb frequency, and discusses case frames, word similarity, similarity measurement, and recall and precision improvement. (LRW)
Descriptors: Information Retrieval, Mathematical Formulas, Measurement Techniques, Natural Language Processing

Miller, Uri – Information Processing & Management, 1997
Discusses general problems of thesaurus construction theory and practice. Highlights include lexical control and its tools in various databases; natural language versus conceptual networks; systems approach; thesaurus versus classification, including associative relations; and thesaurus role in information storage and retrieval. (110 references)…
Descriptors: Classification, Databases, Information Retrieval, Information Storage

Hersh, William; Turpin, Andrew; Price, Susan; Kraemer, Dale; Olson, Daniel; Chan, Benjamin; Sacherek, Lynetta – Information Processing & Management, 2001
Describes research conducted at the TREC (Text Retrieval Conference) interactive track that compared Boolean and natural language searching, showing they achieved comparable results; and assessed the validity of batch-oriented retrieval evaluations, showing that the results from batch evaluations were not comparable to those obtained in…
Descriptors: Comparative Analysis, Evaluation Methods, Information Retrieval, Natural Language Processing

Shim, Junhyeok; Kim, Dongseok; Cha, Jeongwon; Lee, Gary Geunbae; Seo, Jungyun – Information Processing & Management, 2002
Discussion of natural language processing focuses on a multi-strategic integrated text preprocessing method for difficult problems of sentence boundary disambiguation and word boundary disambiguation of Web texts. Describes an evaluation of the method using Korean Web document collections. (Author/LRW)
Descriptors: Evaluation Methods, Korean, Mathematical Formulas, Natural Language Processing

McKeown, Kathleen; And Others – Information Processing & Management, 1995
Presents an approach to summarization that combines information from multiple facts into a single sentence using linguistic constructions. Describes two applications: one produces summaries of basketball games, and the other contains summaries of telephone network planning activity. Both summarize input data as opposed to full text. Discusses…
Descriptors: Basketball, Communications, Computational Linguistics, Information Sources

Kim, Dongseok; Jung, Hanmin; Lee, Gary Geunbae – Information Processing & Management, 2003
Presents a new extraction pattern, modified Document Type Definition (mDTD), which relies on analytical interpretation to identify extraction target from the contents of Web documents. Experiments with 330 Korean and 220 English Web documents on audio and video shopping sites yielded an average extraction precision of 91.3% for Korean and 81.9%…
Descriptors: Computer System Design, English, Information Retrieval, Korean

Turtle, Howard; Flood, James – Information Processing & Management, 1995
Discusses two query evaluation strategies used in large text retrieval systems: (1) term-at-a-time; and (2) document-at-a-time. Describes optimization techniques that can reduce query evaluation costs. Presents simulation results that compare the performance of these optimization techniques when applied to natural language query evaluation. (JMV)
Descriptors: Access to Information, Comparative Analysis, Cost Effectiveness, Evaluation Methods

Strzalkowski, Tomek – Information Processing & Management, 1995
Describes an information retrieval system in which advanced natural language processing is used to enhance the effectiveness of term-based document retrieval by preprocessing the documents; discovering interterm dependencies and build a conceptual hierarchy specific to database domain; and processing the user's natural language requests into…
Descriptors: Databases, Information Processing, Information Retrieval, Information Seeking

Maybury, Mark T. – Information Processing & Management, 1995
Describes and evaluates a system that selects key information from an event database by reasoning about event frequencies, frequencies of relations between events, and domain-specific importance measures. The system aggregates similar information and plans a summary tailored to a stereotypical user. (AEF)
Descriptors: Abstracting, Data Processing, Databases, Electronic Text

Losee, Robert M. – Information Processing & Management, 2001
Increasing information retrieval performance using phrases and part-of-speech (POS) information is one example of a type of decision-making performance that is improved when using this linguistic information. The relative effectiveness of using multi-term phrases as opposed to individual terms is shown, as well as the relative worth of POS tagged…
Descriptors: Decision Making, Form Classes (Languages), Improvement, Information Retrieval

Rowe, Neil C.; Frew, Brian – Information Processing & Management, 1998
Explores the indirect method of locating for indexing the likely explicit and implicit captions of photographs, using multimodal clues including the specific words used, syntax, surrounding layout of the Web page, and general appearance of the associated image. The MARIE-3 system thus avoids full image processing and full natural-language…
Descriptors: Captions, Computer System Design, Indexing, Information Processing