ERIC - Search Results

Source

Journal of the American…

Publication Type

Journal Articles	16
Reports - Research	14
Opinion Papers	1
Reports - Descriptive	1
Reports - Evaluative	1

Education Level

Audience

Researchers

Location

Laws, Policies, & Programs

Assessments and Surveys

What Works Clearinghouse Rating

Showing 1 to 15 of 19 results Save | Export

Text Segmentation for Chinese Spell Checking.

Peer reviewed

Lee, Kin Hong; Lu, Qin; Ng, Mau Kit Michael – Journal of the American Society for Information Science, 1999

Discussion of spell checking for Chinese words proposes a Block-of-Combinations (BOC) text-segmentation method based on frequency of word usage to reduce the word combinations from exponential growth to linear growth. Suggests user interaction to make the segmentation more suitable for spell checking. (Author/LRW)

Descriptors: Chinese, Ideography, Word Frequency

A Study of Statistical Measures for Predicting Terms Used to Index Documents

Peer reviewed

Rosenberg, Victor – Journal of the American Society for Information Science, 1971

A statistical measure is developed for predicting the terms from a restricted vocabulary that will be used to index a document, given that one of the index terms is known. (Author)

Descriptors: Computational Linguistics, Indexing, Statistical Analysis, Subject Index Terms

Automatic Text Analysis Based on Transition Phenomena of Word Occurrences

Peer reviewed

Pao, Miranda Lee – Journal of the American Society for Information Science, 1978

Describes a method of selecting index terms directly from a word frequency list, an idea originally suggested by Goffman. Results of the analysis of word frequencies of two articles seem to indicate that the automated selection of index terms from a frequency list holds some promise for automatic indexing. (Author/MBR)

Descriptors: Automatic Indexing, Comparative Analysis, Experiments, Indexing

A Relationship between Lotka's Law, Bradford's Law, and Zipf's Law.

Peer reviewed

Chen, Ye-Sho; Leimkuhler, Ferdinand F. – Journal of the American Society for Information Science, 1986

A common functional relationship among Lotka's law, Bradford's law, and Zipf's law is derived. The proof takes explicit account of the sequences of observed values of the variables by means of an index. This approach results in a more realistic and precise formulation of each law. (Author/EM)

Descriptors: Comparative Analysis, Goodness of Fit, Information Theory, Mathematical Models

An Experiment in Index Term Frequency

Peer reviewed

Svenonius, Elaine – Journal of the American Society for Information Science, 1972

The question is asked: Of index terms assigned to documents, which function most effectively in retrieval, the most used or popular terms, or those which are used relatively infrequently? The experiment is a retrieval experiment and uses the Cranfield-Salton data. (14 references) (Author)

Descriptors: Indexing, Information Processing, Relevance (Information Retrieval), Subject Index Terms

A Model for Word Clustering.

Peer reviewed

Thom, James A.; Zobel, Justin – Journal of the American Society for Information Science, 1992

Discusses models for the distribution of words in text and proposes a new model based on clustering that can be used to estimate the probability that a document contains a particular word as well as the number of distinct words in a document. Zipf's law and the Poisson approximation are also discussed. (18 references) (LRW)

Descriptors: Cluster Grouping, Mathematical Formulas, Models, Probability

Literature-Based Discovery by Lexical Statistics.

Peer reviewed

Lindsay, Robert K.; Gordon, Michael D. – Journal of the American Society for Information Science, 1999

Reports the results of experiments with MEDLINE that used lexical statistics such as word-frequency counts to discover hidden connections in medical literature. Discusses problems with relying on bibliographic citations or standard indexing methods to establish a relationship between topics that might profitably be explored by scientific research.…

Descriptors: Citations (References), Computational Linguistics, Indexing, Medical Research

The Measurement of Term Importance in Automatic Indexing.

Peer reviewed

Salton, G.; And Others – Journal of the American Society for Information Science, 1981

Reviews major term-weighting theories, presents methods for estimating the relevance properties of terms based on their frequency characteristics in a document collection, and compares weighting systems using term relevance properties with more conventional frequency-based methodologies. Eighteen references are cited. (Author/FM)

Descriptors: Automatic Indexing, Bibliographies, Information Retrieval, Methods

Split Size-Rank Models for the Distribution of Index Terms.

Peer reviewed

Nelson, Michael J.; Tague, Jean M. – Journal of the American Society for Information Science, 1985

Proposes split model for index term distribution in document set that uses rank function for high frequency terms and size function for low frequency terms; the point of transition is determined either empirically or by rule. Distributions to describe index term exhaustivity and term co-occurrence are considered briefly. (36 references) (EJS)

Descriptors: Databases, Indexing, Information Retrieval, Models

A Model for Estimating the Occurrence of Same-Frequency Words and the Boundary between High- and Low-Frequency Words in Texts.

Peer reviewed

Sun, Qinglan; Shaw, Debora; Davis, Charles H. – Journal of the American Society for Information Science, 1999

Proposes a model, based on a "maximum ranking method," for more simply estimating the frequency of any same-frequency words and identifying the boundary point between high-frequency and low-frequency words in a text. This model was used successfully with English and Chinese texts, demonstrating that the frequency of words and number of…

Descriptors: Chinese, Electronic Text, English, Information Science

Automatic Query Formulations in Information Retrieval.

Peer reviewed

Salton, G.; And Others – Journal of the American Society for Information Science, 1983

Introduces methods designed to reduce role of search intermediaries by generating Boolean search formulations automatically using term frequency considerations from natural language statements provided by system patrons. Experimental results are supplied and methods are described for applying automatic query formulation process in practice.…

Descriptors: Information Retrieval, Online Systems, Relevance (Information Retrieval), Search Strategies

The Use of Discriminant Analysis to Select Content-Bearing Words.

Peer reviewed

Dillon, Martin; Federhart, Peggy – Journal of the American Society for Information Science, 1982

Presents a method for identifying indexing terms from word stems, using discriminant analysis to distinguish terms which refer to topics from terms which do not refer to topics. A test of the method on the Harris Survey Question database is discussed. Included are 11 data tables and a reference list. (Author/JL)

Descriptors: Automatic Indexing, Classification, Discriminant Analysis, Information Retrieval

Retrieval Languages of Social Sciences and Natural Sciences: A Statistical Investigation.

Peer reviewed

Kim, Chai – Journal of the American Society for Information Science, 1982

Examines and compares the theoretical and empirical bases for the use of the relative frequency of descriptor use in the design and maintenance of thesauri for information retrieval in the social and natural sciences. Data are presented in two tables and a reference list is included. (JL)

Descriptors: Indexing, Information Retrieval, Natural Sciences, Social Sciences

Statistical Recognition of Content Terms in General Text.

Peer reviewed

Dillon, Martin; Federhart, Peggy – Journal of the American Society for Information Science, 1984

Discusses ways to improve quality of retrieval systems that depend on use of truncated words or quasi-word stems as indexing vocabulary. Problems of generalizability and stability of discriminant function analysis for selecting good topical terms in database drawn from abstracts of Harris Survey press releases are addressed. References are cited.…

Descriptors: Classification, Databases, Discriminant Analysis, Information Retrieval

A Zipfian Model of an Automatic Bibliographic System: An Application to MEDLINE.

Peer reviewed

Fedorowicz, Jane – Journal of the American Society for Information Science, 1982

Derives the underlying structure of the Zipf distribution, with emphasis on its application to word frequencies in the inverted files of automatic bibliographic systems, and applies the Zipfian model to the National Library of Medicine's MEDLINE database. An appendix on the Zipfian mean and 12 references are included. (Author/JL)

Descriptors: Citations (References), Databases, Information Retrieval, Mathematical Models

Previous Page | Next Page »

Pages: 1 | 2

Word Frequency	19
Subject Index Terms	9
Information Retrieval	8
Indexing	7
Tables (Data)	7
Statistical Analysis	6
Automatic Indexing	4
Classification	4
Models	4
Comparative Analysis	3
Databases	3
Online Systems	3
Relevance (Information…	3
Scientific and Technical…	3
Statistical Distributions	3
Abstracts	2
Chinese	2
Citations (References)	2
Computational Linguistics	2
Discriminant Analysis	2
Mathematical Models	2
Research Methodology	2
Search Strategies	2
Semantics	2
Social Sciences	2
More ▼

Dillon, Martin	2
Federhart, Peggy	2
Salton, G.	2
Chen, Ye-Sho	1
Davis, Charles H.	1
Evens, Martha	1
Fedorowicz, Jane	1
Gordon, Michael D.	1
Haas, Stephanie W.	1
Hmeidi, Ismail	1
Kanaan, Ghassan	1
Kim, Chai	1
Lee, Kin Hong	1
Leimkuhler, Ferdinand F.	1
Leydesdorff, Loet	1
Lindsay, Robert K.	1
Losee, Robert M.	1
Lu, Qin	1
Nelson, Michael J.	1
Ng, Mau Kit Michael	1
Pao, Miranda Lee	1
Peters, H. P. F.	1
Rosenberg, Victor	1
Shaw, Debora	1
More ▼