NotesFAQContact Us
Collection
Advanced
Search Tips
Showing 1 to 15 of 16 results Save | Export
Biyi Wen – ProQuest LLC, 2024
This dissertation is an inquiry into the history of Chinese computer-based word processing technologies from 1958 to 1997. My purpose in looking into the history is to answer, how the development and usage of word-processing technologies have contributed to knowledge production, by shaping the understanding of what information technologies can do.…
Descriptors: History, Chinese, Word Processing, Computer Software
Arielle A. Gaither – ProQuest LLC, 2021
Bag-of-words is a commonly used text representation method for many text classification applications. However, bag-of-words representation fails to consider the context of the text because it only examines text documents based on the presence of individual words and explores relationships between texts with similar word choices (Bengfort, 2018).…
Descriptors: Journal Articles, Classification, Language Usage, Databases
Mehay, Dennis Nolan – ProQuest LLC, 2012
Machine translation (MT) systems attempt to translate texts from one language into another by translating words from a "source language" and rearranging them into fluent utterances in a "target language." When the two languages organize concepts in very different ways, knowledge of their general sentence structure, or…
Descriptors: Syntax, Computational Linguistics, Translation, Grammar
Guo, Lifan – ProQuest LLC, 2013
As we witness the prosperity of the social media in the past few years, and feel the explosion of "user-generated content" on the Internet, there is little question that we have entered an era of Big Data. Those social media sites, such as Facebook, LinkedIn, Quora and Twitter have been important sources for a wide spectrum of users.…
Descriptors: Semantics, Social Media, Social Networks, Web Sites
Ture, Ferhan – ProQuest LLC, 2013
With the adoption of web services in daily life, people have access to tremendous amounts of information, beyond any human's reading and comprehension capabilities. As a result, search technologies have become a fundamental tool for accessing information. Furthermore, the web contains information in multiple languages, introducing another barrier…
Descriptors: Translation, Second Languages, Written Language, Computational Linguistics
Pfaff, Jann – ProQuest LLC, 2013
Defining fall risk factors and predicting fall risk status among patients in acute care has been a topic of research for decades. With increasing pressure on hospitals to provide quality care and prevent hospital-acquired conditions, the search for effective fall prevention interventions continues. Hundreds of risk factors for falls in acute care…
Descriptors: Patients, Risk, Injuries, Prediction
Christopherson, Laura L. – ProQuest LLC, 2013
Increasing use of the Internet has led to a proliferation of online communication and information sharing media. These media, each with its own set of affordances and limitations, are thought to encourage new ways to communicate. Interlocutors refashion general English into abbreviated and often pictographic representations of existing concepts.…
Descriptors: Spelling, Information Technology, Computer Mediated Communication, Synchronous Communication
Boxwell, Stephen A. – ProQuest LLC, 2011
Treebanks are a necessary prerequisite for many NLP tasks, including, but not limited to, semantic role labeling. For many languages, however, treebanks are either nonexistent or too small to be useful. Time-critical applications may require rapid deployment of natural language software for a new critical language--much faster than the development…
Descriptors: Natural Language Processing, Training, Programming, Computer System Design
Kim, Jaewook – ProQuest LLC, 2011
One of the most critical steps to integrating heterogeneous e-Business applications using different XML schemas is schema matching, which is known to be costly and error-prone. Many automatic schema matching approaches have been proposed, but the challenge is still daunting because of the complexity of schemas and immaturity of technologies in…
Descriptors: Information Technology, Information Retrieval, Programming Languages, Programming
Jonnalagadda, Siddhartha – ProQuest LLC, 2011
In the current millennium, extensive use of computers and the internet caused an exponential increase in information. Few research areas are as important as information extraction, which primarily involves extracting concepts and the relations between them from free text. Limitations in the size of training data, lack of lexicons and lack of…
Descriptors: Sentences, Semantics, Biomedicine, Information Retrieval
Talukdar, Partha Pratim – ProQuest LLC, 2010
The variety and complexity of potentially-related data resources available for querying--webpages, databases, data warehouses--has been growing ever more rapidly. There is a growing need to pose integrative queries "across" multiple such sources, exploiting foreign keys and other means of interlinking data to merge information from diverse…
Descriptors: Information Needs, Databases, Data, Data Analysis
Heintz, Ilana – ProQuest LLC, 2010
The goal of this dissertation is to introduce a method for deriving morphemes from Arabic words using stem patterns, a feature of Arabic morphology. The motivations are three-fold: modeling with morphemes rather than words should help address the out-of-vocabulary problem; working with stem patterns should prove to be a cross-dialectally valid…
Descriptors: Semitic Languages, Dialects, Vowels, Morphemes
Lu, Hsin-Min – ProQuest LLC, 2010
Deep penetration of personal computers, data communication networks, and the Internet has created a massive platform for data collection, dissemination, storage, and retrieval. Large amounts of textual data are now available at a very low cost. Valuable information, such as consumer preferences, new product developments, trends, and opportunities,…
Descriptors: Classification, Internet, Information Retrieval, Information Technology
Boyd-Graber, Jordan – ProQuest LLC, 2010
Topic models like latent Dirichlet allocation (LDA) provide a framework for analyzing large datasets where observations are collected into groups. Although topic modeling has been fruitfully applied to problems social science, biology, and computer vision, it has been most widely used to model datasets where documents are modeled as exchangeable…
Descriptors: Language Patterns, Semantics, Linguistics, Multilingualism
Davault, Julius M., III. – ProQuest LLC, 2009
One of the problems associated with automatic thesaurus construction is with determining the semantic relationship between word pairs. Quasi-synonyms provide a type of equivalence relationship: words are similar only for purposes of information retrieval. Determining such relationships in a thesaurus is hard to achieve automatically. The term…
Descriptors: Semantics, Information Retrieval, Computational Linguistics, Reference Materials
Previous Page | Next Page ยป
Pages: 1  |  2