ERIC - Search Results

Publication Date

In 2025	0
Since 2024	1
Since 2021 (last 5 years)	2
Since 2016 (last 10 years)	2
Since 2006 (last 20 years)	16

Descriptor

Computational Linguistics	16
Information Technology	16
Information Retrieval	8
Computer Software	6
Natural Language Processing	6
Programming	6
Semantics	6
Computer System Design	5
Models	4
Classification	3
Data Analysis	3
Databases	3
Vocabulary	3
Business	2
Chinese	2
Comparative Analysis	2
Content Analysis	2
Data	2
Electronic Publishing	2
Guidelines	2
Internet	2
Language Processing	2
Language Usage	2
Multilingualism	2
Risk	2
More ▼

Source

ProQuest LLC

Author

Arielle A. Gaither	1
Biyi Wen	1
Boxwell, Stephen A.	1
Boyd-Graber, Jordan	1
Christopherson, Laura L.	1
Davault, Julius M., III.	1
Fan, Hui-Mei	1
Guo, Lifan	1
Heintz, Ilana	1
Jonnalagadda, Siddhartha	1
Kim, Jaewook	1
Lu, Hsin-Min	1
Mehay, Dennis Nolan	1
Pfaff, Jann	1
Talukdar, Partha Pratim	1
Ture, Ferhan	1
More ▼

Publication Type

Dissertations/Theses -…

Education Level

Adult Education	1
Higher Education	1
Postsecondary Education	1

Audience

Location

China

Laws, Policies, & Programs

Assessments and Surveys

What Works Clearinghouse Rating

Showing 1 to 15 of 16 results Save | Export

textBox: An Intermedia History of Chinese Word Processing

Direct link

Biyi Wen – ProQuest LLC, 2024

This dissertation is an inquiry into the history of Chinese computer-based word processing technologies from 1958 to 1997. My purpose in looking into the history is to answer, how the development and usage of word-processing technologies have contributed to knowledge production, by shaping the understanding of what information technologies can do.…

Descriptors: History, Chinese, Word Processing, Computer Software

Context Matters: Employing Word Embeddings to Improve Text Classifier Performance on Peer-Reviewed Academic Journal Abstracts--A Test Case

Direct link

Arielle A. Gaither – ProQuest LLC, 2021

Bag-of-words is a commonly used text representation method for many text classification applications. However, bag-of-words representation fails to consider the context of the text because it only examines text documents based on the presence of individual words and explores relationships between texts with similar word choices (Bengfort, 2018).…

Descriptors: Journal Articles, Classification, Language Usage, Databases

Bean Soup Translation: Flexible, Linguistically-Motivated Syntax for Machine Translation

Direct link

Mehay, Dennis Nolan – ProQuest LLC, 2012

Machine translation (MT) systems attempt to translate texts from one language into another by translating words from a "source language" and rearranging them into fluent utterances in a "target language." When the two languages organize concepts in very different ways, knowledge of their general sentence structure, or…

Descriptors: Syntax, Computational Linguistics, Translation, Grammar

Semantically Enhanced Topic Modeling and Its Applications in Social Media

Direct link

Guo, Lifan – ProQuest LLC, 2013

As we witness the prosperity of the social media in the past few years, and feel the explosion of "user-generated content" on the Internet, there is little question that we have entered an era of Big Data. Those social media sites, such as Facebook, LinkedIn, Quora and Twitter have been important sources for a wide spectrum of users.…

Descriptors: Semantics, Social Media, Social Networks, Web Sites

Searching to Translate and Translating to Search: When Information Retrieval Meets Machine Translation

Direct link

Ture, Ferhan – ProQuest LLC, 2013

With the adoption of web services in daily life, people have access to tremendous amounts of information, beyond any human's reading and comprehension capabilities. As a result, search technologies have become a fundamental tool for accessing information. Furthermore, the web contains information in multiple languages, introducing another barrier…

Descriptors: Translation, Second Languages, Written Language, Computational Linguistics

Representing and Retrieving Patients' Falls Risk Factors and Risk for Falls among Adults in Acute Care through the Electronic Health Record

Direct link

Pfaff, Jann – ProQuest LLC, 2013

Defining fall risk factors and predicting fall risk status among patients in acute care has been a topic of research for decades. With increasing pressure on hospitals to provide quality care and prevent hospital-acquired conditions, the search for effective fall prevention interventions continues. Hundreds of risk factors for falls in acute care…

Descriptors: Patients, Risk, Injuries, Prediction

OMG! L2spell Online: The Creative Vocabulary of Cyberlanguage s(~_^)--b

Direct link

Christopherson, Laura L. – ProQuest LLC, 2013

Increasing use of the Internet has led to a proliferation of online communication and information sharing media. These media, each with its own set of affordances and limitations, are thought to encourage new ways to communicate. Interlocutors refashion general English into abbreviated and often pictographic representations of existing concepts.…

Descriptors: Spelling, Information Technology, Computer Mediated Communication, Synchronous Communication

A CCG-Based Method for Training a Semantic Role Labeler in the Absence of Explicit Syntactic Training Data

Direct link

Boxwell, Stephen A. – ProQuest LLC, 2011

Treebanks are a necessary prerequisite for many NLP tasks, including, but not limited to, semantic role labeling. For many languages, however, treebanks are either nonexistent or too small to be useful. Time-critical applications may require rapid deployment of natural language software for a new critical language--much faster than the development…

Descriptors: Natural Language Processing, Training, Programming, Computer System Design

A Semantic Analysis of XML Schema Matching for B2B Systems Integration

Direct link

Kim, Jaewook – ProQuest LLC, 2011

One of the most critical steps to integrating heterogeneous e-Business applications using different XML schemas is schema matching, which is known to be costly and error-prone. Many automatic schema matching approaches have been proposed, but the challenge is still daunting because of the complexity of schemas and immaturity of technologies in…

Descriptors: Information Technology, Information Retrieval, Programming Languages, Programming

An Effective Approach to Biomedical Information Extraction with Limited Training Data

Direct link

Jonnalagadda, Siddhartha – ProQuest LLC, 2011

In the current millennium, extensive use of computers and the internet caused an exponential increase in information. Few research areas are as important as information extraction, which primarily involves extracting concepts and the relations between them from free text. Limitations in the size of training data, lack of lexicons and lack of…

Descriptors: Sentences, Semantics, Biomedicine, Information Retrieval

Graph-Based Weakly-Supervised Methods for Information Extraction & Integration

Direct link

Talukdar, Partha Pratim – ProQuest LLC, 2010

The variety and complexity of potentially-related data resources available for querying--webpages, databases, data warehouses--has been growing ever more rapidly. There is a growing need to pose integrative queries "across" multiple such sources, exploiting foreign keys and other means of interlinking data to merge information from diverse…

Descriptors: Information Needs, Databases, Data, Data Analysis

Arabic Language Modeling with Stem-Derived Morphemes for Automatic Speech Recognition

Direct link

Heintz, Ilana – ProQuest LLC, 2010

The goal of this dissertation is to introduce a method for deriving morphemes from Arabic words using stem patterns, a feature of Arabic morphology. The motivations are three-fold: modeling with morphemes rather than words should help address the out-of-vocabulary problem; working with stem patterns should prove to be a cross-dialectally valid…

Descriptors: Semitic Languages, Dialects, Vowels, Morphemes

Surveillance in the Information Age: Text Quantification, Anomaly Detection, and Empirical Evaluation

Direct link

Lu, Hsin-Min – ProQuest LLC, 2010

Deep penetration of personal computers, data communication networks, and the Internet has created a massive platform for data collection, dissemination, storage, and retrieval. Large amounts of textual data are now available at a very low cost. Valuable information, such as consumer preferences, new product developments, trends, and opportunities,…

Descriptors: Classification, Internet, Information Retrieval, Information Technology

Linguistic Extensions of Topic Models

Direct link

Boyd-Graber, Jordan – ProQuest LLC, 2010

Topic models like latent Dirichlet allocation (LDA) provide a framework for analyzing large datasets where observations are collected into groups. Although topic modeling has been fruitfully applied to problems social science, biology, and computer vision, it has been most widely used to model datasets where documents are modeled as exchangeable…

Descriptors: Language Patterns, Semantics, Linguistics, Multilingualism

Resolving Quasi-Synonym Relationships in Automatic Thesaurus Construction Using Fuzzy Rough Sets and an Inverse Term Frequency Similarity Function

Direct link

Davault, Julius M., III. – ProQuest LLC, 2009

One of the problems associated with automatic thesaurus construction is with determining the semantic relationship between word pairs. Quasi-synonyms provide a type of equivalence relationship: words are similar only for purposes of information retrieval. Determining such relationships in a thesaurus is hard to achieve automatically. The term…

Descriptors: Semantics, Information Retrieval, Computational Linguistics, Reference Materials

Previous Page | Next Page »

Pages: 1 | 2