Publication Date
In 2025 | 0 |
Since 2024 | 0 |
Since 2021 (last 5 years) | 1 |
Since 2016 (last 10 years) | 3 |
Since 2006 (last 20 years) | 6 |
Descriptor
Computational Linguistics | 7 |
Databases | 7 |
Information Technology | 7 |
Language Research | 3 |
Programming | 3 |
Chinese | 2 |
Classification | 2 |
Data | 2 |
English (Second Language) | 2 |
Foreign Countries | 2 |
Ideography | 2 |
More ▼ |
Author
Publication Type
Dissertations/Theses -… | 3 |
Journal Articles | 2 |
Reports - Research | 2 |
Speeches/Meeting Papers | 2 |
Reports - Descriptive | 1 |
Reports - Evaluative | 1 |
Education Level
Higher Education | 2 |
Postsecondary Education | 2 |
Adult Education | 1 |
Audience
Location
Asia | 1 |
Bangladesh | 1 |
China | 1 |
Hong Kong | 1 |
India | 1 |
Malaysia | 1 |
Pakistan | 1 |
Philippines | 1 |
Singapore | 1 |
Sri Lanka | 1 |
United Kingdom | 1 |
More ▼ |
Laws, Policies, & Programs
Assessments and Surveys
What Works Clearinghouse Rating
Arielle A. Gaither – ProQuest LLC, 2021
Bag-of-words is a commonly used text representation method for many text classification applications. However, bag-of-words representation fails to consider the context of the text because it only examines text documents based on the presence of individual words and explores relationships between texts with similar word choices (Bengfort, 2018).…
Descriptors: Journal Articles, Classification, Language Usage, Databases
Antonio, Abigail F.; Bacang, Bernardita G.; Rillo, Richard M.; Alieto, Ericson O.; Caspillo, Warrelen D. C. – Online Submission, 2019
This study is one of the pioneers in investigating and analyzing the orthographical conventions/norms of the outer circle Asian Englishes using one of the largest databases of English corpus, the Global Web-based English (GloWbE). This study extends the analysis of the current orthographical norms of the new varieties to their colonial parents.…
Descriptors: Language Variation, English (Second Language), Computational Linguistics, Databases
Adorján, Mária – Research-publishing.net, 2020
Many language teachers use Information and Communications Technology (ICT) in their classrooms to create tasks, quizzes, or polls with general online learning platforms. Few teachers have experience, however, of incorporating online corpus tools in their teaching or assessment practices. This paper will explore how autonomous learning can be…
Descriptors: Computational Linguistics, Databases, Second Language Learning, Second Language Instruction
Talukdar, Partha Pratim – ProQuest LLC, 2010
The variety and complexity of potentially-related data resources available for querying--webpages, databases, data warehouses--has been growing ever more rapidly. There is a growing need to pose integrative queries "across" multiple such sources, exploiting foreign keys and other means of interlinking data to merge information from diverse…
Descriptors: Information Needs, Databases, Data, Data Analysis
Marchand, Yannick; Adsett, Connie R.; Damper, Robert I. – Language and Speech, 2009
Automatic syllabification of words is challenging, not least because the syllable is not easy to define precisely. Consequently, no accepted standard algorithm for automatic syllabification exists. There are two broad approaches: rule-based and data-driven. The rule-based method effectively embodies some theoretical position regarding the…
Descriptors: Syllables, Databases, English, Computational Linguistics
Fan, Hui-Mei – ProQuest LLC, 2010
The present study is based on the theoretical assumptions that frequency of characters and their structural components, as well as the frequency types of structural components, are important to enable learners of Chinese as a foreign language (CFL) to discover the underlying structure of Chinese characters. In the CFL context, since reliable…
Descriptors: Textbooks, Phonetics, Semantics, Vocabulary
Zhiwei, Feng – 1995
Trends and developments in computer applications in Chinese language research are described, focusing on these areas: input of Chinese characters and Chinese corpus; automatic segmentation of Chinese written text in corpus; development of a grammar knowledge base for Chinese words to be used as a resource for text segmentation and corpus…
Descriptors: Chinese, Computational Linguistics, Computer Software, Databases