NotesFAQContact Us
Collection
Advanced
Search Tips
Showing all 12 results Save | Export
Xu, Wei – ProQuest LLC, 2014
Our language changes very rapidly, accompanying political, social and cultural trends, as well as the evolution of science and technology. The Internet, especially the social media, has accelerated this process of change. This poses a severe challenge for both human beings and natural language processing (NLP) systems, which usually only model a…
Descriptors: Data Analysis, Language Variation, Natural Language Processing, Computational Linguistics
Peer reviewed Peer reviewed
Direct linkDirect link
Rett, Jessica; Hyams, Nina – Language Acquisition: A Journal of Developmental Linguistics, 2014
This article presents several empirical studies of syntactically encoded evidentiality in English. The first part of our study consists of an adult online experiment that confirms claims in Asudeh & Toivonen (2012) that raised Perception Verb Similatives (PVSs; e.g. "John looks like he is sick") encode direct evidentiality. We then…
Descriptors: Syntax, Databases, Grammar, Correlation
Peer reviewed Peer reviewed
Direct linkDirect link
De Clerck, Bernard; Colleman, Timothy – Language Sciences, 2013
In this paper a case of synchronic layering is examined in which Dutch "massa" ("mass") and plural "massa's" ("masses") are attested with lexical uses as a collective noun, quantifying uses ("a large quantity of") and intensifying uses ("very")--with plural "massa's" only--in some Flemish varieties of Dutch. Against the background of…
Descriptors: Indo European Languages, Morphology (Languages), Nouns, Language Variation
Boulton, Alex – European Association for Computer-Assisted Language Learning (EUROCALL), 2012
Corpora have multiple affordances, not least for use by teachers and learners of a foreign language (L2) in what has come to be known as "data-driven learning" or DDL. The corpus and concordance interface were originally conceived by and for linguists, so other users need to adopt the role of "language researcher" to make the most of them. Despite…
Descriptors: Computational Linguistics, Second Language Instruction, Second Language Learning, Data
Yamangil, Elif – ProQuest LLC, 2013
The past two decades have shown an unexpected effectiveness of "Web-scale" data in natural language processing. Even the simplest models, when paired with unprecedented amounts of unstructured and unlabeled Web data, have been shown to outperform sophisticated ones. It has been argued that the effectiveness of Web-scale data has…
Descriptors: Models, Natural Language Processing, Computational Linguistics, Bayesian Statistics
Peer reviewed Peer reviewed
Direct linkDirect link
Crasborn, Onno – Sign Language Studies, 2010
Recent technologies in the area of video and Internet are allowing the creation and online publication of large signed language corpora. Primarily addressing the needs of linguists and other researchers, because of their unique character in history these data collections are also made accessible online for a general audience. This "open access"…
Descriptors: Sign Language, Internet, Researchers, Computational Linguistics
Peer reviewed Peer reviewed
Direct linkDirect link
Chang, Ching-Fen; Kuo, Chih-Hua – English for Specific Purposes, 2011
There has been increasing interest in the possible applications of corpora to both linguistic research and pedagogy. This study takes a corpus-based, genre-analytic approach to discipline-specific materials development. Combining corpus analysis with genre analysis makes it possible to develop teaching materials that are not only authentic but…
Descriptors: Feedback (Response), Graduate Students, Writing (Composition), Language Research
Peer reviewed Peer reviewed
Direct linkDirect link
Park, Kwanghyun; Kinginger, Celeste – Language Learning & Technology, 2010
The advance of digital video technology in the past two decades facilitates empirical investigation of learning in real time. The focus of this paper is the combined use of real-time digital video and a networked linguistic corpus for exploring the ways in which these technologies enhance our capability to investigate the cognitive process of…
Descriptors: Computer Assisted Instruction, Learning Processes, Computational Linguistics, Video Technology
Peer reviewed Peer reviewed
Direct linkDirect link
Shei, Chi-Chiang – Computer Assisted Language Learning, 2008
Formulaic speech has been notoriously difficult to define and identify despite its crucial importance to native-like fluency and idiomaticity. In this article, I introduce a way of identifying phraseological units in a running text. I am interested in recurrent fragments like "charged with crimes against humanity" in texts which involve multiple…
Descriptors: Sentences, Search Engines, Internet, Language Teachers
Erjavec, Tomaz – 1995
This paper presents an introduction to language engineering software, especially for computerized language and text corpora. The focus of the paper is on small and relatively independent pieces of software designed for specific, often low-level language analysis tasks, and on tools in the public domain. Discussion begins with the application of…
Descriptors: Computational Linguistics, Computer Software, Discourse Analysis, Foreign Countries
Kruyt, J. G.; Raaijmakers, S. A.; van der Kamp, P. H. J.; van Strien, R. J. – 1995
Corpora of present-day Dutch developed by the Institute for Dutch Lexicology include two linguistically annotated corpora that can be accessed via Internet: a 5-million word corpus covering a variety of topics and text types, and a 27-million word newspaper corpus. The texts of both were acquired in machine-readable form and have been lemmatized…
Descriptors: Access to Information, Computational Linguistics, Computer Software, Discourse Analysis
Peer reviewed Peer reviewed
Direct linkDirect link
Braun, Sabine – ReCALL, 2005
The potential of corpora for language learning and teaching has been widely acknowledged and their ready availability on the Web has facilitated access for a broad range of users, including language teachers and learners. However, the integration of corpora into general language learning and teaching practice has so far been disappointing. In this…
Descriptors: Second Language Instruction, Second Language Learning, Educational Technology, Internet