Publication Date
In 2025 | 0 |
Since 2024 | 2 |
Since 2021 (last 5 years) | 2 |
Since 2016 (last 10 years) | 6 |
Since 2006 (last 20 years) | 12 |
Descriptor
Source
Author
Publication Type
Reports - Descriptive | 23 |
Journal Articles | 19 |
Information Analyses | 4 |
Speeches/Meeting Papers | 3 |
Opinion Papers | 1 |
Education Level
Higher Education | 2 |
Postsecondary Education | 1 |
Audience
Practitioners | 1 |
Researchers | 1 |
Teachers | 1 |
Location
China | 2 |
Italy | 1 |
Ukraine | 1 |
United Kingdom | 1 |
Laws, Policies, & Programs
Assessments and Surveys
MacArthur Communicative… | 1 |
What Works Clearinghouse Rating
Vera Kempe; Patricia J. Brooks; Steven Gillis – Language Teaching Research Quarterly, 2024
The Child Language Data Exchange System (CHILDES), created by Brain MacWhinney and Catherine Snow in 1984, is one of the earliest Open Science and data sharing initiatives in child language development research, and probably in developmental psychology and the behavioral sciences more generally. It is the cornerstone of TalkBank--a repository of…
Descriptors: Databases, Child Language, Language Acquisition, Language Research
Stefania Spina; Irene Fioravanti; Luciana Forti; Fabio Zanda – Second Language Research, 2024
This article introduces the CELI corpus, a new learner corpus of written Italian consisting of ca. 600,000 tokens, evenly distributed among CEFR (Common European Framework of Reference for Languages) proficiency levels B1, B2, C1 and C2. The collected texts derive from the language certification exams administered by the University for Foreigners…
Descriptors: Computational Linguistics, Second Language Instruction, Second Language Learning, Best Practices
Jeaco, Stephen – International Journal of Computer-Assisted Language Learning and Teaching, 2019
One of the greatest impacts of corpus linguistics on language teaching has been in the recognition of the importance of collocation. A very influential guide for language teachers with regard to teaching collocation has been the Lexical Approach. Activities pointing students to rich collocational information in monolingual dictionaries, in texts…
Descriptors: Phrase Structure, Second Language Learning, Second Language Instruction, Teaching Methods
Frankenberg-Garcia, Ana; Lew, Robert; Roberts, Jonathan C.; Rees, Geraint Paul; Sharma, Nirwan – ReCALL, 2019
Corpora have given rise to a wide range of lexicographic resources aimed at helping novice users of academic English with their writing. This includes academic vocabulary lists, a variety of textbooks, and even a bespoke academic English dictionary. However, writers may not be familiar with these resources or may not be sufficiently aware of the…
Descriptors: Computational Linguistics, English for Academic Purposes, Lexicography, Writing Instruction
Frank, Michael C.; Braginsky, Mika; Yurovsky, Daniel; Marchman, Virginia A. – Journal of Child Language, 2017
The MacArthur-Bates Communicative Development Inventories (CDIs) are a widely used family of parent-report instruments for easy and inexpensive data-gathering about early language acquisition. CDI data have been used to explore a variety of theoretically important topics, but, with few exceptions, researchers have had to rely on data collected in…
Descriptors: Language Skills, Measures (Individuals), Language Acquisition, Databases
Todirascu, Amalia; Cargill, Marion – Research-publishing.net, 2019
We present SimpleApprenant, a platform aiming to improve French L2 learners' knowledge of Multi Word Expressions (MWEs). SimpleApprenant integrates an MWE database annotated with the Common European Framework of Reference for languages (CEFR) level and several Natural Language Processing (NLP) tools: a spelling checker, a parser, and a set of…
Descriptors: French, Phrase Structure, Second Language Learning, Second Language Instruction
Calzada Pérez, María – Indian Journal of Applied Linguistics, 2013
The present paper revolves around MaxiECPC, one of the various sub-corpora that make up ECPC (the European Comparable and Parallel Corpora), an electronic archive of speeches delivered at different parliaments (i.e. the European Parliament-EP; the Spanish Congreso de los Diputados-CD; and the British House of Commons-HC) from 1996 to 2009. In…
Descriptors: Electronic Publishing, Databases, Archives, Speeches
Ferrero, Carmen Lopez – Applied Linguistics, 2012
The aim of this article is to describe the grammatical patterns of a set of nouns frequently used in Spanish specialized discourse: the so-called "semiterms". The following nouns were selected for the study: "problema" "problem", "resultado" "result", "motivo" "motive/reason", "razon" "reason", and "consecuencia" "consequence". Apart from…
Descriptors: Grammar, Language Patterns, Nouns, Spanish
Waterfall, Heidi R.; Sandbank, Ben; Onnis, Luca; Edelman, Shimon – Journal of Child Language, 2010
This paper reports progress in developing a computer model of language acquisition in the form of (1) a generative grammar that is (2) algorithmically learnable from realistic corpus data, (3) viable in its large-scale quantitative performance and (4) psychologically real. First, we describe new algorithmic methods for unsupervised learning of…
Descriptors: Generative Grammar, Language Acquisition, Computational Linguistics, Databases
Sagae, Kenji; Davis, Eric; Lavie, Alon; MacWhinney, Brian; Wintner, Shuly – Journal of Child Language, 2010
Corpora of child language are essential for research in child language acquisition and psycholinguistics. Linguistic annotation of the corpora provides researchers with better means for exploring the development of grammatical constructions and their usage. We describe a project whose goal is to annotate the English section of the CHILDES database…
Descriptors: Psycholinguistics, Grammar, Child Language, Language Acquisition
Boguslavsky, Igor; Cardenosa, Jesus; Gallardo, Carolina – Applied Linguistics, 2009
Multilingual lexicons are needed in various applications, such as cross-lingual information retrieval, machine translation, and some others. Often, these applications suffer from the ambiguity of dictionary items, especially when an intermediate natural language is involved in the process of the dictionary construction, since this language adds…
Descriptors: Translation, Figurative Language, Multilingualism, Dictionaries
Sha, Guoquan – Computer Assisted Language Learning, 2010
Data-driven learning (DDL), or corpus-based language learning, involves the learner in an exploratory task to discover appropriate expressions or collocates regarding his writing. However, the problematic units of meaning in each learner's writing are so diverse that conventional corpora often prove futile. The search engine Google with the…
Descriptors: Written Language, Search Engines, Second Language Learning, Computational Linguistics
Buchstaller, Isabelle – Edinburgh Working Papers in Applied Linguistics, 2003
This paper discusses mimesis, the direct representation and total imitation of an event. It studies the co-occurrence of quotative verbs with mimetic enactment based on two corpora of U.S. American English, both available through the University of Pennsylvania Data Consortium. The Switchboard Corpus has 542 speakers ranging in age from 20-60 years…
Descriptors: Computational Linguistics, Databases, North American English, Oral Language

Tambovtsev, Yuri A. – Educational and Training Technology International, 1993
Discussion of the use of computers in Slavonic studies in the Ukraine focuses on linguistics. Topics addressed include the Machine Fund of Russian, a Russian language database; the Machine Fund of Non-Russian Languages that includes each republic of the former Soviet Union; natural language processing; and comparing languages. (18 references) (LRW)
Descriptors: Computational Linguistics, Databases, Foreign Countries, Language Classification

MacWhinney, Brian; Snow, Catherine – Journal of Child Language, 1985
Describes the formation of the Child Language Data Exchange System (CHILDES), a system formed to foster the sharing of computerized data on language acquisition. Details the governance of the system, the nature of the database, the shape of the coding conventions, and the types of computer programs being developed. (SED)
Descriptors: Child Language, Computational Linguistics, Data Collection, Databases
Previous Page | Next Page »
Pages: 1 | 2