Publication Date
| In 2026 | 0 |
| Since 2025 | 0 |
| Since 2022 (last 5 years) | 0 |
| Since 2017 (last 10 years) | 1 |
| Since 2007 (last 20 years) | 1 |
Descriptor
| Data Analysis | 1 |
| Data Collection | 1 |
| Language Research | 1 |
| Measurement | 1 |
| Prediction | 1 |
| Word Frequency | 1 |
Source
| Cognitive Science | 1 |
Publication Type
| Journal Articles | 1 |
| Reports - Research | 1 |
Education Level
Audience
Location
Laws, Policies, & Programs
Assessments and Surveys
What Works Clearinghouse Rating
The Challenges of Large-Scale, Web-Based Language Datasets: Word Length and Predictability Revisited
Meylan, Stephan C.; Griffiths, Thomas L. – Cognitive Science, 2021
Language research has come to rely heavily on large-scale, web-based datasets. These datasets can present significant methodological challenges, requiring researchers to make a number of decisions about how they are collected, represented, and analyzed. These decisions often concern long-standing challenges in corpus-based language research,…
Descriptors: Data Analysis, Data Collection, Word Frequency, Prediction

Peer reviewed
Direct link
