ERIC Number: ED578271
Record Type: Non-Journal
Publication Date: 2017-Aug
Pages: 6
Abstractor: As Provided
ISBN: N/A
ISSN: N/A
EISSN: N/A
Available Date: N/A
Evaluating Lexical Coverage in Simple English Wikipedia Articles: A Corpus-Driven Study
Hendry, Clinton; Sheepy, Emily
Research-publishing.net, Paper presented at the EUROCALL 2017 Conference (Southampton, United Kingdom, Aug 23-26, 2017)
Simple English Wikipedia is a user-contributed online encyclopedia intended for young readers and readers whose first language is not English. We compiled a corpus of the entirety of Simple English Wikipedia as of June 20th, 2017. We used lexical frequency profiling tools to investigate the vocabulary size needed to comprehend Simple English Wikipedia texts. We hypothesized that if the texts are indeed simple, learners should need to know far fewer than 8000 words. Our findings indicate that the texts are not as simple as the creators of the authoring guidelines intended. We suggest that authors of simplified texts be encouraged to provide plain language explanations of low-frequency technical terms either in-text or in glossary form. We will discuss implications for researching the pedagogical usefulness of the Simple English Wikipedia. [For the complete volume, see ED578177.]
Descriptors: Web Sites, Collaborative Writing, Teaching Methods, Computational Linguistics, English (Second Language), Second Language Learning, Reading Comprehension, Guidelines, Difficulty Level, Word Frequency, Second Language Instruction
Research-publishing.net. La Grange des Noyes, 25110 Voillans, France. e-mail: info@research-publishing.net; Web site: http://research-publishing.net
Publication Type: Speeches/Meeting Papers; Reports - Research
Education Level: N/A
Audience: N/A
Language: English
Sponsor: N/A
Authoring Institution: N/A
Grant or Contract Numbers: N/A
Author Affiliations: N/A