Publication Date
In 2025 | 0 |
Since 2024 | 2 |
Since 2021 (last 5 years) | 5 |
Since 2016 (last 10 years) | 8 |
Since 2006 (last 20 years) | 12 |
Descriptor
Classification | 13 |
Computational Linguistics | 13 |
Information Retrieval | 13 |
Natural Language Processing | 5 |
Computer Software | 3 |
Data Analysis | 3 |
Indexing | 3 |
Information Technology | 3 |
Semantics | 3 |
Decision Making | 2 |
Design | 2 |
More ▼ |
Source
ProQuest LLC | 4 |
Grantee Submission | 3 |
Interactive Technology and… | 1 |
Journal of Educational Data… | 1 |
Journal of Research on… | 1 |
LEARN Journal: Language… | 1 |
Visible Language | 1 |
Author
Anglin, Kylie L. | 2 |
Arielle A. Gaither | 1 |
Boyer, Kristy Elizabeth | 1 |
Cai, Zhiqiang | 1 |
Crutcher, Keith A. | 1 |
Eagan, Brendan | 1 |
Ezen-Can, Aysu | 1 |
Graesser, Arthur C. | 1 |
Hu, Xiangen | 1 |
Huapu Liu | 1 |
Lijin Zhang | 1 |
More ▼ |
Publication Type
Journal Articles | 6 |
Dissertations/Theses -… | 4 |
Reports - Descriptive | 4 |
Reports - Research | 3 |
Reports - Evaluative | 1 |
Speeches/Meeting Papers | 1 |
Education Level
Elementary Secondary Education | 2 |
Higher Education | 1 |
Postsecondary Education | 1 |
Audience
Laws, Policies, & Programs
Assessments and Surveys
What Works Clearinghouse Rating
Huapu Liu – ProQuest LLC, 2024
This two-part dissertation centers on a re-examination of the role of book indexes in information retrieval research on full-text digital book collections in digital libraries. Early research focused on information retrieval and book indexes (in addition to other parts of books) in the 2000s when the Google Books corpus was first released to the…
Descriptors: Information Retrieval, Indexes, Reference Materials, Semantics
Piyapong Laosrirattanachai; Piyanuch Laosrirattanachai – LEARN Journal: Language Education and Acquisition Research Network, 2024
Lexical bundles and moves are essential for vloggers to communicate clearly and purposefully within travel vlog discourse. It is crucial for L2 learners and practitioners aiming to enter the industry to master these bundles and understand the moves used in creating travel vlogs. This corpus-based study compiled a list of 239 four-word lexical…
Descriptors: Phrase Structure, Electronic Publishing, Second Language Learning, Second Language Instruction
Lijin Zhang; Xueyang Li; Zhiyong Zhang – Grantee Submission, 2023
The thriving developer community has a significant impact on the widespread use of R software. To better understand this community, we conducted a study analyzing all R packages available on CRAN. We identified the most popular topics of R packages by text mining the package descriptions. Additionally, using network centrality measures, we…
Descriptors: Computer Software, Programming Languages, Data Analysis, Visual Aids
Shaurya Rohatgi – ProQuest LLC, 2023
The exponential growth of digital libraries and the proliferation of scholarly content in electronic formats have made data mining and information retrieval essential tools for effectively managing, organizing, and disseminating knowledge. This thesis provides a comprehensive analysis of the advancements and challenges in these fields, with a…
Descriptors: Data Use, Data Analysis, Information Retrieval, Database Design
Arielle A. Gaither – ProQuest LLC, 2021
Bag-of-words is a commonly used text representation method for many text classification applications. However, bag-of-words representation fails to consider the context of the text because it only examines text documents based on the presence of individual words and explores relationships between texts with similar word choices (Bengfort, 2018).…
Descriptors: Journal Articles, Classification, Language Usage, Databases
Cai, Zhiqiang; Siebert-Evenstone, Amanda; Eagan, Brendan; Shaffer, David Williamson; Hu, Xiangen; Graesser, Arthur C. – Grantee Submission, 2019
Coding is a process of assigning meaning to a given piece of evidence. Evidence may be found in a variety of data types, including documents, research interviews, posts from social media, conversations from learning platforms, or any source of data that may provide insights for the questions under qualitative study. In this study, we focus on text…
Descriptors: Semantics, Computational Linguistics, Evidence, Coding
Anglin, Kylie L. – Journal of Research on Educational Effectiveness, 2019
Education researchers have traditionally faced severe data limitations in studying local policy variation; administrative data sets capture only a fraction of districts' policy decisions, and it can be expensive to collect more nuanced implementation data from teachers and leaders. Natural language processing and web-scraping techniques can help…
Descriptors: Natural Language Processing, Educational Policy, Web Sites, Decision Making
Anglin, Kylie L. – Grantee Submission, 2019
Education researchers have traditionally faced severe data limitations in studying local policy variation; administrative datasets capture only a fraction of districts' policy decisions, and it can be expensive to collect more nuanced implementation data from teachers and leaders. Natural language processing and web-scraping techniques can help…
Descriptors: Natural Language Processing, Educational Policy, Web Sites, Decision Making
Ezen-Can, Aysu; Boyer, Kristy Elizabeth – Journal of Educational Data Mining, 2015
Within the landscape of educational data, textual natural language is an increasingly vast source of learning-centered interactions. In natural language dialogue, student contributions hold important information about knowledge and goals. Automatically modeling the dialogue act of these student utterances is crucial for scaling natural language…
Descriptors: Classification, Dialogs (Language), Computational Linguistics, Information Retrieval
Lu, Hsin-Min – ProQuest LLC, 2010
Deep penetration of personal computers, data communication networks, and the Internet has created a massive platform for data collection, dissemination, storage, and retrieval. Large amounts of textual data are now available at a very low cost. Valuable information, such as consumer preferences, new product developments, trends, and opportunities,…
Descriptors: Classification, Internet, Information Retrieval, Information Technology
Sanan, Majed; Rammal, Mahmoud; Zreik, Khaldoun – Interactive Technology and Smart Education, 2008
Purpose: Recently, classification of Arabic documents is a real problem for juridical centers. In this case, some of the Lebanese official journal documents are classified, and the center has to classify new documents based on these documents. This paper aims to study and explain the useful application of supervised learning method on Arabic texts…
Descriptors: Semitic Languages, Classification, Information Retrieval, Periodicals
Steinke, Elisabeth – 1970
An approach to using the computer to assemble German tests is described. The purposes of the system would be: (1) an expansion of the bilingual lexical memory bank to list and store idioms of all degrees of difficulty, with frequency data and with complete and sophisticated retrieval possibility for assembly; (2) the creation of an…
Descriptors: Classification, Computational Linguistics, Computer Oriented Programs, German
Zender, Mike; Crutcher, Keith A. – Visible Language, 2007
The accelerating rate of data generation and resulting publications are taxing the ability of scientific investigators to stay current with the emerging literature. This problem, acute in science, is not uncommon in other areas. New approaches to managing this explosion of information are needed. While it is only possible to read one paper or…
Descriptors: Alzheimers Disease, Pathology, Scientific Concepts, Research Methodology