NotesFAQContact Us
Collection
Advanced
Search Tips
Showing all 4 results Save | Export
Cai, Zhiqiang; Siebert-Evenstone, Amanda; Eagan, Brendan; Shaffer, David Williamson – Grantee Submission, 2021
When text datasets are very large, manually coding line by line becomes impractical. As a result, researchers sometimes try to use machine learning algorithms to automatically code text data. One of the most popular algorithms is topic modeling. For a given text dataset, a topic model provides probability distributions of words for a set of…
Descriptors: Coding, Artificial Intelligence, Models, Probability
Cai, Zhiqiang; Siebert-Evenstone, Amanda; Eagan, Brendan; Shaffer, David Williamson; Hu, Xiangen; Graesser, Arthur C. – Grantee Submission, 2019
Coding is a process of assigning meaning to a given piece of evidence. Evidence may be found in a variety of data types, including documents, research interviews, posts from social media, conversations from learning platforms, or any source of data that may provide insights for the questions under qualitative study. In this study, we focus on text…
Descriptors: Semantics, Computational Linguistics, Evidence, Coding
Peer reviewed Peer reviewed
Bookstein, Abraham; Klein, Shmuel T. – Information Processing and Management, 1992
Presents new methods for compressing bit matrices in large information retrieval systems which exploit possible correlations between rows of words and columns of documents. Three encoding methods are tested and compared--Shannon-Fano, arithmetic, and Huffman--and an appendix discusses binomial coefficients. (20 references) (LRW)
Descriptors: Coding, Comparative Analysis, Correlation, Information Processing
McCollum, Roy – 1995
With today's growth in the use of electronic information systems for e-mail, data development and research, and the relative ease of access to such resources, protecting one's data and correspondence has become a great concern. "Pretty Good Privacy" (PGP), an encryption program developed by Phil Zimmermann, may be the software tool that…
Descriptors: Access to Information, Coding, Computer Networks, Computer Security