clean_tokens()
Tokenize words, and remove stopwords from corpus
corpora_best_match()
Returns a tibble of distances from the reference document for each corpus in a vector of corpora. This tibble is sorted in the order of increasing distance.
corpora_compare()
Returns a numeric vector of the distance between the two corpora
corpus_analysis()
Generate basic statistic for words from the input corpus
corpus_viz()
Generate visualizations for words from the input corpus
load_pretrained()
Download and load pretrained word2vector models (https://github.com/maxoodf/word2vec#basic-usage)