All functions

clean_tokens()

Tokenize words, and remove stopwords from corpus

corpora_best_match()

Returns a tibble of distances from the reference document for each corpus in a vector of corpora. This tibble is sorted in the order of increasing distance.

corpora_compare()

Returns a numeric vector of the distance between the two corpora

corpus_analysis()

Generate basic statistic for words from the input corpus

corpus_viz()

Generate visualizations for words from the input corpus

load_pretrained()

Download and load pretrained word2vector models (https://github.com/maxoodf/word2vec#basic-usage)