Cleans the text in the tweets and returns as new columns in the dataframe. The cleaning process includes converting into lower case, removal of punctuation, hastags and hastag counts

clean_tweets(file_path, tokenization = TRUE, word_count = TRUE)

Arguments

file_path

character File path to csv file containing tweets data

tokenization

logical Creates new column containing cleaned tweet word tokens when True Default is True

word_count

logical Creates new column containing word count of cleaned tweets Default is True

Value

df_tweets Dataframe Dataframe containing data on cleaned tweets .

Examples