Function to create summary statistics and basic EDA plots. Given a data frame, this function outputs general exploratory analysis plots as well as basic statistics summarizing trends in the features of the input data.

arid_eda(df, response, response_type = "numeric", features = c())

Arguments

df

(data frame): the input data frame to analyze

response

(character): the column name of the response variable

response_type

(character): string indicating if response is 'categorical' or 'continuous' (default: 'numeric')

features

(list<string>): a list of explanatory variable column names (default: c())

Value

data frame: a data frame with a list of features and their coefficients

plot: a ggplot object containing the EDA