Skip to contents

Create a distribution plot of the numeric variable in general and statistical summary of the feature . In addition, the correlation values of the input variable with other features based on a threshold will also be returned

Usage

num_dist_summary(
  col_name,
  data,
  title = "",
  label_x = "",
  thresh_corr = 0.3,
  stat = TRUE
)

Arguments

col_name

A string with the name of the column

data

A dataframe , target DataSet for visualization

title

A string , Title for chart , default =''

label_x

A string, X-axis label for numeric column

thresh_corr

Number Threshold value for correlation

stat

Boolean, whether to show statistic information or not

Value

plot and dataframe to display summary statistics and correlation values @examples num_dist_summary( col_name =num1, data=df, title='num_col1',label_x= 'distribution of num1', thresh_corr = 0.3,stat = TRUE )