Tools to make EDA easier!
This package is aimed at making the EDA process more effective. Basically, we found there were tons of repetitive work when getting a glimpse of the data set. To stop wasting time in repeating procedures, our team came up with the idea to develop a toolkit that includes the following functions:
preprocess(path, method=NULL, fill_value=NULL, read_func=readr::read_csv, ...)
:column_stats(data, columns)
:numeric_plots(df)
:plot_histogram(data, columns = "all", num_bins = 30)
:Surely, EDA is not a new topic to data scientists. There are quite a few packages doing similar work on CRAN
. However, most of them only include limited functions like just providing descriptive statistics. Our proposal is more of a one-in-all toolkit for EDA. Below is a list of sister-projects.
brinton
correlationfunnel
ezEDA
You can install the released version of EDAhelperR
from this repo at the R console:
::install_github('UBC-MDS/EDAhelperR') devtools
Example usage:
library(EDAhelperR)
preprocess(readr::readr_example("mtcars.csv"))
column_stats(iris, c('Sepal.Length', 'Sepal.Width', 'Petal.Length'))
numeric_plots(df)
plot_histogram(mtcars)
Interested in contributing? Check out the contributing guidelines. Please note that this project is released with a Code of Conduct. By contributing to this project, you agree to abide by its terms.
EDAhelperR
was created by Rowan Sivanandam, Steven Leung, Vera Cui, Jennifer Hoang. It is licensed under the terms of the MIT license.
EDAhelperR
was created with usethis
.