Here is some basic terminology used in ML:
{fig-alt:“Supervised machine learning terminology” fig-align=“center” width=“90%”}
df = pd.read_csv("data/kc_house_data.csv") df.head(3)
3 rows × 19 columns
df.shape
(1000, 19)
classification_df = pd.read_csv("data/quiz2-grade-toy-classification.csv") classification_df.head(3)
classification_df.shape
(21, 8)
X = classification_df.drop(columns=["quiz2"]) y = classification_df["quiz2"] X.head()
y.head()
0 A+ 1 not A+ 2 not A+ 3 A+ 4 A+ Name: quiz2, dtype: object