- Load Dataset directly as csv into panda dataframe
- Train set, validation set and test set is loaded seperately.
- Duplicate dataset for undersampled (Skip this for first fit)
- Check for Missing Data
- Check for ordinal data masked us numerical data
- Plots and charts for Data
- Dummy variables
- Impute variables?
- Feature Selection
- Scaling
- F1 score
- confusion matrix
- F1 score
- Confusion matrix