An rigorous, well documented machine learning analysis pipeline for binary classification datasets assembled as parallelizable command line modules. Includes exploratory analysis, data processing, feature processing, ML modeling (11 algorithms) with hyperparameter sweeps, visualizations, and statistical analysis. A comprehensive starting point to adapt to your own dataset. Testing changes