50% of data science is copy + paste. The other 50% is figuring out what to copy + paste.
This aim of this book is to help a practitioner understand binary classification by providing a collection of scripts you can copy + paste into their project. They should be fully annotated and self explanatory. If more explanation is needed, submit an issue and I will look into it. If you use the work in something that is public facing, adding a no-cite reference to my DOI is appreciated.
Right now this is an eBook. I plan to release it to Amazon sometime after my dissertation. I find that writing this at the same time is helping me keep level.
To get your own copy of the ebook:
- Clone the repo
- Look in
~/00.book/07.examples.rmd
to find all the datasets used as examples. Download then copy them all to the~/data
folder. - Knit the
~/00.book/00.index.rmd
- Use Edge to "print to PDF".
The recipes presented are intended to be used in R / R Studio. You can use any method to install them, but if you are on Windows you may consider using a Chocolatey script.
Open an admin PowerShell prompt and run the code snippet below.
if('Unrestricted' -ne (Get-ExecutionPolicy)) { Set-ExecutionPolicy Bypass -Scope Process -Force }
iex ((New-Object System.Net.WebClient).DownloadString('https://chocolatey.org/install.ps1'))
refreshenv
choco install r.project -y
choco install r.studio -y