-
Notifications
You must be signed in to change notification settings - Fork 1
Scikit learn Models
Shophine edited this page Mar 19, 2021
·
3 revisions
Naive Bayes methods are a set of supervised learning algorithms based on applying Bayes’ theorem with the “naive” assumption of conditional independence between every pair of features given the value of the class variable.
- Separate the dataset into training and testing datasets
- Take the training dataset and separate it by the target values
- Calculate statistical values such as mean, the standard deviation for the dataset
- Summarize the data by class
- Calculate the Gaussian Probability Density Function
- Estimate the class probabilities
Accuracy: 73%
Logistic regression is the appropriate regression analysis to conduct when the dependent variable is dichotomous (binary). Like all regression analyses, logistic regression is a predictive analysis.
- Separate the dataset into training and testing datasets
- Train the classifier
- Predict the values using the test dataset with the classifier
Accuracy: 78%
Random forest is a type of supervised machine learning algorithm based on ensemble learning.
- Pick N random records from the dataset
- Build a decision tree based on N records
- Choose the number of trees you want to build and repeat the above two steps (More the trees, high the prediction accuracy)
Accuracy: 80%