Skip to content

This repository has the machine learning projects which were built using the python language

Notifications You must be signed in to change notification settings

drdataSpp/ml_python

Folders and files

NameName
Last commit message
Last commit date

Latest commit

 

History

14 Commits
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 

Repository files navigation

Machine Learning Projects Using Python by Soorya Parthiban.

Project 1: Adult Income Classification

Links

Adult Income Classification Jupyter Notebook Link
Adult Income Dataset Link

Project Description

Prediction task is to determine whether a person makes over 50K a year.

How did I solve this problem?

  1. I started with exploring the dataset by performing statistical analysis.
  2. After performing statistical analysis, I found the null values and filled the null values using the respective column's mode values.
  3. After filling the null values, I performed data visualization to understand the relation between the dependent variable and outcome variable.
  4. After performing data visualization, I pre-processed the dataset where I converted all the categorical values to numerical values.
  5. Once the dataset was pre-processed, I split the 70% of data into the training set and 30% of data into testing set.
  6. A Logistic Regression model was trainied using the training data and evaluated using the testing data.
  7. To reduce the curse of dimenstionality, using Principle Component Analysis method I created another train and test set with lesser columns or features.
  8. With the above PCA train and test set, a Logistic Regression and Decision Tree model was trained and evaluated.

Conclusion

Both, Logistic Regression and Decision Tree model performed well on the test set and got an accuracy of 82%.

Project 2: Website Phishing Classification

Links

Website Phishing Classification Jupyter Notebook Link
Website Phishing Dataset Link

Project Description

Prediction task is to determine whether a website is a phishing one.

Project 3: Heart Disease Classification

Links

Heart Disease Classification Jupyter Notebook Link
Heart Disease Dataset Link

Project Description

Prediction task is to determine whether a person has got heart disease.

Project 4: Predicting Medical Charges Expenses

Links

Medical Charges Predictions Jupyter Notebook Link
Medical Charges Dataset Link

Project Description

Prediction task is to determine the medical charges.

Project 5: Classifying edible and poisonous mushrooms

Links

Mushroom Classification Jupyter Notebook Link
Mushroom Classification Dataset Link

Project Description

Prediction task is to determine whether a mushroom is an edible one.

Project 6: Bank Marketing Campaign Classification

Links

Bank Marketing Classification Jupyter Notebook Link
Bank Marketing Dataset Link

Project Description

Prediction task is to determine whether a client will subscribe (yes/no) a term deposit.

Project 7: Microorganism Classification

Links

Microorganism Classification Jupyter Notebook Link
Microorganism Classification Dataset Link

Project Description

Prediction task is to determine the class of given microorganism on the basis of their morphological features.

Project 8: Predicting Breast Cancer

Links

Breast Cancer Classification Jupyter Notebook Link
Breast Cancer Classification Dataset Link

Project Description

Prediction task is to determine whether a mushroom .

About

This repository has the machine learning projects which were built using the python language

Resources

Stars

Watchers

Forks

Releases

No releases published

Packages

No packages published