DS Salary Prediction

This is the Repository for Data Science Salary Prediction of Glassdoor's Data Science Job
First we scrapped Data Science Job from Glassdoor.
Then We cleaned the data and Did Exploratory Data Analysis and Feature Engineering from different perspectives to know in-detail about the python, excel, aws, and spark jobs.
For our model building we used are Linear, Lasso, and Random Forest Regressors using GridsearchCV.

Resources We Used

We analyzed and cleaned this dataset so it can be usable for our model.
And in EDA part, we simplified our data and analyzed different value counts through graphs also through pivot table

We transformed our variables into dummy variables
We used different Models to evaluate
The Random Forest model performed better than the other approaches on the test set.

Here we build Flask API that was hosted on local server with above given tutorial
This API will take list of values from job and predict the salary.
To send request to the flask application: Run app.py, after installing all the required dependencies. In another terminal, run request.py, to get the results.

Name		Name	Last commit message	Last commit date
Latest commit History 7 Commits
.ipynb_checkpoints		.ipynb_checkpoints
Flask		Flask
image		image
Data_Science_Salary_Prediction.ipynb		Data_Science_Salary_Prediction.ipynb
README.md		README.md
eda_sal.csv		eda_sal.csv
glassdoor_jobs.csv		glassdoor_jobs.csv
glassdoor_jobs20.csv		glassdoor_jobs20.csv
model_file.p		model_file.p
sal_modelbuilding.ipynb		sal_modelbuilding.ipynb
salary_data_cleaned.csv		salary_data_cleaned.csv