This project is about data analysis and I tried to make a model that recognizes if a request in the Divar about a car is valid or not. this project prepared for Divar competition that organized by Amirkabir University Of Tecnology.
Because of the volume of data I used Pyspark library to preprocess data. for the model I used gradient boosted tree algorithm, using CatBoost library. (it is more accurate than XGBoost library)