By harnessing the power of predictive modeling techniques, we can unlock insights into pricing dynamics that facilitate informed decision-making for hosts and guests alike, ultimately enhancing the Airbnb experience for all stakeholders.
The dataset used in this project was sourced from Kaggle and comprises 29 features and over 74,000 rows. The dataset encompassed numerical, categorical, text, and boolean features, providing a rich array of information for analysis. https://www.kaggle.com/datasets/stevezhenghp/airbnb-price-prediction/data
β’ Identifying and handling missing values
β’ Handling Outliers
β’ Label Encoding
β’ Correlation Analysis
β’ Wrapper Methods (Recursive Feature Elimination)
β’ Regularization (Lasso)
β’ Linear Regression
β’ Lasso Regression
β’ Ridge Regression
β’ Elastic Net Regression
β’ Support Vector Regression (SVR)
β’ Random Forest
β’ Gradient Boosting
β’ XGBoost (Extreme Gradient Boosting)
β’ LightGBM (Light Gradient Boosting Machine)
β’ CatBoost
β’ K-nearest Neighbors (KNN)
β’ Decision Tree Regression
β’ Bayesian Linear Regression
β’ Pandas
β’ Numpy
β’ Matplotlib
β’ Seaborn
β’ Scipy
β’ Scikit learn
β’ Jupyter Notebook
LIME and SHAP