- Project Overview
- Dataset
- Project Structure
- Requirements
- Installation
- Usage
- Analysis and Results
- Contributing
- License
- Author
CustomerSegmentationUsingKMeans is a project that demonstrates customer segmentation using the K-means clustering algorithm. The goal is to group customers of a retail store based on their purchase history and behavior.
The dataset used for this project can be found on Kaggle. It contains the following columns:
CustomerID
: Unique identifier for each customerGender
: Gender of the customerAge
: Age of the customerAnnual Income (k$)
: Annual income of the customer in thousands of dollarsSpending Score (1-100)
: Spending score assigned by the store based on customer behavior
The repository contains the following files:
CustomerSegmentationUsingKMeans.ipynb
: Jupyter Notebook with the complete analysis and K-means clustering implementationMall_Customers.csv
: Dataset used for the analysis
To run the project, you need the following libraries:
- NumPy
- Pandas
- Matplotlib
- Seaborn
- Plotly
- Scikit-learn
-
Clone the repo:
git clone https://github.com/your-username/CustomerSegmentationUsingKMeans.git cd CustomerSegmentationUsingKMeans
-
Install the required Python packages:
pip install numpy pandas matplotlib seaborn plotly scikit-learn
jupyter notebook CustomerSegmentationUsingKMeans.ipynb
The notebook contains the following steps:
- Importing Libraries: Importing necessary libraries for analysis and visualization.
- Data Exploration: Exploring the dataset to understand the distribution and relationships between different variables.
- Data Preprocessing: Preparing the data for clustering by scaling the features.
- K-means Clustering: Implementing K-means clustering to group customers into segments.
- Visualization: Visualizing the clusters to interpret the results.
Contributions are welcome! Please open an issue or submit a pull request for any improvements or bug fixes.
This project is licensed under the MIT License - see the LICENSE file for details.
For any questions or suggestions, please contact:
- Harsh Singh: [email protected]
- GitHub: harshjuly12