This project involves a comprehensive analysis of e-commerce sales and customer data to uncover various trends and insights. Using SQL for querying and Python for visualization, the project aims to explore a range of queries from basic to advanced, providing a detailed understanding of sales performance, customer behavior, and revenue distribution. The analysis utilizes libraries such as Matplotlib, Seaborn, and NumPy for visualizing results and deriving actionable insights.
- Unique Cities: Identify all unique cities where customers are located.
- Orders Count (2017): Count the number of orders placed in 2017.
- Total Sales per Category: Calculate total sales for each product category.
- Installment Payments Percentage: Determine the percentage of orders paid in installments.
- Customer Count by State: Count the number of customers from each state.
- Orders per Month (2018): Calculate the number of orders per month for the year 2018.
- Average Products per Order: Find the average number of products per order, grouped by customer city.
- Revenue by Product Category: Calculate the percentage of total revenue contributed by each product category.
- Price and Purchase Correlation: Identify the correlation between product price and the number of times a product has been purchased.
- Revenue by Seller: Calculate and rank total revenue generated by each seller.
- Moving Average of Order Values: Calculate the moving average of order values for each customer over their order history.
- Cumulative Sales per Month: Calculate cumulative sales per month for each year.
- Year-over-Year Sales Growth: Determine the year-over-year growth rate of total sales.
- Customer Retention Rate: Calculate the retention rate of customers who make another purchase within 6 months of their first purchase.
- Top Spending Customers: Identify the top 3 customers who spent the most money in each year.
- Customer Locations: Understand the geographical distribution of customers and their impact on sales.
- Order Trends: Analyze how order volumes and sales vary by time period and product category.
- Revenue Distribution: Gain insights into which categories and sellers contribute most to total revenue.
- Customer Behavior: Explore customer purchasing patterns and retention rates.
- Sales Growth: Track sales growth over time to identify trends and forecast future performance.
- Setup: Ensure you have the necessary Python packages and database connection set up.
- Run Queries: Execute the SQL queries to extract data from the database.
- Analyze Data: Use Python scripts to process and visualize the data.
- Explore Visualizations: Review the visualizations generated to gain insights into sales and customer behavior.
The dataset for this analysis is provided in a text file. Please refer to the attached text file for the dataset link and additional details on accessing the data.
This project provides a detailed approach to analyzing e-commerce data, offering valuable insights into performance trends, revenue distribution, and customer behavior. By leveraging SQL and Python, users can explore a variety of queries and visualizations to better understand their data.
Aishwarya S Patil - [email protected]