This project involves an in-depth analysis of customer data for an e-commerce platform. The dataset includes customer demographics, spending habits, and responses to marketing campaigns. The primary goal is to extract actionable insights to improve business performance, understand customer behavior, and evaluate marketing campaign effectiveness.
Dataset Source: Kaggle E-commerce Dataset
The Nxtivia E-commerce Project aims to provide a comprehensive analysis of customer data to understand spending habits, demographics, and responses to marketing campaigns. This analysis will help in identifying key trends, enhancing customer engagement, and optimizing marketing strategies to boost overall business performance.
The Nxtivia E-commerce Project aims to analyze a comprehensive dataset derived from an e-commerce platform to uncover valuable insights into customer behavior and spending patterns. By leveraging this dataset, the project seeks to identify trends, evaluate the effectiveness of marketing campaigns, and propose data-driven strategies to enhance customer engagement and business performance. This analysis is crucial for businesses aiming to optimize their operations and tailor their offerings to meet the evolving needs of their customers.
E-commerce has become a significant part of the retail industry, with businesses constantly seeking ways to understand their customers better and improve their marketing efforts. This project focuses on analyzing customer data to gain insights that can drive strategic decisions and improve overall business outcomes.
The scope of this project includes:
- Analyzing customer demographics to understand the target audience.
- Examining spending habits to identify key product categories and customer preferences.
- Evaluating responses to marketing campaigns to determine their effectiveness.
- Providing actionable recommendations to enhance business performance.
- Import and prepare the dataset from Kaggle.
- Create a database named "Ecom" and load the dataset into MySQL Workbench.
- Perform detailed analysis on various aspects:
- Customer Demographics
- Income and Household Information
- Purchase Behavior
- Campaign Responses
- Online and Offline Purchases
- Customer Complaints and Engagement
- Discounts and Deals
- Campaign Effectiveness
- Web Engagement and Purchases
- Overall Spending Patterns
The organization is an innovative e-commerce platform dedicated to providing a seamless shopping experience for its customers. The company leverages advanced data analytics to understand customer behavior, optimize marketing efforts, and enhance overall business performance. It's commitment to data-driven decision-making ensures that it stays ahead of the competition in a rapidly evolving market.
The dataset for this project was collected from Kaggle, which provides a comprehensive collection of customer data, including demographics, spending habits, and marketing responses. The data was preprocessed to ensure accuracy and consistency before being loaded into the MySQL Workbench database for analysis.
Below is a text-based flow chart representing different types of analytics:
+-----------------------+
| Descriptive |
| Analytics |
+----------+------------+
|
v
+-----------------------+
| Diagnostic |
| Analytics |
+----------+------------+
|
v
+-----------------------+
| Predictive |
| Analytics |
+----------+------------+
|
v
+-----------------------+
| Prescriptive |
| Analytics |
+-----------------------+
- Descriptive Analytics: What happened?
- Diagnostic Analytics: Why did it happen?
- Predictive Analytics: What will happen?
- Prescriptive Analytics: What should we do?
-
Average age of customers:
SELECT AVG(YEAR(CURDATE()) - Year_Birth) AS AverageAge FROM customer_data;
-
Distribution of education levels:
SELECT Education, COUNT(*) AS Count FROM customer_data GROUP BY Education;
-
Marital status distribution:
SELECT Marital_Status, COUNT(*) AS Count FROM customer_data GROUP BY Marital_Status;
-
Average household income:
SELECT AVG(Income) AS Average_Household_Income FROM customer_data;
-
Customers with children and teenagers:
SELECT Kidhome + Teenhome AS Children_Teenagers, COUNT(*) AS Count FROM customer_data GROUP BY Children_Teenagers;
-
Average days since last purchase:
SELECT AVG(Recency) AS Average_Days_Since_Last_Purchase FROM customer_data;
-
Average spending on product categories:
SELECT AVG(MntWines) AS Avg_Wine_Spending, AVG(MntFruits) AS Avg_Fruit_Spending, AVG(MntMeatProducts) AS Avg_Meat_Spending, AVG(MntFishProducts) AS Avg_Fish_Spending, AVG(MntSweetProducts) AS Avg_Sweet_Spending, AVG(MntGoldProds) AS Avg_Gold_Spending FROM customer_data;
-
Accepted campaign offers:
SELECT SUM(AcceptedCmp1) AS Accepted_Cmp1, SUM(AcceptedCmp2) AS Accepted_Cmp2, SUM(AcceptedCmp3) AS Accepted_Cmp3, SUM(AcceptedCmp4) AS Accepted_Cmp4, SUM(AcceptedCmp5) AS Accepted_Cmp5 FROM customer_data;
-
Overall response rate for the last campaign:
SELECT (SUM(Response) / COUNT(*)) * 100 AS Overall_Response_Rate FROM customer_data;
-
Purchases through different channels:
SELECT SUM(NumWebPurchases) AS Total_Web_Purchases, SUM(NumCatalogPurchases) AS Total_Catalog_Purchases, SUM(NumStorePurchases) AS Total_Store_Purchases FROM customer_data;
-
Average web visits per month:
SELECT AVG(NumWebVisitsMonth) AS Avg_Web_Visits_Per_Month FROM customer_data;
-
Customer complaints in the last 2 years:
SELECT COUNT(*) AS Complaints_Last_2_Years FROM customer_data WHERE Complain = 1;
-
Overall customer engagement rate:
SELECT (SUM(AcceptedCmp1) + SUM(AcceptedCmp2) + SUM(AcceptedCmp3) + SUM(AcceptedCmp4) + SUM(AcceptedCmp5) + SUM(Response)) / COUNT(*) * 100 AS Overall_Engagement_Rate FROM customer_data;
-
Deals made with discounts:
SELECT SUM(NumDealsPurchases) AS Total_Deals_With_Discounts FROM customer_data;
-
Average deals and purchases:
SELECT AVG(NumDealsPurchases) AS Avg_Deals_With_Discounts, AVG(NumWebPurchases + NumCatalogPurchases + NumStorePurchases) AS Avg_Total_Purchases FROM customer_data;
-
Campaign conversion rates:
SELECT (SUM(AcceptedCmp1) * 100 / COUNT(*)) AS ConvRate_Cmp1, (SUM(AcceptedCmp2) * 100 / COUNT(*)) AS ConvRate_Cmp2, (SUM(AcceptedCmp3) * 100 / COUNT(*)) AS ConvRate_Cmp3, (SUM(AcceptedCmp4) * 100 / COUNT(*)) AS ConvRate_Cmp4, (SUM(AcceptedCmp5) * 100 / COUNT(*)) AS ConvRate_Cmp5, (SUM(Response) * 100 / COUNT(*)) AS ConvRate_Last_Cmp FROM customer_data;
-
Campaign with the highest acceptance rate:
SELECT GREATEST( (SUM(AcceptedCmp1) * 100 / COUNT(*)), (SUM(AcceptedCmp2) * 100 / COUNT(*)), (SUM(AcceptedCmp3) * 100 / COUNT(*)), (SUM(AcceptedCmp4) * 100 / COUNT(*)), (SUM(AcceptedCmp5) * 100 / COUNT(*)), (SUM(Response) * 100 / COUNT(*)) ) AS Highest_Acceptance_Rate FROM customer_data;
- Correlation between web visits and web purchases:
WITH AvgValues AS ( SELECT AVG(NumWebVisitsMonth) AS AvgWebVisits, AVG(NumWebPurchases) AS AvgWebPurchases FROM customer_data ), StdDevs AS ( SELECT SQRT(SUM(POW(NumWebVisitsMonth - AvgWebVisits, 2)) / COUNT(*)) AS StdDevWebVisits, SQRT(SUM(POW(NumWebPurchases - AvgWebPurchases, 2)) / COUNT(*)) AS StdDevWebPurchases FROM customer_data, AvgValues ), Covariance AS ( SELECT SUM((NumWebVisitsMonth - AvgWebVisits) * (NumWebPurchases - AvgWebPurchases)) / (COUNT(*) - 1) AS Covar FROM customer_data, AvgValues ) SELECT Covar / (StdDevWebVisits * StdDevWebPurchases) AS Correlation_WebVisits_WebPurchases FROM Covariance, StdDevs;
- Distribution of spending across product categories:
SELECT SUM(MntWines) AS Total_Wines, SUM(MntFruits) AS Total_Fruits, SUM(MntMeatProducts) AS Total_Meat_Products, SUM(MntFishProducts) AS Total_Fish_Products, SUM(MntSweetProducts) AS Total_Sweet_Products, SUM(MntGoldProds) AS Total_Gold_Products FROM customer_data;
This project provided valuable insights into customer behavior and spending patterns. Key outcomes include:
- Identification of high-value customer segments.
- Evaluation of marketing campaign effectiveness.
- Recommendations for improving customer engagement and retention.
- Strategies for optimizing product offerings based on customer preferences.
The analysis highlights the importance of data-driven decision-making in e-commerce and provides actionable recommendations to enhance business performance.
- "Data Science for Business" by Foster Provost and Tom Fawcett
- "Marketing Analytics: Data-Driven Techniques with Microsoft Excel" by Wayne L. Winston
- Annual Customer Engagement Report
- Internal Marketing Campaign Analysis Reports