This project focuses on customer segmentation using K-Means clustering, a popular machine learning algorithm. The data used for this analysis comes from a dataset of mall customers, including features like Age, Gender, Annual Income, and Spending Score. The goal is to identify distinct groups within the customer base to tailor marketing strategies effectively.
The dataset consists of 200 entries with the following columns: CustomerID, Gender, Age, Annual Income (k$), and Spending Score (1-100). The initial exploration includes checking for null values, duplicate entries, and basic statistical summaries of the data.
Several plots were created to understand the distribution and relationships between the features:
The first segmentation is based on Age and Spending Score. The Elbow Method is used to determine the optimal number of clusters.
The second segmentation uses Annual Income and Spending Score. The Elbow Method is again applied to determine the optimal number of clusters.
The final segmentation uses all three features: Age, Annual Income, and Spending Score. A 3D scatter plot is created to visualize the clusters.
This project successfully demonstrated the use of K-Means clustering for customer segmentation. By analyzing Age, Annual Income, and Spending Score, distinct customer groups were identified. These insights can be used to develop targeted marketing strategies and improve customer satisfaction.
27 May 2024
Trusted by 48600+ Generalists. Try it now, free to use
Start making more money