Comparison of Machine Learning Methods in Customer Segment

Authors

  • Lin Cao

DOI:

https://doi.org/10.54097/c2s9m434

Keywords:

Customer Segmentation, Decision Trees, Random Forest, Privacy, KNN, SVM.

Abstract

Customer segmentation plays a key strategy in marketing and business analytics. It assigns customers to different groups based on their common characteristics, which allows the company or organization to regulate their marketing efforts and product offers to meet specific needs of each group. With the development of machine learning, lots of methods are discovered and being used widely. The main purpose of this paper is to introduce the four popular machine learning methods and compare their functions. This paper first introduces the datasets in the field of customer segmentation, then it introduces customer segmentation methods based on machine learning, including Support Vector Machines (SVM), Decision Trees (DT), Random Forest (RF), and K-Nearest Neighbors (KNN). Based on the results, Random Forest offers the best precision which runs up to 89.61%. By introducing and comparing the performance of four different methods in a specific data environment, it will enlighten researchers in the field of customer segmentation.

Downloads

Download data is not yet available.

References

Jaime R.S. Fonseca. Why Does Segmentation Matter? Identifying Market Segments Through a Mixed Methodology. ResearchGate, 2011, 25: 1 - 26.

Verdenhofs, A., & Tambov Eva, T. Evolution of Customer Segmentation in the Era of Big Data. Marketing and Management of Innovations, 2019, 1: 238 - 243.

Ebbers F, Zibuschka J, Zimmermann C, et al. User preferences for privacy features in digital assistants. Electronic Markets, 2021, 31: 411 - 426.

Carrie. E-Commerce Data. Kaggle, 2017.

Murphy, Patrick E, and Ben M Enis. Classifying Products Strategically. Sage Journals, 1986, 50 (3): 24 - 42.

Y. B. Cho, S. H. Kim. KA Methodology for Internet Customer Segmentation using Decision Trees. KIIS, 2003, 206 - 213.

Daniel, Fabien. Customer Segmentation. Kaggle, 2019.

Jiang, Lai, and Runming Yao. Modelling Personal Thermal Sensations Using C-Support Vector Classification (C-SVC) Algorithm. ScienceDirect, 2016, 99: 98 - 106.

Larose D T, Larose C D. K‐Nearest Neighbor algorithm. Discovering Knowledge in Data: An Introduction to Data Mining 2014, 1:149 - 164.

Mahapatra, Dwarika Nath. Analyzing Training Information from Random Forests for Improved Image Segmentation. IEEE, 2014, 23 (4) :1504 - 1512.

Yanamadala Ujjwala, Mohana Priya. Iris Species Recognition: An Analysis Using Python and Machine Learning Algorithm. Journal For Basic Sciences, 2022, 22: 721 - 732.

Downloads

Published

10-04-2024

How to Cite

Cao, L. (2024). Comparison of Machine Learning Methods in Customer Segment. Highlights in Science, Engineering and Technology, 92, 133-137. https://doi.org/10.54097/c2s9m434