Comparison of Machine Learning Methods in Customer Segment
DOI:
https://doi.org/10.54097/c2s9m434Keywords:
Customer Segmentation, Decision Trees, Random Forest, Privacy, KNN, SVM.Abstract
Customer segmentation plays a key strategy in marketing and business analytics. It assigns customers to different groups based on their common characteristics, which allows the company or organization to regulate their marketing efforts and product offers to meet specific needs of each group. With the development of machine learning, lots of methods are discovered and being used widely. The main purpose of this paper is to introduce the four popular machine learning methods and compare their functions. This paper first introduces the datasets in the field of customer segmentation, then it introduces customer segmentation methods based on machine learning, including Support Vector Machines (SVM), Decision Trees (DT), Random Forest (RF), and K-Nearest Neighbors (KNN). Based on the results, Random Forest offers the best precision which runs up to 89.61%. By introducing and comparing the performance of four different methods in a specific data environment, it will enlighten researchers in the field of customer segmentation.
Downloads
References
Jaime R.S. Fonseca. Why Does Segmentation Matter? Identifying Market Segments Through a Mixed Methodology. ResearchGate, 2011, 25: 1 - 26.
Verdenhofs, A., & Tambov Eva, T. Evolution of Customer Segmentation in the Era of Big Data. Marketing and Management of Innovations, 2019, 1: 238 - 243.
Ebbers F, Zibuschka J, Zimmermann C, et al. User preferences for privacy features in digital assistants. Electronic Markets, 2021, 31: 411 - 426.
Carrie. E-Commerce Data. Kaggle, 2017.
Murphy, Patrick E, and Ben M Enis. Classifying Products Strategically. Sage Journals, 1986, 50 (3): 24 - 42.
Y. B. Cho, S. H. Kim. KA Methodology for Internet Customer Segmentation using Decision Trees. KIIS, 2003, 206 - 213.
Daniel, Fabien. Customer Segmentation. Kaggle, 2019.
Jiang, Lai, and Runming Yao. Modelling Personal Thermal Sensations Using C-Support Vector Classification (C-SVC) Algorithm. ScienceDirect, 2016, 99: 98 - 106.
Larose D T, Larose C D. K‐Nearest Neighbor algorithm. Discovering Knowledge in Data: An Introduction to Data Mining 2014, 1:149 - 164.
Mahapatra, Dwarika Nath. Analyzing Training Information from Random Forests for Improved Image Segmentation. IEEE, 2014, 23 (4) :1504 - 1512.
Yanamadala Ujjwala, Mohana Priya. Iris Species Recognition: An Analysis Using Python and Machine Learning Algorithm. Journal For Basic Sciences, 2022, 22: 721 - 732.
Downloads
Published
Issue
Section
License
Copyright (c) 2024 Highlights in Science, Engineering and Technology

This work is licensed under a Creative Commons Attribution-NonCommercial 4.0 International License.







