Customer Segmentation Based on Machine Learning Methods

Authors

  • Zhiyue Wang

DOI:

https://doi.org/10.54097/g70xqb16

Keywords:

Customer Segmentation, Machine Learning, Logistic Regression.

Abstract

In recent years, as times change, consumer behavior continues to change rapidly, and their preferences and consumer attitudes change with age and experience. In the generalization of the mass market, it is difficult to identify the needs and desires of customers through various promotional tools. Therefore, customer segmentation can be an option for marketers to offer preferential goods or services to customers. Segmentation can help the company to quickly identify the preferences of the customers and provide them with the desired goods. However, there are significant differences between customers, making it difficult for merchants to segment customers through simple attribute filtering. Fortunately, with the development of machine learning, machine learning-based customer segmentation methods have received a lot of attention from researchers. However, different machine learning methods have different characteristics and there are some differences in commercial applications. Therefore, this paper analyses the principles and performance of the algorithms to provide reference for researchers in related fields. Firstly, this paper introduces several common machines learning methods, including Logistic Regression, Decision Tree, Random Forest and AdaBoost, and then compares the effectiveness of these algorithms through experiments. Finally, this paper looks forward to future research directions.

Downloads

Download data is not yet available.

References

Hajiha A, Radfar R, Malayeri S S. Data mining application for customer segmentation based on loyalty: An iranian food industry case study. 2011 IEEE International Conference on Industrial Engineering and Engineering Management. IEEE, 2011: 504 - 508.

Ling R, Yen D C. Customer relationship management: An analysis framework and implementation strategies. Journal of computer information systems, 2001, 41 (3): 82 - 97.

Hassan M, Tabasum M. Customer profiling and segmentation in retail banks using data mining techniques. International journal of advanced research in computer science, 2018, 9 (4): 24 - 29.

Smith W R. Product differentiation and market segmentation as alternative marketing strategies. Journal of marketing, 1956, 21 (1): 3 - 8.

Das S, Nayak J. Customer segmentation via data mining techniques: state-of-the-art review. Computational Intelligence in Data Mining: Proceedings of ICCIDM 2021, 2022: 489 - 507.

Berson A, Smith S J, Thearling K. Building Data Mining Applications for CRM McGraw-Hill. 1999.Fangfang. Research on power load forecasting based on Improved BP neural network. Harbin Institute of Technology, 2011.

Online Retail. (2015). UCI Machine Learning Repository. https://doi.org/10.24432/C5BW33.

Hasnain M, Pasha M F, Ghani I, et al. Evaluating trust prediction and confusion matrix measures for web services ranking. IEEE Access, 2020, 8: 90847 - 90861.

Buya S, Tongkumchum P, Owusu B E. Modelling of land-use change in Thailand using binary logistic regression and multinomial logistic regression. Arabian Journal of Geosciences, 2020, 13: 1 - 12.

Zuccarelli, E. Performance metrics in machine learning - part 1: Classification, Medium. Available at: https://towardsdatascience.com/performance-metrics-in-machine-learning-part-1-classification-6c6b8d8a8c92, 2021.

Hu Z, Lo C P. Modeling urban growth in Atlanta using logistic regression. Computers, environment and urban systems, 2007, 31 (6): 667 - 688.

Buya S, Tongkumchum P, Owusu B E. Modelling of land-use change in Thailand using binary logistic regression and multinomial logistic regression. Arabian Journal of Geosciences, 2020, 13: 1 - 12.

Farid D M, Zhang L, Rahman C M, et al. Hybrid decision tree and naïve Bayes classifiers for multi-class classification tasks. Expert systems with applications, 2014, 41 (4): 1937 - 1946.

Martín M, Macías J A. A supporting tool for enhancing user’s mental model elicitation and decision-making in user experience research. International Journal of Human–Computer Interaction, 2023, 39 (1): 183 - 202.

Dimitriadis S I, Liparas D, Alzheimer's Disease Neuroimaging Initiative. How random is the random forest? Random forest algorithm on the service of structural imaging biomarkers for Alzheimer's disease: from Alzheimer's disease neuroimaging initiative (ADNI) database. Neural regeneration research, 2018, 13 (6): 962.

Caruana R, Niculescu-Mizil A. An empirical comparison of supervised learning algorithms Proceedings of the 23rd international conference on Machine learning. 2006: 161 - 168.

Shahraki A, Abbasi M, Haugen Ø. Boosting algorithms for network intrusion detection: A comparative evaluation of Real AdaBoost, Gentle AdaBoost and Modest AdaBoost. Engineering Applications of Artificial Intelligence, 2020, 94: 103770.

Downloads

Published

10-04-2024

How to Cite

Wang, Z. (2024). Customer Segmentation Based on Machine Learning Methods. Highlights in Science, Engineering and Technology, 92, 126-132. https://doi.org/10.54097/g70xqb16