Analysis and Prediction of Telecom Customer Churn based on Machine Learning
DOI:
https://doi.org/10.54097/hset.v16i.2495Keywords:
Machine learning; customer churn prediction; Spearman single factor analysis; random forest feature importance score.Abstract
As the telecommunications market becomes increasingly saturated, major operators are facing an increasingly severe problem of soaring customer churn rates. How to identify high-risk churn customers is the most concerned issue for operators. Thanks to the rapid development of pattern recognition technology, existing machine learning algorithms provide key technical support for telecom customer churn prediction. However, how to choose an appropriate forecasting method combined with the characteristics of the application data is still an open question. To this end, based on the analysis and comparison of the feature correlation between telecom customer data and churn, this paper compares the differences in the prediction results of different machine algorithms, so as to choose the method that best fits the characteristics of the application data to build the final customer churn prediction model. Specifically, the Spearman correlation coefficient is used to calculate the correlation between variables in the dataset, the random forest algorithm is used to score the importance of all variables, and the prediction generated by the gradient boosting tree algorithm is introduced. Finally, the gradient boosting tree algorithm is evaluated by five performance indicators: precision rate, recall rate, precision rate, F1 score and AUC (Area under the ROC curve).
Downloads
References
Cheng Qiyun, Sun Caixin, Zhang Xiaoxing, et al. Short-Term load forecasting model and method for power system based on complementation of neural network and fuzzy logic. Transactions of China Electrotechnical Society, 2004, 19(10): 53-58.
Fangfang. Research on power load forecasting based on Improved BP neural network. Harbin Institute of Technology, 2011.
Amjady N. Short-term hourly load forecasting using time series modeling with peak load estimation capability. IEEE Transactions on Power Systems, 2001, 16(4): 798-805.
Ma Kunlong. Short term distributed load forecasting method based on big data. Changsha: Hunan University, 2014.
SHI Biao, LI Yu Xia, YU Xhua, YAN Wang. Short-term load forecasting based on modified particle swarm optimizer and fuzzy neural network model. Systems Engineering-Theory and Practice, 2010, 30(1): 158-160.
Fangfang. Research on power load forecasting based on Improved BP neural network. Harbin Institute of Technology, 2011.
Amjady N. Short-term hourly load forecasting using time series modeling with peak load estimation capability. IEEE Transactions on Power Systems, 2001, 16(4): 798-805.
Ma Kunlong. Short term distributed load forecasting method based on big data. Changsha: Hunan University, 2014.
SHI Biao, LI Yu Xia, YU Xhua, YAN Wang. Short-term load forecasting based on modified particle swarm optimizer and fuzzy neural network model. Systems Engineering-Theory and Practice, 2010, 30(1): 158-160.
Fangfang. Research on power load forecasting based on Improved BP neural network. Harbin Institute of Technology, 2011.
Amjady N. Short-term hourly load forecasting using time series modeling with peak load estimation capability. IEEE Transactions on Power Systems, 2001, 16(4): 798-805.
Ma Kunlong. Short term distributed load forecasting method based on big data. Changsha: Hunan University, 2014.
SHI Biao, LI Yu Xia, YU Xhua, YAN Wang. Short-term load forecasting based on modified particle swarm optimizer and fuzzy neural network model. Systems Engineering-Theory and Practice, 2010, 30(1): 158-160.
Downloads
Published
Issue
Section
License

This work is licensed under a Creative Commons Attribution-NonCommercial 4.0 International License.







