Research on Application of Stacking Technique in Telecom Churn Prediction
DOI:
https://doi.org/10.54097/hset.v31i.4811Keywords:
Telecom customs churn, Churn prediction, Data mining, Stacking technique.Abstract
In today's saturated telecommunication market, operators need to fully tap the value of data, strengthen the management of existing users, and reduce customer churns. In order to further to reduce the maintenance cost of enterprises, it is necessary to apply more efficient data mining technology to the prediction of telecommunication user churns in order to improve the operating profit of enterprises. This paper took a telecommunication customer churn data set on Kaggle as the research data, and explores a high accuracy customer churn prediction model. Logistic Regression was exploited to create customer churn prediction models, and the parameters of each model were optimized. On this basis, the Stacking technique was used to fuse the models, and the best combination method is obtained. After research, this paper mainly drew the following conclusions: First, in a single model, the model built with XG Boost algorithm has the highest prediction accuracy, which is 86.20%. Secondly, combining the accuracy rate of training set, cross-validation set and test set, it can be found that XG Boost and Random Forest have the best performance under Stacking fusion algorithm. The accuracy of the above three datasets is 97.61%, 85.84%, 86.51%, respectively, which improves the prediction accuracy of the prediction model.
Downloads
References
Verbeke, W., Martens, D., Mues, C., & Baesens, B. (2011). Building comprehensible customer churn prediction models with advanced rule induction techniques. Expert systems with applications, 38(3), 2354-2364..
Cox, L. A. (2002). Data mining and causal modeling of customer behaviors. Telecommunication Systems, 21(2), 349-381.
Farquad, M. A. H., Ravi, V., & Raju, S. B. (2014). Churn prediction using comprehensible support vector machine: An analytical CRM application. Applied Soft Computing, 19, 31-40.
Wei Hu (2014, May). Customer Churn Analysis in Mold Industry Based on Data Mining.
Huang, Y., Zhu, F., Yuan, M., Deng, K., Li, Y., Ni, B., ... & Zeng, J. (2015, May). Telco churn prediction with big data. In Proceedings of the 2015 ACM SIGMOD international conference on management of data (pp. 607-618).
Wu, Q., Ye, Y., Zhang, H., Ng, M. K., & Ho, S. S. (2014). ForesTexter: an efficient random forest algorithm for imbalanced text categorization. Knowledge-Based Systems, 67, 105-116.
Yang Xiaofeng (2016). Multi-layer Random Forest for Telco Churn Prediction.
Hung, S. Y., Yen, D. C., & Wang, H. Y. (2006). Applying data mining to telecom churn management. Expert Systems with Applications, 31(3), 515-524.
Qi, J., Zhang, Y., Zhang, Y., & Shi, S. (2006, December). TreeLogit model for customer churn prediction. In 2006 IEEE Asia-Pacific Conference on Services Computing (APSCC'06) (pp. 70-75). IEEE.
Huang, B., Kechadi, M. T., & Buckley, B. (2012). Customer churn prediction in telecommunications. Expert Systems with Applications, 39(1), 1414-1425.
YIN Ting, MA Jun, QIN Xizhong, et al. Bayesian decision tree applying in forecasting customer churn. Computer Engineering and Applications, 2014, 50 (7): 125-128.
Xu Shuqiao (2019). A Research on Bagging of Xgboost Classifiers for Prediction Churn in Telecom
Downloads
Published
Issue
Section
License

This work is licensed under a Creative Commons Attribution-NonCommercial 4.0 International License.







