Effectiveness Evaluation of Big Data Risk Control Models in Consumer Credit: A Comparative Study of Default Prediction Based on XGBoost Algorithm
DOI:
https://doi.org/10.54097/w7r2d931Keywords:
Big Data Risk Control; Consumer Credit; Default Prediction; Machine Learning.Abstract
With the rapid development of financial technology and the continuous expansion of the consumer credit market, the accurate identification and prediction of credit risks have become crucial for financial institutions in risk management. This paper focuses on credit risk prediction in the consumer credit market, constructing an XGBoost model based on 168,000 pieces of data from Internet finance company S, and comparing it with models such as logistic regression, SVM, and Gaussian Naive Bayes. The results show that the XGBoost model achieves an accuracy rate of 97% on both the training and validation sets, takes only 0.12 minutes, and has a true positive rate (TPR) of 86.04%, with overall performance significantly superior to logistic regression, SVM, and Gaussian Naive Bayes models. Variable contribution analysis further reveals that "new users", "Sesame Credit", "number of shutdown days", and "Ant Credit Pay Limit" are key factors reflecting overdue behaviors in consumer credit. Based on these core influencing factors, this paper designs corresponding risk control strategies to balance risk control and business efficiency. The research provides a theoretical basis and practical guidance for financial institutions to optimize risk control models, helping to reduce credit risks and promote the sustainable development of consumer credit business.
Downloads
References
[1] Bhattacharya A, Biswas S K, Mandal A. Credit risk evaluation: a comprehensive study[J]. Multimedia Tools and Applications, 2023, 82(12): 18217-18267.
[2] Garg A, Maheshwari A, Kapoor R, et al. A Comparative Analysis for Predicting Loan Default Risks using Machine Learning Algorithms[J]. Available at SSRN 5065367, 2024.
[3] Taş C. Comparison of Machine Learning and Standard Credit Risk Models' Performances in Credit Risk Scoring of Buy Now Pay Later Customers[D]. Middle East Technical University (Turkey), 2023.
[4] Niu M, Wang Y, Zhang K, et al. Comparison of different individual credit risk assessment models[C]//Second International Conference on Statistics, Applied Mathematics, and Computing Science (CSAMCS 2022). SPIE, 2023, 12597: 664-670.
[5] Uphade D B, Muley A A, Chalwadi S V. Identification of Most Preferable Machine Learning Technique for Prediction of Bank Loan Defaulters[J]. Indian Journal of Science and Technology, 2024, 17(4): 343-351.
[6] Luo Z, Hsu P, Xu N. SME default prediction framework with the effective use of external public credit data[J]. Sustainability, 2020, 12(18): 7575.
[7] Thomas L, Crook J, Edelman D. Credit scoring and its applications[M]. Society for industrial and Applied Mathematics, 2017.
Downloads
Published
Issue
Section
License

This work is licensed under a Creative Commons Attribution-NonCommercial 4.0 International License.