An Application of Logistic Regression in Bank Lending Prediction: A Machine Learning Perspective
DOI:
https://doi.org/10.54097/20qcm204Keywords:
Logistic Regression; Credit Scoring; Bank Lending; Risk Assessment; Machine Learning.Abstract
Logistic regression has been widely applied in bank lending prediction due to its simplicity, efficiency, and interpretability. This paper provides a review of the theoretical foundations, practical applications, and recent advancements of logistic regression in this domain. Through case studies and literature analysis, the author focuses on the challenges faced by logistic regression in the modeling process, such as data imbalance and nonlinear relationships, as well as optimization strategies like regularization and ensemble learning. Despite the proliferation of machine learning methods, logistic regression remains the preferred choice and benchmark for credit risk modeling, thanks to its robust predictive performance and regulatory friendliness. Looking forward, the author argues that the integration of logistic regression with other machine learning techniques, coupled with domain knowledge, will further enhance its applicability and predictive power. However, striking a balance between model complexity and interpretability, while improving model robustness and extrapolation, remains an important area for future research. This paper offers new perspectives and insights for credit risk management practices in banks and other financial institutions.
Downloads
References
[1] Sperandei, S. Understanding logistic regression analysis. Biochemia medica, 2014, 24(1): 12-18. DOI: https://doi.org/10.11613/BM.2014.003
[2] Bracke, P., Datta, A., Jung, C., & Sen, S. Machine learning explainability in finance: an application to default risk analysis. Bank of England Staff Working Paper, (2019). DOI: https://doi.org/10.2139/ssrn.3435104
[3] Crook, J. N., Edelman, D. B., & Thomas, L. C. Recent developments in consumer credit risk assessment. European Journal of Operational Research, 2007, 183(3): 1447-1465. DOI: https://doi.org/10.1016/j.ejor.2006.09.100
[4] Lessmann, S., Baesens, B., Seow, H. V., & Thomas, L. C. Benchmarking state-of-the-art classification algorithms for credit scoring: An update of research. European Journal of Operational Research, 2015, 247(1), 124-136. DOI: https://doi.org/10.1016/j.ejor.2015.05.030
[5] Siddiqi, N. Credit risk scorecards: developing and implementing intelligent credit scoring. John Wiley & Sons, 2012. DOI: https://doi.org/10.1002/9781119201731
[6] Chawla, N. V., Bowyer, K. W., Hall, L. O., & Kegelmeyer, W. P. SMOTE: synthetic minority over-sampling technique. Journal of artificial intelligence research, 2002, 16, 321-357. DOI: https://doi.org/10.1613/jair.953
[7] Kalemo, K., & Ketcha, N. Loan default prediction using logistic regression. Journal of Banking and Financial Economics, 2019, 2(12), 66-81.
[8] Edelberg, W. Risk-based pricing of interest rates for consumer loans. Journal of Monetary Economics, 2006, 53(8), 2283-2298. DOI: https://doi.org/10.1016/j.jmoneco.2005.09.001
[9] Ciampi, F. Corporate governance characteristics and default prediction modeling for small enterprises. An empirical analysis of Italian firms. Journal of Business Research, 2015, 68(5), 1012-1025. DOI: https://doi.org/10.1016/j.jbusres.2014.10.003
[10] Mileris, R. Macroeconomic determinants of loan portfolio credit risk in banks. Engineering Economics, 2012, 23(5), 496-504. DOI: https://doi.org/10.5755/j01.ee.23.5.1890
[11] Zakrzewska, D., & Murlewski, J. Clustering algorithms for bank customer segmentation. In 5th International Conference on Intelligent Systems Design and Applications (ISDA'05), 2005. DOI: https://doi.org/10.1109/ISDA.2005.33
[12] Bhattacharyya, S., Jha, S., Tharakunnel, K., & Westland, J. C. Data mining for credit card fraud: A comparative study. Decision Support Systems, 2011, 50(3), 602-613. DOI: https://doi.org/10.1016/j.dss.2010.08.008
[13] Bellotti, T., & Crook, J. Support vector machines for credit scoring and discovery of significant features. Expert Systems with Applications, 2009, 36(2), 3302-3308. DOI: https://doi.org/10.1016/j.eswa.2008.01.005
[14] Yao, X., Crook, J., & Andreeva, G. Support vector regression for loss given default modelling. European Journal of Operational Research, 2015, 240(2), 528-538. DOI: https://doi.org/10.1016/j.ejor.2014.06.043
[15] Alaraj, M., Abbod, M., & Hunaiti, Z. Evaluating consumer loans using neural networks ensembles. In International Conference on Machine Learning and Computing, 2019.
Downloads
Published
Issue
Section
License

This work is licensed under a Creative Commons Attribution-NonCommercial 4.0 International License.






