Comparative Study of Machine Learning Classification Models on Loan Approval Prediction

Authors

  • Yufan Xie Department of Mathematics and Statistics, Kenyon College, Gambier, United States

DOI:

https://doi.org/10.54097/q4kzq750

Keywords:

Loan Approval Prediction, Machine Learning, Binary Classification, Ensemble Learning, SMOTE.

Abstract

As people’s demands for loans with various purposes have increased drastically over the years, the process for banks to determine whether applications should be approved or not has become increasingly time-consuming and complicated. Therefore, models that can automatically provide banks with initial decisions for reference will significantly improve the efficiency of the process. Reviewing the results provided by the model may also offer an insight into potential biases in their decision-making procedures. The study will focus on determining better models based on evaluations and further manipulations of 6 machine learning methods: Logistic Regression, Support Vector Machine (SVM), K-Nearest Neighbors (KNN), Decision Tree, Random Forest, XGBoost, and Light-GBM. After comparisons among different classifiers established through different combinations of methods, such as oversampling and grid search, the study finally identifies two models that are most balanced on multiple metrics: the XGBoost model that undergoes oversampling and grid search, and a voting classifier, i.e., an ensemble of XGBoost, Random Forest, and Light-GBM models. In the comparison, it is found that, particularly when the original data set is imbalanced, the oversampling leads the models to make fewer conservative decisions on whether a loan application should be approved by decreasing the precision score while increasing the recall score.

Downloads

Download data is not yet available.

References

[1] Mamun M A, Farjana A, Mamun M. Predicting bank loan eligibility using machine learning models and comparison analysis. 7th North American International Conference on Industrial Engineering and Operations Management, 2022: 1423–1432.

[2] Sheikh M A, Goel A K, Kumar T. An approach for prediction of loan approval using machine learning algorithm. 2020 International Conference on Electronics and Sustainable Communication Systems (ICESC), 2020: 490–494.

[3] Arutjothi G, Senthamarai C. Prediction of loan status in Commercial Bank using machine learning classifier. 2017 International Conference on Intelligent Sustainable Systems (ICISS), 2017: 416–419.

[4] Tumuluru P, Burra L R, Loukya M, et al. Comparative analysis of Customer Loan Approval Prediction using machine learning algorithms. 2022 Second International Conference on Artificial Intelligence and Smart Energy (ICAIS), 2022: 349–353.

[5] Serengil S I, Imece S, Tosun U G, et al. A comparative study of machine learning approaches for non performing loan prediction. 2021 6th International Conference on Computer Science and Engineering (UBMK), 2021: 326–331.

[6] Owusu E, Quainoo R, Mensah S, et al. A deep learning approach for loan default prediction using Imbalanced Dataset. International Journal of Intelligent Information Technologies, 2023, 19(1): 1–16.

[7] Hussein Sayed E, Alabrah A, Hussein Rahouma K, et al. Machine learning and deep learning for loan prediction in banking: Exploring Ensemble Methods and data balancing. IEEE Access, 2024, 12: 193997–194019.

[8] Ta-Wei L. Loan Approval Classification Dataset. Kaggle, 2021.

[9] El Madou K, Marso S, El Kharrim M, et al. Evolutions in machine learning technology for Financial Distress Prediction: A comprehensive review and comparative analysis. Expert Systems, 2023, 41(2).

[10] Orji U E, Ugwuishiwu C H, Nguemaleu J C, et al. Machine learning models for predicting bank loan eligibility. 2022 IEEE Nigeria 4th International Conference on Disruptive Technologies for Sustainable Development (NIGERCON), 2022: 1–5.

[11] Fati S M. Machine learning-based prediction model for loan status approval. J. Hunan Univ. Nat. Sci., 2021, 48(10).

Downloads

Published

15-04-2026

Issue

Section

Articles

How to Cite

Xie, Y. (2026). Comparative Study of Machine Learning Classification Models on Loan Approval Prediction. Journal of Innovation and Development, 15(2), 168-177. https://doi.org/10.54097/q4kzq750