Enhancing Bank Loan Approval Efficiency Using Machine Learning: An Ensemble Model Approach
Downloads
Lending is a major source of income for banks, but identifying worthy borrowers who will consistently repay loans is a constant problem. From a pool of loan applicants, conventional selection procedures frequently fail to find the most qualified individuals. To make loan applications faster, we created a new system that uses machine learning to automatically find people who qualify for loans. This comprehensive analysis involves data preprocessing, effective data balancing using SMOTE, and the application of various machine learning models, including Decision Trees, Support Vector Machines, K-Nearest Neighbors, Gaussian Naive Bayes, AdaBoost, Gradient Boosting, Logistic Regression, and advanced deep learning models like recurrent neural networks, deep neural networks, and long short-term memory models. We thoroughly evaluate the models based on accuracy, recall, and F1 score. Our experimental results demonstrate that the Extra Trees model outperforms its counterparts. Furthermore, we achieve a significant 0.62% increase in accuracy over the Extra Trees model by using an ensemble voting model that combines the top three machine learning models to predict bank loan defaulters. An intuitive desktop application has been developed to enhance user engagement. Remarkably, our findings indicate that the voting-based ensemble model surpasses both current state-of-the-art methods and individual ML models, including Extra Trees, with an impressive accuracy of 87.26%. Ultimately, this innovative system promises substantial improvements and efficiency in bank loan approval processes, benefiting both financial institutions and loan applicants.
Dansana, D., Patro, S.G.K., Mishra, B.K., Prasad, V., Razak, A. and Wodajo, A.W., 2024. Analyzing the impact of loan features on bank loan prediction using R andom F orest algorithm. Engineering Reports, 6(2), p.e12707.Khairi et al., 2021
Khairi, A., Bahri, B. and Artha, B., 2021. A literature review of non-performing loan. Journal of Business and Management Review, 2(5), pp.366-373.Musdholifah et al., 2020
Musdholifah, M., Hartono, U. and Wulandari, Y., 2020. Banking crisis prediction: emerging crisis determinants in Indonesian banks. International Journal of Economics and Financial Issues, 10(2), p.124.Ma et al., 2023
Ma, Y., Yu, C., Yan, M., Sangaiah, A.K. and Wu, Y., 2023. Dark-side avoidance of mobile applications with data biases elimination in socio-cyber world. IEEE Transactions on Computational Social Systems.Koulouridi et al., 2021
Koulouridi, E., Kumar, S., Nario, L., Pepanides, T. and Vettori, M., 2020. Managing and monitoring credit risk after the COVID-19 pandemic. McKinsey & Company.
Alzubi, O.A., Alzubi, J.A., Al-Zoubi, A.M., Hassonah, M.A. and Kose, U., 2022. An efficient malware detection approach with feature weighting based on Harris Hawks optimization. Cluster Computing, pp.1-19.
Koulouridi, E., Kumar, S., Nario, L., Pepanides, T. and Vettori, M., 2020. Managing and monitoring credit risk after the COVID-19 pandemic. McKinsey & Company.
Rawate, K.R. and Tijare, P.A., 2017. Review on prediction system for bank loan credibility. International Journal of Advance Engineering and Research Development, 4(12), pp.860-867.
Bhargav, P. and Sashirekha, K., 2023. A Machine Learning Method for Predicting Loan Approval by Comparing the Random Forest and Decision Tree Algorithms. Journal of Survey in Fisheries Sciences, 10(1S), pp.1803-1813.
Dasari, Y., Rishitha, K. and Gandhi, O., 2023. Prediction of bank loan status using machine learning algorithms. International Journal of Computing and Digital Systems, 14(1), pp.1-1.
Abdullah, M., Chowdhury, M.A.F., Uddin, A. and Moudud‐Ul‐Huq, S., 2023. Forecasting nonperforming loans using machine learning. Journal of Forecasting, 42(7), pp.1664-1689.
Kavitha, M.N., Saranya, S.S., Dhinesh, E., Sabarish, L. and Gokulkrishnan, A., 2023, March. Hybrid ML classifier for loan prediction system. In 2023 international conference on sustainable computing and data communication systems (ICSCDS) (pp. 1543-1548). IEEE.
Wang, Y., Wang, M., Pan, Y. and Chen, J., 2023. Joint loan risk prediction based on deep learning‐optimized stacking model. Engineering Reports, p.e12748.
Alsaleem, M.Y. and Hasoon, S.O., 2020. Predicting bank loan risks using machine learning algorithms. AL-Rafidain Journal of Computer Sciences and Mathematics, 14(1), pp.149-158.
Wang, D., Wu, Q. and Zhang, W., 2019. Neural learning of online consumer credit risk. arXiv preprint arXiv:1906.01923.
Supriya, P., Pavani, M., Saisushma, N., Kumari, N.V. and Vikas, K., 2019. Loan prediction by using machine learning models. International Journal of Engineering and Techniques, 5(2), pp.144-147.
Sun, T. and Vasarhalyi, M.A., 2021. Predicting credit card delinquencies: An application of deep neural networks. Handbook of Financial Econometrics, Mathematics. Statistics, and Machine Learning, pp.4349-4381.
Madaan, M., Kumar, A., Keshri, C., Jain, R. and Nagrath, P., 2021. Loan default prediction using decision trees and random forest: A comparative study. In IOP Conference Series: Materials Science and Engineering (Vol. 1022, No. 1, p. 012042). IOP Publishing.
Anand, M., Velu, A. and Whig, P., 2022. Prediction of loan behaviour with machine learning models for secure banking. Journal of Computer Science and Engineering (JCSE), 3(1), pp.1-13.
Kumar, C.N., Keerthana, D., Kavitha, M. and Kalyani, M., 2022, June. Customer loan eligibility prediction using machine learning algorithms in banking sector. In 2022 7th international conference on communication and electronics systems (ICCES) (pp. 1007-1012). IEEE.
Dosalwar, S., Kinkar, K., Sannat, R. and Pise, N., 2021. Analysis of loan availability using machine learning techniques. International Journal of Advanced Research in Science, Communication and Technology (IJARSCT), 9(1), pp.15-20.
Blessie, E.C. and Rekha, R., 2019. Exploring the machine learning algorithm for prediction the loan sanctioning process. International Journal of Innovative Technology and Exploring Engineering (IJITEE), 9(1), pp.2714-2719.
Loan prediction problem dataset. Loan Prediction Problem Dataset (kaggle.com)