An Investigation of Machine Learning Techniques for Loan Default Payments Prediction

Main Article Content

Jesada Kajornrit
Wilawan Inchamnam
Waraporn Jirapanthong


In banking business, loan default payments of individual customers are counted as risks that result in the loss of the business. Thus, some assessment mechanisms are needed to assess the risks of individual customers who apply for personal loan products. This paper presents an investigation of machine learning techniques to predict loan default payments based on individual customers information backgrounds. The paper emphasis on the ensemble techniques that mostly used in banking business. Besides the ensemble prediction models, the principal component analysis is also used for further investigation. The experimental results showed that all prediction models provided acceptable prediction of non-defaulting payment class, but provided unacceptable prediction of default payment class. That is because the imbalance nature of the data and the features used are not specific enough for the prediction models to classify the minor class from the major class. This paper acts as an initial study of the credit default payment analysis.

Article Details

How to Cite
J. Kajornrit, W. . Inchamnam, and W. . Jirapanthong, “An Investigation of Machine Learning Techniques for Loan Default Payments Prediction”, JIST, vol. 13, no. 1, pp. 38–44, Jun. 2023.
Research Article: Business and Information System(Detail in Scope of Journal)


A. K. I. Hassan and a. Abraham, “Modeling Consumer Loan Default Prediction Using Ensemble Neural Networks,” In Proc. of Int. Conf. on Computing, Electrical and Electronic Engineering (ICCEEE), Khartoum, Sudan, 2013, pp. 719-724.

T. Alam, K. Shaukat, I. A. Hameed, S. Luo, M. U. Sarwar, S. Shabbir, J. Li, and M. Khushi, “An Investigation of Credit Card Default Prediction in the Imbalanced Datasets,” IEEE Access, Vol 8. October 2020, pp. 201173-201198.

M. Dumont, R. Maree, L. Wehenkel, and P. Geurts, “Fast multi-class image annotation with random subwindows and multiple output randomized trees,”, in Conf. Computer Vision Theory and Applications, Lisboa, Portugal, February 2009.

L. Breiman, “Random Forests, “Machine Language, Vol. 45, No. 1, October 2001, pp 5–32.

P. Geurts, D. Ernst., and L. Wehenkel, “Extremely randomized trees”, Machine Learning, Vol. 63, No. 1, 2006, pp. 3-42.

Y. Freund, and R. Schapire, “A Decision-Theoretic Generalization of On-Line Learning and an Application to Boosting,” Journal of Computer and System Sciences, Vol. 55, No. 1, August 1997, pp. 119-139.

J. H. Friedman. "Greedy function approximation: A gradient boosting machine," Ann. Statist, Vol. 29 No. 5, October 2001, pp. 1189 - 1232.

O. Sornil and J. Kajornrit, “Improving performance of neural network models for intrusion detection using singular value decomposition”, Recent Advances in Intelligent Systems and Signal Processing, WSES Press, 2003, pp. 338-342.

A. Soni and K. C. P. Shankar, “Bank Loan Default Prediction Using Ensemble Machine Learning Algorithm” In Proc. of 2nd Int. Conf. on Interdisciplinary Cyber Physical Systems (ICPS), Chennai, India, 2022, pp. 170-175.

S. Fan, “Design and implementation of a personal loan default prediction platform based on LightGBM model,” In Proc. on 3rd Int. Conf. International Conference on Power, Electronics and Computer Applications, Shenyang (ICPECA), China, 2023, pp. 1232-1236.

S. K. Shaheen and E. ElFakharany, “Predictive analytics for loan default in banking sector using machine learning techniques,” In Prod. Of 28th Int. Conf. on International Conference on Computer Theory and Applications (ICCTA), Alexandria, Egypt, 2018, pp. 66-71.

L. Lai, “Loan Default Prediction with Machine Learning Techniques,” In Prod. of Int. Conf. International Conference on Computer Communication and Network Security (CCNS), Xi'an, China, 2020, pp. 5-9.

S. Barua, D. Gavandi, P. Sangle, L. Shinde, and J. Ramteke, “Swindle: Predicting the Probability of Loan Defaults using CatBoost Algorithm,” In Proc. of 5th Int. Conf. on Computing Methodologies and Communication (ICCMC), Erode, India, 2021, pp. 1710-1715.

A. Al-qerem, G. Al-Naymat, and M. Alhasan, “Loan Default Prediction Model Improvement through Comprehensive Preprocessing and Features Selection,” In Proc. of Int. Arab Conference on Information Technology (ACIT), Al Ain, United Arab Emirates, 2019, pp. 235-240.

B. Patel, H. Patil, J. Hembram, and S. Jaswal, “Loan Default Forecasting using Data Mining,” In Proc. of Int. Conf. for Emerging Technology (INCET), Belgaum, India, 2020, pp. 1-4.