Business Failure Prediction by Hybrid Data Mining Approach: A Case of Thailand Agribusiness

Authors

  • Jeerawadee Pumjaroen Applied Statistics, Faculty of Science and Technology, Rajamangala University of Technology Thanyaburi (RMUTT)

Keywords:

data mining, business failure, forecasting, early warning model, classification

Abstract

The failure of a business could significantly impact private companies, the government, and the whole economy. Therefore, predicting business failure is always one major research problem in business and economics. Many methods, such as theoretical models, statistical models, and data mining techniques, were applied to predict business failures. This research developed a business failure prediction model to classify failed and non-failed companies from one to three years before the failure by a hybrid data mining technique. The interest of this research is to integrate clustering and classification techniques to predict business failure, which can be beneficial for further research related to business failure prediction or early warning models. The study involved 3,118 agribusiness companies that submitted their financial statements from 2016 to 2020 in Thailand. Based on the data of financial statements, a single classifier, including decision tree (DT), logistic regression (LR), and neural network (NN), was compared with a hybrid data mining technique—clustering and classification. The results showed that applying the hybrid method, k-mean and DT, helped to improve the business failure prediction performance.

References

Alapati, K. Y., & Sindhu, K. (2016). Combining Clustering with Classification: A Technique to Improve Classification Accuracy. International Journal of Computer Science Engineering (IJCSE), 5(6).

Altman, E. I. (1968). Financial ratios, discriminant analysis and the prediction of corporate bankruptcy. The journal of finance, 23(4), 589-609.

Beaver, W. H. (1966). Financial ratios as predictors of failure. Journal of accounting research, 71-111.

Beranová, M., Basovníková, M., & Martinovičová, D. (2013). Clustering of agricultural enterprises. Acta Universitatis Agriculturae et Silviculturae Mendelianae Brunensis, 61(2), 289-296.

Buanak, S. (2016). A Bankruptcy Test for Small and Medium-Size Enterprises (SMEs) in Thailand. Thammasat University, Thailand.

Davis, J., & Goadrich, M. (2006). The relationship between Precision-Recall and ROC curves. Paper presented at the Proceedings of the 23rd international conference on Machine learning.

Development, D. o. B. (2020). Manual for Submitting Financial Statements. DBD. Retrieved from https://www.dbd.go.th/download/document_file/balance/manual_finacial2563.pdf.

Dimitras, A. I., Zanakis, S. H., & Zopounidis, C. (1996). A survey of business failures with an emphasis on prediction methods and industrial applications. European Journal of Operational Research, 90(3), 487-513.

Dissayarungkun, P. (2021). Cluster Analysis of Small Enterprises According to a Risk of Business. Journal of Applied Statistics and Information Technology, 6(2), 1-16.

Geng, R., Bose, I., & Chen, X. (2015). Prediction of financial distress: An empirical study of listed Chinese companies using data mining. European Journal of Operational Research, 241(1), 236-247.

Gepp, A., Kumar, K., & Bhattacharya, S. (2010). Business failure prediction using decision trees. Journal of forecasting, 29(6), 536-555.

Hair, J. F., Black, W. C., Babin, B. J., & Anderson, R. E. (2014). Multivariate data analysis: Pearson new international edition. Essex: Pearson Education Limited, 1(2).

Hsieh, N.-C. (2005). Hybrid mining approach in the design of credit scoring models. Expert Systems with Applications, 28(4), 655-665.

Jader, R., & Aminifar, S. (2022). Predictive Model for Diagnosis of Gestational Diabetes in the Kurdistan Region by a Combination of Clustering and Classification Algorithms: An Ensemble Approach. Applied Computational Intelligence and Soft Computing, 2022.

Jain, A. K., Murty, M. N., & Flynn, P. J. (1999). Data clustering: a review. ACM computing surveys (CSUR), 31(3), 264-323.

Klepáč, V., & Hampel, D. (2017). Predicting financial distress of agriculture companies in EU. Agricultural Economics-Zemedelska Ekonomika.

Kumar, P. R., & Ravi, V. (2007). Bankruptcy prediction in banks and firms via statistical and intelligent techniques–A review. European Journal of Operational Research, 180(1), 1-28.

Li, H., Sun, J., & Wu, J. (2010). Predicting business failure using classification and regression tree: An empirical comparison with popular classical statistical methods and top classification mining methods. Expert Systems with Applications, 37(8), 5895-5904.

Lin, F. Y., & McClean, S. (2001). A data mining approach to the prediction of corporate failure. In Applications and Innovations in Intelligent Systems VIII, 93-106.

Lincoln, M. (1984). An empirical study of the usefulness of accounting ratios to describe levels of insolvency risk. Journal of banking & finance, 8(2), 321-340.

Martin, D. (1977). Early warning of bank failure: A logit regression approach. Journal of banking & finance, 1(3), 249-276.

Narungsri, K. (2005). Financial Failure Prediction for Small and Medium Enterprises in Thailand. Suranaree University of Technology, Thailand.

Olafsson, S., Li, X., & Wu, S. (2008). Operations research and data mining. European Journal of Operational Research, 187(3), 1429-1448.

Olson, D. L., Delen, D., & Meng, Y. (2012). Comparative analysis of data mining methods for bankruptcy prediction. Decision Support Systems, 52(2), 464-473.

Piernik, M., & Morzy, T. (2021). A study on using data clustering for feature extraction to improve the quality of classification. Knowledge and Information Systems, 63(7), 1771-1805.

Pumjaroen, J. (2019). Early warning system for real economy: a case study of Thailand. (Doctoral Dissertation). National Institute of Development Administration,

Pumjaroen, J., & Sethapramote, Y. (2023). Evaluating The Robustness And Implementing Double One-Sided Hodrick-Prescott For Cycle Extraction. ABAC Journal, 43(2), 143-160.

Pumjaroen, J., Vichitthamaros, P., & Sethapramote, Y. (2020). Forecasting Economic Cycle with a Structural Equation Model: Evidence from Thailand. International Journal of Economics and Financial Issues, 10(3), 47-57.

Shalaby, M., Belal, N. A., & Omar, Y. (2021). Data clustering improves siamese neural networks classification of Parkinson’s disease. Complexity, 2021, 1-9.

Shearer, C. (2000). The CRISP-DM model: the new blueprint for data mining. Journal of data warehousing, 5(4), 13-22.

Sun, J., Li, H., Fujita, H., Fu, B., & Ai, W. (2020). Class-imbalanced dynamic financial distress prediction based on Adaboost-SVM ensemble combined with SMOTE and time weighting. Information Fusion, 54, 128-144.

Tsai, C.-F. (2014). Combining cluster analysis with classifier ensembles to predict financial distress. Information Fusion, 16, 46-58.

Tsai, C.-F., & Chen, M.-L. (2010). Credit rating by hybrid machine learning techniques. Applied soft computing, 10(2), 374-380.

Wanke, P., Barros, C. P., & Faria, J. R. (2015). Financial distress drivers in Brazilian banks: A dynamic slacks approach. European Journal of Operational Research, 240(1), 258-268.

Webb, A. R. (2003). Statistical pattern recognition. (2nd ed.). John Wiley & Sons.

West, D., Dellana, S., & Qian, J. (2005). Neural network ensemble strategies for financial decision applications. Computers & operations research, 32(10), 2543-2559.

Downloads

Published

2023-06-19

How to Cite

Pumjaroen, J. (2023). Business Failure Prediction by Hybrid Data Mining Approach: A Case of Thailand Agribusiness. Journal of Applied Statistics and Information Technology, 8(1), 35–48. retrieved from https://ph02.tci-thaijo.org/index.php/asit-journal/article/view/248356