Predictive Analysis of COVID-19 Patients in Thailand using Multiple Countries Data

Main Article Content

Siratee Vorathamthongdee
Prabhas Chongstitvatana

Abstract

COVID-19 is a situation that has spread worldwide since 2019. This study predicts the number of patients with COVID-19 in Thailand. Using data between January 22, 2020, and December 31, 2021, we collect confirmed cases from John Hopkins open data. Using the machine learning model to predict the number of patient cases in the country helps the government manage its policies and resources. In this study, the K-Means clustering algorithm performs to group the countries that have similar patterns of confirmed cases to Thailand. Clustering results show that Japan, Malaysia, the Philippines, Bangladesh, Cuba, Iraq, Mexico, and Vietnam are all in the same cluster as Thailand. Using Long Short-Term Memory (LSTM) to predict the confirmed case of Thailand by feeding the model pairs of countries in the same cluster as Thailand, the performance of LSTM shows that using pairs of countries between Bangladesh, Japan, and Mexico with Thailand has the lowest error on MAPE, respectively, when compared to using only Thailand data.


 

Article Details

Section
Research Articles

References

O. L. Aiyegbusi et al., "Symptoms, complications and management of long COVID: a review," Journal of the Royal Society of Medicine, vol. 114, no. 9, pp. 428-442, 2021.

W. H. Organization, "Responding to community spread of COVID-19: interim guidance, 7 March 2020," World Health Organization, 2020.

WHO. (7 June 2022). Archived: WHO Timeline - COVID-19. [Online] Available: https://www.who.int/news/item/ 27-04-2020-who-timeline---covid-19

T. Panch, H. Mattie, and L. A. Celi, "The “inconvenient truth” about AI in healthcare," NPJ digital medicine, vol. 2, no. 1, pp. 1-3, 2019.

P. Warittha, Suchanan, C. (10 June 2022). Tourism at a crossroad. Bank of Thailand. [Online] Available: https://www.bot.or.th/Thai/ResearchAndPublications/articles/Pages/Article_18Aug2021.aspx

T. Mantoro, R. T. Handayanto, M. A. Ayu, and J. Asian, "Prediction of covid-19 spreading using support vector regression and susceptible infectious recovered model," in 2020 6th International Conference on Computing Engineering and Design (ICCED), 2020: IEEE, pp. 1-5.

R. L. Kumar, F. Khan, S. Din, S. S. Band, A. Mosavi, and E. Ibeke, "Recurrent neural network and reinforcement learning model for COVID-19 prediction," Frontiers in public health, vol. 9, 2021.

L. Bi, M. Fili, and G. Hu, "COVID-19 forecasting and intervention planning using gated recurrent unit and evolutionary algorithm," Neural Computing and Applications, pp. 1-19, 2022.

A. Sherstinsky, "Fundamentals of recurrent neural network (RNN) and long short-term memory (LSTM) network," Physica D: Nonlinear Phenomena, vol. 404, p. 132306, 2020.

R. M. Carrillo-Larco and M. Castillo-Cara, "Using country-level variables to classify countries according to the number of confirmed COVID-19 cases: An unsupervised machine learning approach," Wellcome open research, vol. 5, 2020.

D. Abdullah, S. Susilo, A. S. Ahmar, R. Rusli, and R. Hidayat, "The application of K-means clustering for province clustering in Indonesia of the risk of the COVID-19 pandemic based on COVID-19 data," Quality & Quantity, vol. 56, no. 3, pp. 1283-1291, 2022.

V. K. R. Chimmula and L. Zhang, "Time series forecasting of COVID-19 transmission in Canada using LSTM networks," Chaos, Solitons & Fractals, vol. 135, p. 109864, 2020.

S. Dutta and S. K. Bandyopadhyay, "Machine learning approach for confirmation of covid-19 cases: Positive, negative, death and release," MedRxiv, 2020.

A. Tomar and N. Gupta, "Prediction for the spread of COVID-19 in India and effectiveness of preventive measures," Science of The Total Environment, vol. 728, p. 138762, 2020.

C. Xu, "A Comparative Study: Time-Series Analysis Methods for Predicting COVID-19 Case Trend," ed, 2021.

H. Baalbaki et al., "Fighting against COVID-19: Who Failed and Who Succeeded?,", Journal of Computer and Communications, vol. 10, no. 4, pp. 32-50, 2022.

A. B. Said, A. Erradi, H. A. Aly, and A. Mohamed, "Predicting COVID-19 cases using bidirectional LSTM on multivariate time series," Environmental Science and Pollution Research, vol. 28, no. 40, pp. 56043-56052, 2021.

J. MacQueen, "Some methods for classification and analysis of multivariate observations," 1967.

D. Arthur and S. Vassilvitskii, (10 June 2022). k-means++: The Advantages of Careful Seeding, Stanford InfoLab, Technical Report 2006. [Online]. Available: http://ilpubs.stanford.edu:8090/778/

R. C. Staudemeyer and E. R. Morris, "Understanding LSTM--a tutorial into long short-term memory recurrent neural networks," arXiv preprint arXiv:1909.09586, 2019.