Prediction Models for Tourism Stock Market Trends during COVID-19 pandemic using News Sentiment Analysis with Data Mining: A Case Study
Keywords:Data mining, Machine learning, Stock market, Sentiment analysis, Text mining, tourism industry, COVID-19
This work introduces a process for predicting the trends of stocks in the tourism sector during COVID-19 pandemic using sentiment analysis of COVID19 news headlines with data mining techniques. The COVID-19 news headlines are first collected daily and analyzed via sentiment analysis to obtain their polarity based on naïve Bayes and neural network techniques. These polarity results are then used with the related stock historical data to predict the trend of the stock prices by K-nearest neighbor and decision tree classifications. In our numerical experiments, seven major stocks from the tourism and hotel business operated in Thailand are considered. Our proposed prediction models are shown to have accuracies ranging around 70%- 90%. The highest accuracy of about 90% is achieved when a neuron network is used in the sentiment analysis with the decision tree for predicting the stock trends.
Yang Y, Zhang CX, Rickly JM. A review of early COVID-19 research in tourism: Launching the Annals of Tourism Research’s Curated Collection on coronavirus and tourism. Annals of Tourism Research. 2021;91:103313.
Gössling S, Scott D, Hall CM. Pandemics, tourism and global change: a rapid assessment of COVID-19. Journal of Sustainable Tourism. 2021;29(1):1–20.
Sigala M. Tourism and COVID-19: Impacts and implications for advancing and resetting industry and research. Journal of business research. 2020;117:312–21.
Duarte Alonso A, Kok SK, Bressan A, O’Shea M, Sakellarios N, Koresis A, et al. COVID-19, aftermath, impacts, and hospitality firms: An international perspective. International Journal of Hospitality Management. 2020;91:102654. Available from: https://www. sciencedirect.com/science/article/pii/S0278431920302061.
Organisation WT. UNWTO world tourism barometer and statistical annex. 2021;7.
Cong Nguyen To B, Khac Quoc Nguyen B, Van Thien Nguyen T, Thi Minh Nguyen P. Vaccine initiation rate and volatility in the international stock market during COVID-19. Bao and Van Thien Nguyen, Tam and Thi Minh Nguyen, Phuong, Vaccine Initiation Rate and Volatility in the International Stock Market during COVID-19 (September 29, 2021). 2021.
Khalfaoui R, Nammouri H, Labidi O, Jabeur SB. Is the COVID-19 vaccine effective on the US financial market? Public Health. 2021;198:177–9.
Al-Jassar SA, Moosa IA. The effect of quantitative easing on stock prices: a structural time series approach. Applied Economics. 2019;51(17):1817–27.
Tsai MC, Cheng CH, Tsai MI, Shiu HY. Forecasting leading industry stock prices based on a hybrid time-series forecast model. PloS one. 2018;13(12):e0209922.
Ariyo AA, Adewumi AO, Ayo CK. Stock price prediction using the ARIMA model. In: 2014 UKSim-AMSS 16th international conference on computer modelling and simulation. IEEE; 2014. p. 106–12.
Jarrett JE, Kyper E. ARIMA modeling with intervention to forecast and analyze Chinese stock prices. International Journal of Engineering Business Management. 2011;3(3):53–8.
Mondal P, Shit L, Goswami S. Study of effectiveness of time series modeling (ARIMA) in forecasting stock prices. International Journal of Computer Science, Engineering and Applications. 2014;4(2):13.
Yu P, Yan X. Stock price prediction based on deep neural networks. Neural Computing and Applications. 2020;32(6):1609–28.
Lahmiri S. Wavelet low-and highfrequency components as features for predicting stock prices with backpropagation neural networks. Journal of King Saud University-Computer and Information Sciences. 2014;26(2):218–27.
Serletis A. Money and stock prices in the United States. Applied Financial Economics. 1993;3(1):51–4.
Du J, Liu Q, Chen K, Wang J. Forecasting stock prices in two ways based on LSTM neural network. In: 2019 IEEE 3rd Information Technology, Networking, Electronic and Automation Control Conference (ITNEC). IEEE; 2019. p. 1083–6.
Strapparava C, Valitutti A, et al. Wordnet affect: an affective extension of wordnet. In: Lrec. vol. 4. Lisbon, Portugal; 2004. p. 40.
Baccianella S, Esuli A, Sebastiani F. Sentiwordnet 3.0: An enhanced lexical resource for sentiment analysis and opinion mining. In: Proceedings of the Seventh International Conference on Language Resources and Evaluation (LREC’10); 2010. .
Li X, Xie H, Chen L, Wang J, Deng X. News impact on stock price return via sentiment analysis. Knowledge-Based Systems. 2014;69:14–23.
Mohan S, Mullapudi S, Sammeta S, Vijayvergia P, Anastasiu DC. Stock price prediction using news sentiment analysis. In: 2019 IEEE Fifth International Conference on Big Data Computing Service and Applications (BigDataService). IEEE; 2019. p. 205–8.
Jing N, Wu Z, Wang H. A hybrid model integrating deep learning with investor sentiment analysis for stock price prediction. Expert Systems with Applications. 2021;178:115019.
Liang X, Chen RC, He Y, Chen Y. Associating stock prices with web financial information time series based on support vector regression. Neurocomputing. 2013;115:142–9.
Derakhshan A, Beigy H. Sentiment analysis on stock social media for stock price movement prediction. Engineering Applications of Artificial Intelligence. 2019;85:569–78.
Wu DD, Zheng L, Olson DL. A decision support approach for online stock forum sentiment analysis. IEEE transactions on systems, man, and cybernetics: systems. 2014;44(8):1077–87.
Mehta P, Pandya S, Kotecha K. Harvesting social media sentiment analysis to enhance stock market prediction using deep learning. PeerJ Computer Science. 2021;7:e476.
Wu B, Wang L, Wang S, Zeng YR. Forecasting the US oil markets based on social media information during the COVID-19 pandemic. Energy. 2021;226:120403.
Liew VKS. Abnormal returns on tourism shares in the Chinese stock exchanges amid the COVID-19 pandemic. Available at SSRN 3863889. 2020.
He P, Sun Y, Zhang Y, Li T. COVID–19’s impact on stock prices across different sectors—An event study based on the Chinese stock market. Emerging Markets Finance and Trade. 2020;56(10):2198–212.
Agovino M, Musella G. Economic losses in tourism during the COVID-19 pandemic. The case of Sorrento. Current Issues in Tourism. 2021:1–25.
Pan WT, Huang QY, Yang ZY, Zhu FY, Pang YN, Zhuang ME. Determinants of Tourism Stocks During the COVID-19: Evidence From the Deep Learning Models. Frontiers in Public Health. 2021;9. Available from: https: //www.frontiersin.org/articles/ 10.3389/fpubh.2021.675801.
Laeeq Razzak Janjua FM, Sukjai P, Rehman A, Yu Z. Impact of COVID-19 pandemic on logistics performance, economic growth and tourism industry of Thailand: an empirical forecasting using ARIMA. Brazilian Journal of Operations & Production Management. 2021;18(2):e2021999.
Sontayasara T, Jariyapongpaiboon S, Promjun A, Seelpipat N, Saengtabtim K, Tang J, et al. Twitter sentiment analysis of Bangkok tourism during COVID-19 pandemic using support vector machine algorithm. Journal of Disaster Research. 2021;16(1):24–30.
Mishra RK, Urolagin S, Jothi J, Neogi A, Nawaz N. Deep learning-based sentiment analysis and topic modeling on tourism during Covid-19 pandemic. Frontiers in Computer Science. 2021;3(10.3389).
Obembe D, Kolade O, Obembe F, Owoseni A, Mafimisebi O. Covid-19 and the tourism industry: An early stage sentiment analysis of the impact of social media and stakeholder communication. International Journal of Information Management Data Insights. 2021;1(2):100040.
Jimenez M, Maxime C, Le Traon Y, Papadakis M. On the impact of tokenizer and parameters on n-gram based code analysis. In: 2018 IEEE International Conference on Software Maintenance and Evolution (ICSME). IEEE; 2018. p. 437–48.
Tremblay A, Tucker BV. The effects of N-gram probabilistic measures on the recognition and production of fourword sequences. The Mental Lexicon. 2011;6(2):302–24.
Pratiwi IYR, Asmara RA, Rahutomo F. Study of hoax news detection using naïve bayes classifier in Indonesian language. In: 2017 11th International Conference on Information & Communication Technology and System (ICTS). IEEE; 2017. p.73–8.
Banchs RE. Text mining with MATLAB®. Springer; 2013.
Priddy KL, Keller PE. Artificial neural networks: an introduction. vol. 68. SPIE press; 2005.
Hastie T, Tibshirani R, Friedman JH, Friedman JH. The elements of statistical learning: data mining, inference, and prediction. vol. 2. Springer; 2009.
Asghar MZ, Rahman F, Kundi FM, Ahmad S. Development of stock market trend prediction system using multiple regression. Computational and mathematical organization theory. 2019;25(3):271–301.
Quinlan JR. Learning decision tree classifiers. ACM Computing Surveys (CSUR).1996;28(1):71-2.
Banfield RE, Hall LO, Bowyer KW, Kegelmeyer WP. A comparison of decision tree ensemble creation techniques. IEEE transactions on pattern analysis and machine intelligence. 2006;29(1):173-80.
Hindrayani KM, Fahrudin TM, Aji RP, Safitri EM. Indonesian Stock Price Prediction including Covid19 Era Using Decision Tree Regression. In: 2020 3rd International Seminar on Research of Information Technology and Intelligent Systems (ISRITI). IEEE; 2020. p. 344-7.
How to Cite
Copyright (c) 2023 Science & Technology Asia
This work is licensed under a Creative Commons Attribution-NonCommercial-NoDerivatives 4.0 International License.