Thai Language Sentiment Analysis with a Hybrid Method on WangchanBERTa-CNN-BiLSTM
Main Article Content
Abstract
Understanding emotions conveyed in text, especially in non-global languages such as Thai, sentiment analysis is particularly important in Thailand. However, this endeavor faces challenges due to variations in text length, which significantly impact sentiment analysis outcomes. Previous research has employed neural network and machine learning models in the process, yet each model specializes in different aspects, making comprehensive sentiment analysis coverage unattainable. Recent research has delved into hybrid models like CNN-BiLSTM and BiLSTM-CNN. Although they demonstrate efficacy, their performance still varies across different datasets. For instance, CNN-BiLSTM excels with short sentences by considering surrounding word context, while BiLSTM-CNN is more effective with long sentences due to its bidirectional learning capability. While showing promise, these models perform effectively, but varied text lengths in datasets often lead to sentiment misinterpretation. To address these challenges, inspired by recent advances, we propose an innovative solution: the Parallel Hybrid model. This approach integrates WangchanBERTa into both CNN-BiLSTM and BiLSTM-CNN architectures, harnessing ensemble techniques to improve overall performance and adaptability. Our experiments, conducted on datasets like Wisesight, a highly imbalanced dataset with mostly longer texts, and Thai Children's Tales, a less imbalanced dataset with mostly shorter texts, confirm the effectiveness of the Parallel Hybrid model, which outperforms other model configurations with Macro F1 scores of 0.6270 and 0.7859, respectively. This research marks a significant advancement in sentiment analysis for the Thai language.
Article Details
This work is licensed under a Creative Commons Attribution-NonCommercial-NoDerivatives 4.0 International License.
I/we certify that I/we have participated sufficiently in the intellectual content, conception and design of this work or the analysis and interpretation of the data (when applicable), as well as the writing of the manuscript, to take public responsibility for it and have agreed to have my/our name listed as a contributor. I/we believe the manuscript represents valid work. Neither this manuscript nor one with substantially similar content under my/our authorship has been published or is being considered for publication elsewhere, except as described in the covering letter. I/we certify that all the data collected during the study is presented in this manuscript and no data from the study has been or will be published separately. I/we attest that, if requested by the editors, I/we will provide the data/information or will cooperate fully in obtaining and providing the data/information on which the manuscript is based, for examination by the editors or their assignees. Financial interests, direct or indirect, that exist or may be perceived to exist for individual contributors in connection with the content of this paper have been disclosed in the cover letter. Sources of outside support of the project are named in the cover letter.
I/We hereby transfer(s), assign(s), or otherwise convey(s) all copyright ownership, including any and all rights incidental thereto, exclusively to the Journal, in the event that such work is published by the Journal. The Journal shall own the work, including 1) copyright; 2) the right to grant permission to republish the article in whole or in part, with or without fee; 3) the right to produce preprints or reprints and translate into languages other than English for sale or free distribution; and 4) the right to republish the work in a collection of articles in any other mechanical or electronic format.
We give the rights to the corresponding author to make necessary changes as per the request of the journal, do the rest of the correspondence on our behalf and he/she will act as the guarantor for the manuscript on our behalf.
All persons who have made substantial contributions to the work reported in the manuscript, but who are not contributors, are named in the Acknowledgment and have given me/us their written permission to be named. If I/we do not include an Acknowledgment that means I/we have not received substantial contributions from non-contributors and no contributor has been omitted.
References
S. Zhang, Z. Wei, Y. Wang, and T. Liao, "Sentiment analysis of Chinese micro-blog text based on extended sentiment dictionary," Future Generation Computer Systems, vol. 81, pp. 395-403, 2018/04/01/ 2018, doi: https://doi.org/10.1016/j.future.2017.09.048.
Y. Zhang and Z. Rao, "n-BiLSTM: BiLSTM with n-gram Features for Text Classification," in 2020 IEEE 5th Information Technology and Mechatronics Engineering Conference (ITOEC), 12-14 June 2020 2020, pp. 1056-1059, doi: 10.1109/ITOEC49072.2020.9141692. [Online]. Available: ttps://ieeexplore.ieee.org/stampPDF/getPDF.jsp?tp=&arnumber=9141692&ref=
F. E. Cahyanti, Adiwijaya, and S. A. Faraby, "On The Feature Extraction For Sentiment Analysis of Movie Reviews Based on SVM," in 2020 8th International Conference on Information and Communication Technology (ICoICT), 24-26 June 2020 2020, pp. 1-5, doi: 10.1109/ICoICT49345.2020.9166397. [Online]. Available: https://ieeexplore.ieee.org/stampPDF/getPDF.jsp?tp=&arnumber=9166397&ref=
M. Mishra and A. Patil, "Sentiment Prediction of IMDb Movie Reviews Using CNN-LSTM Approach," in 2023 International Conference on Control, Communication and Computing (ICCC), 19-21 May 2023 2023, pp. 1-6, doi: 10.1109/ICCC57789.2023.10165155. [Online]. Available: https://ieeexplore.ieee.org/stampPDF/getPDF.jsp?tp=&arnumber=10165155&ref=
Y. Al Amrani, M. Lazaar, and K. E. El Kadiri, "Random Forest and Support Vector Machine based Hybrid Approach to Sentiment Analysis," Procedia Computer Science, vol. 127, pp. 511-520, 2018/01/01/ 2018, doi: https://doi.org/10.1016/j.procs.2018.01.150.
B. ErŞAhİN, Ö. Aktaş, D. Kilinç, and M. ErŞAhİN, "A hybrid sentiment analysis method for Turkish," TURKISH JOURNAL OF ELECTRICAL ENGINEERING & COMPUTER SCIENCES, vol. 27, pp. 1780-1793, 05/15 2019, doi: 10.3906/elk-1808-189.
U. Naseem, S. K. Khan, I. Razzak, and I. A. Hameed, "Hybrid Words Representation for Airlines Sentiment Analysis," in AI 2019: Advances in Artificial Intelligence, Cham, J. Liu and J. Bailey, Eds., 2019// 2019: Springer International Publishing, pp. 381-392.
R. Cai et al., "Sentiment Analysis About Investors and Consumers in Energy Market Based on BERT-BiLSTM," IEEE Access, vol. 8, pp. 171408-171415, 2020, doi: 10.1109/ACCESS.2020.3024750.
K. Pasupa and T. Seneewong Na Ayutthaya, "Hybrid Deep Learning Models for Thai Sentiment Analysis," Cognitive Computation, vol. 14, no. 1, pp. 167-193, 2022/01/01 2022, doi: 10.1007/s12559-020-09770-0.
B. Gupta, P. Prakasam, and T. Velmurugan, "Integrated BERT embeddings, BiLSTM-BiGRU and 1-D CNN model for binary sentiment classification analysis of movie reviews," Multimedia Tools and Applications, vol. 81, no. 23, pp. 33067-33086, 2022/09/01 2022, doi: 10.1007/s11042-022-13155-w.
N. Chen, Y. Sun, and Y. Yan, "Sentiment analysis and research based on two-channel parallel hybrid neural network model with attention mechanism," IET Control Theory & Applications, vol. 17, no. 17, pp. 2259-2267, 2023/11/01 2023, doi: https://doi.org/10.1049/cth2.12463