DEEP LEARNING MODELS FOR EARLY DETECTION OF DEPRESSION FROM SOCIAL MEDIA TEXT

Authors

  • Duangporn Janon Department of Statistic, Faculty of Science, King Mongkut’s Institute of Technology Ladkrabang (KMITL), Bangkok Thailand 10520
  • Pronpimol Chaiwuttisak Department of Statistic, Faculty of Science, King Mongkut’s Institute of Technology Ladkrabang (KMITL), Bangkok Thailand 10520

DOI:

https://doi.org/10.55003/IJIET.8109

Keywords:

Depression detection, Reddit, eRisk2025, TF-IDF, BERT-base, Emotion-RoBERTa

Abstract

Depression is a major global health concern. Recent studies suggest that language used on social media may reveal early signs of psychological distress before individuals seek formal clinical support. The objective of this study was to develop and evaluate transformer-based natural language processing models for detecting depressive risk signals in long-form Reddit posts using the ERISK2025 dataset. A traditional TF-IDF representation combined with Logistic regression served as the baseline and was compared with BERT-base and the emotion-aware Emotion-RoBERTa model under identical experimental conditions. Model performance was assessed using accuracy, risk-class recall, macro-F1, AUC, and confusion-matrix-based error distribution. Emotion-RoBERTa achieved the highest overall accuracy (0.76) and the strongest sensitivity to high-risk users with a risk-class recall of 0.91 and the lowest false-negative rate among the tested models. BERT-base showed moderate performance. The TF-IDF baseline was weakest at identifying high-risk cases. These findings indicate that emotion-aware language representations can improve early detection of depression risk
in social media text and may identify vulnerable users more reliably than traditional text-representation methods. The results also indicate that model evaluation in this setting should rely on clinically meaningful metrics rather than accuracy alone. Future work should consider larger datasets together with temporal patterns in user posts and ethical constraints related to real-world mental health screening.

References

Chaffey, D. (2025). Global social media statistics research summary. Smart Insights. https://www.smartinsights.com/social-media-marketing/social-media-strategy/new-global-social-media-research/.

Pew Research Center. (2025). Social media fact sheet. https://www.pewresearch.org/internet/fact-sheet/social-media/.

National Statistical Office of Thailand. (2025). Survey on the use of information and communication technology in households, 2025 (Quarter 2). Ministry of Digital Economy and Society.

https://www.nso.go.th/nsoweb/storage/survey_detail/2025/20250903131427_86572.pdf.

Tsao, S. F., Chen, H., Tisseverasinghe, T., Yang, Y., Li, L., & Butt, Z. A. (2021). What social media told us in the time of COVID-19: A scoping review. The Lancet Digital Health, 3(3), 175–194.

Handayani, P. W., Zagatti, G. A., Kefi, H., & Bressan, S. (2023). Impact of social media usage on users’ COVID-19 protective behavior: Survey study in Indonesia. JMIR Formative Research, 7(1).

Steinert, S., & Dennis, M. J. (2022). Emotions and digital well-being: On social media’s emotional affordances. Philosophy & Technology, 35(2), 36.

Chen, J. C. J., Yan, Y. Y. Y., & Leach, J. L. J. (2022). Are emotion-expressing messages more shared on social media? A meta-analytic review. Review of Communication Research, 10. Ophir, Y., Asterhan, C. S., & Schwarz, B. B. (2019). The digital footprints of adolescent depression, social rejection and victimization of bullying on Facebook. Computers in Human Behavior, 91, 62-71.

Nesi, J., Rothenberg, W. A., Bettis, A. H., Massing-Schaffer, M., Fox, K. A., Telzer, E. H., & Prinstein, M. J. (2022). Emotional responses to social media experiences among adolescents: Longitudinal associations with depressive symptoms. Journal of Clinical Child & Adolescent Psychology, 51(6), 907-922.

Yang, G., King, S. G., Lin, H. M., & Goldstein, R. Z. (2023). Emotional expression on social media support forums for substance cessation: Observational study of text-based Reddit posts. Journal of Medical Internet Research, 25. WHO. (2025a). Depressive disorder (depression). https://www.who.int/news-room/fact-sheets/detail/depression.

Fan, Y., Fan, A., Yang, Z., & Fan, D. (2025). Global burden of mental disorders in 204 countries and territories, 1990–2021: Results from the global burden of disease study 2021. BMC Psychiatry, 25(1), 486.

Yoch, M., & Sirull, R. (2021). New global burden of disease analyses show depression and anxiety among the top causes of health loss worldwide. Institute for Health Metrics and Evaluation. https://www.healthdata.org/news-events/insights-blog/acting-data/new-global-burden-disease-analyses-show-depression-and.

Proudman, D., Greenberg, P., & Nellesen, D. (2021). The growing burden of major depressive disorders (MDD): Implications for researchers and policy makers. Pharmacoeconomics, 39(6), 619-625.

Reddy, M. S. (2010). Depression: The disorder and the burden. Indian Journal of Psychological Medicine, 32(1), 1-2.

Moreno-Agostino, D., Wu, Y. T., Daskalopoulou, C., Hasan, M. T., Huisman, M., & Prina, M. (2021). Global trends in the prevalence and incidence of depression: A systematic review and meta-analysis. Journal of Affective Disorders, 281, 235-243.

WHO. (2025b). Mental health and social connection in Thailand. https://www.who.int/thailand/news/feature-stories/detail/mental-health-and-social-connection-in-thailand.

Bangkok Post. (2022). Teens face pandemic woes. https://www.bangkokpost.com/thailand/general/2314054/teens-face-pandemic-woes.

Munthuli, A., Pooprasert, P., Klangpornkun, N., Phienphanich, P., Onsuwan, C., Jaisin, K., & Tantibundhit, C. (2023). Classification and analysis of text transcription from Thai depression assessment tasks among patients with depression. PLoS ONE, 18(3).

National Health Security Office. (2023). Mental health hotline integrated into the UCS.

https://eng.nhso.go.th/view/1/DescriptionNews/Mental-health-hotline-integrated-into-the-UCS/511/EN-US.

Pruksarungruang, J., & Rhein, D. (2022). Depression literacy: An analysis of the stigmatization of depression in Thailand. SAGE Open, 12(4).

Alaramma, S. K., Habu, A. A., Ya’u, B. I., & Madaki, A. G. (2023). Sentiment analysis of sarcasm detection in social media. Gadau Journal of Pure and Allied Sciences, 2(1), 76-82.

Alqahtani, A., Alhenaki, L., & Alsheddi, A. (2023). Text-based sarcasm detection on social networks: A systematic review. International Journal of Advanced Computer Science and Applications, 14(3).

Salas-Zárate, R., Alor-Hernández, G., Salas-Zárate, M. P., Paredes-Valverde, M. A., Bustos-López, M., & Sánchez-Cervantes, J. L. (2022). Detecting depression signs on social media: A systematic literature review. Healthcare, 10(2).

Ahmed, A., Aziz, S., Toro, C. T., Alzubaidi, M., Irshaidat, S., Serhan, H. A., & Househ, M. (2022). Machine learning models to detect anxiety and depression through social media: A scoping review. Computer Methods and Programs in Biomedicine Update, 2.

Bao, E., Pérez, A., & Parapar, J. (2025). ReDSM5: A Reddit dataset for DSM-5 depression detection. In Proceedings of the ACM International Conference on Information and Knowledge Management (pp. 6323-6327).

Boettcher, N. (2021). Studies of depression and anxiety using Reddit as a data source: Scoping review. JMIR Mental Health, 8(11).

Triantafyllopoulos, I., Paraskevopoulos, G., & Potamianos, A. (2023). Depression detection in social media posts using affective and social norm features. arXiv preprint arXiv:2303.14279.

Bokolo, B. G., & Liu, Q. (2023). Deep learning-based depression detection from social media: Comparative evaluation of ML and transformer techniques. Electronics, 12(21).

Teferra, B. G., Rueda, A., Pang, H., Valenzano, R., Samavi, R., Krishnan, S., & Bhat, V. (2024). Screening for depression using natural language processing: Literature review. Interactive Journal of Medical Research, 13(1).

Mao, H., & Han, Q. (2025). Applications of transformer-based language models for depression detection: A scoping review. Journal of Integrated Social Sciences and Humanities.

Ding, Z., Wang, Z., Zhang, Y., Cao, Y., Liu, Y., Shen, X., & Dai, J. (2025). Trade-offs between machine learning and deep learning for mental illness detection on social media. Scientific Reports, 15(1), 14497.

Kerasiotis, M., Ilias, L., & Askounis, D. (2024). Depression detection in social media posts using transformer-based models and auxiliary features. Social Network Analysis and Mining, 14(1), 196.

Qasim, A., Mehak, G., Hussain, N., Gelbukh, A., & Sidorov, G. (2025). Detection of depression severity in social media text using transformer-based models. Information, 16(2), 114.

Zhang, Z., Zhu, J., Guo, Z., Zhang, Y., Li, Z., & Hu, B. (2024). Natural language processing for depression prediction on Sina Weibo. JMIR Mental Health, 11.

Kuzmin, G., Strepetov, P., Stankevich, M., Chudova, N., Shelmanov, A., & Smirnov, I. (2025). Exploring large language models for detecting mental disorders. In Proceedings of the Conference on Empirical Methods in Natural Language Processing (pp. 34523-34547).

Kowsari, K., Jafari Meimandi, K., Heidarysafa, M., Mendu, S., Barnes, L., & Brown, D. (2019). Text classification algorithms: A survey. Information, 10(4), 150.

Wang, M. (2023). Research on text classification method based on NLP. Advances in Computer, Signals and Systems, 7(2).

Downloads

Published

2026-06-29

Issue

Section

Research Articles