Application of Data Mining Techniques for Developing an Information System for Water Demand Forecasting
Main Article Content
Abstract
This study aims to compare the performance of water demand forecasting models in Thailand using data mining techniques under the CRISP-DM framework. The models include Linear Regression, Artificial Neural Network, Random Forest, and Gradient Boosted Trees. Monthly water consumption data from 2014 to 2023 were obtained from the national open data platform and divided into training, validation, and testing sets at a ratio of 80:10:10. RapidMiner was used for model development. Performance evaluation was conducted using Mean Absolute Error (MAE) and Root Mean Square Error (RMSE). The results show that Linear Regression achieved the best performance with an MAE of 188,158.96 m³ and an RMSE of 310,809.74 m³, which were lower than those of the other models and demonstrated high stability. A web-based prototype system was also developed to visualize forecasting results through dashboards, supporting water production planning and distribution management in alignment with Sustainable Development Goal 6.
Article Details
This work is licensed under a Creative Commons Attribution-NonCommercial-NoDerivatives 4.0 International License.
I/we certify that I/we have participated sufficiently in the intellectual content, conception and design of this work or the analysis and interpretation of the data (when applicable), as well as the writing of the manuscript, to take public responsibility for it and have agreed to have my/our name listed as a contributor. I/we believe the manuscript represents valid work. Neither this manuscript nor one with substantially similar content under my/our authorship has been published or is being considered for publication elsewhere, except as described in the covering letter. I/we certify that all the data collected during the study is presented in this manuscript and no data from the study has been or will be published separately. I/we attest that, if requested by the editors, I/we will provide the data/information or will cooperate fully in obtaining and providing the data/information on which the manuscript is based, for examination by the editors or their assignees. Financial interests, direct or indirect, that exist or may be perceived to exist for individual contributors in connection with the content of this paper have been disclosed in the cover letter. Sources of outside support of the project are named in the cover letter.
I/We hereby transfer(s), assign(s), or otherwise convey(s) all copyright ownership, including any and all rights incidental thereto, exclusively to the Journal, in the event that such work is published by the Journal. The Journal shall own the work, including 1) copyright; 2) the right to grant permission to republish the article in whole or in part, with or without fee; 3) the right to produce preprints or reprints and translate into languages other than English for sale or free distribution; and 4) the right to republish the work in a collection of articles in any other mechanical or electronic format.
We give the rights to the corresponding author to make necessary changes as per the request of the journal, do the rest of the correspondence on our behalf and he/she will act as the guarantor for the manuscript on our behalf.
All persons who have made substantial contributions to the work reported in the manuscript, but who are not contributors, are named in the Acknowledgment and have given me/us their written permission to be named. If I/we do not include an Acknowledgment that means I/we have not received substantial contributions from non-contributors and no contributor has been omitted.
References
The Water Project, "Water scarcity and global water crisis," 2025. [Online]. Available: https://thewaterproject.org/water-scarcity-water-crisis
World Bank, "Climate change impact assessment on urban water demand," 2025. [Online]. Available: https://www.worldbank.org/en/topic/water
UNICEF Thailand, "Water, sanitation and climate resilience in Thailand," 2024. [Online]. Available: https://www.unicef.org/thailand/wash
Pollution Control Department, "Thailand environmental situation report," 2025. [Online]. Available: https://www.pcd.go.th
Mordor Intelligence, "Smart irrigation market – growth, trends and forecast," 2025. [Online]. Available: https://www.mordorintelligence.com/industry-reports/smart-irrigation-market
สำนักงานปลัดกระทรวงมหาดไทย, "สุนทรพจน์วันน้ำโลก ประจำปี พ.ศ. 2567," 2567. [ออนไลน์]. แหล่งที่มา: https://www.moi.go.th
I. I. Aminu, "State-of-art implementation of computing intelligent models for water demand modelling: A decade review and future direction," Journal of Water and Land Development, 2025. [Online]. Available: https://www.jwld.pl/files/2025-03-JWLD-22.pdf
J. Han, J. Pei, and M. Kamber, Data Mining: Concepts and Techniques, 4th ed. San Francisco, CA, USA: Morgan Kaufmann, 2022.
B. Marr, Data Strategy: How to Profit from a World of Big Data, Analytics and Artificial Intelligence. London, UK: Kogan Page, 2021.
Y. Zhang, X. Wang, J. Chen, and H. Li, "Application of data mining techniques in water resources management: A review," Environmental Modelling & Software, vol. 143, pp. 105104, 2021.
F. Martínez-Plumed, L. Contreras-Ochando, C. Ferri, J. Hernández-Orallo, M. Kull, N. Lachiche, M. J. Ramírez-Quintana, and P. Flach, "CRISP-DM twenty years later: From data mining processes to data science trajectories," IEEE Transactions on Knowledge and Data Engineering, vol. 33, no. 8, pp. 3048–3061, 2021.
T. H. Davenport and S. Miller, "Working with AI: Real stories of human-machine collaboration," MIT Sloan Management Review, 2020.
S. Raschka, Y. Liu, V. Mirjalili, and D. Dzhulgakov, Machine Learning and Deep Learning with Python, scikit-learn, and TensorFlow 2. Birmingham, UK: Packt Publishing, 2020.
A. Géron, Hands-On Machine Learning with Scikit-Learn, Keras, and TensorFlow, 3rd ed. Sebastopol, CA, USA: O'Reilly Media, 2022.
C. J. Willmott and K. Matsuura, "Advantages of the mean absolute error (MAE) over the root mean square error (RMSE) in assessing average model performance," Climate Research, vol. 30, pp. 79–82, 2021.
A. Antunes, J. P. Santos, and R. Martins, "Short-term water demand forecasting using machine learning techniques," Water Resources Management, vol. 35, no. 4, pp. 1327–1343, 2021.
W. Hao, A. Cominola, and A. Castelletti, "Comparing predictive machine learning models for short- and long-term urban water demand forecasting in Milan, Italy," IFAC-PapersOnLine, vol. 55, no. 33, pp. 92–98, 2022.
N. Fumo and M. R. Biswas, "Water demand forecasting using machine learning algorithms," Applied Water Science, vol. 13, no. 1, pp. 12, 2023.
Z. Pu, J. Yan, L. Chen, Z. Li, W. Tian, T. Tao, and K. Xin, "A hybrid Wavelet-CNN-LSTM deep learning model for short-term urban water demand forecasting," Frontiers of Environmental Science & Engineering, vol. 17, no. 2, pp. 22, 2023.
