Application of Data Mining Techniques for Developing an Information System for Water Demand Forecasting

Main Article Content

Haruethai Asakit
Patcharamai Saosueb

Abstract

This study aims to compare the performance of water demand forecasting models in Thailand using data mining techniques under the CRISP-DM framework. The models include Linear Regression, Artificial Neural Network, Random Forest, and Gradient Boosted Trees. Monthly water consumption data from 2014 to 2023 were obtained from the national open data platform and divided into training, validation, and testing sets at a ratio of 80:10:10. RapidMiner was used for model development. Performance evaluation was conducted using Mean Absolute Error (MAE) and Root Mean Square Error (RMSE). The results show that Linear Regression achieved the best performance with an MAE of 188,158.96 m³ and an RMSE of 310,809.74 m³, which were lower than those of the other models and demonstrated high stability. A web-based prototype system was also developed to visualize forecasting results through dashboards, supporting water production planning and distribution management in alignment with Sustainable Development Goal 6.

Article Details

How to Cite
[1]
H. Asakit and P. Saosueb, “Application of Data Mining Techniques for Developing an Information System for Water Demand Forecasting”, JIST, vol. 16, no. 1, pp. 85–100, Jun. 2026.
Section
Research Article: Information Systems (Detail in Scope of Journal)

References

The Water Project, "Water scarcity and global water crisis," 2025. [Online]. Available: https://thewaterproject.org/water-scarcity-water-crisis

World Bank, "Climate change impact assessment on urban water demand," 2025. [Online]. Available: https://www.worldbank.org/en/topic/water

UNICEF Thailand, "Water, sanitation and climate resilience in Thailand," 2024. [Online]. Available: https://www.unicef.org/thailand/wash

Pollution Control Department, "Thailand environmental situation report," 2025. [Online]. Available: https://www.pcd.go.th

Mordor Intelligence, "Smart irrigation market – growth, trends and forecast," 2025. [Online]. Available: https://www.mordorintelligence.com/industry-reports/smart-irrigation-market

สำนักงานปลัดกระทรวงมหาดไทย, "สุนทรพจน์วันน้ำโลก ประจำปี พ.ศ. 2567," 2567. [ออนไลน์]. แหล่งที่มา: https://www.moi.go.th

I. I. Aminu, "State-of-art implementation of computing intelligent models for water demand modelling: A decade review and future direction," Journal of Water and Land Development, 2025. [Online]. Available: https://www.jwld.pl/files/2025-03-JWLD-22.pdf

J. Han, J. Pei, and M. Kamber, Data Mining: Concepts and Techniques, 4th ed. San Francisco, CA, USA: Morgan Kaufmann, 2022.

B. Marr, Data Strategy: How to Profit from a World of Big Data, Analytics and Artificial Intelligence. London, UK: Kogan Page, 2021.

Y. Zhang, X. Wang, J. Chen, and H. Li, "Application of data mining techniques in water resources management: A review," Environmental Modelling & Software, vol. 143, pp. 105104, 2021.

F. Martínez-Plumed, L. Contreras-Ochando, C. Ferri, J. Hernández-Orallo, M. Kull, N. Lachiche, M. J. Ramírez-Quintana, and P. Flach, "CRISP-DM twenty years later: From data mining processes to data science trajectories," IEEE Transactions on Knowledge and Data Engineering, vol. 33, no. 8, pp. 3048–3061, 2021.

T. H. Davenport and S. Miller, "Working with AI: Real stories of human-machine collaboration," MIT Sloan Management Review, 2020.

S. Raschka, Y. Liu, V. Mirjalili, and D. Dzhulgakov, Machine Learning and Deep Learning with Python, scikit-learn, and TensorFlow 2. Birmingham, UK: Packt Publishing, 2020.

A. Géron, Hands-On Machine Learning with Scikit-Learn, Keras, and TensorFlow, 3rd ed. Sebastopol, CA, USA: O'Reilly Media, 2022.

C. J. Willmott and K. Matsuura, "Advantages of the mean absolute error (MAE) over the root mean square error (RMSE) in assessing average model performance," Climate Research, vol. 30, pp. 79–82, 2021.

A. Antunes, J. P. Santos, and R. Martins, "Short-term water demand forecasting using machine learning techniques," Water Resources Management, vol. 35, no. 4, pp. 1327–1343, 2021.

W. Hao, A. Cominola, and A. Castelletti, "Comparing predictive machine learning models for short- and long-term urban water demand forecasting in Milan, Italy," IFAC-PapersOnLine, vol. 55, no. 33, pp. 92–98, 2022.

N. Fumo and M. R. Biswas, "Water demand forecasting using machine learning algorithms," Applied Water Science, vol. 13, no. 1, pp. 12, 2023.

Z. Pu, J. Yan, L. Chen, Z. Li, W. Tian, T. Tao, and K. Xin, "A hybrid Wavelet-CNN-LSTM deep learning model for short-term urban water demand forecasting," Frontiers of Environmental Science & Engineering, vol. 17, no. 2, pp. 22, 2023.