Data Analytics for Maize Price Prediction using Regression Algorithms

Authors

  • Pawita Jantankaew นิสิต หลักสูตรวิทยาศาสตรมหาบัณฑิต สาขาวิทยาการคอมพิวเตอร์ คณะวิทยาศาสตร์ มหาวิทยาลัยเกษตรศาสตร์
  • Nuanwan Soonthornphisaj รองศาสตราจารย์ ภาควิชาวิทยาการคอมพิวเตอร์ คณะวิทยาศาสตร์ มหาวิทยาลัยเกษตรศาสตร์

Keywords:

Regression algorithm, Maize price, Machine learning, Data Science

Abstract

This research aims for finding the regression model that can predict maize price sold by Thai farmers. Three regression models are explored which are multiple linear regression, Ridge regression and Lasso regression. These algorithms learn from dataset collected by office of agricultural economics from Jan 2002-May 2019. We propose two new features which are the rate of cassava price change from 1 month and the rate of maize price change from 1-4 months. We do statistical analysis to see the relationship between features. Performance of regression algorithms are measured in terms of R-squared, Root mean square error and Mean absolute error. The experimental results reveal that feature selection play an important role for multiple linear regression with the R-squared = 0.94. We found that multiple linear regression outperforms Ridge regression (R- Squared = 0.86) and Lasso regression (R- Squared = 0.86). The mean absolute error of multiple linear regression, Ridge regression and Lasso algorithm are 0.31, 0.42 and 0.43, respectively. The root mean square error of these three regression algorithms are 0.50, 0.58 and 0.58, respectively.

References

Ge Y, Wu H. Prediction of corn price fluctuation based on multiple linear regression analysis model under big data. Neural Comput & Applic. 2019; 32:16843–16855.

Ayush J, Marvaniya S, Godbole S, Munigala V. A framework for crop price forecasting in emerging economies by analyzing the quality of time-series data. arXiv. 2020:10.

Silva RF, Barreira BL, Cugnasca CE. Prediction of corn and sugar prices using machine Learning, econometrics, and ensemble models. MDPI. 2021; 9(31):1-4.

Ouyang H, Wei X, Wu Q. Agricultural commodity futures prices prediction via long- and short-term time series network. J Appl Econ. 2019; 22(1): 468 - 483.

Shahhosseini M, Hu G, Archontoulis S. Forecasting corn yield with machine learning ensembles. Front. Plant Sci. 2020;31.

Rencher AC, Christensen WF. Multivariate regression, Methods of Multivariate Analysis: 3 ed: John Wiley & Sons; 2012.

Tibshirani R. Regression shrinkage and selection via the Lasso. J Royal Stat Soc. 1996; 58(1): 267-288.

Hoerl AE, Kennard RW, Hoerl R. Practical use of ridge regression: A Challenge Met. Journal of the Royal Statistical Society. 1985; 34(2): 114-120.

Soonthornphisaj N. Machine learning. Bangkok: Kasetsart University; 2020.

Downloads

Published

2022-06-05

Issue

Section

บทความวิจัย