Production Planning for Pineapple Canned with Reinforcement Learning

ผู้แต่ง

  • Payungsak Klasantia Department of Industrial Technology Management, Faculty of Industrial Technology, Muban Chombueng Rajabhat University
  • Noppadol Amm-Dee Department of Industrial Technology Management, Faculty of Industrial Technology, Muban Chombueng Rajabhat University
  • Chidchanok Choksuchat Division of Computational Science, Faculty of Science, Prince of Songkla University

คำสำคัญ:

Production scheduling, reinforcement learning, Markov Decision Process, production planning, machine learning

บทคัดย่อ

Pineapple is a significant economic crop in Thailand, and the pineapple processing industry is crucial for farmers, manufacturers, and customers. Production planning is challenging due to the complexity of customer demands and the increasing uncertainty of fresh pineapple yields. The researcher studied and experimented with algorithms for production planning for canned pineapple using reinforcement learning. The objective was to optimize production planning by finding the best values for just-in-time scheduling through reinforcement learning, based on the Markov Decision Process (MDP) used for sequential decision-making. After analyzing the problem and recognizing that production planning has such characteristics, the researcher proceeded as follows: 1) Designed a dataset from case study data and defined the objective function. 2) Developed a reinforcement learning model using the Advanced Actor Critic (A2C) algorithm to create the production plan for the case study. 3) Tuned the model's parameters, trained the model, and tested it. 4) Evaluated the model and found that the reward or the defined objective function increased by at least 40% compared to the initial model. Additionally, the sellable products' readiness improved by 120%, and the discrepancy between expected and actual returns decreased by 19% compared to the initial machine learning model.

References

Food and Agriculture Organization of the United Nations. FAOSTAT [Internet]. 2023 [cited 2023 May 6]. Available from: https://www.fao.org/faostat/en/#data/WCAD.

Tantipipaphong K. Guidelines for promoting agricultural product processing for export. Thesis, National Defense Course, National Defense College. Thailand; 2020.

Saengchan S. Development of Marketing Strategy of Thai Canned Pineapple Exporters. Master of Business Administration Thesis, Burapha University College of Commerce. Thailand; 2017.

Chantaros P, Klaychey R, Limpianchob C. Optimization of a Sustainable Supply Chain Planning System for Cultivated Banana Production with a Mixed-integer Linear Programming Approach. TJOR 2023;11(1):21-3.

Georgiadis GP, Mario Pampin B, Cabo DA, Georgiadis MC. Optimal production scheduling of food process industries. Comput Chem Eng 2020;134:106682.

Hubbs CD, Li C, Sahinidis NV, Grossmann IE, Wassick JM. A deep reinforcement learning approach for chemical production scheduling. Comput Chem Eng 2020;141:106982.

Kumar A, Dimitrakopoulos R, Maulen M. Adaptive self-learning mechanisms for updating short-term production decisions in an industrial mining complex. J Intell Manuf 2020;31(6):1795–1811.

Guo F, Li Y, Liu A, Liu Z. A reinforcement learning method to scheduling problem of steel production process. J Phys Conf Ser 2020;1486:072035.

Woo JH, Kim B, Ju S, Cho YI. Automation of load balancing for Gantt planning using reinforcement learning. Eng Appl Artif Intell. 2021;101:104226.

Kemmer L, Kleist H, Rochebouet D, Tziortziotis N, Read J. Reinforcement learning for supply chain optimization. EWRL 2018;14(10):1-9.

Schwung D, Schwung A, Ding SX. Actor-critic Reinforcement Learing for Energy Optimization in Hybrid Production Environment. IJC 2019; 18(4):360-71.

Geevers K. Deep Reinforcement Learning in Inventory Management. Master Thesis, Industrial Engineering and Management, University of Twente. Netherlands; 2020.

Bellman R. A Markovian decision process. J Math Mech. 1957;6(5):679-84.

Oroojlooyjadid A, Nazari M, Snyder L, Takáč M. A Deep Q-Network for the Beer Game: A Deep Reinforcement Learning algorithm to Solve Inventory Optimization Problems [Internet]. 2020 [cited 2024 Nov 01]. Available from: https://arxiv.org/abs/1708.05924.

Kuhnle A, Kaiser JP, Theiß F, Stricker N, Lanza G. Designing an adaptive production control system using reinforcement learning. J Intell Manuf. 2021 ;32(3):855-76.

Martínez JY, Coto Palacio J, Nowé A. Multi-agent reinforcement learning tool for job shop scheduling problems. Commun Comput Inf Sci 2020;1173:3-12.

Lang S, Behrendt F, Lanzerath N, Reggelin T, Müller M. Integration of deep reinforcement learning and discrete-event simulation for real-time scheduling of a flexible job shop production. Proc 2020 Winter Simul Conf (WSC) 2020;3057-68.

Chang J, Yu D, Hu Y, He W, Yu H. Deep reinforcement learning for dynamic flexible job shop scheduling with random job arrival. Processes 2022;10(4):760.

Romero-Hdz J, Saha BN, Tsutsumi S, Fincato R. Incorporating domain knowledge into reinforcement learning to expedite welding sequence optimization. Eng Appl Artif Intell 2020;91:103612.

Modrak V, Sudhakarapandian R, Balamurugan A, Soltysova Z. A review on reinforcement learning in production scheduling: an inferential perspective. Algorithms 2024;17(8):343.

Downloads

เผยแพร่แล้ว

2025-03-04

How to Cite

Klasantia, P., Amm-Dee, N. ., & Choksuchat, C. . (2025). Production Planning for Pineapple Canned with Reinforcement Learning. วารสารวิทยาศาสตร์และเทคโนโลยี หัวเฉียวเฉลิมพระเกียรติ, 11(1), 40–52. สืบค้น จาก https://ph02.tci-thaijo.org/index.php/scihcu/article/view/255855