Optimizing Supply Chain Management with Reinforcement Learning: A Data-Driven Approach

Authors

  • Adi Purwanto Universitas Dian Nuswantoro
  • Siti Maesaroh Universitas Dian Nuswantoro
  • Agung Sulistyo Universitas Dian Nuswantoro

DOI:

https://doi.org/10.63876/ijtm.v1i3.113

Keywords:

Supply Chain Management, Reinforcement Learning, Data-Driven Optimization, Decision-Making, Logistics

Abstract

Effective supply chain management (SCM) is crucial for improving efficiency, reducing costs, and enhancing responsiveness in dynamic market conditions. Traditional SCM optimization methods often rely on static models that struggle to adapt to uncertainty and real-time changes. In this study, we propose a data-driven approach using reinforcement learning (RL) to optimize decision-making in SCM. By leveraging historical and real-time data, our RL model dynamically learns optimal inventory policies, demand forecasting strategies, and logistics planning to minimize costs and maximize service levels. We evaluate the performance of our approach through simulations and real-world case studies, demonstrating significant improvements over conventional optimization techniques. The results highlight the potential of RL in transforming SCM by enabling adaptive, intelligent decision-making in complex and uncertain environments.

Downloads

Download data is not yet available.

References

V. Sharma, R. D. Raut, M. Hajiaghaei-Keshteli, B. E. Narkhede, R. Gokhale, and P. Priyadarshinee, “Mediating effect of industry 4.0 technologies on the supply chain management practices and supply chain performance,” J. Environ. Manage., vol. 322, p. 115945, Nov. 2022, doi: https://doi.org/10.1016/j.jenvman.2022.115945.

C. L. B. Silveira, A. Tabares, L. T. Faria, and J. F. Franco, “Mathematical optimization versus Metaheuristic techniques: A performance comparison for reconfiguration of distribution systems,” Electr. Power Syst. Res., vol. 196, p. 107272, Jul. 2021, doi: https://doi.org/10.1016/j.epsr.2021.107272.

M. Pournader, H. Ghaderi, A. Hassanzadegan, and B. Fahimnia, “Artificial intelligence applications in supply chain management,” Int. J. Prod. Econ., vol. 241, p. 108250, Nov. 2021, doi: https://doi.org/10.1016/j.ijpe.2021.108250.

H. Lin, J. Lin, and F. Wang, “An innovative machine learning model for supply chain management,” J. Innov. Knowl., vol. 7, no. 4, p. 100276, Oct. 2022, doi: https://doi.org/10.1016/j.jik.2022.100276.

J. Karthika, M. Rajkumar, and S. Narendiran, “Performance enhancement of hybrid SCM/WDM system using ANN-trained Raman amplifier,” Mater. Today Proc., vol. 37, pp. 2529–2534, 2021, doi: https://doi.org/10.1016/j.matpr.2020.08.489.

P. Charoenkwan, S. Kanthawong, C. Nantasenamat, M. M. Hasan, and W. Shoombuatong, “iDPPIV-SCM: A Sequence-Based Predictor for Identifying and Analyzing Dipeptidyl Peptidase IV (DPP-IV) Inhibitory Peptides Using a Scoring Card Method,” J. Proteome Res., vol. 19, no. 10, pp. 4125–4136, Oct. 2020, doi: https://doi.org/10.1021/acs.jproteome.0c00590.

N. Nasurudeen Ahamed and P. Karthikeyan, “A Reinforcement Learning Integrated in Heuristic search method for self-driving vehicle using blockchain in supply chain management,” Int. J. Intell. Networks, vol. 1, pp. 92–101, 2020, doi: https://doi.org/10.1016/j.ijin.2020.09.001.

H. D. Perez, S. Amaran, E. Erisen, J. M. Wassick, and I. E. Grossmann, “Optimization of extended business processes in digital supply chains using mathematical programming,” Comput. Chem. Eng., vol. 152, p. 107323, Sep. 2021, doi: https://doi.org/10.1016/j.compchemeng.2021.107323.

L. Schewe, M. Schmidt, and D. Weninger, “A decomposition heuristic for mixed-integer supply chain problems,” Oper. Res. Lett., vol. 48, no. 3, pp. 225–232, May 2020, doi: https://doi.org/10.1016/j.orl.2020.02.006.

M. Tayyab and B. Sarkar, “An interactive fuzzy programming approach for a sustainable supplier selection under textile supply chain management,” Comput. Ind. Eng., vol. 155, p. 107164, May 2021, doi: https://doi.org/10.1016/j.cie.2021.107164.

L. Terrada, M. El Khaïli, and H. Ouajji, “Multi-Agents System Implementation for Supply Chain Management Making-Decision,” Procedia Comput. Sci., vol. 177, pp. 624–630, 2020, doi: https://doi.org/10.1016/j.procs.2020.10.089.

L. Li, Y. Wan, S. Chen, W. Tian, W. Long, and J. Song, “Prediction of optimal ranges of mix ratio of self-compacting mortars (SCMs) based on response surface method (RSM),” Constr. Build. Mater., vol. 319, p. 126043, Feb. 2022, doi: https://doi.org/10.1016/j.conbuildmat.2021.126043.

A. Karamchandani, S. K. Srivastava, and R. K. Srivastava, “Perception-based model for analyzing the impact of enterprise blockchain adoption on SCM in the Indian service industry,” Int. J. Inf. Manage., vol. 52, p. 102019, Jun. 2020, doi: https://doi.org/10.1016/j.ijinfomgt.2019.10.004.

A. Sheikh-Zadeh, M. D. Rossetti, and M. A. Scott, “Performance-based inventory classification methods for large-Scale multi-echelon replenishment systems,” Omega, vol. 101, p. 102276, Jun. 2021, doi: https://doi.org/10.1016/j.omega.2020.102276.

M. Massmann, M. Meyer, M. Frank, S. von Enzberg, A. Kühn, and R. Dumitrescu, “Method for data inventory and classification,” Procedia CIRP, vol. 93, pp. 234–239, 2020, doi: https://doi.org/10.1016/j.procir.2020.04.033.

K. P. Tran, H. Du Nguyen, and S. Thomassey, “Anomaly detection using Long Short Term Memory Networks and its applications in Supply Chain Management,” IFAC-PapersOnLine, vol. 52, no. 13, pp. 2408–2412, 2019, doi: https://doi.org/10.1016/j.ifacol.2019.11.567.

G. Du, Y. Zou, X. Zhang, T. Liu, J. Wu, and D. He, “Deep reinforcement learning based energy management for a hybrid electric vehicle,” Energy, vol. 201, p. 117591, Jun. 2020, doi: https://doi.org/10.1016/j.energy.2020.117591.

A. Notsu, K. Yasuda, S. Ubukata, and K. Honda, “Online state space generation by a growing self-organizing map and differential learning for reinforcement learning,” Appl. Soft Comput., vol. 97, p. 106723, Dec. 2020, doi: https://doi.org/10.1016/j.asoc.2020.106723.

Y. J. Yoo and J. T. Rhee, “An application of SCM-based logistics planning in the trade between South and North Korea,” Comput. Ind. Eng., vol. 43, no. 1–2, pp. 159–168, Jul. 2002, doi: https://doi.org/10.1016/S0360-8352(02)00073-6.

Y. Tang et al., “A Deep Q-Network based optimized modulation scheme for Dual-Active-Bridge converter to reduce the RMS current,” Energy Reports, vol. 6, pp. 1192–1198, Dec. 2020, doi: https://doi.org/10.1016/j.egyr.2020.11.055.

C.-Y. Tang, C.-H. Liu, W.-K. Chen, and S. D. You, “Implementing action mask in proximal policy optimization (PPO) algorithm,” ICT Express, vol. 6, no. 3, pp. 200–203, Sep. 2020, doi: https://doi.org/10.1016/j.icte.2020.05.003.

Downloads

Published

2022-12-26

How to Cite

Purwanto, A., Maesaroh, S., & Sulistyo, A. (2022). Optimizing Supply Chain Management with Reinforcement Learning: A Data-Driven Approach. International Journal of Technology and Modeling, 1(3), 93–105. https://doi.org/10.63876/ijtm.v1i3.113

Issue

Section

Articles