IDEAS home Printed from https://ideas.repec.org/a/gam/jmathe/v12y2024i3p393-d1326652.html
   My bibliography  Save this article

State-Space Compression for Efficient Policy Learning in Crude Oil Scheduling

Author

Listed:
  • Nan Ma

    (School of Information Science and Engineering, China University of Petroleum, Beijing 102249, China)

  • Hongqi Li

    (School of Information Science and Engineering, China University of Petroleum, Beijing 102249, China)

  • Hualin Liu

    (Petrochina Planning and Engineering Institute, Beijing 100083, China
    Key Laboratory of Oil & Gas Business Chain Optimization, CNPC, Beijing 100083, China)

Abstract

The imperative for swift and intelligent decision making in production scheduling has intensified in recent years. Deep reinforcement learning, akin to human cognitive processes, has heralded advancements in complex decision making and has found applicability in the production scheduling domain. Yet, its deployment in industrial settings is marred by large state spaces, protracted training times, and challenging convergence, necessitating a more efficacious approach. Addressing these concerns, this paper introduces an innovative, accelerated deep reinforcement learning framework—VSCS (Variational Autoencoder for State Compression in Soft Actor–Critic). The framework adeptly employs a variational autoencoder (VAE) to condense the expansive high-dimensional state space into a tractable low-dimensional feature space, subsequently leveraging these features to refine policy learning and augment the policy network’s performance and training efficacy. Furthermore, a novel methodology to ascertain the optimal dimensionality of these low-dimensional features is presented, integrating feature reconstruction similarity with visual analysis to facilitate informed dimensionality selection. This approach, rigorously validated within the realm of crude oil scheduling, demonstrates significant improvements over traditional methods. Notably, the convergence rate of the proposed VSCS method shows a remarkable increase of 77.5 % , coupled with an 89.3 % enhancement in the reward and punishment values. Furthermore, this method substantiates the robustness and appropriateness of the chosen feature dimensions.

Suggested Citation

  • Nan Ma & Hongqi Li & Hualin Liu, 2024. "State-Space Compression for Efficient Policy Learning in Crude Oil Scheduling," Mathematics, MDPI, vol. 12(3), pages 1-16, January.
  • Handle: RePEc:gam:jmathe:v:12:y:2024:i:3:p:393-:d:1326652
    as

    Download full text from publisher

    File URL: https://www.mdpi.com/2227-7390/12/3/393/pdf
    Download Restriction: no

    File URL: https://www.mdpi.com/2227-7390/12/3/393/
    Download Restriction: no
    ---><---

    References listed on IDEAS

    as
    1. Andreas Kuhnle & Jan-Philipp Kaiser & Felix Theiß & Nicole Stricker & Gisela Lanza, 2021. "Designing an adaptive production control system using reinforcement learning," Journal of Intelligent Manufacturing, Springer, vol. 32(3), pages 855-876, March.
    2. Che, Gelegen & Zhang, Yanyan & Tang, Lixin & Zhao, Shengnan, 2023. "A deep reinforcement learning based multi-objective optimization for the scheduling of oxygen production system in integrated iron and steel plants," Applied Energy, Elsevier, vol. 345(C).
    3. Ana Esteso & David Peidro & Josefa Mula & Manuel Díaz-Madroñero, 2023. "Reinforcement learning applied to production planning and control," International Journal of Production Research, Taylor & Francis Journals, vol. 61(16), pages 5772-5789, August.
    4. Oriol Vinyals & Igor Babuschkin & Wojciech M. Czarnecki & Michaël Mathieu & Andrew Dudzik & Junyoung Chung & David H. Choi & Richard Powell & Timo Ewalds & Petko Georgiev & Junhyuk Oh & Dan Horgan & M, 2019. "Grandmaster level in StarCraft II using multi-agent reinforcement learning," Nature, Nature, vol. 575(7782), pages 350-354, November.
    Full references (including those not matched with items on IDEAS)

    Most related items

    These are the items that most often cite the same works as this one and are cited by the same works as this one.
    1. Sebastian Mayer & Tobias Classen & Christian Endisch, 2021. "Modular production control using deep reinforcement learning: proximal policy optimization," Journal of Intelligent Manufacturing, Springer, vol. 32(8), pages 2335-2351, December.
    2. Konstantinos S. Boulas & Georgios D. Dounias & Chrissoleon T. Papadopoulos, 2023. "A hybrid evolutionary algorithm approach for estimating the throughput of short reliable approximately balanced production lines," Journal of Intelligent Manufacturing, Springer, vol. 34(2), pages 823-852, February.
    3. Nweye, Kingsley & Sankaranarayanan, Siva & Nagy, Zoltan, 2023. "MERLIN: Multi-agent offline and transfer learning for occupant-centric operation of grid-interactive communities," Applied Energy, Elsevier, vol. 346(C).
    4. Yang, Zhengzhi & Zheng, Lei & Perc, Matjaž & Li, Yumeng, 2024. "Interaction state Q-learning promotes cooperation in the spatial prisoner's dilemma game," Applied Mathematics and Computation, Elsevier, vol. 463(C).
    5. Wang, Xianjia & Yang, Zhipeng & Liu, Yanli & Chen, Guici, 2023. "A reinforcement learning-based strategy updating model for the cooperative evolution," Physica A: Statistical Mechanics and its Applications, Elsevier, vol. 618(C).
    6. Avishkar Bhoopchand & Bethanie Brownfield & Adrian Collister & Agustin Dal Lago & Ashley Edwards & Richard Everett & Alexandre Fréchette & Yanko Gitahy Oliveira & Edward Hughes & Kory W. Mathewson & P, 2023. "Learning few-shot imitation as cultural transmission," Nature Communications, Nature, vol. 14(1), pages 1-14, December.
    7. Raeid Saqur, 2024. "What Teaches Robots to Walk, Teaches Them to Trade too -- Regime Adaptive Execution using Informed Data and LLMs," Papers 2406.15508, arXiv.org.
    8. Shuo Sun & Rundong Wang & Bo An, 2021. "Reinforcement Learning for Quantitative Trading," Papers 2109.13851, arXiv.org.
    9. Dmitrii Dobriborsci & Roman Zashchitin & Mikhail Kakanov & Wolfgang Aumer & Pavel Osinenko, 2024. "Predictive reinforcement learning: map-less navigation method for mobile robot," Journal of Intelligent Manufacturing, Springer, vol. 35(8), pages 4217-4232, December.
    10. Qian Zhang & Tianhao Li & Dengfeng Li & Wei Lu, 2024. "A goal-oriented reinforcement learning for optimal drug dosage control," Annals of Operations Research, Springer, vol. 338(2), pages 1403-1423, July.
    11. Ziqing Wang & Wenzhu Liao, 2024. "Smart scheduling of dynamic job shop based on discrete event simulation and deep reinforcement learning," Journal of Intelligent Manufacturing, Springer, vol. 35(6), pages 2593-2610, August.
    12. Li, Wenqing & Ni, Shaoquan, 2022. "Train timetabling with the general learning environment and multi-agent deep reinforcement learning," Transportation Research Part B: Methodological, Elsevier, vol. 157(C), pages 230-251.
    13. Hien Nguyen Ngoc & Ganix Lasa & Ion Iriarte, 2022. "Human-centred design in industry 4.0: case study review and opportunities for future research," Journal of Intelligent Manufacturing, Springer, vol. 33(1), pages 35-76, January.
    14. Geng, Yini & Liu, Yifan & Lu, Yikang & Shen, Chen & Shi, Lei, 2022. "Reinforcement learning explains various conditional cooperation," Applied Mathematics and Computation, Elsevier, vol. 427(C).
    15. Boian Lazov, 2023. "A Deep Reinforcement Learning Trader without Offline Training," Papers 2303.00356, arXiv.org.
    16. Alberto Loffredo & Marvin Carl May & Andrea Matta & Gisela Lanza, 2024. "Reinforcement learning for sustainability enhancement of production lines," Journal of Intelligent Manufacturing, Springer, vol. 35(8), pages 3775-3791, December.
    17. Qingyan Li & Tao Lin & Qianyi Yu & Hui Du & Jun Li & Xiyue Fu, 2023. "Review of Deep Reinforcement Learning and Its Application in Modern Renewable Power System Control," Energies, MDPI, vol. 16(10), pages 1-23, May.
    18. Michael Curry & Alexander Trott & Soham Phade & Yu Bai & Stephan Zheng, 2022. "Analyzing Micro-Founded General Equilibrium Models with Many Agents using Deep Reinforcement Learning," Papers 2201.01163, arXiv.org, revised Feb 2022.
    19. Rodrick Wallace, 2022. "How AI founders on adversarial landscapes of fog and friction," The Journal of Defense Modeling and Simulation, , vol. 19(3), pages 519-538, July.
    20. Jinming Xu & Yuan Lin, 2024. "Energy Management for Hybrid Electric Vehicles Using Safe Hybrid-Action Reinforcement Learning," Mathematics, MDPI, vol. 12(5), pages 1-20, February.

    Corrections

    All material on this site has been provided by the respective publishers and authors. You can help correct errors and omissions. When requesting a correction, please mention this item's handle: RePEc:gam:jmathe:v:12:y:2024:i:3:p:393-:d:1326652. See general information about how to correct material in RePEc.

    If you have authored this item and are not yet registered with RePEc, we encourage you to do it here. This allows to link your profile to this item. It also allows you to accept potential citations to this item that we are uncertain about.

    If CitEc recognized a bibliographic reference but did not link an item in RePEc to it, you can help with this form .

    If you know of missing items citing this one, you can help us creating those links by adding the relevant references in the same way as above, for each refering item. If you are a registered author of this item, you may also want to check the "citations" tab in your RePEc Author Service profile, as there may be some citations waiting for confirmation.

    For technical questions regarding this item, or to correct its authors, title, abstract, bibliographic or download information, contact: MDPI Indexing Manager (email available below). General contact details of provider: https://www.mdpi.com .

    Please note that corrections may take a couple of weeks to filter through the various RePEc services.

    IDEAS is a RePEc service. RePEc uses bibliographic data supplied by the respective publishers.