IDEAS home Printed from https://ideas.repec.org/a/gam/jmathe/v13y2025i11p1765-d1664719.html
   My bibliography  Save this article

Convex Optimization of Markov Decision Processes Based on Z Transform: A Theoretical Framework for Two-Space Decomposition and Linear Programming Reconstruction

Author

Listed:
  • Shiqing Qiu

    (School of Mathematical Sciences, Chengdu University of Technology, Chengdu 610059, China
    These authors contributed equally to this work.)

  • Haoyu Wang

    (School of Mathematical Sciences, Chengdu University of Technology, Chengdu 610059, China
    These authors contributed equally to this work.)

  • Yuxin Zhang

    (School of Business, Henan University, Zhengzhou 450001, China)

  • Zong Ke

    (Department of Statistics and Data Science, Faculty of Science, National University of Singapore, 21 Lower Kent Ridge Road, Singapore 119077, Singapore)

  • Zichao Li

    (Department of Management Science and Engineering, University of Waterloo, Waterloo, ON N2L 3G1, Canada)

Abstract

This study establishes a novel mathematical framework for stochastic maintenance optimization in production systems by integrating Markov decision processes (MDPs) with convex programming theory. We develop a Z-transformation-based dual-space decomposition method to reconstruct MDPs into a solvable linear programming form, resolving the inherent instability of traditional models caused by uncertain initial conditions and non-stationary state transitions. The proposed approach introduces three mathematical innovations: (i) a spectral clustering mechanism that reduces state-space dimensionality while preserving Markovian properties, (ii) a Lagrangian dual formulation with adaptive penalty functions to handle operational constraints, and (iii) a warm start algorithm accelerating convergence in high-dimensional convex optimization. Theoretical analysis proves that the derived policy achieves stability in probabilistic transitions through martingale convergence arguments, demonstrating structural invariance to initial distributions. Experimental validations on production processes reveal that our model reduces long-term maintenance costs by 36.17% compared to Monte Carlo simulations (1500 vs. 2350 average cost) and improves computational efficiency by 14.29% over Q-learning methods. Sensitivity analyses confirm robustness across Weibull-distributed failure regimes (shape parameter β ∈ [1.2, 4.8]) and varying resource constraints.

Suggested Citation

  • Shiqing Qiu & Haoyu Wang & Yuxin Zhang & Zong Ke & Zichao Li, 2025. "Convex Optimization of Markov Decision Processes Based on Z Transform: A Theoretical Framework for Two-Space Decomposition and Linear Programming Reconstruction," Mathematics, MDPI, vol. 13(11), pages 1-27, May.
  • Handle: RePEc:gam:jmathe:v:13:y:2025:i:11:p:1765-:d:1664719
    as

    Download full text from publisher

    File URL: https://www.mdpi.com/2227-7390/13/11/1765/pdf
    Download Restriction: no

    File URL: https://www.mdpi.com/2227-7390/13/11/1765/
    Download Restriction: no
    ---><---

    References listed on IDEAS

    as
    1. Huibing Hao & Chunping Li & Antonio Forcina, 2022. "Reliability Modeling and Evaluation for Complex Systems Subject to New Dependent Competing Failure Process," Mathematical Problems in Engineering, Hindawi, vol. 2022, pages 1-17, August.
    2. András Zempléni & Miklós Véber & Belmiro Duarte & Pedro Saraiva, 2004. "Control charts: a cost‐optimization approach for processes with random shifts," Applied Stochastic Models in Business and Industry, John Wiley & Sons, vol. 20(3), pages 185-200, July.
    3. Castro, I.T. & Pérez-Ocón, R., 2006. "Reward optimization of a repairable system," Reliability Engineering and System Safety, Elsevier, vol. 91(3), pages 311-319.
    4. Guo, Chunhui & Liang, Zhenglin, 2022. "A predictive Markov decision process for optimizing inspection and maintenance strategies of partially observable multi-state systems," Reliability Engineering and System Safety, Elsevier, vol. 226(C).
    5. Deep, Akash & Zhou, Shiyu & Veeramani, Dharmaraj & Chen, Yong, 2023. "Partially observable Markov decision process-based optimal maintenance planning with time-dependent observations," European Journal of Operational Research, Elsevier, vol. 311(2), pages 533-544.
    6. Subrata Golui & Chandan Pal, 2022. "Risk-sensitive discounted cost criterion for continuous-time Markov decision processes on a general state space," Mathematical Methods of Operations Research, Springer;Gesellschaft für Operations Research (GOR);Nederlands Genootschap voor Besliskunde (NGB), vol. 95(2), pages 219-247, April.
    7. Huanyong Zhang & Ningshu Li & Jinghan Lin, 2024. "Modeling the Decision and Coordination Mechanism of Power Battery Closed-Loop Supply Chain Using Markov Decision Processes," Sustainability, MDPI, vol. 16(11), pages 1-19, May.
    Full references (including those not matched with items on IDEAS)

    Most related items

    These are the items that most often cite the same works as this one and are cited by the same works as this one.
    1. Finkelstein, Maxim & Cha, Ji Hwan & Langston, Amy, 2023. "Improving classical optimal age-replacement policies for degrading items," Reliability Engineering and System Safety, Elsevier, vol. 236(C).
    2. Balázs Dobi & András Zempléni, 2022. "Markovchart: an R package for cost-optimal patient monitoring and treatment using control charts," Computational Statistics, Springer, vol. 37(4), pages 1653-1693, September.
    3. Yu, Miaomiao & Tang, Yinghui & Liu, Liping & Cheng, Jiang, 2013. "A phase-type geometric process repair model with spare device procurement and repairman’s multiple vacations," European Journal of Operational Research, Elsevier, vol. 225(2), pages 310-323.
    4. Sarada, Y. & Shenbagam, R., 2021. "Optimization of a repairable deteriorating system subject to random threshold failure using preventive repair and stochastic lead time," Reliability Engineering and System Safety, Elsevier, vol. 205(C).
    5. Karabağ, Oktay & Bulut, Önder & Toy, Ayhan Özgür & Fadıloğlu, Mehmet Murat, 2024. "An efficient procedure for optimal maintenance intervention in partially observable multi-component systems," Reliability Engineering and System Safety, Elsevier, vol. 244(C).
    6. Leung, Kit Nam Francis & Zhang, Yuan Lin & Lai, Kin Keung, 2011. "Analysis for a two-dissimilar-component cold standby repairable system with repair priority," Reliability Engineering and System Safety, Elsevier, vol. 96(11), pages 1542-1551.
    7. Chen, Jinyuan & Li, Zehui, 2008. "An extended extreme shock maintenance model for a deteriorating system," Reliability Engineering and System Safety, Elsevier, vol. 93(8), pages 1123-1129.
    8. Gámiz, M.L. & Navas-Gómez, F. & Raya-Miranda, R. & Segovia-García, M.C., 2023. "Dynamic reliability and sensitivity analysis based on HMM models with Markovian signal process," Reliability Engineering and System Safety, Elsevier, vol. 239(C).
    9. Miaomiao Yu & Yinghui Tang, 2017. "Optimal replacement policy based on maximum repair time for a random shock and wear model," TOP: An Official Journal of the Spanish Society of Statistics and Operations Research, Springer;Sociedad de Estadística e Investigación Operativa, vol. 25(1), pages 80-94, April.
    10. Subrata Golui & Chandan Pal & R. Manikandan & Abhay Sobhanan, 2024. "Optimal control of a dynamic production-inventory system with various cost criteria," Annals of Operations Research, Springer, vol. 337(1), pages 75-103, June.
    11. Arnold, Richard & Chukova, Stefanka & Hayakawa, Yu & Marshall, Sarah, 2020. "Geometric-Like Processes: An Overview and Some Reliability Applications," Reliability Engineering and System Safety, Elsevier, vol. 201(C).
    12. Zhang, Yuan Lin & Wang, Guan Jun, 2009. "A geometric process repair model for a repairable cold standby system with priority in use and repair," Reliability Engineering and System Safety, Elsevier, vol. 94(11), pages 1782-1787.
    13. Xu, Jianyu & Liu, Bin & Zhao, Xiujie & Wang, Xiao-Lin, 2024. "Online reinforcement learning for condition-based group maintenance using factored Markov decision processes," European Journal of Operational Research, Elsevier, vol. 315(1), pages 176-190.
    14. Javid, Y., 2025. "Efficient risk-based inspection framework: Balancing safety and budgetary constraints," Reliability Engineering and System Safety, Elsevier, vol. 253(C).
    15. Cha, Ji Hwan & Finkelstein, Maxim, 2024. "Preventive maintenance for the constrained multi-attempt minimal repair," Reliability Engineering and System Safety, Elsevier, vol. 243(C).
    16. Sun, Qin & Li, Hongxu & Zhong, Yuanfu & Ren, Kezhou & Zhang, Yingchao, 2024. "Deep reinforcement learning-based resilience enhancement strategy of unmanned weapon system-of-systems under inevitable interferences," Reliability Engineering and System Safety, Elsevier, vol. 242(C).
    17. Ning, Ru & Wang, Xiaoyue & Zhao, Xian & Li, Ziyue, 2024. "Joint optimization of preventive maintenance and triggering mechanism for k-out-of-n: F systems with protective devices based on periodic inspection," Reliability Engineering and System Safety, Elsevier, vol. 251(C).
    18. Arts, Joachim & Boute, Robert N. & Loeys, Stijn & van Staden, Heletjé E., 2025. "Fifty years of maintenance optimization: Reflections and perspectives," European Journal of Operational Research, Elsevier, vol. 322(3), pages 725-739.
    19. Finkelstein, Maxim & Cha, Ji Hwan & Bedford, Tim, 2023. "Optimal preventive maintenance strategy for populations of systems that generate outputs," Reliability Engineering and System Safety, Elsevier, vol. 237(C).
    20. Guan Jun Wang & Yuan Lin Zhang, 2016. "Optimal replacement policy for a two-dissimilar-component cold standby system with different repair actions," International Journal of Systems Science, Taylor & Francis Journals, vol. 47(5), pages 1021-1031, April.

    More about this item

    Keywords

    ;
    ;
    ;
    ;
    ;

    Statistics

    Access and download statistics

    Corrections

    All material on this site has been provided by the respective publishers and authors. You can help correct errors and omissions. When requesting a correction, please mention this item's handle: RePEc:gam:jmathe:v:13:y:2025:i:11:p:1765-:d:1664719. See general information about how to correct material in RePEc.

    If you have authored this item and are not yet registered with RePEc, we encourage you to do it here. This allows to link your profile to this item. It also allows you to accept potential citations to this item that we are uncertain about.

    If CitEc recognized a bibliographic reference but did not link an item in RePEc to it, you can help with this form .

    If you know of missing items citing this one, you can help us creating those links by adding the relevant references in the same way as above, for each refering item. If you are a registered author of this item, you may also want to check the "citations" tab in your RePEc Author Service profile, as there may be some citations waiting for confirmation.

    For technical questions regarding this item, or to correct its authors, title, abstract, bibliographic or download information, contact: MDPI Indexing Manager (email available below). General contact details of provider: https://www.mdpi.com .

    Please note that corrections may take a couple of weeks to filter through the various RePEc services.

    IDEAS is a RePEc service. RePEc uses bibliographic data supplied by the respective publishers.