Convex Optimization of Markov Decision Processes Based on Z Transform: A Theoretical Framework for Two-Space Decomposition and Linear Programming Reconstruction

My bibliography Save this article

Convex Optimization of Markov Decision Processes Based on Z Transform: A Theoretical Framework for Two-Space Decomposition and Linear Programming Reconstruction

Author

Listed:

Shiqing Qiu
(School of Mathematical Sciences, Chengdu University of Technology, Chengdu 610059, China
These authors contributed equally to this work.)
Haoyu Wang
(School of Mathematical Sciences, Chengdu University of Technology, Chengdu 610059, China
These authors contributed equally to this work.)
Yuxin Zhang
(School of Business, Henan University, Zhengzhou 450001, China)
Zong Ke
(Department of Statistics and Data Science, Faculty of Science, National University of Singapore, 21 Lower Kent Ridge Road, Singapore 119077, Singapore)
Zichao Li
(Department of Management Science and Engineering, University of Waterloo, Waterloo, ON N2L 3G1, Canada)

Registered:

Abstract

This study establishes a novel mathematical framework for stochastic maintenance optimization in production systems by integrating Markov decision processes (MDPs) with convex programming theory. We develop a Z-transformation-based dual-space decomposition method to reconstruct MDPs into a solvable linear programming form, resolving the inherent instability of traditional models caused by uncertain initial conditions and non-stationary state transitions. The proposed approach introduces three mathematical innovations: (i) a spectral clustering mechanism that reduces state-space dimensionality while preserving Markovian properties, (ii) a Lagrangian dual formulation with adaptive penalty functions to handle operational constraints, and (iii) a warm start algorithm accelerating convergence in high-dimensional convex optimization. Theoretical analysis proves that the derived policy achieves stability in probabilistic transitions through martingale convergence arguments, demonstrating structural invariance to initial distributions. Experimental validations on production processes reveal that our model reduces long-term maintenance costs by 36.17% compared to Monte Carlo simulations (1500 vs. 2350 average cost) and improves computational efficiency by 14.29% over Q-learning methods. Sensitivity analyses confirm robustness across Weibull-distributed failure regimes (shape parameter β ∈ [1.2, 4.8]) and varying resource constraints.

Suggested Citation

Shiqing Qiu & Haoyu Wang & Yuxin Zhang & Zong Ke & Zichao Li, 2025. "Convex Optimization of Markov Decision Processes Based on Z Transform: A Theoretical Framework for Two-Space Decomposition and Linear Programming Reconstruction," Mathematics, MDPI, vol. 13(11), pages 1-27, May.

Handle: RePEc:gam:jmathe:v:13:y:2025:i:11:p:1765-:d:1664719

Download full text from publisher

References listed on IDEAS

Castro, I.T. & PÃ©rez-OcÃ³n, R., 2006. "Reward optimization of a repairable system," Reliability Engineering and System Safety, Elsevier, vol. 91(3), pages 311-319.
Subrata Golui & Chandan Pal, 2022. "Risk-sensitive discounted cost criterion for continuous-time Markov decision processes on a general state space," Mathematical Methods of Operations Research, Springer;Gesellschaft für Operations Research (GOR);Nederlands Genootschap voor Besliskunde (NGB), vol. 95(2), pages 219-247, April.
Huibing Hao & Chunping Li & Antonio Forcina, 2022. "Reliability Modeling and Evaluation for Complex Systems Subject to New Dependent Competing Failure Process," Mathematical Problems in Engineering, Hindawi, vol. 2022, pages 1-17, August.
Guo, Chunhui & Liang, Zhenglin, 2022. "A predictive Markov decision process for optimizing inspection and maintenance strategies of partially observable multi-state systems," Reliability Engineering and System Safety, Elsevier, vol. 226(C).
Deep, Akash & Zhou, Shiyu & Veeramani, Dharmaraj & Chen, Yong, 2023. "Partially observable Markov decision process-based optimal maintenance planning with time-dependent observations," European Journal of Operational Research, Elsevier, vol. 311(2), pages 533-544.
Huanyong Zhang & Ningshu Li & Jinghan Lin, 2024. "Modeling the Decision and Coordination Mechanism of Power Battery Closed-Loop Supply Chain Using Markov Decision Processes," Sustainability, MDPI, vol. 16(11), pages 1-19, May.
András Zempléni & Miklós Véber & Belmiro Duarte & Pedro Saraiva, 2004. "Control charts: a cost‐optimization approach for processes with random shifts," Applied Stochastic Models in Business and Industry, John Wiley & Sons, vol. 20(3), pages 185-200, July.

Full references (including those not matched with items on IDEAS)

Most related items

These are the items that most often cite the same works as this one and are cited by the same works as this one.

Finkelstein, Maxim & Cha, Ji Hwan & Langston, Amy, 2023. "Improving classical optimal age-replacement policies for degrading items," Reliability Engineering and System Safety, Elsevier, vol. 236(C).
Javid, Y., 2025. "Efficient risk-based inspection framework: Balancing safety and budgetary constraints," Reliability Engineering and System Safety, Elsevier, vol. 253(C).
Cha, Ji Hwan & Finkelstein, Maxim, 2024. "Preventive maintenance for the constrained multi-attempt minimal repair," Reliability Engineering and System Safety, Elsevier, vol. 243(C).
Sun, Qin & Li, Hongxu & Zhong, Yuanfu & Ren, Kezhou & Zhang, Yingchao, 2024. "Deep reinforcement learning-based resilience enhancement strategy of unmanned weapon system-of-systems under inevitable interferences," Reliability Engineering and System Safety, Elsevier, vol. 242(C).
Balázs Dobi & András Zempléni, 2022. "Markovchart: an R package for cost-optimal patient monitoring and treatment using control charts," Computational Statistics, Springer, vol. 37(4), pages 1653-1693, September.
Ning, Ru & Wang, Xiaoyue & Zhao, Xian & Li, Ziyue, 2024. "Joint optimization of preventive maintenance and triggering mechanism for k-out-of-n: F systems with protective devices based on periodic inspection," Reliability Engineering and System Safety, Elsevier, vol. 251(C).
Arts, Joachim & Boute, Robert N. & Loeys, Stijn & van Staden, Heletjé E., 2025. "Fifty years of maintenance optimization: Reflections and perspectives," European Journal of Operational Research, Elsevier, vol. 322(3), pages 725-739.
Finkelstein, Maxim & Cha, Ji Hwan & Bedford, Tim, 2023. "Optimal preventive maintenance strategy for populations of systems that generate outputs," Reliability Engineering and System Safety, Elsevier, vol. 237(C).
Guan Jun Wang & Yuan Lin Zhang, 2016. "Optimal replacement policy for a two-dissimilar-component cold standby system with different repair actions," International Journal of Systems Science, Taylor & Francis Journals, vol. 47(5), pages 1021-1031, April.
Yang, Li & Zhou, Shihan & Ma, Xiaobing & Chen, Yi & Jia, Heping & Dai, Wei, 2024. "Group machinery intelligent maintenance: Adaptive health prediction and global dynamic maintenance decision-making," Reliability Engineering and System Safety, Elsevier, vol. 252(C).
Yu, Miaomiao & Tang, Yinghui & Liu, Liping & Cheng, Jiang, 2013. "A phase-type geometric process repair model with spare device procurement and repairman’s multiple vacations," European Journal of Operational Research, Elsevier, vol. 225(2), pages 310-323.
Sarada, Y. & Shenbagam, R., 2021. "Optimization of a repairable deteriorating system subject to random threshold failure using preventive repair and stochastic lead time," Reliability Engineering and System Safety, Elsevier, vol. 205(C).
Levitin, Gregory & Xing, Liudong & Dai, Yuanshun, 2015. "Optimal loading of system with random repair time," European Journal of Operational Research, Elsevier, vol. 247(1), pages 137-143.
Levitin, Gregory & Xing, Liudong & Dai, Yuanshun, 2015. "Optimal backup frequency in system with random repair time," Reliability Engineering and System Safety, Elsevier, vol. 144(C), pages 12-22.
KarabaÄŸ, Oktay & Bulut, Ã–nder & Toy, Ayhan Ã–zgÃ¼r & FadÄ±loÄŸlu, Mehmet Murat, 2024. "An efficient procedure for optimal maintenance intervention in partially observable multi-component systems," Reliability Engineering and System Safety, Elsevier, vol. 244(C).
Leung, Kit Nam Francis & Zhang, Yuan Lin & Lai, Kin Keung, 2011. "Analysis for a two-dissimilar-component cold standby repairable system with repair priority," Reliability Engineering and System Safety, Elsevier, vol. 96(11), pages 1542-1551.
Chen, Jinyuan & Li, Zehui, 2008. "An extended extreme shock maintenance model for a deteriorating system," Reliability Engineering and System Safety, Elsevier, vol. 93(8), pages 1123-1129.
Wang, Siqi & Zhao, Xian & Wu, Congshan & Wang, Xiaoyue, 2023. "Joint optimization of multi-stage component reassignment and preventive maintenance for balanced systems considering imperfect maintenance," Reliability Engineering and System Safety, Elsevier, vol. 237(C).
GÃ¡miz, M.L. & Navas-GÃ³mez, F. & Raya-Miranda, R. & Segovia-GarcÃa, M.C., 2023. "Dynamic reliability and sensitivity analysis based on HMM models with Markovian signal process," Reliability Engineering and System Safety, Elsevier, vol. 239(C).
Miaomiao Yu & Yinghui Tang, 2017. "Optimal replacement policy based on maximum repair time for a random shock and wear model," TOP: An Official Journal of the Spanish Society of Statistics and Operations Research, Springer;Sociedad de Estadística e Investigación Operativa, vol. 25(1), pages 80-94, April.

More about this item

Keywords

; ; ; ; ;

Statistics

Access and download statistics

Corrections

All material on this site has been provided by the respective publishers and authors. You can help correct errors and omissions. When requesting a correction, please mention this item's handle: RePEc:gam:jmathe:v:13:y:2025:i:11:p:1765-:d:1664719. See general information about how to correct material in RePEc.

If you have authored this item and are not yet registered with RePEc, we encourage you to do it here. This allows to link your profile to this item. It also allows you to accept potential citations to this item that we are uncertain about.

If CitEc recognized a bibliographic reference but did not link an item in RePEc to it, you can help with this form .

If you know of missing items citing this one, you can help us creating those links by adding the relevant references in the same way as above, for each refering item. If you are a registered author of this item, you may also want to check the "citations" tab in your RePEc Author Service profile, as there may be some citations waiting for confirmation.

For technical questions regarding this item, or to correct its authors, title, abstract, bibliographic or download information, contact: MDPI Indexing Manager (email available below). General contact details of provider: https://www.mdpi.com .

Please note that corrections may take a couple of weeks to filter through the various RePEc services.

IDEAS is a RePEc service. RePEc uses bibliographic data supplied by the respective publishers.

Browse Econ Literature

More features

Convex Optimization of Markov Decision Processes Based on Z Transform: A Theoretical Framework for Two-Space Decomposition and Linear Programming Reconstruction

Author

Abstract

Suggested Citation

Download full text from publisher

References listed on IDEAS

Most related items

More about this item

Keywords

Statistics

Corrections

More services and features

MyIDEAS

Author registration

Rankings

RePEc Genealogy

RePEc Biblio

MPRA

New papers by email

EconAcademics

Plagiarism

About RePEc

RePEc home

Blog

Help/FAQ

RePEc team

Participating archives

Privacy statement

Help us

Corrections

Volunteers

Get papers listed

Open a RePEc archive

Get RePEc data