A mean–variance optimization problem for discounted Markov decision processes

My bibliography Save this article

A mean–variance optimization problem for discounted Markov decision processes

Author

Listed:

Guo, Xianping
Ye, Liuer
Yin, George

Registered:

Abstract

In this paper, we consider a mean–variance optimization problem for Markov decision processes (MDPs) over the set of (deterministic stationary) policies. Different from the usual formulation in MDPs, we aim to obtain the mean–variance optimal policy that minimizes the variance over a set of all policies with a given expected reward. For continuous-time MDPs with the discounted criterion and finite-state and action spaces, we prove that the mean–variance optimization problem can be transformed to an equivalent discounted optimization problem using the conditional expectation and Markov properties. Then, we show that a mean–variance optimal policy and the efficient frontier can be obtained by policy iteration methods with a finite number of iterations. We also address related issues such as a mutual fund theorem and illustrate our results with an example.

Suggested Citation

Guo, Xianping & Ye, Liuer & Yin, George, 2012. "A mean–variance optimization problem for discounted Markov decision processes," European Journal of Operational Research, Elsevier, vol. 220(2), pages 423-429.

Handle: RePEc:eee:ejores:v:220:y:2012:i:2:p:423-429
DOI: 10.1016/j.ejor.2012.01.051

Download full text from publisher

As the access to this document is restricted, you may want to search for a different version of it.

References listed on IDEAS

Fu, Chenpeng & Lari-Lavassani, Ali & Li, Xun, 2010. "Dynamic mean-variance portfolio selection with borrowing constraint," European Journal of Operational Research, Elsevier, vol. 200(1), pages 312-319, January.
Richard Chen & Eugene Feinberg, 2007. "Non-randomized policies for constrained Markov decision processes," Mathematical Methods of Operations Research, Springer;Gesellschaft für Operations Research (GOR);Nederlands Genootschap voor Besliskunde (NGB), vol. 66(1), pages 165-179, August.
Jianming Xia & Jia‐An Yan, 2006. "Markowitz'S Portfolio Optimization In An Incomplete Market," Mathematical Finance, Wiley Blackwell, vol. 16(1), pages 203-216, January.
Christos H. Papadimitriou & John N. Tsitsiklis, 1987. "The Complexity of Markov Decision Processes," Mathematics of Operations Research, INFORMS, vol. 12(3), pages 441-450, August.
Daniel S. Bernstein & Robert Givan & Neil Immerman & Shlomo Zilberstein, 2002. "The Complexity of Decentralized Control of Markov Decision Processes," Mathematics of Operations Research, INFORMS, vol. 27(4), pages 819-840, November.
Lanlan Zhang & Xianping Guo, 2008. "Constrained continuous-time Markov decision processes with average criteria," Mathematical Methods of Operations Research, Springer;Gesellschaft für Operations Research (GOR);Nederlands Genootschap voor Besliskunde (NGB), vol. 67(2), pages 323-340, April.

Full references (including those not matched with items on IDEAS)

Citations

Citations are extracted by the CitEc Project, subscribe to its RSS feed for this item.

Cited by:

Lord Mensah, 2016. "Asset Allocation Brewed Accross African Stock Markets," Proceedings of Economics and Finance Conferences 3205757, International Institute of Social and Economic Sciences.
Li Xia, 2020. "Risk‐Sensitive Markov Decision Processes with Combined Metrics of Mean and Variance," Production and Operations Management, Production and Operations Management Society, vol. 29(12), pages 2808-2827, December.
Mannor, Shie & Tsitsiklis, John N., 2013. "Algorithmic aspects of mean–variance optimization in Markov decision processes," European Journal of Operational Research, Elsevier, vol. 231(3), pages 645-653.
Ma, Shuai & Ma, Xiaoteng & Xia, Li, 2023. "A unified algorithm framework for mean-variance optimization in discounted Markov decision processes," European Journal of Operational Research, Elsevier, vol. 311(3), pages 1057-1067.

Most related items

These are the items that most often cite the same works as this one and are cited by the same works as this one.

Yao, Haixiang & Li, Zhongfei & Chen, Shumin, 2014. "Continuous-time mean–variance portfolio selection with only risky assets," Economic Modelling, Elsevier, vol. 36(C), pages 244-251.
Louis Anthony Cox, 2020. "Answerable and Unanswerable Questions in Risk Analysis with Open‐World Novelty," Risk Analysis, John Wiley & Sons, vol. 40(S1), pages 2144-2177, November.
Yuanyuan Zhang & Xiang Li & Sini Guo, 2018. "Portfolio selection problems with Markowitz’s mean–variance framework: a review of literature," Fuzzy Optimization and Decision Making, Springer, vol. 17(2), pages 125-158, June.
Zeng, Yan & Li, Zhongfei & Lai, Yongzeng, 2013. "Time-consistent investment and reinsurance strategies for mean–variance insurers with jumps," Insurance: Mathematics and Economics, Elsevier, vol. 52(3), pages 498-507.
Zhang, Miao & Chen, Ping, 2016. "Mean–variance asset–liability management under constant elasticity of variance process," Insurance: Mathematics and Economics, Elsevier, vol. 70(C), pages 11-18.
Liu, Yong-Jun & Zhang, Wei-Guo, 2015. "A multi-period fuzzy portfolio optimization model with minimum transaction lots," European Journal of Operational Research, Elsevier, vol. 242(3), pages 933-941.
Yanling Chang & Alan Erera & Chelsea White, 2015. "Value of information for a leader–follower partially observed Markov game," Annals of Operations Research, Springer, vol. 235(1), pages 129-153, December.
Alev{s} v{C}ern'y & Jan Kallsen, 2007. "On the Structure of General Mean-Variance Hedging Strategies," Papers 0708.1715, arXiv.org, revised Jul 2017.
Zhang, Miao & Chen, Ping & Yao, Haixiang, 2017. "Mean-variance portfolio selection with only risky assets under regime switching," Economic Modelling, Elsevier, vol. 62(C), pages 35-42.
Buckley, Winston & Long, Hongwei & Marshall, Mario, 2016. "Numerical approximations of optimal portfolios in mispriced asymmetric Lévy markets," European Journal of Operational Research, Elsevier, vol. 252(2), pages 676-686.
Daniel S. Bernstein & Robert Givan & Neil Immerman & Shlomo Zilberstein, 2002. "The Complexity of Decentralized Control of Markov Decision Processes," Mathematics of Operations Research, INFORMS, vol. 27(4), pages 819-840, November.
Armando F. Mendoza-Pérez & Héctor Jasso-Fuentes & Omar A. De-la-Cruz Courtois, 2016. "Constrained Markov decision processes in Borel spaces: from discounted to average optimality," Mathematical Methods of Operations Research, Springer;Gesellschaft für Operations Research (GOR);Nederlands Genootschap voor Besliskunde (NGB), vol. 84(3), pages 489-525, December.
F. Cong & C. W. Oosterlee, 2017. "On Robust Multi-Period Pre-Commitment And Time-Consistent Mean-Variance Portfolio Optimization," International Journal of Theoretical and Applied Finance (IJTAF), World Scientific Publishing Co. Pte. Ltd., vol. 20(07), pages 1-26, November.
Kraft, Holger & Steffensen, Mogens, 2012. "A dynamic programming approach to constrained portfolios," CFS Working Paper Series 2012/07, Center for Financial Studies (CFS).
Hao Zhang, 2010. "Partially Observable Markov Decision Processes: A Geometric Technique and Analysis," Operations Research, INFORMS, vol. 58(1), pages 214-228, February.
Martin Mundhenk, 2000. "The Complexity of Optimal Small Policies," Mathematics of Operations Research, INFORMS, vol. 25(1), pages 118-129, February.
Yu Wu & Bo Zeng & Siming Huang, 2019. "A Dynamic Strategy for Home Pick-Up Service with Uncertain Customer Requests and Its Implementation," Sustainability, MDPI, vol. 11(7), pages 1-21, April.
Bismut, Elizabeth & Straub, Daniel, 2021. "Optimal adaptive inspection and maintenance planning for deteriorating structural systems," Reliability Engineering and System Safety, Elsevier, vol. 215(C).
Bren, Austin & Saghafian, Soroush, 2018. "Data-Driven Percentile Optimization for Multi-Class Queueing Systems with Model Ambiguity: Theory and Application," Working Paper Series rwp18-008, Harvard University, John F. Kennedy School of Government.
Buckley, Winston S. & Brown, Garfield O. & Marshall, Mario, 2012. "A mispricing model of stocks under asymmetric information," European Journal of Operational Research, Elsevier, vol. 221(3), pages 584-592.

More about this item

Keywords

Mean–variance criterion; Finite continuous-time MDPs; Discounted reward; Policy iteration algorithm; Efficient frontier;
All these keywords.

Statistics

Access and download statistics

Corrections

All material on this site has been provided by the respective publishers and authors. You can help correct errors and omissions. When requesting a correction, please mention this item's handle: RePEc:eee:ejores:v:220:y:2012:i:2:p:423-429. See general information about how to correct material in RePEc.

If you have authored this item and are not yet registered with RePEc, we encourage you to do it here. This allows to link your profile to this item. It also allows you to accept potential citations to this item that we are uncertain about.

If CitEc recognized a bibliographic reference but did not link an item in RePEc to it, you can help with this form .

If you know of missing items citing this one, you can help us creating those links by adding the relevant references in the same way as above, for each refering item. If you are a registered author of this item, you may also want to check the "citations" tab in your RePEc Author Service profile, as there may be some citations waiting for confirmation.

For technical questions regarding this item, or to correct its authors, title, abstract, bibliographic or download information, contact: Catherine Liu (email available below). General contact details of provider: http://www.elsevier.com/locate/eor .

Please note that corrections may take a couple of weeks to filter through the various RePEc services.

IDEAS is a RePEc service. RePEc uses bibliographic data supplied by the respective publishers.

Browse Econ Literature

More features

A mean–variance optimization problem for discounted Markov decision processes

Author

Abstract

Suggested Citation

Download full text from publisher

References listed on IDEAS

Citations

Most related items

More about this item

Keywords

Statistics

Corrections

More services and features

MyIDEAS

Author registration

Rankings

RePEc Genealogy

RePEc Biblio

MPRA

New papers by email

EconAcademics

Plagiarism

About RePEc

RePEc home

Blog

Help/FAQ

RePEc team

Participating archives

Privacy statement

Help us

Corrections

Volunteers

Get papers listed

Open a RePEc archive

Get RePEc data