A Compromise Programming Approach To Multiobjective Markov Decision Processes

My bibliography Save this article

A Compromise Programming Approach To Multiobjective Markov Decision Processes

Author

Listed:

WLODZIMIERZ OGRYCZAK
(ICCE, Warsaw University of Technology, Poland)
PATRICE PERNY
(LIP6, University Pierre and Marie Curie, Paris, France)
PAUL WENG
(LIP6, University Pierre and Marie Curie, Paris, France)

Registered:

Wlodzimierz Ogryczak

Abstract

A Markov decision process (MDP) is a general model for solving planning problems under uncertainty. It has been extended to multiobjective MDP to address multicriteria or multiagent problems in which the value of a decision must be evaluated according to several viewpoints, sometimes conflicting. Although most of the studies concentrate on the determination of the set of Pareto-optimal policies, we focus here on a more specialized problem that concerns the direct determination of policies achieving well-balanced tradeoffs. To this end, we introduce a reference point method based on the optimization of a weighted ordered weighted average (WOWA) of individual disachievements. We show that the resulting notion of optimal policy does not satisfy the Bellman principle and depends on the initial state. To overcome these difficulties, we propose a solution method based on a linear programming (LP) reformulation of the problem. Finally, we illustrate the feasibility of the proposed method on two types of planning problems under uncertainty arising in navigation of an autonomous agent and in inventory management.

Suggested Citation

Wlodzimierz Ogryczak & Patrice Perny & Paul Weng, 2013. "A Compromise Programming Approach To Multiobjective Markov Decision Processes," International Journal of Information Technology & Decision Making (IJITDM), World Scientific Publishing Co. Pte. Ltd., vol. 12(05), pages 1021-1053.

Handle: RePEc:wsi:ijitdm:v:12:y:2013:i:05:n:s0219622013400075
DOI: 10.1142/S0219622013400075

Download full text from publisher

As the access to this document is restricted, you may want to

for a different version of it.

References listed on IDEAS

JosÉ Figueira & Salvatore Greco & Matthias Ehrogott, 2005. "Multiple Criteria Decision Analysis: State of the Art Surveys," International Series in Operations Research and Management Science, Springer, number 978-0-387-23081-8, June.

Full references (including those not matched with items on IDEAS)

Most related items

These are the items that most often cite the same works as this one and are cited by the same works as this one.

Fancello, Giovanna & Tsoukiàs, Alexis, 2021. "Learning urban capabilities from behaviours. A focus on visitors values for urban planning," Socio-Economic Planning Sciences, Elsevier, vol. 76(C).
Bana e Costa, Carlos A. & Oliveira, Carlos S. & Vieira, Victor, 2008. "Prioritization of bridges and tunnels in earthquake risk mitigation using multicriteria decision analysis: Application to Lisbon," Omega, Elsevier, vol. 36(3), pages 442-450, June.
Denys Yemshanov & Frank H. Koch & Yakov Ben‐Haim & Marla Downing & Frank Sapio & Marty Siltanen, 2013. "A New Multicriteria Risk Mapping Approach Based on a Multiattribute Frontier Concept," Risk Analysis, John Wiley & Sons, vol. 33(9), pages 1694-1709, September.
Corrente, Salvatore & Figueira, José Rui & Greco, Salvatore, 2014. "The SMAA-PROMETHEE method," European Journal of Operational Research, Elsevier, vol. 239(2), pages 514-522.
Comino, E. & Ferretti, V., 2016. "Indicators-based spatial SWOT analysis: supporting the strategic planning and management of complex territorial systems," LSE Research Online Documents on Economics 64142, London School of Economics and Political Science, LSE Library.
Kaveh Madani & Laura Read & Laleh Shalikarian, 2014. "Voting Under Uncertainty: A Stochastic Framework for Analyzing Group Decision Making Problems," Water Resources Management: An International Journal, Published for the European Water Resources Association (EWRA), Springer;European Water Resources Association (EWRA), vol. 28(7), pages 1839-1856, May.
Kadziński, MiŁosz & Greco, Salvatore & SŁowiński, Roman, 2012. "Extreme ranking analysis in robust ordinal regression," Omega, Elsevier, vol. 40(4), pages 488-501.
Haurant, P. & Oberti, P. & Muselli, M., 2011. "Multicriteria selection aiding related to photovoltaic plants on farming fields on Corsica island: A real case study using the ELECTRE outranking framework," Energy Policy, Elsevier, vol. 39(2), pages 676-688, February.
Growiec, Jakub, 2018. "Factor-specific technology choice," Journal of Mathematical Economics, Elsevier, vol. 77(C), pages 1-14.
- Jakub Growiec, 2017. "Factor-Specific Technology Choice," EcoMod2017 10240, EcoMod.
- Jakub Growiec, 2017. "Factor-specific technology choice," NBP Working Papers 265, Narodowy Bank Polski.
JosÃ© M. MerigÃ³ & Anna M. Gil-Lafuente & Daniel Palacios-MarquÃ©s, 2014. "A new method for fuzzy decision making under risk and uncertainty," International Journal of Business Continuity and Risk Management, Inderscience Enterprises Ltd, vol. 5(1), pages 29-42.
Franceschini, Fiorenzo & Maisano, Domenico, 2015. "Checking the consistency of the solution in ordinal semi-democratic decision-making problems," Omega, Elsevier, vol. 57(PB), pages 188-195.
Bouyssou, Denis & Marchant, Thierry, 2007. "An axiomatic approach to noncompensatory sorting methods in MCDM, II: More than two categories," European Journal of Operational Research, Elsevier, vol. 178(1), pages 246-276, April.
- Denis Bouyssou & Thierry Marchant, 2007. "An axiomatic approach to noncompensatory sorting methods in MCDM, II: More than two categories," Post-Print hal-02361918, HAL.
Grabisch, Michel & Kojadinovic, Ivan & Meyer, Patrick, 2008. "A review of methods for capacity identification in Choquet integral based multi-attribute utility theory: Applications of the Kappalab R package," European Journal of Operational Research, Elsevier, vol. 186(2), pages 766-785, April.
- Michel Grabisch & Ivan Kojadinovic & Patrick Meyer, 2008. "A review of methods for capacity identification in Choquet integral based multi-attribute utility theory: Applications of the Kappalab R package," Université Paris1 Panthéon-Sorbonne (Post-Print and Working Papers) halshs-00187175, HAL.
- Michel Grabisch & Ivan Kojadinovic & Patrick Meyer, 2008. "A review of methods for capacity identification in Choquet integral based multi-attribute utility theory: Applications of the Kappalab R package," Post-Print halshs-00187175, HAL.
Pablo Aragonés‐Beltrán & Mª. Carmen González‐Cruz & Astrid León‐Camargo & Rosario Viñoles‐Cebolla, 2023. "Assessment of regional development needs according to criteria based on the Sustainable Development Goals in the Meta Region (Colombia)," Sustainable Development, John Wiley & Sons, Ltd., vol. 31(2), pages 1101-1121, April.
Boris Yatsalo & Sergey Gritsyuk & Terry Sullivan & Benjamin Trump & Igor Linkov, 2016. "Multi-criteria risk management with the use of DecernsMCDA: methods and case studies," Environment Systems and Decisions, Springer, vol. 36(3), pages 266-276, September.
Juliana Martins Ruzante & Valerie J. Davidson & Julie Caswell & Aamir Fazil & John A. L. Cranfield & Spencer J. Henson & Sven M. Anders & Claudia Schmidt & Jeffrey M. Farber, 2010. "A Multifactorial Risk Prioritization Framework for Foodborne Pathogens," Risk Analysis, John Wiley & Sons, vol. 30(5), pages 724-742, May.
- Spencer J. Henson & Julie Caswell & John A. L. Cranfield & Aamir Frazil & Valerie J. Davidson & Sven M. Anders & Claudia Schmidt, 2007. "A Multi-Factorial Risk Prioritization Framework for Food-Borne Pathogens," Working Papers 2007-8, University of Massachusetts Amherst, Department of Resource Economics.
- Henson, Spencer J. & Caswell, Julie A. & Cranfield, John A.L. & Fazil, Aamir & Davidson, Valerie J. & Anders, Sven M. & Schmidt, Claudia, 2007. "A Multi-Factorial Risk Prioritization Framework for Food-borne Pathogens," Working Paper Series 7385, University of Massachusetts, Amherst, Department of Resource Economics.
Becchio, Cristina & Bottero, Marta Carla & Corgnati, Stefano Paolo & Dell’Anna, Federico, 2018. "Decision making for sustainable urban energy planning: an integrated evaluation framework of alternative solutions for a NZED (Net Zero-Energy District) in Turin," Land Use Policy, Elsevier, vol. 78(C), pages 803-817.
Tunjo Perić & Zoran Babić & Josip Matejaš, 2018. "Comparative analysis of application efficiency of two iterative multi objective linear programming methods (MP method and STEM method)," Central European Journal of Operations Research, Springer;Slovak Society for Operations Research;Hungarian Operational Research Society;Czech Society for Operations Research;Österr. Gesellschaft für Operations Research (ÖGOR);Slovenian Society Informatika - Section for Operational Research;Croatian Operational Research Society, vol. 26(3), pages 565-583, September.
Morgenroth, Edgar & FitzGerald, John & FitzGerald, John, 2006. "Summary and Conclusions," Book Chapters, in: Morgenroth, Edgar (ed.),Ex-Ante Evaluation of the Investment Priorities for the National Development Plan 2007-2013, chapter 24, pages 317-333, Economic and Social Research Institute (ESRI).
- Baker, Terence J. & FitzGerald, John & Honohan, Patrick & FitzGerald, John & Honohan, Patrick, 1996. "Summary and Conclusions," Book Chapters, in: Baker, Terence J. (ed.),Economic Implications for Ireland of EMU, chapter 12, pages 339-352, Economic and Social Research Institute (ESRI).
Fernandez, Eduardo & Navarro, Jorge & Bernal, Sergio, 2010. "Handling multicriteria preferences in cluster analysis," European Journal of Operational Research, Elsevier, vol. 202(3), pages 819-827, May.

More about this item

Keywords

; ; ; ; ; ;

Statistics

Access and download statistics

Corrections

All material on this site has been provided by the respective publishers and authors. You can help correct errors and omissions. When requesting a correction, please mention this item's handle: RePEc:wsi:ijitdm:v:12:y:2013:i:05:n:s0219622013400075. See general information about how to correct material in RePEc.

If you have authored this item and are not yet registered with RePEc, we encourage you to do it here. This allows to link your profile to this item. It also allows you to accept potential citations to this item that we are uncertain about.

If CitEc recognized a bibliographic reference but did not link an item in RePEc to it, you can help with this form .

If you know of missing items citing this one, you can help us creating those links by adding the relevant references in the same way as above, for each refering item. If you are a registered author of this item, you may also want to check the "citations" tab in your RePEc Author Service profile, as there may be some citations waiting for confirmation.

For technical questions regarding this item, or to correct its authors, title, abstract, bibliographic or download information, contact: Tai Tone Lim (email available below). General contact details of provider: http://www.worldscinet.com/ijitdm/ijitdm.shtml .

Please note that corrections may take a couple of weeks to filter through the various RePEc services.

IDEAS is a RePEc service. RePEc uses bibliographic data supplied by the respective publishers.

Browse Econ Literature

More features

A Compromise Programming Approach To Multiobjective Markov Decision Processes

Author

Abstract

Suggested Citation

Download full text from publisher

References listed on IDEAS

Most related items

More about this item

Keywords

Statistics

Corrections

More services and features

MyIDEAS

Author registration

Rankings

RePEc Genealogy

RePEc Biblio

MPRA

New papers by email

EconAcademics

Plagiarism

About RePEc

RePEc home

Blog

Help/FAQ

RePEc team

Participating archives

Privacy statement

Help us

Corrections

Volunteers

Get papers listed

Open a RePEc archive

Get RePEc data