Online Multi-task Learning with Hard Constraints

My bibliography Save this paper

Online Multi-task Learning with Hard Constraints

Author

Listed:

Gabor Lugosi
(ICREA - Institució Catalana de Recerca i Estudis Avançats)
Omiros Papaspiliopoulos
(ICREA - Institució Catalana de Recerca i Estudis Avançats)
Gilles Stoltz
(DMA - Département de Mathématiques et Applications - ENS Paris - ENS-PSL - École normale supérieure - Paris - PSL - Université Paris sciences et lettres - CNRS - Centre National de la Recherche Scientifique, GREGH - Groupement de Recherche et d'Etudes en Gestion à HEC - HEC Paris - Ecole des Hautes Etudes Commerciales - CNRS - Centre National de la Recherche Scientifique)

Registered:

Gilles Stoltz

Abstract

We discuss multi-task online learning when a decision maker has to deal simultaneously with M tasks. The tasks are related, which is modeled by imposing that the M-tuple of actions taken by the decision maker needs to satisfy certain constraints. We give natural examples of such restrictions and then discuss a general class of tractable constraints, for which we introduce computationally efficient ways of selecting actions, essentially by reducing to an on-line shortest path problem. We briefly discuss ``tracking'' and ``bandit'' versions of the problem and extend the model in various ways, including non-additive global losses and uncountably infinite sets of tasks.

Suggested Citation

Gabor Lugosi & Omiros Papaspiliopoulos & Gilles Stoltz, 2009. "Online Multi-task Learning with Hard Constraints," Working Papers hal-00362643, HAL.

Handle: RePEc:hal:wpaper:hal-00362643
Note: View the original document on HAL open archive server: https://hal.science/hal-00362643v2

Download full text from publisher

References listed on IDEAS

Mengel, Friederike, 2012. "Learning across games," Games and Economic Behavior, Elsevier, vol. 74(2), pages 601-619.
- Friederike Mengel, 2007. "Learning Across Games," Working Papers. Serie AD 2007-05, Instituto Valenciano de Investigaciones Económicas, S.A. (Ivie).

Full references (including those not matched with items on IDEAS)

Most related items

These are the items that most often cite the same works as this one and are cited by the same works as this one.

Marco LiCalzi & Roland Mühlenbernd, 2022. "Feature-weighted categorized play across symmetric games," Experimental Economics, Springer;Economic Science Association, vol. 25(3), pages 1052-1078, June.
Marchiori, Davide & Di Guida, Sibilla & Polonio, Luca, 2021. "Plasticity of strategic sophistication in interactive decision-making," Journal of Economic Theory, Elsevier, vol. 196(C).
Florian Gauer & Christoph Kuzmics, 2020. "Cognitive Empathy In Conflict Situations," International Economic Review, Department of Economics, University of Pennsylvania and Osaka University Institute of Social and Economic Research Association, vol. 61(4), pages 1659-1678, November.
- Florian Gauer & Christoph Kuzmics, 2016. "Cognitive Empathy in Conflict Situations," Graz Economics Papers 2016-02, University of Graz, Department of Economics.
- Gauer, Florian & Kuzmics, Christoph, 2016. "Cognitive empathy in conflict situations," Center for Mathematical Economics Working Papers 551, Center for Mathematical Economics, Bielefeld University.
Philippe Jehiel, 2022. "Analogy-Based Expectation Equilibrium and Related Concepts:Theory, Applications, and Beyond," PSE Working Papers halshs-03735680, HAL.
- Philippe Jehiel, 2022. "Analogy-Based Expectation Equilibrium and Related Concepts:Theory, Applications, and Beyond," Working Papers halshs-03735680, HAL.
Wei James Chen & Joseph Tao-yi Wang, 2020. "A modified Monty Hall problem," Theory and Decision, Springer, vol. 89(2), pages 151-156, September.
Lensberg, Terje & Schenk-Hoppé, Klaus Reiner, 2021. "Cold play: Learning across bimatrix games," Journal of Economic Behavior & Organization, Elsevier, vol. 185(C), pages 419-441.
- Lensberg, Terje & Schenk-Hoppé, Klaus R., 2020. "Cold play: Learning across bimatrix games," MPRA Paper 99095, University Library of Munich, Germany.
Jehiel, Philippe & Singh, Juni, 2021. "Multi-state choices with aggregate feedback on unfamiliar alternatives," Games and Economic Behavior, Elsevier, vol. 130(C), pages 1-24.
- Philippe Jehiel & Juni Singh, 2019. "Multi-state choices with aggregate feedback on unfamiliar alternatives," PSE Working Papers halshs-02183444, HAL.
- Philippe Jehiel & Juni Singh, 2021. "Multi-state choices with aggregate feedback on unfamiliar alternatives," Post-Print halshs-03672197, HAL.
- Philippe Jehiel & Juni Singh, 2021. "Multi-state choices with aggregate feedback on unfamiliar alternatives," PSE-Ecole d'économie de Paris (Postprint) halshs-03672197, HAL.
- Philippe Jehiel & Juni Singh, 2019. "Multi-state choices with aggregate feedback on unfamiliar alternatives," Working Papers halshs-02183444, HAL.
Edward W. Piotrowski & Jan Sladkowski & Anna Szczypinska, "undated". "Reinforcement Learning in Market Games," Departmental Working Papers 30, University of Bialtystok, Department of Theoretical Physics.
- Edward W. Piotrowski & Jan Sladkowski & Anna Szczypinska, 2007. "Reinforcement learning in market games," Papers 0710.0114, arXiv.org.
Daskalova, Vessela & Vriend, Nicolaas J., 2020. "Categorization and coordination," European Economic Review, Elsevier, vol. 129(C).
- Vessela Daskalova & Nicolaas J. Vriend, 2014. "Categorization and Coordination," Working Papers 719, Queen Mary University of London, School of Economics and Finance.
- Vessela Daskalova & Nicolaas J. Vriend, 2014. "Categorization and Coordination," Cambridge Working Papers in Economics 1460, Faculty of Economics, University of Cambridge.
Sawa, Ryoji & Zusai, Dai, 2019. "Evolutionary dynamics in multitasking environments," Journal of Economic Behavior & Organization, Elsevier, vol. 166(C), pages 288-308.
Mohlin, Erik, 2014. "Optimal categorization," Journal of Economic Theory, Elsevier, vol. 152(C), pages 356-381.
- Mohlin, Erik, 2009. "Optimal Categorization," SSE/EFI Working Paper Series in Economics and Finance 721, Stockholm School of Economics, revised 30 May 2014.
Mengel, Friederike & Sciubba, Emanuela, 2010. "Extrapolation in Games of Coordination and Dominance Solvable Games," Sustainable Development Papers 98475, Fondazione Eni Enrico Mattei (FEEM).
- Mengel, F. & Sciubba, E., 2010. "Extrapolation in games of coordination and dominance solvable games," Research Memorandum 034, Maastricht University, Maastricht Research School of Economics of Technology and Organization (METEOR).
- Friederike Mengel & Emanuela Sciubba, 2010. "Extrapolation in Games of Coordination and Dominance Solvable Games," Working Papers 2010.148, Fondazione Eni Enrico Mattei.
Benndorf, Volker & Martínez-Martínez, Ismael & Normann, Hans-Theo, 2016. "Equilibrium selection with coupled populations in hawk–dove games: Theory and experiment in continuous time," Journal of Economic Theory, Elsevier, vol. 165(C), pages 472-486.
- Benndorf, Volker & Martinez-Martinez, Ismael & Normann, Hans-Theo, 2016. "Equilibrium selection with coupled populations in hawk-dove games: Theory and experiment in continuous time," DICE Discussion Papers 222, Heinrich Heine University Düsseldorf, Düsseldorf Institute for Competition Economics (DICE).
Christoph March, 2011. "Adaptive social learning," PSE Working Papers halshs-00572528, HAL.
- Christoph March, 2016. "Adaptive Social Learning," CESifo Working Paper Series 5783, CESifo.
- Christoph March, 2011. "Adaptive social learning," Working Papers halshs-00572528, HAL.
Mohlin, Erik, 2012. "Evolution of theories of mind," Games and Economic Behavior, Elsevier, vol. 75(1), pages 299-318.
- Mohlin, Erik, 2010. "Evolution of Theories of Mind," SSE/EFI Working Paper Series in Economics and Finance 0728, Stockholm School of Economics, revised 20 Mar 2012.
, & ,, 2008. "Contagion through learning," Theoretical Economics, Econometric Society, vol. 3(4), December.
- Jakub Steiner, 2007. "Contagion through Learning," Edinburgh School of Economics Discussion Paper Series 151, Edinburgh School of Economics, University of Edinburgh.
Khan, Abhimanyu, 2021. "Evolutionary Stability of Behavioural Rules," MPRA Paper 112920, University Library of Munich, Germany, revised 01 May 2022.
- Khan, Abhimanyu, 2021. "Evolutionary Stability of Behavioural Rules," MPRA Paper 111309, University Library of Munich, Germany.
Marco LiCalzi & Roland Mühlenbernd, 2019. "Categorization and Cooperation across Games," Games, MDPI, vol. 10(1), pages 1-21, January.
- Marco LiCalzi & Roland Muhlenbernd, 2018. "Categorization and cooperation across games," Working Papers 14, Department of Management, Università Ca' Foscari Venezia.
Yasar, Alperen, 2023. "Power struggles and gender discrimination in the workplace," SocArXiv t4g83, Center for Open Science.
Arina Nikandrova, 2013. "Repeated Play of Families of Games by Resource-Constrained Players," Games, MDPI, vol. 4(3), pages 1-8, July.

More about this item

Statistics

Access and download statistics

Corrections

All material on this site has been provided by the respective publishers and authors. You can help correct errors and omissions. When requesting a correction, please mention this item's handle: RePEc:hal:wpaper:hal-00362643. See general information about how to correct material in RePEc.

If you have authored this item and are not yet registered with RePEc, we encourage you to do it here. This allows to link your profile to this item. It also allows you to accept potential citations to this item that we are uncertain about.

If CitEc recognized a bibliographic reference but did not link an item in RePEc to it, you can help with this form .

If you know of missing items citing this one, you can help us creating those links by adding the relevant references in the same way as above, for each refering item. If you are a registered author of this item, you may also want to check the "citations" tab in your RePEc Author Service profile, as there may be some citations waiting for confirmation.

For technical questions regarding this item, or to correct its authors, title, abstract, bibliographic or download information, contact: CCSD (email available below). General contact details of provider: https://hal.archives-ouvertes.fr/ .

Please note that corrections may take a couple of weeks to filter through the various RePEc services.

IDEAS is a RePEc service. RePEc uses bibliographic data supplied by the respective publishers.

Browse Econ Literature

More features

Online Multi-task Learning with Hard Constraints

Author

Abstract

Suggested Citation

Download full text from publisher

References listed on IDEAS

Most related items

More about this item

Statistics

Corrections

More services and features

MyIDEAS

Author registration

Rankings

RePEc Genealogy

RePEc Biblio

MPRA

New papers by email

EconAcademics

Plagiarism

About RePEc

RePEc home

Blog

Help/FAQ

RePEc team

Participating archives

Privacy statement

Help us

Corrections

Volunteers

Get papers listed

Open a RePEc archive

Get RePEc data