IDEAS home Printed from https://ideas.repec.org/p/arx/papers/2312.15595.html
   My bibliography  Save this paper

Zero-Inflated Bandits

Author

Listed:
  • Haoyu Wei
  • Runzhe Wan
  • Lei Shi
  • Rui Song

Abstract

Many real-world bandit applications are characterized by sparse rewards, which can significantly hinder learning efficiency. Leveraging problem-specific structures for careful distribution modeling is recognized as essential for improving estimation efficiency in statistics. However, this approach remains under-explored in the context of bandits. To address this gap, we initiate the study of zero-inflated bandits, where the reward is modeled using a classic semi-parametric distribution known as the zero-inflated distribution. We develop algorithms based on the Upper Confidence Bound and Thompson Sampling frameworks for this specific structure. The superior empirical performance of these methods is demonstrated through extensive numerical studies.

Suggested Citation

  • Haoyu Wei & Runzhe Wan & Lei Shi & Rui Song, 2023. "Zero-Inflated Bandits," Papers 2312.15595, arXiv.org, revised Jan 2025.
  • Handle: RePEc:arx:papers:2312.15595
    as

    Download full text from publisher

    File URL: http://arxiv.org/pdf/2312.15595
    File Function: Latest version
    Download Restriction: no
    ---><---

    References listed on IDEAS

    as
    1. Weron, Rafal, 1996. "Correction to: "On the Chambers–Mallows–Stuck Method for Simulating Skewed Stable Random Variables"," MPRA Paper 20761, University Library of Munich, Germany, revised 2010.
    2. Weron, Rafal, 1996. "On the Chambers-Mallows-Stuck method for simulating skewed stable random variables," Statistics & Probability Letters, Elsevier, vol. 28(2), pages 165-171, June.
    3. Huiming Zhang & Haoyu Wei, 2022. "Sharper Sub-Weibull Concentrations," Mathematics, MDPI, vol. 10(13), pages 1-29, June.
    4. Bogucki, Robert, 2015. "Suprema of canonical Weibull processes," Statistics & Probability Letters, Elsevier, vol. 107(C), pages 253-263.
    Full references (including those not matched with items on IDEAS)

    Most related items

    These are the items that most often cite the same works as this one and are cited by the same works as this one.
    1. Furrer, Hansjorg & Michna, Zbigniew & Weron, Aleksander, 1997. "Stable Lévy motion approximation in collective risk theory," Insurance: Mathematics and Economics, Elsevier, vol. 20(2), pages 97-114, September.
    2. J.-F. Chamayou, 2001. "Pseudo random numbers for the Landau and Vavilov distributions," Computational Statistics, Springer, vol. 16(1), pages 131-152, March.
    3. Weron, Rafał, 2004. "Computationally intensive Value at Risk calculations," Papers 2004,32, Humboldt University of Berlin, Center for Applied Statistics and Economics (CASE).
    4. Tsionas, Mike, 2012. "Simple techniques for likelihood analysis of univariate and multivariate stable distributions: with extensions to multivariate stochastic volatility and dynamic factor models," MPRA Paper 40966, University Library of Munich, Germany, revised 20 Aug 2012.
    5. John C. Frain, 2007. "Small sample power of tests of normality when the alternative is an alpha-stable distribution," Trinity Economics Papers tep0207, Trinity College Dublin, Department of Economics.
    6. Harry Pavlopoulos & George Chronis, 2023. "On highly skewed fractional log‐stable noise sequences and their application," Journal of Time Series Analysis, Wiley Blackwell, vol. 44(4), pages 337-358, July.
    7. Taufer, Emanuele, 2015. "On the empirical process of strongly dependent stable random variables: asymptotic properties, simulation and applications," Statistics & Probability Letters, Elsevier, vol. 106(C), pages 262-271.
    8. Goddard, John & Onali, Enrico, 2012. "Self-affinity in financial asset returns," International Review of Financial Analysis, Elsevier, vol. 24(C), pages 1-11.
    9. Guarcello, C., 2021. "Lévy noise effects on Josephson junctions," Chaos, Solitons & Fractals, Elsevier, vol. 153(P2).
    10. Mbakob Yonkeu, R. & David, Afungchui, 2022. "Coherence and stochastic resonance in the fractional-birhythmic self-sustained system subjected to fractional time-delay feedback and Lévy noise," Chaos, Solitons & Fractals, Elsevier, vol. 165(P1).
    11. Kotchoni, Rachidi, 2012. "Applications of the characteristic function-based continuum GMM in finance," Computational Statistics & Data Analysis, Elsevier, vol. 56(11), pages 3599-3622.
    12. Danish A. Ahmed & Sergei V. Petrovskii & Paulo F. C. Tilles, 2018. "The “Lévy or Diffusion” Controversy: How Important Is the Movement Pattern in the Context of Trapping?," Mathematics, MDPI, vol. 6(5), pages 1-27, May.
    13. Marc S. Paolella, 2016. "Stable-GARCH Models for Financial Returns: Fast Estimation and Tests for Stability," Econometrics, MDPI, vol. 4(2), pages 1-28, May.
    14. Wang, Xiaolong & Feng, Jing & Liu, Qi & Li, Yongge & Xu, Yong, 2022. "Neural network-based parameter estimation of stochastic differential equations driven by Lévy noise," Physica A: Statistical Mechanics and its Applications, Elsevier, vol. 606(C).
    15. Stéphane Goutte & David Guerreiro & Bilel Sanhaji & Sophie Saglio & Julien Chevallier, 2019. "International Financial Markets," Post-Print halshs-02183053, HAL.
    16. Svetlana Boyarchenko & Sergei Levendorskiu{i}, 2022. "Efficient evaluation of expectations of functions of a stable L\'evy process and its extremum," Papers 2209.12349, arXiv.org.
    17. Fernández-Martínez, M. & Sánchez-Granero, M.A. & Trinidad Segovia, J.E., 2013. "Measuring the self-similarity exponent in Lévy stable processes of financial time series," Physica A: Statistical Mechanics and its Applications, Elsevier, vol. 392(21), pages 5330-5345.
    18. repec:hum:wpaper:sfb649dp2005-008 is not listed on IDEAS
    19. Svetlana Boyarchenko & Sergei Levendorskii, 2023. "Simulation of a L\'evy process, its extremum, and hitting time of the extremum via characteristic functions," Papers 2312.03929, arXiv.org.
    20. Borak, Szymon & Härdle, Wolfgang Karl & Weron, Rafał, 2005. "Stable distributions," SFB 649 Discussion Papers 2005-008, Humboldt University Berlin, Collaborative Research Center 649: Economic Risk.
    21. Borak, Szymon & Misiorek, Adam & Weron, Rafał, 2010. "Models for heavy-tailed asset returns," SFB 649 Discussion Papers 2010-049, Humboldt University Berlin, Collaborative Research Center 649: Economic Risk.

    More about this item

    NEP fields

    This paper has been announced in the following NEP Reports:

    Statistics

    Access and download statistics

    Corrections

    All material on this site has been provided by the respective publishers and authors. You can help correct errors and omissions. When requesting a correction, please mention this item's handle: RePEc:arx:papers:2312.15595. See general information about how to correct material in RePEc.

    If you have authored this item and are not yet registered with RePEc, we encourage you to do it here. This allows to link your profile to this item. It also allows you to accept potential citations to this item that we are uncertain about.

    If CitEc recognized a bibliographic reference but did not link an item in RePEc to it, you can help with this form .

    If you know of missing items citing this one, you can help us creating those links by adding the relevant references in the same way as above, for each refering item. If you are a registered author of this item, you may also want to check the "citations" tab in your RePEc Author Service profile, as there may be some citations waiting for confirmation.

    For technical questions regarding this item, or to correct its authors, title, abstract, bibliographic or download information, contact: arXiv administrators (email available below). General contact details of provider: http://arxiv.org/ .

    Please note that corrections may take a couple of weeks to filter through the various RePEc services.

    IDEAS is a RePEc service. RePEc uses bibliographic data supplied by the respective publishers.