IDEAS home Printed from https://ideas.repec.org/p/arx/papers/2312.15595.html

Zero-Inflated Bandits

Author

Listed:
  • Haoyu Wei
  • Runzhe Wan
  • Lei Shi
  • Rui Song

Abstract

Many real-world bandit applications are characterized by sparse rewards, which can significantly hinder learning efficiency. Leveraging problem-specific structures for careful distribution modeling is recognized as essential for improving estimation efficiency in statistics. However, this approach remains under-explored in the context of bandits. To address this gap, we initiate the study of zero-inflated bandits, where the reward is modeled using a classic semi-parametric distribution known as the zero-inflated distribution. We develop algorithms based on the Upper Confidence Bound and Thompson Sampling frameworks for this specific structure. The superior empirical performance of these methods is demonstrated through extensive numerical studies.

Suggested Citation

  • Haoyu Wei & Runzhe Wan & Lei Shi & Rui Song, 2023. "Zero-Inflated Bandits," Papers 2312.15595, arXiv.org, revised Jan 2025.
  • Handle: RePEc:arx:papers:2312.15595
    as

    Download full text from publisher

    File URL: http://arxiv.org/pdf/2312.15595
    File Function: Latest version
    Download Restriction: no
    ---><---

    References listed on IDEAS

    as
    1. Weron, Rafal, 1996. "Correction to: "On the Chambers–Mallows–Stuck Method for Simulating Skewed Stable Random Variables"," MPRA Paper 20761, University Library of Munich, Germany, revised 2010.
    2. Weron, Rafal, 1996. "On the Chambers-Mallows-Stuck method for simulating skewed stable random variables," Statistics & Probability Letters, Elsevier, vol. 28(2), pages 165-171, June.
    3. Huiming Zhang & Haoyu Wei, 2022. "Sharper Sub-Weibull Concentrations," Mathematics, MDPI, vol. 10(13), pages 1-29, June.
    4. Bogucki, Robert, 2015. "Suprema of canonical Weibull processes," Statistics & Probability Letters, Elsevier, vol. 107(C), pages 253-263.
    Full references (including those not matched with items on IDEAS)

    Most related items

    These are the items that most often cite the same works as this one and are cited by the same works as this one.
    1. Furrer, Hansjorg & Michna, Zbigniew & Weron, Aleksander, 1997. "Stable Lévy motion approximation in collective risk theory," Insurance: Mathematics and Economics, Elsevier, vol. 20(2), pages 97-114, September.
    2. Dassios, Angelos & Qu, Yan & Zhao, Hongbiao, 2018. "Exact simulation for a class of tempered stable," LSE Research Online Documents on Economics 86981, London School of Economics and Political Science, LSE Library.
    3. J.-F. Chamayou, 2001. "Pseudo random numbers for the Landau and Vavilov distributions," Computational Statistics, Springer, vol. 16(1), pages 131-152, March.
    4. Weron, Rafał, 2004. "Computationally intensive Value at Risk calculations," Papers 2004,32, Humboldt University of Berlin, Center for Applied Statistics and Economics (CASE).
    5. Luc Devroye & Lancelot James, 2014. "On simulation and properties of the stable law," Statistical Methods & Applications, Springer;Società Italiana di Statistica, vol. 23(3), pages 307-343, August.
    6. Tsionas, Mike, 2012. "Simple techniques for likelihood analysis of univariate and multivariate stable distributions: with extensions to multivariate stochastic volatility and dynamic factor models," MPRA Paper 40966, University Library of Munich, Germany, revised 20 Aug 2012.
    7. John C. Frain, 2007. "Small sample power of tests of normality when the alternative is an alpha-stable distribution," Trinity Economics Papers tep0207, Trinity College Dublin, Department of Economics.
    8. Harry Pavlopoulos & George Chronis, 2023. "On highly skewed fractional log‐stable noise sequences and their application," Journal of Time Series Analysis, Wiley Blackwell, vol. 44(4), pages 337-358, July.
    9. Yuyu Chen & Taizhong Hu & Seva Shneer & Zhenfeng Zou, 2025. "Stochastic dominance for linear combinations of infinite-mean risks," Papers 2505.01739, arXiv.org.
    10. Chronis, George A., 2016. "Modelling the extreme variability of the US Consumer Price Index inflation with a stable non-symmetric distribution," Economic Modelling, Elsevier, vol. 59(C), pages 271-277.
    11. Taufer, Emanuele, 2015. "On the empirical process of strongly dependent stable random variables: asymptotic properties, simulation and applications," Statistics & Probability Letters, Elsevier, vol. 106(C), pages 262-271.
    12. Goddard, John & Onali, Enrico, 2012. "Self-affinity in financial asset returns," International Review of Financial Analysis, Elsevier, vol. 24(C), pages 1-11.
    13. Guarcello, C., 2021. "Lévy noise effects on Josephson junctions," Chaos, Solitons & Fractals, Elsevier, vol. 153(P2).
    14. Mbakob Yonkeu, R. & David, Afungchui, 2022. "Coherence and stochastic resonance in the fractional-birhythmic self-sustained system subjected to fractional time-delay feedback and Lévy noise," Chaos, Solitons & Fractals, Elsevier, vol. 165(P1).
    15. Guo, Yongfeng & Wang, Linjie & Wei, Fang & Tan, Jianguo, 2019. "Dynamical behavior of simplified FitzHugh-Nagumo neural system driven by Lévy noise and Gaussian white noise," Chaos, Solitons & Fractals, Elsevier, vol. 127(C), pages 118-126.
    16. Danish A. Ahmed & Sergei V. Petrovskii & Paulo F. C. Tilles, 2018. "The “Lévy or Diffusion” Controversy: How Important Is the Movement Pattern in the Context of Trapping?," Mathematics, MDPI, vol. 6(5), pages 1-27, May.
    17. Szczurek, Andrzej & Maciejewska, Monika & Wyłomańska, Agnieszka & Sikora, Grzegorz & Balcerek, Michał & Teuerle, Marek, 2016. "Discrimination of particulate matter emission sources using stochastic methods," Physica A: Statistical Mechanics and its Applications, Elsevier, vol. 463(C), pages 452-466.
    18. Kerger, Phillip & Kobayashi, Kei, 2020. "Parameter estimation for one-sided heavy-tailed distributions," Statistics & Probability Letters, Elsevier, vol. 164(C).
    19. Marc S. Paolella, 2016. "Stable-GARCH Models for Financial Returns: Fast Estimation and Tests for Stability," Econometrics, MDPI, vol. 4(2), pages 1-28, May.
    20. Jurić, Višnja, 2025. "Two – Dimensional Modelling of Financial Data," Proceedings of the ENTRENOVA - ENTerprise REsearch InNOVAtion Conference (2024), Hybrid Conference, Dubrovnik, Croatia, in: Proceedings of the ENTRENOVA - ENTerprise REsearch InNOVAtion Conference, Hybrid Conference, Dubrovnik, Croatia, 5-7 September, 2024, pages 73-83, IRENET - Society for Advancing Innovation and Research in Economy, Zagreb.

    More about this item

    NEP fields

    This paper has been announced in the following NEP Reports:

    Statistics

    Access and download statistics

    Corrections

    All material on this site has been provided by the respective publishers and authors. You can help correct errors and omissions. When requesting a correction, please mention this item's handle: RePEc:arx:papers:2312.15595. See general information about how to correct material in RePEc.

    If you have authored this item and are not yet registered with RePEc, we encourage you to do it here. This allows to link your profile to this item. It also allows you to accept potential citations to this item that we are uncertain about.

    If CitEc recognized a bibliographic reference but did not link an item in RePEc to it, you can help with this form .

    If you know of missing items citing this one, you can help us creating those links by adding the relevant references in the same way as above, for each refering item. If you are a registered author of this item, you may also want to check the "citations" tab in your RePEc Author Service profile, as there may be some citations waiting for confirmation.

    For technical questions regarding this item, or to correct its authors, title, abstract, bibliographic or download information, contact: arXiv administrators (email available below). General contact details of provider: http://arxiv.org/ .

    Please note that corrections may take a couple of weeks to filter through the various RePEc services.

    IDEAS is a RePEc service. RePEc uses bibliographic data supplied by the respective publishers.