IDEAS home Printed from https://ideas.repec.org/a/bla/stanee/v79y2025i4ne70013.html

High‐dimensional prediction for count response via sparse exponential weights

Author

Listed:
  • The Tien Mai

Abstract

Count data is prevalent in various fields such as ecology, medical and genomics research. In high‐dimensional settings, where the number of features exceeds the sample size, feature selection becomes essential. While frequentist methods like Lasso have advanced in handling high‐dimensional count data, Bayesian approaches remain underexplored with no theoretical results on prediction performance. This article introduces a novel probabilistic machine learning framework for high‐dimensional count data prediction. We propose a pseudo‐Bayesian method that integrates a scaled Student prior to promote sparsity and uses an exponential weight aggregation procedure. A key contribution is a novel risk measure tailored to count data prediction, with theoretical guarantees for prediction risk using PAC‐Bayesian bounds. Our results include nonasymptotic oracle inequalities, demonstrating rate‐optimal prediction error without prior knowledge of sparsity. We implement this approach efficiently using Langevin Monte Carlo method. Simulations and a real data application highlight the strong performance of our method compared to the Lasso in various settings.

Suggested Citation

  • The Tien Mai, 2025. "High‐dimensional prediction for count response via sparse exponential weights," Statistica Neerlandica, Netherlands Society for Statistics and Operations Research, vol. 79(4), November.
  • Handle: RePEc:bla:stanee:v:79:y:2025:i:4:n:e70013
    DOI: 10.1111/stan.70013
    as

    Download full text from publisher

    File URL: https://doi.org/10.1111/stan.70013
    Download Restriction: no

    File URL: https://libkey.io/10.1111/stan.70013?utm_source=ideas
    LibKey link: if access is restricted and if your library uses this service, LibKey will redirect you to where you can use your library subscription to access this item
    ---><---

    More about this item

    Statistics

    Access and download statistics

    Corrections

    All material on this site has been provided by the respective publishers and authors. You can help correct errors and omissions. When requesting a correction, please mention this item's handle: RePEc:bla:stanee:v:79:y:2025:i:4:n:e70013. See general information about how to correct material in RePEc.

    If you have authored this item and are not yet registered with RePEc, we encourage you to do it here. This allows to link your profile to this item. It also allows you to accept potential citations to this item that we are uncertain about.

    We have no bibliographic references for this item. You can help adding them by using this form .

    If you know of missing items citing this one, you can help us creating those links by adding the relevant references in the same way as above, for each refering item. If you are a registered author of this item, you may also want to check the "citations" tab in your RePEc Author Service profile, as there may be some citations waiting for confirmation.

    For technical questions regarding this item, or to correct its authors, title, abstract, bibliographic or download information, contact: Wiley Content Delivery (email available below). General contact details of provider: http://www.blackwellpublishing.com/journal.asp?ref=0039-0402 .

    Please note that corrections may take a couple of weeks to filter through the various RePEc services.

    IDEAS is a RePEc service. RePEc uses bibliographic data supplied by the respective publishers.