IDEAS home Printed from https://ideas.repec.org/p/arx/papers/2408.15454.html

BayesSRW: Bayesian Sampling and Re-weighting approach for variance reduction

Author

Listed:
  • Carol Liu

Abstract

In this paper, we address the challenge of sampling in scenarios where limited resources prevent exhaustive measurement across all subjects. We consider a setting where samples are drawn from multiple groups, each following a distribution with unknown mean and variance parameters. We introduce a novel sampling strategy, motivated simply by Cauchy-Schwarz inequality, which minimizes the variance of the population mean estimator by allocating samples proportionally to both the group size and the standard deviation. This approach improves the efficiency of sampling by focusing resources on groups with greater variability, thereby enhancing the precision of the overall estimate. Additionally, we extend our method to a two-stage sampling procedure in a Bayes approach, named BayesSRW, where a preliminary stage is used to estimate the variance, which then informs the optimal allocation of the remaining sampling budget. Through simulation examples, we demonstrate the effectiveness of our approach in reducing estimation uncertainty and providing more reliable insights in applications ranging from user experience surveys to high-dimensional peptide array studies.

Suggested Citation

  • Carol Liu, 2024. "BayesSRW: Bayesian Sampling and Re-weighting approach for variance reduction," Papers 2408.15454, arXiv.org.
  • Handle: RePEc:arx:papers:2408.15454
    as

    Download full text from publisher

    File URL: http://arxiv.org/pdf/2408.15454
    File Function: Latest version
    Download Restriction: no
    ---><---

    References listed on IDEAS

    as
    1. Richard, Jean-Francois & Zhang, Wei, 2007. "Efficient high-dimensional importance sampling," Journal of Econometrics, Elsevier, vol. 141(2), pages 1385-1411, December.
    2. Jean-Francois Richard, 2007. "Efficient High-Dimensional Importance Sampling," Working Paper 321, Department of Economics, University of Pittsburgh, revised Jan 2007.
    3. Jae Kwang Kim & Mingue Park, 2010. "Calibration Estimation in Survey Sampling," International Statistical Review, International Statistical Institute, vol. 78(1), pages 21-39, April.
    4. John D. Storey, 2002. "A direct approach to false discovery rates," Journal of the Royal Statistical Society Series B, Royal Statistical Society, vol. 64(3), pages 479-498, August.
    5. Efron, Bradley, 2007. "Correlation and Large-Scale Simultaneous Significance Testing," Journal of the American Statistical Association, American Statistical Association, vol. 102, pages 93-103, March.
    Full references (including those not matched with items on IDEAS)

    Most related items

    These are the items that most often cite the same works as this one and are cited by the same works as this one.
    1. Bauwens, L. & Galli, F., 2009. "Efficient importance sampling for ML estimation of SCD models," Computational Statistics & Data Analysis, Elsevier, vol. 53(6), pages 1974-1992, April.
    2. Wen Shi & Xi Chen & Jennifer Shang, 2019. "An Efficient Morris Method-Based Framework for Simulation Factor Screening," INFORMS Journal on Computing, INFORMS, vol. 31(4), pages 745-770, October.
    3. Yu, Jun, 2012. "A semiparametric stochastic volatility model," Journal of Econometrics, Elsevier, vol. 167(2), pages 473-482.
    4. Jianqing Fan & Xu Han, 2017. "Estimation of the false discovery proportion with unknown dependence," Journal of the Royal Statistical Society Series B, Royal Statistical Society, vol. 79(4), pages 1143-1164, September.
    5. Florian Heiss, 2016. "Discrete Choice Methods with Simulation," Econometric Reviews, Taylor & Francis Journals, vol. 35(4), pages 688-692, April.
    6. Mengheng Li & Siem Jan (S.J.) Koopman, 2018. "Unobserved Components with Stochastic Volatility in U.S. Inflation: Estimation and Signal Extraction," Tinbergen Institute Discussion Papers 18-027/III, Tinbergen Institute.
    7. Siem Jan Koopman & André Lucas & Marcel Scharth, 2016. "Predicting Time-Varying Parameters with Parameter-Driven and Observation-Driven Models," The Review of Economics and Statistics, MIT Press, vol. 98(1), pages 97-110, March.
    8. Roman Liesenfeld & Guilherme Valle Moura & Jean‐François Richard, 2010. "Determinants and Dynamics of Current Account Reversals: An Empirical Analysis," Oxford Bulletin of Economics and Statistics, Department of Economics, University of Oxford, vol. 72(4), pages 486-517, August.
    9. Falk Bräuning & Siem Jan Koopman, 2016. "The dynamic factor network model with an application to global credit risk," Working Papers 16-13, Federal Reserve Bank of Boston.
    10. Baştürk, N. & Borowska, A. & Grassi, S. & Hoogerheide, L. & van Dijk, H.K., 2019. "Forecast density combinations of dynamic models and data driven portfolio strategies," Journal of Econometrics, Elsevier, vol. 210(1), pages 170-186.
    11. Blazsek, Szabolcs & Escribano, Alvaro, 2010. "Knowledge spillovers in US patents: A dynamic patent intensity model with secret common innovation factors," Journal of Econometrics, Elsevier, vol. 159(1), pages 14-32, November.
    12. Tommaso Proietti & Alessandra Luati, 2013. "Maximum likelihood estimation of time series models: the Kalman filter and beyond," Chapters, in: Nigar Hashimzade & Michael A. Thornton (ed.), Handbook of Research Methods and Applications in Empirical Macroeconomics, chapter 15, pages 334-362, Edward Elgar Publishing.
    13. Tsyplakov, Alexander, 2010. "The links between inflation and inflation uncertainty at the longer horizon," MPRA Paper 26908, University Library of Munich, Germany.
    14. Wang, Nianling & Yin, Jiyuan & Li, Yong, 2024. "Economic policy uncertainty and stock market volatility in China: Evidence from SV-MIDAS-t model," International Review of Financial Analysis, Elsevier, vol. 92(C).
    15. Vêlayoudom Marimoutou & Manel Soury, 2015. "Energy Markets and CO2 Emissions: Analysis by Stochastic Copula Autoregressive Model," AMSE Working Papers 1520, Aix-Marseille School of Economics, France.
    16. Bretó, Carles, 2014. "On idiosyncratic stochasticity of financial leverage effects," Statistics & Probability Letters, Elsevier, vol. 91(C), pages 20-26.
    17. Siem Jan Koopman & Rutger Lit & Thuy Minh Nguyen, 2012. "Fast Efficient Importance Sampling by State Space Methods," Tinbergen Institute Discussion Papers 12-008/4, Tinbergen Institute, revised 16 Oct 2014.
    18. Creal, Drew D. & Wu, Jing Cynthia, 2015. "Estimation of affine term structure models with spanned or unspanned stochastic volatility," Journal of Econometrics, Elsevier, vol. 185(1), pages 60-81.
    19. André A. Monteiro, 2008. "Parameter Driven Multi-state Duration Models: Simulated vs. Approximate Maximum Likelihood Estimation," Tinbergen Institute Discussion Papers 08-021/2, Tinbergen Institute.
    20. Ozturk, Serda Selin & Demirer, Riza & Gupta, Rangan, 2022. "Climate uncertainty and carbon emissions prices: The relative roles of transition and physical climate risks," Economics Letters, Elsevier, vol. 217(C).

    More about this item

    Statistics

    Access and download statistics

    Corrections

    All material on this site has been provided by the respective publishers and authors. You can help correct errors and omissions. When requesting a correction, please mention this item's handle: RePEc:arx:papers:2408.15454. See general information about how to correct material in RePEc.

    If you have authored this item and are not yet registered with RePEc, we encourage you to do it here. This allows to link your profile to this item. It also allows you to accept potential citations to this item that we are uncertain about.

    If CitEc recognized a bibliographic reference but did not link an item in RePEc to it, you can help with this form .

    If you know of missing items citing this one, you can help us creating those links by adding the relevant references in the same way as above, for each refering item. If you are a registered author of this item, you may also want to check the "citations" tab in your RePEc Author Service profile, as there may be some citations waiting for confirmation.

    For technical questions regarding this item, or to correct its authors, title, abstract, bibliographic or download information, contact: arXiv administrators (email available below). General contact details of provider: http://arxiv.org/ .

    Please note that corrections may take a couple of weeks to filter through the various RePEc services.

    IDEAS is a RePEc service. RePEc uses bibliographic data supplied by the respective publishers.