IDEAS home Printed from https://ideas.repec.org/p/arx/papers/2507.00763.html
   My bibliography  Save this paper

Comparing Misspecified Models with Big Data: A Variational Bayesian Perspective

Author

Listed:
  • Yong Li
  • Sushanta K. Mallick
  • Tao Zeng
  • Junxing Zhang

Abstract

Optimal data detection in massive multiple-input multiple-output (MIMO) systems often requires prohibitively high computational complexity. A variety of detection algorithms have been proposed in the literature, offering different trade-offs between complexity and detection performance. In recent years, Variational Bayes (VB) has emerged as a widely used method for addressing statistical inference in the context of massive data. This study focuses on misspecified models and examines the risk functions associated with predictive distributions derived from variational posterior distributions. These risk functions, defined as the expectation of the Kullback-Leibler (KL) divergence between the true data-generating density and the variational predictive distributions, provide a framework for assessing predictive performance. We propose two novel information criteria for predictive model comparison based on these risk functions. Under certain regularity conditions, we demonstrate that the proposed information criteria are asymptotically unbiased estimators of their respective risk functions. Through comprehensive numerical simulations and empirical applications in economics and finance, we demonstrate the effectiveness of these information criteria in comparing misspecified models in the context of massive data.

Suggested Citation

  • Yong Li & Sushanta K. Mallick & Tao Zeng & Junxing Zhang, 2025. "Comparing Misspecified Models with Big Data: A Variational Bayesian Perspective," Papers 2507.00763, arXiv.org.
  • Handle: RePEc:arx:papers:2507.00763
    as

    Download full text from publisher

    File URL: http://arxiv.org/pdf/2507.00763
    File Function: Latest version
    Download Restriction: no
    ---><---

    References listed on IDEAS

    as
    1. David M. Blei & Alp Kucukelbir & Jon D. McAuliffe, 2017. "Variational Inference: A Review for Statisticians," Journal of the American Statistical Association, Taylor & Francis Journals, vol. 112(518), pages 859-877, April.
    2. Matias Quiroz & Robert Kohn & Mattias Villani & Minh-Ngoc Tran, 2019. "Speeding Up MCMC by Efficient Data Subsampling," Journal of the American Statistical Association, Taylor & Francis Journals, vol. 114(526), pages 831-843, April.
    3. Hansen, Peter Reinhard, 2005. "A Test for Superior Predictive Ability," Journal of Business & Economic Statistics, American Statistical Association, vol. 23, pages 365-380, October.
    4. Phillips, Peter C.B. & Ploberger, Werner, 1994. "Posterior Odds Testing for a Unit Root with Data-Based Model Selection," Econometric Theory, Cambridge University Press, vol. 10(3-4), pages 774-808, August.
    5. Yong Li & Sushanta K. Mallick & Nianling Wang & Jun Yu & Tao Zeng, 2024. "Deviance Information Criterion for Model Selection:Theoretical Justification and Applications," Working Papers 202415, University of Macau, Faculty of Business Administration.
    6. Phillips, Peter C. B., 1995. "Bayesian model selection and prediction with empirical applications," Journal of Econometrics, Elsevier, vol. 69(1), pages 289-331, September.
    7. David J. Spiegelhalter & Nicola G. Best & Bradley P. Carlin & Angelika Van Der Linde, 2002. "Bayesian measures of model complexity and fit," Journal of the Royal Statistical Society Series B, Royal Statistical Society, vol. 64(4), pages 583-639, October.
    8. Li, Yong & Yu, Jun & Zeng, Tao, 2020. "Deviance information criterion for latent variable models and misspecified models," Journal of Econometrics, Elsevier, vol. 216(2), pages 450-493.
    9. Phillips, Peter C B, 1996. "Econometric Model Determination," Econometrica, Econometric Society, vol. 64(4), pages 763-812, July.
    10. Granger, Clive W. J. & King, Maxwell L. & White, Halbert, 1995. "Comments on testing economic theories and the use of model selection criteria," Journal of Econometrics, Elsevier, vol. 67(1), pages 173-187, May.
    Full references (including those not matched with items on IDEAS)

    Most related items

    These are the items that most often cite the same works as this one and are cited by the same works as this one.
    1. Yan Qian & Zijun Wang, 2021. "A model selection approach to jointly testing for structural breaks and cointegration with application to the Eurocurrency interest rates market," Empirical Economics, Springer, vol. 61(2), pages 799-825, August.
    2. Kim, Jae-Young, 2012. "Model selection in the presence of nonstationarity," Journal of Econometrics, Elsevier, vol. 169(2), pages 247-257.
    3. Aaron Schiff & Peter Phillips, 2000. "Forecasting New Zealand's real GDP," New Zealand Economic Papers, Taylor & Francis Journals, vol. 34(2), pages 159-181.
    4. Phillips, Peter C.B., 2005. "Automated Discovery In Econometrics," Econometric Theory, Cambridge University Press, vol. 21(1), pages 3-20, February.
    5. Todd E. Clark & Michael W. McCracken, 2010. "Averaging forecasts from VARs with uncertain instabilities," Journal of Applied Econometrics, John Wiley & Sons, Ltd., vol. 25(1), pages 5-29, January.
    6. Mur, Jesús & Angulo, Ana, 2009. "Model selection strategies in a spatial setting: Some additional results," Regional Science and Urban Economics, Elsevier, vol. 39(2), pages 200-213, March.
    7. Werner Ploberger & Peter C.B. Phillips, 1998. "Rissanen's Theorem and Econometric Time Series," Cowles Foundation Discussion Papers 1197, Cowles Foundation for Research in Economics, Yale University.
    8. Gael M. Martin & David T. Frazier & Christian P. Robert, 2020. "Computing Bayes: Bayesian Computation from 1763 to the 21st Century," Monash Econometrics and Business Statistics Working Papers 14/20, Monash University, Department of Econometrics and Business Statistics.
    9. Ho, Paul, 2023. "Global robust Bayesian analysis in large models," Journal of Econometrics, Elsevier, vol. 235(2), pages 608-642.
    10. Gael M. Martin & David T. Frazier & Ruben Loaiza-Maya & Florian Huber & Gary Koop & John Maheu & Didier Nibbering & Anastasios Panagiotelis, 2023. "Bayesian Forecasting in the 21st Century: A Modern Review," Monash Econometrics and Business Statistics Working Papers 1/23, Monash University, Department of Econometrics and Business Statistics.
    11. Kleibergen, Frank & Paap, Richard, 2002. "Priors, posteriors and bayes factors for a Bayesian analysis of cointegration," Journal of Econometrics, Elsevier, vol. 111(2), pages 223-249, December.
    12. Loaiza-Maya, Rubén & Nibbering, Didier & Zhu, Dan, 2024. "Hybrid unadjusted Langevin methods for high-dimensional latent variable models," Journal of Econometrics, Elsevier, vol. 241(2).
    13. Tao Zeng & Yong Li & Jun Yu, 2014. "Deviance Information Criterion for Comparing VAR Models," Advances in Econometrics, in: Essays in Honor of Peter C. B. Phillips, volume 33, pages 615-637, Emerald Group Publishing Limited.
    14. Chao, John C. & Phillips, Peter C. B., 1999. "Model selection in partially nonstationary vector autoregressive processes with reduced rank structure," Journal of Econometrics, Elsevier, vol. 91(2), pages 227-271, August.
    15. Yuan Fang & Dimitris Karlis & Sanjeena Subedi, 2022. "Infinite Mixtures of Multivariate Normal-Inverse Gaussian Distributions for Clustering of Skewed Data," Journal of Classification, Springer;The Classification Society, vol. 39(3), pages 510-552, November.
    16. Ivanov Ventzislav & Kilian Lutz, 2005. "A Practitioner's Guide to Lag Order Selection For VAR Impulse Response Analysis," Studies in Nonlinear Dynamics & Econometrics, De Gruyter, vol. 9(1), pages 1-36, March.
    17. Pesaran, Hashem & Timmermann, Allan, 2005. "Real-Time Econometrics," Econometric Theory, Cambridge University Press, vol. 21(1), pages 212-231, February.
    18. Kleibergen, F.R. & Paap, R., 1996. "Priors, Posterior Odds and Lagrange Multiplier Statistics in Bayesian Analyses of Cointegration," Econometric Institute Research Papers EI 9668-/A, Erasmus University Rotterdam, Erasmus School of Economics (ESE), Econometric Institute.
    19. Shen‐Ming Lee & Truong‐Nhat Le & Phuoc‐Loc Tran & Chin‐Shang Li, 2022. "Investigating the association of a sensitive attribute with a random variable using the Christofides generalised randomised response design and Bayesian methods," Journal of the Royal Statistical Society Series C, Royal Statistical Society, vol. 71(5), pages 1471-1502, November.
    20. Ye Yang & Osman Doğan & Süleyman Taşpınar, 2023. "Observed-data DIC for spatial panel data models," Empirical Economics, Springer, vol. 64(3), pages 1281-1314, March.

    More about this item

    NEP fields

    This paper has been announced in the following NEP Reports:

    Statistics

    Access and download statistics

    Corrections

    All material on this site has been provided by the respective publishers and authors. You can help correct errors and omissions. When requesting a correction, please mention this item's handle: RePEc:arx:papers:2507.00763. See general information about how to correct material in RePEc.

    If you have authored this item and are not yet registered with RePEc, we encourage you to do it here. This allows to link your profile to this item. It also allows you to accept potential citations to this item that we are uncertain about.

    If CitEc recognized a bibliographic reference but did not link an item in RePEc to it, you can help with this form .

    If you know of missing items citing this one, you can help us creating those links by adding the relevant references in the same way as above, for each refering item. If you are a registered author of this item, you may also want to check the "citations" tab in your RePEc Author Service profile, as there may be some citations waiting for confirmation.

    For technical questions regarding this item, or to correct its authors, title, abstract, bibliographic or download information, contact: arXiv administrators (email available below). General contact details of provider: http://arxiv.org/ .

    Please note that corrections may take a couple of weeks to filter through the various RePEc services.

    IDEAS is a RePEc service. RePEc uses bibliographic data supplied by the respective publishers.