IDEAS home Printed from https://ideas.repec.org/p/swe/wpaper/2007-13.html
   My bibliography  Save this paper

Bayesian Covariance Matrix Estimation using a Mixture of Decomposable Graphical Models

Author

Listed:
  • Helen Armstrong

    () (School of Mathematics, University of New South Wales)

  • Christopher K. Carter

    () (School of Economics, University of New South Wales)

  • Kevin K. F. Wong

    (Graduate University for Advanced Studies, Tokyo, Japan)

  • Robert Kohn

    () (School of Economics, University of New South Wales)

Abstract

Estimating a covariance matrix efficiently and discovering its structure are important statistical problems with applications in many fields. This article takes a Bayesian approach to estimate the covariance matrix of Gaussian data. We use ideas from Gaussian graphical models and model selection to construct a prior for the covariance matrix that is a mixture over all decomposable graphs, where a graph means the configuration of nonzero offdiagonal elements in the inverse of the covariance matrix. Our prior for the covariance matrix is such that the probability of each graph size is specified by the user and graphs of equal size are assigned equal probability. Most previous approaches assume that all graphs are equally probable. We give empirical results that show the prior that assigns equal probability over graph sizes outperforms the prior that assigns equal probability over all graphs, both in identifying the correct decomposable graph and in more efficiently estimating the covariance matrix. The advantage is greatest when the number of observations is small relative to the dimension of the covariance matrix. The article also shows empirically that there is minimal change in statistical efficiency in using the mixture over decomposable graphs prior for estimating a general covariance compared to the Bayesian estimator by Wong et al. (2003), even when the graph of the covariance matrix is nondecomposable. However, our approach has some important advantages over that of Wong et al. (2003). Our method requires the number of decomposable graphs for each graph size. We show how to estimate these numbers using simulation and that the simulation results agree with analytic results when such results are known. We also show how to estimate the posterior distribution of the covariance matrix using Markov chain Monte Carlo with the elements of the covariance matrix integrated out and give empirical results that show the sampler is computationally efficient and converges rapidly. Finally, we note that both the prior and the simulation method to evaluate the prior apply generally to any decomposable graphical model.

Suggested Citation

  • Helen Armstrong & Christopher K. Carter & Kevin K. F. Wong & Robert Kohn, 2007. "Bayesian Covariance Matrix Estimation using a Mixture of Decomposable Graphical Models," Discussion Papers 2007-13, School of Economics, The University of New South Wales.
  • Handle: RePEc:swe:wpaper:2007-13
    as

    Download full text from publisher

    File URL: http://wwwdocs.fce.unsw.edu.au/economics/Research/WorkingPapers/2007_13.pdf
    Download Restriction: no

    References listed on IDEAS

    as
    1. S. P. Brooks & P. Giudici & G. O. Roberts, 2003. "Efficient construction of reversible jump Markov chain Monte Carlo proposal distributions," Journal of the Royal Statistical Society Series B, Royal Statistical Society, vol. 65(1), pages 3-39.
    2. Smith M. & Kohn R., 2002. "Parsimonious Covariance Matrix Estimation for Longitudinal Data," Journal of the American Statistical Association, American Statistical Association, vol. 97, pages 1141-1153, December.
    3. Alberto Roverato, 2002. "Hyper Inverse Wishart Distribution for Non-decomposable Graphs and its Application to Bayesian Inference for Gaussian Graphical Models," Scandinavian Journal of Statistics, Danish Society for Theoretical Statistics;Finnish Statistical Society;Norwegian Statistical Association;Swedish Statistical Association, vol. 29(3), pages 391-411.
    4. Frederick Wong, 2003. "Efficient estimation of covariance selection models," Biometrika, Biometrika Trust, vol. 90(4), pages 809-830, December.
    5. Mathias Drton, 2004. "Model selection for Gaussian concentration graphs," Biometrika, Biometrika Trust, vol. 91(3), pages 591-602, September.
    Full references (including those not matched with items on IDEAS)

    More about this item

    Keywords

    Covariance selection; Graphical models; Reduced conditional sampling; Variable selection;

    NEP fields

    This paper has been announced in the following NEP Reports:

    Statistics

    Access and download statistics

    Corrections

    All material on this site has been provided by the respective publishers and authors. You can help correct errors and omissions. When requesting a correction, please mention this item's handle: RePEc:swe:wpaper:2007-13. See general information about how to correct material in RePEc.

    For technical questions regarding this item, or to correct its authors, title, abstract, bibliographic or download information, contact: (Hongyi Li) or (Rebekah McClure). General contact details of provider: http://edirc.repec.org/data/senswau.html .

    If you have authored this item and are not yet registered with RePEc, we encourage you to do it here. This allows to link your profile to this item. It also allows you to accept potential citations to this item that we are uncertain about.

    If CitEc recognized a reference but did not link an item in RePEc to it, you can help with this form .

    If you know of missing items citing this one, you can help us creating those links by adding the relevant references in the same way as above, for each refering item. If you are a registered author of this item, you may also want to check the "citations" tab in your RePEc Author Service profile, as there may be some citations waiting for confirmation.

    Please note that corrections may take a couple of weeks to filter through the various RePEc services.

    IDEAS is a RePEc service hosted by the Research Division of the Federal Reserve Bank of St. Louis . RePEc uses bibliographic data supplied by the respective publishers.