On efficient calculations for Bayesian variable selection
We describe an efficient, exact Bayesian algorithm applicable to both variable selection and model averaging problems. A fully Bayesian approach provides a more complete characterization of the posterior ensemble of possible sub-models, but presents a computational challenge as the number of candidate variables increases. While several approximation techniques have been developed to deal with problems that contain a large numbers of candidate variables, including BMA, IBMA, MCMC and Gibbs Sampling approaches, here we focus on improving the time complexity of exact inference using a recursive algorithm (Exact Bayesian Inference in Regression, or EBIR) that uses components of one sub-model to rapidly generate another and prove that its time complexity is O(m2), where m is the number candidate variables. Testing against simulated data shows that EBIR significantly reduces compute time without sacrificing accuracy, while comparisons to the results obtained by MCMC approaches on the Crime and Punishment data set show that model averaging yields improved predictive performance over two model selection approaches. In addition, we show that finite mixtures of centroid solutions provide a means to better characterize the shape of multimodal posterior spaces than any individual model. Finally, we describe how the BIC approximations employed in the BMA and IBMA algorithms can be replaced by an EBIR calculation of equal time complexity and illustrate the departure of the BIC approximation from the exact Bayesian inference of EBIR.
If you experience problems downloading a file, check if you have the proper application to view it first. In case of further problems read the IDEAS help page. Note that these files are not on the IDEAS site. Please be patient as the files may be large.
As the access to this document is restricted, you may want to look for a different version under "Related research" (further below) or search for a different version of it.
References listed on IDEAS
Please report citation or reference errors to , or , if you are the registered author of the cited work, log in to your RePEc Author Service profile, click on "citations" and make appropriate adjustments.:
- Hui Zou & Trevor Hastie, 2005. "Regularization and variable selection via the elastic net," Journal of the Royal Statistical Society Series B, Royal Statistical Society, vol. 67(2), pages 301-320.
- Fan J. & Li R., 2001. "Variable Selection via Nonconcave Penalized Likelihood and its Oracle Properties," Journal of the American Statistical Association, American Statistical Association, vol. 96, pages 1348-1360, December.
- Smith, M. & Kohn, R., .
"Nonparametric Regression using Bayesian Variable Selection,"
Statistics Working Paper
_009, Australian Graduate School of Management.
- Smith, Michael & Kohn, Robert, 1996. "Nonparametric regression using Bayesian variable selection," Journal of Econometrics, Elsevier, vol. 75(2), pages 317-343, December.
- Stigler, George J, 1970.
"The Optimum Enforcement of Laws,"
Journal of Political Economy,
University of Chicago Press, vol. 78(3), pages 526-36, May-June.
- Zou, Hui, 2006. "The Adaptive Lasso and Its Oracle Properties," Journal of the American Statistical Association, American Statistical Association, vol. 101, pages 1418-1429, December.
- S. Illeris & G. Akehurst, 2001. "Introduction," The Service Industries Journal, Taylor & Francis Journals, vol. 21(1), pages 1-4, January.
- Theo Eicher & Chris Papageorgiou & Oliver Röhn, 2007.
"Unraveling the Fortunates of the Fortunate: An Iterative Bayesian Model Averaging (IBMA) Approach,"
CESifo Working Paper Series
1907, CESifo Group Munich.
- Eicher, Theo S. & Papageorgiou, Chris & Roehn, Oliver, 2007. "Unraveling the fortunes of the fortunate: An Iterative Bayesian Model Averaging (IBMA) approach," Journal of Macroeconomics, Elsevier, vol. 29(3), pages 494-514, September.
- Jiahua Chen & Zehua Chen, 2008. "Extended Bayesian information criteria for model selection with large model spaces," Biometrika, Biometrika Trust, vol. 95(3), pages 759-771.
- Hui Zou & Trevor Hastie, 2005. "Addendum: Regularization and variable selection via the elastic net," Journal of the Royal Statistical Society Series B, Royal Statistical Society, vol. 67(5), pages 768-768.
- Carmen Fernandez & Eduardo Ley & Mark Steel, 1999.
"Model uncertainty in cross-country growth regressions,"
9903003, EconWPA, revised 06 Oct 2001.
- Carmen Fernandez & Eduardo Ley & Mark F. J. Steel, 2001. "Model uncertainty in cross-country growth regressions," Journal of Applied Econometrics, John Wiley & Sons, Ltd., vol. 16(5), pages 563-576.
- Carmen Fernandez & Eduardo Ley & Mark Steel, 2001. "Model uncertainty in cross-country growth regressions," Econometrics 0110002, EconWPA.
- Gary S. Becker, 1974.
"Crime and Punishment: An Economic Approach,"
in: Essays in the Economics of Crime and Punishment, pages 1-54
National Bureau of Economic Research, Inc.
- Jianqing Fan & Jinchi Lv, 2008. "Sure independence screening for ultrahigh dimensional feature space," Journal of the Royal Statistical Society Series B, Royal Statistical Society, vol. 70(5), pages 849-911.
- Ehrlich, Isaac, 1975.
"The Deterrent Effect of Capital Punishment: A Question of Life and Death,"
American Economic Review,
American Economic Association, vol. 65(3), pages 397-417, June.
- Isaac Ehrlich, 1973. "The Deterrent Effect of Capital Punishment: A Question of Life and Death," NBER Working Papers 0018, National Bureau of Economic Research, Inc.
- Chris Hans, 2009. "Bayesian lasso regression," Biometrika, Biometrika Trust, vol. 96(4), pages 835-845.
- Wang, Hansheng, 2009. "Forward Regression for Ultra-High Dimensional Variable Screening," Journal of the American Statistical Association, American Statistical Association, vol. 104(488), pages 1512-1524.
- Ehrlich, Isaac, 1973. "Participation in Illegitimate Activities: A Theoretical and Empirical Investigation," Journal of Political Economy, University of Chicago Press, vol. 81(3), pages 521-65, May-June.
- Carmen Fernandez & E Ley & Mark F J Steel, 2004.
"Benchmark priors for Bayesian models averaging,"
ESE Discussion Papers
66, Edinburgh School of Economics, University of Edinburgh.
- Efron, Bradley, 2004. "Large-Scale Simultaneous Hypothesis Testing: The Choice of a Null Hypothesis," Journal of the American Statistical Association, American Statistical Association, vol. 99, pages 96-104, January.
- Park, Trevor & Casella, George, 2008. "The Bayesian Lasso," Journal of the American Statistical Association, American Statistical Association, vol. 103, pages 681-686, June.
When requesting a correction, please mention this item's handle: RePEc:eee:csdana:v:56:y:2012:i:6:p:1319-1332. See general information about how to correct material in RePEc.
For technical questions regarding this item, or to correct its authors, title, abstract, bibliographic or download information, contact: (Zhang, Lei)
If you have authored this item and are not yet registered with RePEc, we encourage you to do it here. This allows to link your profile to this item. It also allows you to accept potential citations to this item that we are uncertain about.
If references are entirely missing, you can add them using this form.
If the full references list an item that is present in RePEc, but the system did not link to it, you can help with this form.
If you know of missing items citing this one, you can help us creating those links by adding the relevant references in the same way as above, for each refering item. If you are a registered author of this item, you may also want to check the "citations" tab in your profile, as there may be some citations waiting for confirmation.
Please note that corrections may take a couple of weeks to filter through the various RePEc services.