IDEAS home Printed from https://ideas.repec.org/a/eee/econom/v143y2008i2p291-316.html
   My bibliography  Save this article

Bayesian identification, selection and estimation of semiparametric functions in high-dimensional additive models

Author

Listed:
  • Panagiotelis, Anastasios
  • Smith, Michael

Abstract

In this paper we propose an approach to both estimate and select unknown smooth functions in an additive model with potentially many functions. Each function is written as a linear combination of basis terms, with coefficients regularized by a proper linearly constrained Gaussian prior. Given any potentially rank deficient prior precision matrix, we show how to derive linear constraints so that the corresponding effect is identified in the additive model. This allows for the use of a wide range of bases and precision matrices in priors for regularization. By introducing indicator variables, each constrained Gaussian prior is augmented with a point mass at zero, thus allowing for function selection. Posterior inference is calculated using Markov chain Monte Carlo and the smoothness in the functions is both the result of shrinkage through the constrained Gaussian prior and model averaging. We show how using non-degenerate priors on the shrinkage parameters enables the application of substantially more computationally efficient sampling schemes than would otherwise be the case. We show the favourable performance of our approach when compared to two contemporary alternative Bayesian methods. To highlight the potential of our approach in high-dimensional settings we apply it to estimate two large seemingly unrelated regression models for intra-day electricity load. Both models feature a variety of different univariate and bivariate functions which require different levels of smoothing, and where component selection is meaningful. Priors for the error disturbance covariances are selected carefully and the empirical results provide a substantive contribution to the electricity load modelling literature in their own right.

Suggested Citation

  • Panagiotelis, Anastasios & Smith, Michael, 2008. "Bayesian identification, selection and estimation of semiparametric functions in high-dimensional additive models," Journal of Econometrics, Elsevier, vol. 143(2), pages 291-316, April.
  • Handle: RePEc:eee:econom:v:143:y:2008:i:2:p:291-316
    as

    Download full text from publisher

    File URL: http://www.sciencedirect.com/science/article/pii/S0304-4076(07)00214-X
    Download Restriction: Full text for ScienceDirect subscribers only
    ---><---

    As the access to this document is restricted, you may want to search for a different version of it.

    References listed on IDEAS

    as
    1. David J. Nott & Robert Kohn, 2005. "Adaptive sampling for Bayesian variable selection," Biometrika, Biometrika Trust, vol. 92(4), pages 747-763, December.
    2. Koop, Gary & Poirier, Dale J., 2004. "Bayesian variants of some classical semiparametric regression techniques," Journal of Econometrics, Elsevier, vol. 123(2), pages 259-282, December.
    3. Alexandre Pintore & Paul Speckman & Chris C. Holmes, 2006. "Spatially adaptive smoothing splines," Biometrika, Biometrika Trust, vol. 93(1), pages 113-125, March.
    4. D. G. T. Denison & B. K. Mallick & A. F. M. Smith, 1998. "Automatic Bayesian curve fitting," Journal of the Royal Statistical Society Series B, Royal Statistical Society, vol. 60(2), pages 333-350.
    5. Ramanathan, Ramu & Engle, Robert & Granger, Clive W. J. & Vahid-Araghi, Farshid & Brace, Casey, 1997. "Shorte-run forecasts of electricity loads and peaks," International Journal of Forecasting, Elsevier, vol. 13(2), pages 161-174, June.
    6. Smith, Michael, 2000. "Modeling and Short-term Forecasting of New South Wales Electricity System Load," Journal of Business & Economic Statistics, American Statistical Association, vol. 18(4), pages 465-478, October.
    7. Smith, M. & Yau, P. & Shively, T. & Kohn, R., 1998. "Estimating Long-Term Trends in Tropospheric Ozone Levels," Monash Econometrics and Business Statistics Working Papers 2/98, Monash University, Department of Econometrics and Business Statistics.
    8. Fernandez, Carmen & Ley, Eduardo & Steel, Mark F. J., 2001. "Benchmark priors for Bayesian model averaging," Journal of Econometrics, Elsevier, vol. 100(2), pages 381-427, February.
    9. Geweke, John & Keane, Michael, 2007. "Smoothly mixing regressions," Journal of Econometrics, Elsevier, vol. 138(1), pages 252-290, May.
    10. Smith, Michael & Kohn, Robert, 2000. "Nonparametric seemingly unrelated regression," Journal of Econometrics, Elsevier, vol. 98(2), pages 257-281, October.
    11. Smith, Michael & Kohn, Robert, 1996. "Nonparametric regression using Bayesian variable selection," Journal of Econometrics, Elsevier, vol. 75(2), pages 317-343, December.
    12. Sangjoon Kim & Neil Shephard & Siddhartha Chib, 1998. "Stochastic Volatility: Likelihood Inference and Comparison with ARCH Models," Review of Economic Studies, Oxford University Press, vol. 65(3), pages 361-393.
    13. Dale J. Poirier & Gary Koop & Justin Tobias, 2005. "Semiparametric Bayesian inference in multiple equation models," Journal of Applied Econometrics, John Wiley & Sons, Ltd., vol. 20(6), pages 723-747.
    14. Smith M. & Kohn R., 2002. "Parsimonious Covariance Matrix Estimation for Longitudinal Data," Journal of the American Statistical Association, American Statistical Association, vol. 97, pages 1141-1153, December.
    15. Patrick J. Wolfe & Simon J. Godsill & Wee‐Jing Ng, 2004. "Bayesian variable selection and regularization for time–frequency surface estimation," Journal of the Royal Statistical Society Series B, Royal Statistical Society, vol. 66(3), pages 575-589, August.
    16. Engle, Robert, 2002. "Dynamic Conditional Correlation: A Simple Class of Multivariate Generalized Autoregressive Conditional Heteroskedasticity Models," Journal of Business & Economic Statistics, American Statistical Association, vol. 20(3), pages 339-350, July.
    17. Michael Smith & Chi‐Ming Wong & Robert Kohn, 1998. "Additive nonparametric regression with autocorrelated errors," Journal of the Royal Statistical Society Series B, Royal Statistical Society, vol. 60(2), pages 311-331.
    18. Chib, Siddhartha & Jeliazkov, Ivan, 2006. "Inference in Semiparametric Dynamic Models for Binary Longitudinal Data," Journal of the American Statistical Association, American Statistical Association, vol. 101, pages 685-700, June.
    19. Cottet R. & Smith M., 2003. "Bayesian Modeling and Forecasting of Intraday Electricity Load," Journal of the American Statistical Association, American Statistical Association, vol. 98, pages 839-849, January.
    20. Wong, Chi-ming & Kohn, Robert, 1996. "A Bayesian approach to additive semiparametric regression," Journal of Econometrics, Elsevier, vol. 74(2), pages 209-235, October.
    Full references (including those not matched with items on IDEAS)

    Citations

    Citations are extracted by the CitEc Project, subscribe to its RSS feed for this item.
    as


    Cited by:

    1. Shively, Thomas S. & Walker, Stephen G. & Damien, Paul, 2011. "Nonparametric function estimation subject to monotonicity, convexity and other shape constraints," Journal of Econometrics, Elsevier, vol. 161(2), pages 166-181, April.
    2. Duchwan Ryu & Erning Li & Bani K. Mallick, 2011. "Bayesian Nonparametric Regression Analysis of Data with Random Effects Covariates from Longitudinal Measurements," Biometrics, The International Biometric Society, vol. 67(2), pages 454-466, June.
    3. Aijun Yang & Xuejun Jiang & Lianjie Shu & Jinguan Lin, 2017. "Bayesian variable selection with sparse and correlation priors for high-dimensional data analysis," Computational Statistics, Springer, vol. 32(1), pages 127-143, March.
    4. Shao, Zhen & Gao, Fei & Zhang, Qiang & Yang, Shan-Lin, 2015. "Multivariate statistical and similarity measure based semiparametric modeling of the probability distribution: A novel approach to the case study of mid-long term electricity consumption forecasting i," Applied Energy, Elsevier, vol. 156(C), pages 502-518.
    5. Min Wang & Xiaoqian Sun & Tao Lu, 2015. "Bayesian structured variable selection in linear regression models," Computational Statistics, Springer, vol. 30(1), pages 205-229, March.
    6. Stefan Lang & Nikolaus Umlauf & Peter Wechselberger & Kenneth Harttgen & Thomas Kneib, 2012. "Multilevel structured additive regression," Working Papers 2012-07, Faculty of Economics and Statistics, Universität Innsbruck.
    7. Panagiotelis, Anastasios & Smith, Michael, 2010. "Bayesian skew selection for multivariate models," Computational Statistics & Data Analysis, Elsevier, vol. 54(7), pages 1824-1839, July.
    8. Chen, Xue-Dong & Tang, Nian-Sheng, 2010. "Bayesian analysis of semiparametric reproductive dispersion mixed-effects models," Computational Statistics & Data Analysis, Elsevier, vol. 54(9), pages 2145-2158, September.
    9. Peter J. Danaher & Michael S. Smith, 2011. "Modeling Multivariate Distributions Using Copulas: Applications in Marketing," Marketing Science, INFORMS, vol. 30(1), pages 4-21, 01-02.
    10. Shao, Zhen & Chao, Fu & Yang, Shan-Lin & Zhou, Kai-Le, 2017. "A review of the decomposition methodology for extracting and identifying the fluctuation characteristics in electricity demand forecasting," Renewable and Sustainable Energy Reviews, Elsevier, vol. 75(C), pages 123-136.
    11. Fabian Scheipl & Thomas Kneib & Ludwig Fahrmeir, 2013. "Penalized likelihood and Bayesian function selection in regression models," AStA Advances in Statistical Analysis, Springer;German Statistical Society, vol. 97(4), pages 349-385, October.
    12. Bin Jiang & Anastasios Panagiotelis & George Athanasopoulos & Rob Hyndman & Farshid Vahid, 2016. "Bayesian Rank Selection in Multivariate Regression," Monash Econometrics and Business Statistics Working Papers 6/16, Monash University, Department of Econometrics and Business Statistics.
    13. Aijun Yang & Ju Xiang & Lianjie Shu & Hongqiang Yang, 2018. "Sparse Bayesian Variable Selection with Correlation Prior for Forecasting Macroeconomic Variable using Highly Correlated Predictors," Computational Economics, Springer;Society for Computational Economics, vol. 51(2), pages 323-338, February.
    14. Yang Aijun & Xiang Ju & Yang Hongqiang & Lin Jinguan, 2018. "Sparse Bayesian Variable Selection in Probit Model for Forecasting U.S. Recessions Using a Large Set of Predictors," Computational Economics, Springer;Society for Computational Economics, vol. 51(4), pages 1123-1138, April.
    15. Bernardi, Mauro & Costola, Michele, 2019. "High-dimensional sparse financial networks through a regularised regression model," SAFE Working Paper Series 244, Leibniz Institute for Financial Research SAFE.
    16. Zhao, Kaifeng & Lian, Heng, 2016. "The Expectation–Maximization approach for Bayesian quantile regression," Computational Statistics & Data Analysis, Elsevier, vol. 96(C), pages 1-11.
    17. Xin-Yuan Song & Zhao-Hua Lu & Jing-Heng Cai & Edward Ip, 2013. "A Bayesian Modeling Approach for Generalized Semiparametric Structural Equation Models," Psychometrika, Springer;The Psychometric Society, vol. 78(4), pages 624-647, October.
    18. Felix Heinzl & Ludwig Fahrmeir & Thomas Kneib, 2012. "Additive mixed models with Dirichlet process mixture and P-spline priors," AStA Advances in Statistical Analysis, Springer;German Statistical Society, vol. 96(1), pages 47-68, January.
    19. Mestekemper, Thomas & Kauermann, Göran & Smith, Michael S., 2013. "A comparison of periodic autoregressive and dynamic factor models in intraday energy demand forecasting," International Journal of Forecasting, Elsevier, vol. 29(1), pages 1-12.
    20. Smith, Michael S. & Kauermann, Göran, 2011. "Bicycle commuting in Melbourne during the 2000s energy crisis: A semiparametric analysis of intraday volumes," Transportation Research Part B: Methodological, Elsevier, vol. 45(10), pages 1846-1862.
    21. Aijun Yang & Yunxian Li & Niansheng Tang & Jinguan Lin, 2015. "Bayesian variable selection in multinomial probit model for classifying high-dimensional data," Computational Statistics, Springer, vol. 30(2), pages 399-418, June.
    22. Zhao, Kaifeng & Lian, Heng, 2014. "Variational inferences for partially linear additive models with variable selection," Computational Statistics & Data Analysis, Elsevier, vol. 80(C), pages 223-239.
    23. Yuanying Zhao & Dengke Xu, 2023. "A Bayesian Variable Selection Method for Spatial Autoregressive Quantile Models," Mathematics, MDPI, vol. 11(4), pages 1-19, February.
    24. Aijun Yang & Yuzhu Tian & Yunxian Li & Jinguan Lin, 2020. "Sparse Bayesian variable selection in kernel probit model for analyzing high-dimensional data," Computational Statistics, Springer, vol. 35(1), pages 245-258, March.

    Most related items

    These are the items that most often cite the same works as this one and are cited by the same works as this one.
    1. Panagiotelis, Anastasios & Smith, Michael, 2008. "Bayesian density forecasting of intraday electricity prices using multivariate skew t distributions," International Journal of Forecasting, Elsevier, vol. 24(4), pages 710-727.
    2. Villani, Mattias & Kohn, Robert & Giordani, Paolo, 2007. "Nonparametric Regression Density Estimation Using Smoothly Varying Normal Mixtures," Working Paper Series 211, Sveriges Riksbank (Central Bank of Sweden).
    3. Villani, Mattias & Kohn, Robert & Giordani, Paolo, 2009. "Regression density estimation using smooth adaptive Gaussian mixtures," Journal of Econometrics, Elsevier, vol. 153(2), pages 155-173, December.
    4. Gael M. Martin & David T. Frazier & Ruben Loaiza-Maya & Florian Huber & Gary Koop & John Maheu & Didier Nibbering & Anastasios Panagiotelis, 2023. "Bayesian Forecasting in the 21st Century: A Modern Review," Monash Econometrics and Business Statistics Working Papers 1/23, Monash University, Department of Econometrics and Business Statistics.
    5. Gael M. Martin & David T. Frazier & Worapree Maneesoonthorn & Ruben Loaiza-Maya & Florian Huber & Gary Koop & John Maheu & Didier Nibbering & Anastasios Panagiotelis, 2022. "Bayesian Forecasting in Economics and Finance: A Modern Review," Papers 2212.03471, arXiv.org, revised Jul 2023.
    6. Bin Jiang & Anastasios Panagiotelis & George Athanasopoulos & Rob Hyndman & Farshid Vahid, 2016. "Bayesian Rank Selection in Multivariate Regression," Monash Econometrics and Business Statistics Working Papers 6/16, Monash University, Department of Econometrics and Business Statistics.
    7. Vaz, Lucélia Viviane & Filho, Getulio Borges da Silveira, 2017. "Functional Autoregressive Models: An Application to Brazilian Hourly Electricity Load," Brazilian Review of Econometrics, Sociedade Brasileira de Econometria - SBE, vol. 37(2), November.
    8. Dimitris Korobilis, 2008. "Forecasting in vector autoregressions with many predictors," Advances in Econometrics, in: Bayesian Econometrics, pages 403-431, Emerald Group Publishing Limited.
    9. Jaume Rosselló Nadal & Mohcine Bakhat, 2009. "A new approach to estimating tourism-induced electricity consumption," CRE Working Papers (Documents de treball del CRE) 2009/6, Centre de Recerca Econòmica (UIB ·"Sa Nostra").
    10. Mestekemper, Thomas & Kauermann, Göran & Smith, Michael S., 2013. "A comparison of periodic autoregressive and dynamic factor models in intraday energy demand forecasting," International Journal of Forecasting, Elsevier, vol. 29(1), pages 1-12.
    11. Cancelo, José Ramón & Espasa, Antoni & Grafe, Rosmarie, 2007. "Forecasting from one day to one week ahead for the Spanish system operator," DES - Working Papers. Statistics and Econometrics. WS ws078418, Universidad Carlos III de Madrid. Departamento de Estadística.
    12. Pena, Daniel & Redondas, Dolores, 2006. "Bayesian curve estimation by model averaging," Computational Statistics & Data Analysis, Elsevier, vol. 50(3), pages 688-709, February.
    13. Dordonnat, V. & Koopman, S.J. & Ooms, M. & Dessertaine, A. & Collet, J., 2008. "An hourly periodic state space model for modelling French national electricity load," International Journal of Forecasting, Elsevier, vol. 24(4), pages 566-587.
    14. Ouysse, Rachida & Kohn, Robert, 2010. "Bayesian variable selection and model averaging in the arbitrage pricing theory model," Computational Statistics & Data Analysis, Elsevier, vol. 54(12), pages 3249-3268, December.
    15. Rong Chen & John L. Harris & Jun M. Liu & Lon-Mu Liu, 2006. "A semi-parametric time series approach in modeling hourly electricity loads," Journal of Forecasting, John Wiley & Sons, Ltd., vol. 25(8), pages 537-559.
    16. Li Ma, 2015. "Scalable Bayesian Model Averaging Through Local Information Propagation," Journal of the American Statistical Association, Taylor & Francis Journals, vol. 110(510), pages 795-809, June.
    17. Kim, Myung Suk, 2013. "Modeling special-day effects for forecasting intraday electricity demand," European Journal of Operational Research, Elsevier, vol. 230(1), pages 170-180.
    18. Li, Feng & Kang, Yanfei, 2018. "Improving forecasting performance using covariate-dependent copula models," International Journal of Forecasting, Elsevier, vol. 34(3), pages 456-476.
    19. Zhang, Xibin & King, Maxwell L. & Shang, Han Lin, 2014. "A sampling algorithm for bandwidth estimation in a nonparametric regression model with a flexible error density," Computational Statistics & Data Analysis, Elsevier, vol. 78(C), pages 218-234.
    20. Smith, Michael & Kohn, Robert, 2000. "Nonparametric seemingly unrelated regression," Journal of Econometrics, Elsevier, vol. 98(2), pages 257-281, October.

    More about this item

    Statistics

    Access and download statistics

    Corrections

    All material on this site has been provided by the respective publishers and authors. You can help correct errors and omissions. When requesting a correction, please mention this item's handle: RePEc:eee:econom:v:143:y:2008:i:2:p:291-316. See general information about how to correct material in RePEc.

    If you have authored this item and are not yet registered with RePEc, we encourage you to do it here. This allows to link your profile to this item. It also allows you to accept potential citations to this item that we are uncertain about.

    If CitEc recognized a bibliographic reference but did not link an item in RePEc to it, you can help with this form .

    If you know of missing items citing this one, you can help us creating those links by adding the relevant references in the same way as above, for each refering item. If you are a registered author of this item, you may also want to check the "citations" tab in your RePEc Author Service profile, as there may be some citations waiting for confirmation.

    For technical questions regarding this item, or to correct its authors, title, abstract, bibliographic or download information, contact: Catherine Liu (email available below). General contact details of provider: http://www.elsevier.com/locate/jeconom .

    Please note that corrections may take a couple of weeks to filter through the various RePEc services.

    IDEAS is a RePEc service. RePEc uses bibliographic data supplied by the respective publishers.