IDEAS home Printed from https://ideas.repec.org/a/spr/stpapr/v61y2020i5d10.1007_s00362-018-1027-8.html
   My bibliography  Save this article

Sparse directed acyclic graphs incorporating the covariates

Author

Listed:
  • Xiao Guo

    (Northwest University)

  • Hai Zhang

    (Northwest University
    Macau University of Science and Technology)

Abstract

Directed acyclic graphs (DAGs) have been widely used to model the causal relationships among variables using multivariate data. However, covariates are often available together with these data which may influence the underlying causal network. Motivated by such kind of data, in this paper, we incorporate the covariates directly into the DAGs to model the dependency relationships among nodal variables. Specifically, the causal strengths are assumed to be a linear function of the covariates, which enhances the interpretability and flexibility of the model. We fit the model in the $$l_1$$ l 1 penalized maximum likelihood framework and employ a coordinate descent based algorithm to solve the resulting optimization problem. The consistency of the estimator are also established under the regime where the order of nodal variables are known. Finally, we evaluate the performance of the proposed method through a series of simulations and a lung cancer data example.

Suggested Citation

  • Xiao Guo & Hai Zhang, 2020. "Sparse directed acyclic graphs incorporating the covariates," Statistical Papers, Springer, vol. 61(5), pages 2119-2148, October.
  • Handle: RePEc:spr:stpapr:v:61:y:2020:i:5:d:10.1007_s00362-018-1027-8
    DOI: 10.1007/s00362-018-1027-8
    as

    Download full text from publisher

    File URL: http://link.springer.com/10.1007/s00362-018-1027-8
    File Function: Abstract
    Download Restriction: Access to the full text of the articles in this series is restricted.

    File URL: https://libkey.io/10.1007/s00362-018-1027-8?utm_source=ideas
    LibKey link: if access is restricted and if your library uses this service, LibKey will redirect you to where you can use your library subscription to access this item
    ---><---

    As the access to this document is restricted, you may want to search for a different version of it.

    References listed on IDEAS

    as
    1. Yang Ni & Francesco C. Stingo & Veerabhadran Baladandayuthapani, 2017. "Sparse Multi-Dimensional Graphical Models: A Unified Bayesian Framework," Journal of the American Statistical Association, Taylor & Francis Journals, vol. 112(518), pages 779-793, April.
    2. Min Jin Ha & Wei Sun & Jichun Xie, 2016. "PenPC : A two-step approach to estimate the skeletons of high-dimensional directed acyclic graphs," Biometrics, The International Biometric Society, vol. 72(1), pages 146-155, March.
    3. Mengjie Chen & Zhao Ren & Hongyu Zhao & Harrison Zhou, 2016. "Asymptotically Normal and Efficient Estimation of Covariate-Adjusted Gaussian Graphical Model," Journal of the American Statistical Association, Taylor & Francis Journals, vol. 111(513), pages 394-406, March.
    4. Lam, Clifford & Fan, Jianqing, 2009. "Sparsistency and rates of convergence in large covariance matrix estimation," LSE Research Online Documents on Economics 31540, London School of Economics and Political Science, LSE Library.
    5. J. Peters & P. Bühlmann, 2014. "Identifiability of Gaussian structural equation models with equal error variances," Biometrika, Biometrika Trust, vol. 101(1), pages 219-228.
    6. Ali Shojaie & Alexandra Jauhiainen & Michael Kallitsis & George Michailidis, 2014. "Inferring Regulatory Networks by Combining Perturbation Screens and Steady State Gene Expression Profiles," PLOS ONE, Public Library of Science, vol. 9(2), pages 1-16, February.
    7. Chenlei Leng & Cheng Yong Tang, 2012. "Sparse Matrix Graphical Models," Journal of the American Statistical Association, Taylor & Francis Journals, vol. 107(499), pages 1187-1200, September.
    8. Jie Cheng & Elizaveta Levina & Pei Wang & Ji Zhu, 2014. "A sparse ising model with covariates," Biometrics, The International Biometric Society, vol. 70(4), pages 943-953, December.
    9. Friedman, Jerome H. & Hastie, Trevor & Tibshirani, Rob, 2010. "Regularization Paths for Generalized Linear Models via Coordinate Descent," Journal of Statistical Software, Foundation for Open Access Statistics, vol. 33(i01).
    10. Peng, Jie & Wang, Pei & Zhou, Nengfeng & Zhu, Ji, 2009. "Partial Correlation Estimation by Joint Sparse Regression Models," Journal of the American Statistical Association, American Statistical Association, vol. 104(486), pages 735-746.
    11. T. Tony Cai & Hongzhe Li & Weidong Liu & Jichun Xie, 2013. "Covariate-adjusted precision matrix estimation with an application in genetical genomics," Biometrika, Biometrika Trust, vol. 100(1), pages 139-156.
    12. Ming Yuan & Yi Lin, 2007. "Model selection and estimation in the Gaussian graphical model," Biometrika, Biometrika Trust, vol. 94(1), pages 19-35.
    13. Sung Won Han & Gong Chen & Myun-Seok Cheon & Hua Zhong, 2016. "Estimation of Directed Acyclic Graphs Through Two-Stage Adaptive Lasso for Gene Network Inference," Journal of the American Statistical Association, Taylor & Francis Journals, vol. 111(515), pages 1004-1019, July.
    14. Fei Fu & Qing Zhou, 2013. "Learning Sparse Causal Gaussian Networks With Experimental Intervention: Regularization and Coordinate Descent," Journal of the American Statistical Association, Taylor & Francis Journals, vol. 108(501), pages 288-300, March.
    15. Ali Shojaie & George Michailidis, 2010. "Penalized likelihood methods for estimation of sparse high-dimensional directed acyclic graphs," Biometrika, Biometrika Trust, vol. 97(3), pages 519-538.
    16. Cai, Tony & Liu, Weidong & Luo, Xi, 2011. "A Constrained â„“1 Minimization Approach to Sparse Precision Matrix Estimation," Journal of the American Statistical Association, American Statistical Association, vol. 106(494), pages 594-607.
    Full references (including those not matched with items on IDEAS)

    Citations

    Citations are extracted by the CitEc Project, subscribe to its RSS feed for this item.
    as


    Cited by:

    1. Mingao Yuan & Fan Yang & Zuofeng Shang, 2022. "Hypothesis testing in sparse weighted stochastic block model," Statistical Papers, Springer, vol. 63(4), pages 1051-1073, August.

    Most related items

    These are the items that most often cite the same works as this one and are cited by the same works as this one.
    1. Huihang Liu & Xinyu Zhang, 2023. "Frequentist model averaging for undirected Gaussian graphical models," Biometrics, The International Biometric Society, vol. 79(3), pages 2050-2062, September.
    2. Xiao Guo & Hai Zhang & Yao Wang & Yong Liang, 2019. "Structure learning of sparse directed acyclic graphs incorporating the scale-free property," Computational Statistics, Springer, vol. 34(2), pages 713-742, June.
    3. Dong Liu & Changwei Zhao & Yong He & Lei Liu & Ying Guo & Xinsheng Zhang, 2023. "Simultaneous cluster structure learning and estimation of heterogeneous graphs for matrix‐variate fMRI data," Biometrics, The International Biometric Society, vol. 79(3), pages 2246-2259, September.
    4. Lafit, Ginette & Nogales Martín, Francisco Javier & Zamar, Rubén, 2015. "Ranking Edges and Model Selection in High-Dimensional Graphs," DES - Working Papers. Statistics and Econometrics. WS ws1511, Universidad Carlos III de Madrid. Departamento de Estadística.
    5. Bailey, Natalia & Pesaran, M. Hashem & Smith, L. Vanessa, 2019. "A multiple testing approach to the regularisation of large sample correlation matrices," Journal of Econometrics, Elsevier, vol. 208(2), pages 507-534.
    6. Tan, Kean Ming & Witten, Daniela & Shojaie, Ali, 2015. "The cluster graphical lasso for improved estimation of Gaussian graphical models," Computational Statistics & Data Analysis, Elsevier, vol. 85(C), pages 23-36.
    7. Jie Cheng & Elizaveta Levina & Pei Wang & Ji Zhu, 2014. "A sparse ising model with covariates," Biometrics, The International Biometric Society, vol. 70(4), pages 943-953, December.
    8. Pan, Yuqing & Mai, Qing, 2020. "Efficient computation for differential network analysis with applications to quadratic discriminant analysis," Computational Statistics & Data Analysis, Elsevier, vol. 144(C).
    9. Fan, Xinyan & Zhang, Qingzhao & Ma, Shuangge & Fang, Kuangnan, 2021. "Conditional score matching for high-dimensional partial graphical models," Computational Statistics & Data Analysis, Elsevier, vol. 153(C).
    10. Liu, Weidong & Luo, Xi, 2015. "Fast and adaptive sparse precision matrix estimation in high dimensions," Journal of Multivariate Analysis, Elsevier, vol. 135(C), pages 153-162.
    11. Wang, Ke & Franks, Alexander & Oh, Sang-Yun, 2023. "Learning Gaussian graphical models with latent confounders," Journal of Multivariate Analysis, Elsevier, vol. 198(C).
    12. Lafit, Ginette & Nogales Martín, Francisco Javier, 2017. "Robust and sparse estimation of high-dimensional precision matrices via bivariate outlier detection," DES - Working Papers. Statistics and Econometrics. WS 24534, Universidad Carlos III de Madrid. Departamento de Estadística.
    13. Shanghong Xie & Xiang Li & Peter McColgan & Rachael I. Scahill & Donglin Zeng & Yuanjia Wang, 2020. "Identifying disease‐associated biomarker network features through conditional graphical model," Biometrics, The International Biometric Society, vol. 76(3), pages 995-1006, September.
    14. Luo, Shan & Chen, Zehua, 2014. "Edge detection in sparse Gaussian graphical models," Computational Statistics & Data Analysis, Elsevier, vol. 70(C), pages 138-152.
    15. Pei Wang & Shunjie Chen & Sijia Yang, 2022. "Recent Advances on Penalized Regression Models for Biological Data," Mathematics, MDPI, vol. 10(19), pages 1-24, October.
    16. Khai X. Chiong & Hyungsik Roger Moon, 2017. "Estimation of Graphical Models using the $L_{1,2}$ Norm," Papers 1709.10038, arXiv.org, revised Oct 2017.
    17. Guanghui Cheng & Zhengjun Zhang & Baoxue Zhang, 2017. "Test for bandedness of high-dimensional precision matrices," Journal of Nonparametric Statistics, Taylor & Francis Journals, vol. 29(4), pages 884-902, October.
    18. Wang, Luheng & Chen, Zhao & Wang, Christina Dan & Li, Runze, 2020. "Ultrahigh dimensional precision matrix estimation via refitted cross validation," Journal of Econometrics, Elsevier, vol. 215(1), pages 118-130.
    19. Fangting Zhou & Kejun He & Kunbo Wang & Yanxun Xu & Yang Ni, 2023. "Functional Bayesian networks for discovering causality from multivariate functional data," Biometrics, The International Biometric Society, vol. 79(4), pages 3279-3293, December.
    20. Yang, Yuehan & Xia, Siwei & Yang, Hu, 2023. "Multivariate sparse Laplacian shrinkage for joint estimation of two graphical structures," Computational Statistics & Data Analysis, Elsevier, vol. 178(C).

    Corrections

    All material on this site has been provided by the respective publishers and authors. You can help correct errors and omissions. When requesting a correction, please mention this item's handle: RePEc:spr:stpapr:v:61:y:2020:i:5:d:10.1007_s00362-018-1027-8. See general information about how to correct material in RePEc.

    If you have authored this item and are not yet registered with RePEc, we encourage you to do it here. This allows to link your profile to this item. It also allows you to accept potential citations to this item that we are uncertain about.

    If CitEc recognized a bibliographic reference but did not link an item in RePEc to it, you can help with this form .

    If you know of missing items citing this one, you can help us creating those links by adding the relevant references in the same way as above, for each refering item. If you are a registered author of this item, you may also want to check the "citations" tab in your RePEc Author Service profile, as there may be some citations waiting for confirmation.

    For technical questions regarding this item, or to correct its authors, title, abstract, bibliographic or download information, contact: Sonal Shukla or Springer Nature Abstracting and Indexing (email available below). General contact details of provider: http://www.springer.com .

    Please note that corrections may take a couple of weeks to filter through the various RePEc services.

    IDEAS is a RePEc service. RePEc uses bibliographic data supplied by the respective publishers.