IDEAS home Printed from https://ideas.repec.org/a/oup/biomet/v109y2022i1p85-102..html
   My bibliography  Save this article

Dimension reduction for covariates in network data
[On semidefinite relaxations for the block model]

Author

Listed:
  • Junlong Zhao
  • Xiumin Liu
  • Hansheng Wang
  • Chenlei Leng

Abstract

SummaryA problem of major interest in network data analysis is to explain the strength of connections using context information. To achieve this, we introduce a novel approach, called network-supervised dimension reduction, in which covariates are projected onto low-dimensional spaces to reveal the linkage pattern without assuming a model. We propose a new loss function for estimating the parameters in the resulting linear projection, based on the notion that closer proximity in the low-dimension projection corresponds to stronger connections. Interestingly, the convergence rate of our estimator is found to depend on a network effect factor, which is the smallest number that can partition a graph in a manner similar to the graph colouring problem. Our method has interesting connections to principal component analysis and linear discriminant analysis, which we exploit for clustering and community detection. The proposed approach is further illustrated by numerical experiments and analysis of a pulsar candidates dataset from astronomy.

Suggested Citation

  • Junlong Zhao & Xiumin Liu & Hansheng Wang & Chenlei Leng, 2022. "Dimension reduction for covariates in network data [On semidefinite relaxations for the block model]," Biometrika, Biometrika Trust, vol. 109(1), pages 85-102.
  • Handle: RePEc:oup:biomet:v:109:y:2022:i:1:p:85-102.
    as

    Download full text from publisher

    File URL: http://hdl.handle.net/10.1093/biomet/asab006
    Download Restriction: Access to full text is restricted to subscribers.
    ---><---

    As the access to this document is restricted, you may want to search for a different version of it.

    References listed on IDEAS

    as
    1. Ting Yan & Binyan Jiang & Stephen E. Fienberg & Chenlei Leng, 2019. "Statistical Inference in a Directed Network Model With Covariates," Journal of the American Statistical Association, Taylor & Francis Journals, vol. 114(526), pages 857-868, April.
    2. Emily M. Jin & Michelle Girvan & M. E. J. Newman, 2001. "The Structure of Growing Social Networks," Working Papers 01-06-032, Santa Fe Institute.
    3. N. Binkiewicz & J. T. Vogelstein & K. Rohe, 2017. "Covariate-assisted spectral clustering," Biometrika, Biometrika Trust, vol. 104(2), pages 361-377.
    4. Bryan S. Graham, 2017. "An Econometric Model of Network Formation With Degree Heterogeneity," Econometrica, Econometric Society, vol. 85, pages 1033-1063, July.
    5. Lam, Clifford & Yao, Qiwei, 2012. "Factor modeling for high-dimensional time series: inference for the number of factors," LSE Research Online Documents on Economics 45684, London School of Economics and Political Science, LSE Library.
    Full references (including those not matched with items on IDEAS)

    Most related items

    These are the items that most often cite the same works as this one and are cited by the same works as this one.
    1. Ma, Shujie & Su, Liangjun & Zhang, Yichong, 2020. "Detecting Latent Communities in Network Formation Models," Economics and Statistics Working Papers 12-2020, Singapore Management University, School of Economics.
    2. Qiuping Wang & Yuan Zhang & Ting Yan, 2023. "Asymptotic theory in network models with covariates and a growing number of node parameters," Annals of the Institute of Statistical Mathematics, Springer;The Institute of Statistical Mathematics, vol. 75(2), pages 369-392, April.
    3. Junhui Cai & Dan Yang & Wu Zhu & Haipeng Shen & Linda Zhao, 2021. "Network regression and supervised centrality estimation," Papers 2111.12921, arXiv.org.
    4. Chen, Mingli & Fernández-Val, Iván & Weidner, Martin, 2021. "Nonlinear factor models for network and panel data," Journal of Econometrics, Elsevier, vol. 220(2), pages 296-324.
    5. David W. Hughes, 2021. "Estimating Nonlinear Network Data Models with Fixed Effects," Boston College Working Papers in Economics 1058, Boston College Department of Economics.
    6. Gao, Wayne Yuan & Li, Ming & Xu, Sheng, 2023. "Logical differencing in dyadic network formation models with nontransferable utilities," Journal of Econometrics, Elsevier, vol. 235(1), pages 302-324.
    7. Andreas Dzemski, 2019. "An Empirical Model of Dyadic Link Formation in a Network with Unobserved Heterogeneity," The Review of Economics and Statistics, MIT Press, vol. 101(5), pages 763-776, December.
    8. Bartolucci, Francesco & Pigini, Claudia & Valentini, Francesco, 2021. "MCMC Conditional Maximum Likelihood for the two-way fixed-effects logit," MPRA Paper 110034, University Library of Munich, Germany.
    9. Mingli Chen & Kengo Kato & Chenlei Leng, 2021. "Analysis of networks via the sparse β‐model," Journal of the Royal Statistical Society Series B, Royal Statistical Society, vol. 83(5), pages 887-910, November.
    10. Gao, Jiti & Liu, Fei & Peng, Bin & Yan, Yayi, 2023. "Binary response models for heterogeneous panel data with interactive fixed effects," Journal of Econometrics, Elsevier, vol. 235(2), pages 1654-1679.
    11. Tadao Hoshino, 2020. "A Pairwise Strategic Network Formation Model with Group Heterogeneity: With an Application to International Travel," Papers 2012.14886, arXiv.org, revised Feb 2021.
    12. Luis E. Candelaria, 2020. "A Semiparametric Network Formation Model with Unobserved Linear Heterogeneity," Papers 2007.05403, arXiv.org, revised Aug 2020.
    13. Jing Luo & Tour Liu & Qiuping Wang, 2022. "Affiliation weighted networks with a differentially private degree sequence," Statistical Papers, Springer, vol. 63(2), pages 367-395, April.
    14. Candelaria, Luis E., 2020. "A Semiparametric Network Formation Model with Unobserved Linear Heterogeneity," The Warwick Economics Research Paper Series (TWERPS) 1279, University of Warwick, Department of Economics.
    15. S Chandna & S C Olhede & P J Wolfe, 2022. "Local linear graphon estimation using covariates [Representations for partially exchangeable arrays of random variables]," Biometrika, Biometrika Trust, vol. 109(3), pages 721-734.
    16. Marco Battaglini & Eleonora Patacchini & Edoardo Rainone, 2019. "Endogenous Social Connections in Legislatures," NBER Working Papers 25988, National Bureau of Economic Research, Inc.
    17. Fan, Jianqing & Jiang, Bai & Sun, Qiang, 2022. "Bayesian factor-adjusted sparse regression," Journal of Econometrics, Elsevier, vol. 230(1), pages 3-19.
    18. Mathieu Lambotte & Sandrine Mathy & Anna Risch & Carole Treibich, 2022. "Spreading active transportation: peer effects and key players in the workplace," Working Papers 2022-02, Grenoble Applied Economics Laboratory (GAEL).
    19. Yuefeng Han & Rong Chen & Dan Yang & Cun-Hui Zhang, 2020. "Tensor Factor Model Estimation by Iterative Projection," Papers 2006.02611, arXiv.org, revised May 2022.
    20. Jianqing Fan & Xu Han, 2017. "Estimation of the false discovery proportion with unknown dependence," Journal of the Royal Statistical Society Series B, Royal Statistical Society, vol. 79(4), pages 1143-1164, September.

    Corrections

    All material on this site has been provided by the respective publishers and authors. You can help correct errors and omissions. When requesting a correction, please mention this item's handle: RePEc:oup:biomet:v:109:y:2022:i:1:p:85-102.. See general information about how to correct material in RePEc.

    If you have authored this item and are not yet registered with RePEc, we encourage you to do it here. This allows to link your profile to this item. It also allows you to accept potential citations to this item that we are uncertain about.

    If CitEc recognized a bibliographic reference but did not link an item in RePEc to it, you can help with this form .

    If you know of missing items citing this one, you can help us creating those links by adding the relevant references in the same way as above, for each refering item. If you are a registered author of this item, you may also want to check the "citations" tab in your RePEc Author Service profile, as there may be some citations waiting for confirmation.

    For technical questions regarding this item, or to correct its authors, title, abstract, bibliographic or download information, contact: Oxford University Press (email available below). General contact details of provider: https://academic.oup.com/biomet .

    Please note that corrections may take a couple of weeks to filter through the various RePEc services.

    IDEAS is a RePEc service. RePEc uses bibliographic data supplied by the respective publishers.