IDEAS home Printed from https://ideas.repec.org/p/arx/papers/2604.03171.html

Flexible Imputation of Incomplete Network Data

Author

Listed:
  • Ge Sun
  • Weisheng Zhang

Abstract

Sampled network data are widely used in empirical research because collecting complete network information is costly. However, empirical analyses based on sampled networks may lead to biased estimators. We propose a nonparametric imputation method for sampled networks and show that empirical analyses based on imputed networks yield consistent estimates. Our approach imputes missing network links by combining a projection onto covariates with a local two-way fixed-effects regression. The method avoids parametric assumptions, does not rely on low-rank restrictions, and flexibly accommodates both observed covariates and unobserved heterogeneity. We establish entrywise convergence rates for the imputed matrix and prove the consistency of generalized method of moments (GMM) estimators based on imputed networks. We further derive the convergence rate of the corresponding estimator in the linear-in-means peer-effects model. Simulations show strong performance of our method both in terms of imputation accuracy and in downstream empirical analysis. We illustrate our method with an application to the microfinance network data of Banerjee et al. (2013).

Suggested Citation

  • Ge Sun & Weisheng Zhang, 2026. "Flexible Imputation of Incomplete Network Data," Papers 2604.03171, arXiv.org, revised May 2026.
  • Handle: RePEc:arx:papers:2604.03171
    as

    Download full text from publisher

    File URL: http://arxiv.org/pdf/2604.03171
    File Function: Latest version
    Download Restriction: no
    ---><---

    References listed on IDEAS

    as
    1. Sebastian Calonico & Matias D. Cattaneo & Max H. Farrell, 2018. "On the Effect of Bias Estimation on Coverage Accuracy in Nonparametric Inference," Journal of the American Statistical Association, Taylor & Francis Journals, vol. 113(522), pages 767-779, April.
    2. Martin Mugnier, 2022. "A Simple and Computationally Trivial Estimator for Grouped Fixed Effects Models," Papers 2203.08879, arXiv.org, revised Apr 2025.
    3. Martin Mugnier, 2025. "A simple and computationally trivial estimator for grouped fixed effects models," Post-Print halshs-05163274, HAL.
    4. Angelo Mele, 2017. "A Structural Model of Dense Network Formation," Econometrica, Econometric Society, vol. 85, pages 825-850, May.
    5. Andreas Dzemski, 2019. "An Empirical Model of Dyadic Link Formation in a Network with Unobserved Heterogeneity," The Review of Economics and Statistics, MIT Press, vol. 101(5), pages 763-776, December.
    6. Beaman, Lori & Dillon, Andrew, 2018. "Diffusion of agricultural information within social networks: Evidence on gender inequalities from Mali," Journal of Development Economics, Elsevier, vol. 133(C), pages 147-161.
    7. Mugnier, Martin, 2025. "A simple and computationally trivial estimator for grouped fixed effects models," Journal of Econometrics, Elsevier, vol. 250(C).
    8. Tianxi Cai & T. Tony Cai & Anru Zhang, 2016. "Structured Matrix Completion with Applications to Genomic Data Integration," Journal of the American Statistical Association, Taylor & Francis Journals, vol. 111(514), pages 621-633, April.
    9. Alan Griffith, 2022. "Name Your Friends, but Only Five? The Importance of Censoring in Peer Effects Estimates Using Social Network Data," Journal of Labor Economics, University of Chicago Press, vol. 40(4), pages 779-805.
    Full references (including those not matched with items on IDEAS)

    Most related items

    These are the items that most often cite the same works as this one and are cited by the same works as this one.
    1. Andrei Zeleneev & Weisheng Zhang, 2025. "Tractable Estimation of Nonlinear Panels with Interactive Fixed Effects," Papers 2511.15427, arXiv.org, revised Jun 2026.
    2. Aristide Houndetoungan & Cristelle Kouame & Michael Vlassopoulos, 2024. "Identifying Peer Effects in Networks with Unobserved Effort and Isolated Students," Papers 2405.06850, arXiv.org, revised Apr 2026.
    3. Alex Centeno, 2022. "A Structural Model for Detecting Communities in Networks," Papers 2209.08380, arXiv.org, revised Oct 2022.
    4. Gao, Wayne Yuan & Li, Ming & Xu, Sheng, 2023. "Logical differencing in dyadic network formation models with nontransferable utilities," Journal of Econometrics, Elsevier, vol. 235(1), pages 302-324.
    5. Graham, Bryan S., 2020. "Network data," Handbook of Econometrics,, Elsevier.
    6. Patacchini, Eleonora & Hsieh, Chih-Sheng & Lin, Xu, 2019. "Social Interaction Methods," CEPR Discussion Papers 14141, Centre for Economic Policy Research.
    7. Tadao Hoshino, 2020. "A Pairwise Strategic Network Formation Model with Group Heterogeneity: With an Application to International Travel," Papers 2012.14886, arXiv.org, revised Feb 2021.
    8. Luis E. Candelaria, 2020. "A Semiparametric Network Formation Model with Unobserved Linear Heterogeneity," Papers 2007.05403, arXiv.org, revised Aug 2020.
    9. Ming Li & Zhentao Shi & Yapeng Zheng, 2024. "Bagging the Network," Papers 2410.23852, arXiv.org, revised May 2026.
    10. Candelaria, Luis E. & Ura, Takuya, 2023. "Identification and inference of network formation games with misclassified links," Journal of Econometrics, Elsevier, vol. 235(2), pages 862-891.
    11. Candelaria, Luis E., 2020. "A Semiparametric Network Formation Model with Unobserved Linear Heterogeneity," The Warwick Economics Research Paper Series (TWERPS) 1279, University of Warwick, Department of Economics.
    12. Vincent Boucher & Aristide Houndetoungan, 2025. "Estimating Peer Effects Using Partial Network Data," Papers 2509.08145, arXiv.org.
    13. L. Sanna Stephan, 2024. "Semiparametric Estimation of Individual Coefficients in a Dyadic Link Formation Model Lacking Observable Characteristics," Papers 2408.04552, arXiv.org.
    14. S Anukriti & Catalina Herrera‐Almanza & Praveen K. Pathak & Mahesh Karra, 2020. "Curse of the Mummy‐ji: The Influence of Mothers‐in‐Law on Women in India†," American Journal of Agricultural Economics, John Wiley & Sons, vol. 102(5), pages 1328-1351, October.
    15. Awudu Abdulai, 2023. "Information acquisition and the adoption of improved crop varieties," American Journal of Agricultural Economics, John Wiley & Sons, vol. 105(4), pages 1049-1062, August.
    16. Rahul Singh & Moses Stewart, 2025. "Placebo Discontinuity Design," Papers 2507.12693, arXiv.org.
    17. Markus Kinateder & Luca Paolo Merlino, 2021. "The Evolution of Networks and Local Public Good Provision: A Potential Approach," Games, MDPI, vol. 12(3), pages 1-12, July.
    18. Cl'ement de Chaisemartin & Diego Ciccia & Xavier D'Haultf{oe}uille & Felix Knau, 2024. "Difference-in-Differences Estimators When No Unit Remains Untreated," Papers 2405.04465, arXiv.org, revised Apr 2026.
    19. Balila Acurio & Alessandro Tomarchio, 2024. "The Effects of Business Credit Support Programs: Evidence from a Regression Discontinuity Design," IHEID Working Papers 20-2024, Economics Section, The Graduate Institute of International Studies.
    20. Britto, Diogo G.C. & Fiorin, Stefano, 2020. "Corruption and legislature size: Evidence from Brazil," European Journal of Political Economy, Elsevier, vol. 65(C).

    More about this item

    NEP fields

    This paper has been announced in the following NEP Reports:

    Statistics

    Access and download statistics

    Corrections

    All material on this site has been provided by the respective publishers and authors. You can help correct errors and omissions. When requesting a correction, please mention this item's handle: RePEc:arx:papers:2604.03171. See general information about how to correct material in RePEc.

    If you have authored this item and are not yet registered with RePEc, we encourage you to do it here. This allows to link your profile to this item. It also allows you to accept potential citations to this item that we are uncertain about.

    If CitEc recognized a bibliographic reference but did not link an item in RePEc to it, you can help with this form .

    If you know of missing items citing this one, you can help us creating those links by adding the relevant references in the same way as above, for each refering item. If you are a registered author of this item, you may also want to check the "citations" tab in your RePEc Author Service profile, as there may be some citations waiting for confirmation.

    For technical questions regarding this item, or to correct its authors, title, abstract, bibliographic or download information, contact: arXiv administrators (email available below). General contact details of provider: http://arxiv.org/ .

    Please note that corrections may take a couple of weeks to filter through the various RePEc services.

    IDEAS is a RePEc service. RePEc uses bibliographic data supplied by the respective publishers.