IDEAS home Printed from https://ideas.repec.org/p/arx/papers/2511.00612.html
   My bibliography  Save this paper

Improving control over unobservables with network data

Author

Listed:
  • Vincent Starck

Abstract

This paper develops a method to conduct causal inference in the presence of unobserved confounders by leveraging networks with homophily, a frequently observed tendency to form edges with similar nodes. I introduce a concept of asymptotic homophily, according to which individuals' selectivity scales with the size of the potential connection pool. It contributes to the network formation literature with a model that can accommodate common empirical features such as homophily, degree heterogeneity, sparsity, and clustering, and provides a framework to obtain consistent estimators of treatment effects that are robust to selection on unobservables. I also consider an alternative setting that accommodates dense networks and show how selecting linked individuals whose observed characteristics made such a connection less likely delivers an estimator with similar properties. In an application, I recover an estimate of the effect of parental involvement on students' test scores that is greater than that of OLS, arguably due to the estimator's ability to account for unobserved ability.

Suggested Citation

  • Vincent Starck, 2025. "Improving control over unobservables with network data," Papers 2511.00612, arXiv.org.
  • Handle: RePEc:arx:papers:2511.00612
    as

    Download full text from publisher

    File URL: http://arxiv.org/pdf/2511.00612
    File Function: Latest version
    Download Restriction: no
    ---><---

    References listed on IDEAS

    as
    1. Bryan S. Graham, 2017. "An econometric model of network formation with degree heterogeneity," CeMMAP working papers 08/17, Institute for Fiscal Studies.
    2. Andreas Dzemski, 2019. "An Empirical Model of Dyadic Link Formation in a Network with Unobserved Heterogeneity," The Review of Economics and Statistics, MIT Press, vol. 101(5), pages 763-776, December.
    3. Eric Auerbach, 2022. "Identification and Estimation of a Partially Linear Regression Model Using Network Data," Econometrica, Econometric Society, vol. 90(1), pages 347-365, January.
    4. Keisuke Hirano & Guido W. Imbens & Geert Ridder, 2003. "Efficient Estimation of Average Treatment Effects Using the Estimated Propensity Score," Econometrica, Econometric Society, vol. 71(4), pages 1161-1189, July.
    5. Vincent Boucher & Ismael Mourifié, 2017. "My friend far, far away: a random field approach to exponential random graph models," Econometrics Journal, Royal Economic Society, vol. 20(3), pages 14-46, October.
    6. Sergio Currarini & Matthew O. Jackson & Paolo Pin, 2009. "An Economic Model of Friendship: Homophily, Minorities, and Segregation," Econometrica, Econometric Society, vol. 77(4), pages 1003-1045, July.
    7. Bekker, Paul A, 1994. "Alternative Approximations to the Distributions of Instrumental Variable Estimators," Econometrica, Econometric Society, vol. 62(3), pages 657-681, May.
    8. Bryan S. Graham, 2017. "An Econometric Model of Network Formation With Degree Heterogeneity," Econometrica, Econometric Society, vol. 85, pages 1033-1063, July.
    9. Guido W. Imbens, 2004. "Nonparametric Estimation of Average Treatment Effects Under Exogeneity: A Review," The Review of Economics and Statistics, MIT Press, vol. 86(1), pages 4-29, February.
    10. Angelo Mele, 2022. "A Structural Model of Homophily and Clustering in Social Networks," Journal of Business & Economic Statistics, Taylor & Francis Journals, vol. 40(3), pages 1377-1389, June.
    11. Imbens,Guido W. & Rubin,Donald B., 2015. "Causal Inference for Statistics, Social, and Biomedical Sciences," Cambridge Books, Cambridge University Press, number 9780521885881, January.
    12. Lin, Zhexiao & Han, Fang, 2025. "On regression-adjusted imputation estimators of average treatment effects," Journal of Econometrics, Elsevier, vol. 251(C).
    13. Chih‐Sheng Hsieh & Lung Fei Lee, 2016. "A Social Interactions Model with Endogenous Friendship Formation and Selectivity," Journal of Applied Econometrics, John Wiley & Sons, Ltd., vol. 31(2), pages 301-319, March.
    Full references (including those not matched with items on IDEAS)

    Most related items

    These are the items that most often cite the same works as this one and are cited by the same works as this one.
    1. Gao, Wayne Yuan & Li, Ming & Xu, Sheng, 2023. "Logical differencing in dyadic network formation models with nontransferable utilities," Journal of Econometrics, Elsevier, vol. 235(1), pages 302-324.
    2. Graham, Bryan S., 2020. "Network data," Handbook of Econometrics,, Elsevier.
    3. Zuckerman, David, 2024. "Multidimensional homophily," Journal of Economic Behavior & Organization, Elsevier, vol. 218(C), pages 486-513.
    4. Candelaria, Luis E. & Ura, Takuya, 2023. "Identification and inference of network formation games with misclassified links," Journal of Econometrics, Elsevier, vol. 235(2), pages 862-891.
    5. Alexandre Belloni & Victor Chernozhukov & Denis Chetverikov & Christian Hansen & Kengo Kato, 2018. "High-dimensional econometrics and regularized GMM," CeMMAP working papers CWP35/18, Centre for Microdata Methods and Practice, Institute for Fiscal Studies.
    6. Chih‐Sheng Hsieh & Lung‐Fei Lee & Vincent Boucher, 2020. "Specification and estimation of network formation and network interaction models with the exponential probability distribution," Quantitative Economics, Econometric Society, vol. 11(4), pages 1349-1390, November.
    7. Alex Centeno, 2022. "A Structural Model for Detecting Communities in Networks," Papers 2209.08380, arXiv.org, revised Oct 2022.
    8. Aristide Houndetoungan, 2024. "Count Data Models with Heterogeneous Peer Effects under Rational Expectations," Papers 2405.17290, arXiv.org, revised Dec 2025.
    9. Mugnier, Martin, 2025. "A simple and computationally trivial estimator for grouped fixed effects models," Journal of Econometrics, Elsevier, vol. 250(C).
    10. Ming Li & Zhentao Shi & Yapeng Zheng, 2024. "Bagging the Network," Papers 2410.23852, arXiv.org, revised Sep 2025.
    11. Luis Alvarez & Cristine Pinto & Vladimir Ponczek, 2022. "Homophily in preferences or meetings? Identifying and estimating an iterative network formation model," Papers 2201.06694, arXiv.org, revised Mar 2025.
    12. Chernozhukov, Victor & Fernández-Val, Iván & Weidner, Martin, 2024. "Network and panel quantile effects via distribution regression," Journal of Econometrics, Elsevier, vol. 240(2).
    13. Gaonkar, Shweta & Mele, Angelo, 2023. "A model of inter-organizational network formation," Journal of Economic Behavior & Organization, Elsevier, vol. 214(C), pages 82-104.
    14. Guido W. Imbens, 2022. "Causality in Econometrics: Choice vs Chance," Econometrica, Econometric Society, vol. 90(6), pages 2541-2566, November.
    15. Michael P. Leung, 2019. "Inference in Models of Discrete Choice with Social Interactions Using Network Data," Papers 1911.07106, arXiv.org.
    16. Claudia Pigini & Alessandro Pionati & Francesco Valentini, 2025. "Grouped fixed effects regularization for binary choice models," Papers 2502.06446, arXiv.org, revised Nov 2025.
    17. Rahul Singh & Liyuan Xu & Arthur Gretton, 2020. "Kernel Methods for Causal Functions: Dose, Heterogeneous, and Incremental Response Curves," Papers 2010.04855, arXiv.org, revised Oct 2022.
    18. Lin, Zhexiao & Han, Fang, 2025. "On regression-adjusted imputation estimators of average treatment effects," Journal of Econometrics, Elsevier, vol. 251(C).
    19. Guido W. Imbens, 2015. "Matching Methods in Practice: Three Examples," Journal of Human Resources, University of Wisconsin Press, vol. 50(2), pages 373-419.
    20. Qiuping Wang & Yuan Zhang & Ting Yan, 2023. "Asymptotic theory in network models with covariates and a growing number of node parameters," Annals of the Institute of Statistical Mathematics, Springer;The Institute of Statistical Mathematics, vol. 75(2), pages 369-392, April.

    More about this item

    NEP fields

    This paper has been announced in the following NEP Reports:

    Statistics

    Access and download statistics

    Corrections

    All material on this site has been provided by the respective publishers and authors. You can help correct errors and omissions. When requesting a correction, please mention this item's handle: RePEc:arx:papers:2511.00612. See general information about how to correct material in RePEc.

    If you have authored this item and are not yet registered with RePEc, we encourage you to do it here. This allows to link your profile to this item. It also allows you to accept potential citations to this item that we are uncertain about.

    If CitEc recognized a bibliographic reference but did not link an item in RePEc to it, you can help with this form .

    If you know of missing items citing this one, you can help us creating those links by adding the relevant references in the same way as above, for each refering item. If you are a registered author of this item, you may also want to check the "citations" tab in your RePEc Author Service profile, as there may be some citations waiting for confirmation.

    For technical questions regarding this item, or to correct its authors, title, abstract, bibliographic or download information, contact: arXiv administrators (email available below). General contact details of provider: http://arxiv.org/ .

    Please note that corrections may take a couple of weeks to filter through the various RePEc services.

    IDEAS is a RePEc service. RePEc uses bibliographic data supplied by the respective publishers.