IDEAS home Printed from https://ideas.repec.org/p/arx/papers/2405.17787.html

Dyadic Regression with Sample Selection

Author

Listed:
  • Kensuke Sakamoto

Abstract

This paper addresses the sample selection problem in panel dyadic regression analysis. Dyadic data often include many zeros in the main outcomes due to the underlying network formation process. This not only contaminates popular estimators used in practice but also complicates the inference due to the dyadic dependence structure. We extend Kyriazidou (1997)'s approach to dyadic data and characterize the asymptotic distribution of our proposed estimator. The convergence rates are $\sqrt{n}$ or $\sqrt{n^{2}h_{n}}$, depending on the degeneracy of the H\'{a}jek projection part of the estimator, where $n$ is the number of nodes and $h_{n}$ is a bandwidth. We propose a bias-corrected confidence interval and a variance estimator that adapts to the degeneracy. A Monte Carlo simulation shows the good finite sample performance of our estimator and highlights the importance of bias correction in both asymptotic regimes when the fraction of zeros in outcomes varies. We illustrate our procedure using data from Moretti and Wilson (2017)'s paper on migration.

Suggested Citation

  • Kensuke Sakamoto, 2024. "Dyadic Regression with Sample Selection," Papers 2405.17787, arXiv.org, revised Sep 2025.
  • Handle: RePEc:arx:papers:2405.17787
    as

    Download full text from publisher

    File URL: http://arxiv.org/pdf/2405.17787
    File Function: Latest version
    Download Restriction: no
    ---><---

    References listed on IDEAS

    as
    1. Gabriel Jiménez & Steven Ongena & José‐Luis Peydró & Jesús Saurina, 2014. "Hazardous Times for Monetary Policy: What Do Twenty‐Three Million Bank Loans Say About the Effects of Monetary Policy on Credit Risk‐Taking?," Econometrica, Econometric Society, vol. 82(2), pages 463-505, March.
    2. Luis E. Candelaria, 2020. "A Semiparametric Network Formation Model with Unobserved Linear Heterogeneity," Papers 2007.05403, arXiv.org, revised Aug 2020.
    3. Horowitz, Joel L, 1992. "A Smoothed Maximum Score Estimator for the Binary Response Model," Econometrica, Econometric Society, vol. 60(3), pages 505-531, May.
    4. Candelaria, Luis E., 2020. "A Semiparametric Network Formation Model with Unobserved Linear Heterogeneity," The Warwick Economics Research Paper Series (TWERPS) 1279, University of Warwick, Department of Economics.
    5. Ahn, Hyungtaik & Powell, James L., 1993. "Semiparametric estimation of censored selection models with a nonparametric selection mechanism," Journal of Econometrics, Elsevier, vol. 58(1-2), pages 3-29, July.
    6. Eric Auerbach, 2022. "Identification and Estimation of a Partially Linear Regression Model Using Network Data," Econometrica, Econometric Society, vol. 90(1), pages 347-365, January.
    7. Gary Chamberlain, 1980. "Analysis of Covariance with Qualitative Data," The Review of Economic Studies, Review of Economic Studies Ltd, vol. 47(1), pages 225-238.
    8. Elhanan Helpman & Marc Melitz & Yona Rubinstein, 2008. "Estimating Trade Flows: Trading Partners and Trading Volumes," The Quarterly Journal of Economics, President and Fellows of Harvard College, vol. 123(2), pages 441-487.
    9. Hall, Peter, 1984. "Central limit theorem for integrated square error of multivariate nonparametric density estimators," Journal of Multivariate Analysis, Elsevier, vol. 14(1), pages 1-16, February.
    10. Head, Keith & Mayer, Thierry, 2014. "Gravity Equations: Workhorse,Toolkit, and Cookbook," Handbook of International Economics, in: Gopinath, G. & Helpman, . & Rogoff, K. (ed.), Handbook of International Economics, edition 1, volume 4, chapter 0, pages 131-195, Elsevier.
    Full references (including those not matched with items on IDEAS)

    Most related items

    These are the items that most often cite the same works as this one and are cited by the same works as this one.
    1. Charlier, Erwin & Melenberg, Bertrand & van Soest, Arthur, 2001. "An analysis of housing expenditure using semiparametric models and panel data," Journal of Econometrics, Elsevier, vol. 101(1), pages 71-107, March.
    2. David W. Hughes, 2022. "A jackknife bias correction for nonlinear network data models with fixed effects," Papers 2203.15603, arXiv.org, revised Nov 2025.
    3. Mugnier, Martin & Wang, Ao, 2024. "Fixed Effects Nonlinear Panel Models with Heterogeneous Slopes : Identification and Consistency," The Warwick Economics Research Paper Series (TWERPS) 1531, University of Warwick, Department of Economics.
    4. Aradillas-Lopez, Andres, 2012. "Pairwise-difference estimation of incomplete information games," Journal of Econometrics, Elsevier, vol. 168(1), pages 120-140.
    5. Malikov, Emir & Kumbhakar, Subal C. & Sun, Yiguo, 2016. "Varying coefficient panel data model in the presence of endogenous selectivity and fixed effects," Journal of Econometrics, Elsevier, vol. 190(2), pages 233-251.
    6. David W. Hughes, 2021. "Estimating Nonlinear Network Data Models with Fixed Effects," Boston College Working Papers in Economics 1058, Boston College Department of Economics.
    7. Manuel Arellano & Stéphane Bonhomme, 2017. "Sample Selection in Quantile Regression: A Survey," Working Papers wp2018_1702, CEMFI.
    8. Manuel Arellano & Stéphane Bonhomme, 2017. "Sample Selection in Quantile Regression: A Survey," Working Papers wp2017_1702, CEMFI.
    9. Xin, Kai & Zhang, ZhengYu & Zhou, YaHong & Zhu, PingFang, 2021. "Time-varying individual effects in a panel data probit model with an application to female labor force participation," Economic Modelling, Elsevier, vol. 95(C), pages 181-191.
    10. Mugnier, Martin & Wang, Ao, 2022. "Identification and (Fast) Estimation of Large Nonlinear Panel Models with Two-Way Fixed Effects," The Warwick Economics Research Paper Series (TWERPS) 1422, University of Warwick, Department of Economics.
    11. Alejandro Sanchez-Becerra, 2022. "The Network Propensity Score: Spillovers, Homophily, and Selection into Treatment," Papers 2209.14391, arXiv.org.
    12. Candelaria, Luis E. & Ura, Takuya, 2023. "Identification and inference of network formation games with misclassified links," Journal of Econometrics, Elsevier, vol. 235(2), pages 862-891.
    13. Qi Li & Jeffrey Scott Racine, 2006. "Nonparametric Econometrics: Theory and Practice," Economics Books, Princeton University Press, edition 1, volume 1, number 8355, December.
    14. Joseph G. Altonji & Rosa L. Matzkin, 2001. "Panel Data Estimators for Nonseparable Models with Endogenous Regressors," NBER Technical Working Papers 0267, National Bureau of Economic Research, Inc.
    15. Liang Chen & Garrett Johnson & Yao Luo, 2015. "Great and Small Walls of China: Distance & Chinese E-Commerce," Working Papers 15-14, NET Institute.
    16. Anirudh Shingal & Malte Ehrich, 2019. "Trade effects of standards harmonization in the EU: improved access for non-EU partners," Indian Council for Research on International Economic Relations (ICRIER) Working Paper 372, Indian Council for Research on International Economic Relations (ICRIER), New Delhi, India.
    17. Agnosteva, Delina E. & Anderson, James E. & Yotov, Yoto V., 2019. "Intra-national trade costs: Assaying regional frictions," European Economic Review, Elsevier, vol. 112(C), pages 32-50.
    18. Lahiri, Kajal & Yang, Liu, 2013. "Forecasting Binary Outcomes," Handbook of Economic Forecasting, in: G. Elliott & C. Granger & A. Timmermann (ed.), Handbook of Economic Forecasting, edition 1, volume 2, chapter 0, pages 1025-1106, Elsevier.
    19. Esposito, Federico, 2022. "Demand risk and diversification through international trade," Journal of International Economics, Elsevier, vol. 135(C).
    20. Chernozhukov, Victor & Fernández-Val, Iván & Weidner, Martin, 2024. "Network and panel quantile effects via distribution regression," Journal of Econometrics, Elsevier, vol. 240(2).

    More about this item

    NEP fields

    This paper has been announced in the following NEP Reports:

    Statistics

    Access and download statistics

    Corrections

    All material on this site has been provided by the respective publishers and authors. You can help correct errors and omissions. When requesting a correction, please mention this item's handle: RePEc:arx:papers:2405.17787. See general information about how to correct material in RePEc.

    If you have authored this item and are not yet registered with RePEc, we encourage you to do it here. This allows to link your profile to this item. It also allows you to accept potential citations to this item that we are uncertain about.

    If CitEc recognized a bibliographic reference but did not link an item in RePEc to it, you can help with this form .

    If you know of missing items citing this one, you can help us creating those links by adding the relevant references in the same way as above, for each refering item. If you are a registered author of this item, you may also want to check the "citations" tab in your RePEc Author Service profile, as there may be some citations waiting for confirmation.

    For technical questions regarding this item, or to correct its authors, title, abstract, bibliographic or download information, contact: arXiv administrators (email available below). General contact details of provider: http://arxiv.org/ .

    Please note that corrections may take a couple of weeks to filter through the various RePEc services.

    IDEAS is a RePEc service. RePEc uses bibliographic data supplied by the respective publishers.