Sparse Network Asymptotics for Logistic Regression Under Possible Misspecification

My bibliography Save this article

Sparse Network Asymptotics for Logistic Regression Under Possible Misspecification

Author

Listed:

Bryan S. Graham

Registered:

Abstract

Consider a bipartite network where N consumers choose to buy or not to buy M different products. This paper considers the properties of the logit fit of the N × M array of “i‐buys‐j” purchase decisions, Y=[Yij]1≤i≤N,1≤j≤M, onto a vector of known functions of consumer and product attributes under asymptotic sequences where (i) both N and M grow large, (ii) the average number of products purchased per consumer is finite in the limit, (iii) there exists dependence across elements in the same row or same column of Y (i.e., dyadic dependence), and (iv) the true conditional probability of making a purchase may, or may not, take the assumed logit form. Condition (ii) implies that the limiting network of purchases is sparse: only a vanishing fraction of all possible purchases are actually made. Under sparse network asymptotics, I show that the parameter indexing the logit approximation solves a particular Kullback–Leibler Information Criterion (KLIC) minimization problem (defined with respect to a certain Poisson population). This finding provides a simple characterization of the logit pseudo‐true parameter under general misspecification (analogous to a (mean squared error (MSE) minimizing) linear predictor approximation of a general conditional expectation function (CEF)). With respect to sampling theory, sparseness implies that the first and last terms in an extended Hoeffding‐type variance decomposition of the score of the logit pseudo composite log‐likelihood are of equal order. In contrast, under dense network asymptotics, the last term is asymptotically negligible. Asymptotic normality of the logistic regression coefficients is shown using a martingale central limit theorem (CLT) for triangular arrays. Unlike in the dense case, the normality result derived here also holds under degeneracy of the network graphon. Relatedly, when there “happens to be” no dyadic dependence in the data set in hand, it specializes to recently derived results on the behavior of logistic regression with rare events and i.i.d. data. Simulation results suggest that sparse network asymptotics better approximate the finite network distribution of the logit estimator. A short empirical illustration, and additional calibrated Monte Carlo experiments, further illustrate the main theoretical ideas.

Suggested Citation

Bryan S. Graham, 2024. "Sparse Network Asymptotics for Logistic Regression Under Possible Misspecification," Econometrica, Econometric Society, vol. 92(6), pages 1837-1868, November.

Handle: RePEc:wly:emetrp:v:92:y:2024:i:6:p:1837-1868
DOI: 10.3982/ECTA19051

Download full text from publisher

References listed on IDEAS

Fafchamps, Marcel & Gubert, Flore, 2007. "The formation of risk sharing networks," Journal of Development Economics, Elsevier, vol. 83(2), pages 326-350, July.
- Marcel Fafchamps & Flore Gubert, 2005. "The Formation of Risk Sharing Networks," Working Papers DT/2005/13, DIAL (Développement, Institutions et Mondialisation).
- Marcel Fafchamps & Flore Gubert & IRD-Paris & DIAL, 2005. "The Formation of Risk Sharing Networks," Economics Series Working Papers GPRG-WPS-037, University of Oxford, Department of Economics.
Cattaneo, Matias D. & Crump, Richard K. & Jansson, Michael, 2014. "Small Bandwidth Asymptotics For Density-Weighted Average Derivatives," Econometric Theory, Cambridge University Press, vol. 30(1), pages 176-200, February.
- Matias D. Cattaneo & Richard K. Crump & Michael Jansson, 2008. "Small Bandwidth Asymptotics for Density-Weighted Average Derivatives," CREATES Research Papers 2008-24, Department of Economics and Business Economics, Aarhus University.
- Cattaneo, Matias D & Crump, Richard K & Jansson, Michael, 2014. "Small Bandwidth Asymptotics For Density-Weighted Average Derivatives," Department of Economics, Working Paper Series qt3jd237cg, Department of Economics, Institute for Business and Economic Research, UC Berkeley.
repec:dau:papers:123456789/4392 is not listed on IDEAS
repec:dau:papers:123456789/10840 is not listed on IDEAS
Marcel Fafchamps & Flore Gubert, 2007. "Risk Sharing and Network Formation," American Economic Review, American Economic Association, vol. 97(2), pages 75-79, May.
- Marcel Fafchamps & Flore Gubert & DIAL, 2007. "Risk Sharing and Network Formation," Economics Series Working Papers GPRG-WPS-067, University of Oxford, Department of Economics.
Laurent Davezies & Xavier D’haultfœuille & Yannick Guyonvarch, 2021. "Empirical process results for exchangeable arrays," Post-Print hal-04430851, HAL.
Aldous, David J., 1981. "Representations for partially exchangeable arrays of random variables," Journal of Multivariate Analysis, Elsevier, vol. 11(4), pages 581-598, December.
Isaiah Andrews & James H. Stock & Liyang Sun, 2019. "Weak Instruments in Instrumental Variables Regression: Theory and Practice," Annual Review of Economics, Annual Reviews, vol. 11(1), pages 727-753, August.
Konrad Menzel, 2016. "Inference for Games with Many Players," The Review of Economic Studies, Review of Economic Studies Ltd, vol. 83(1), pages 306-337.
Roussille, Nina & Scuderi, Benjamin, 2023. "Bidding for Talent: A Test of Conduct in a High-Wage Labor Market," IZA Discussion Papers 16352, Institute of Labor Economics (IZA).

Full references (including those not matched with items on IDEAS)

Citations

Citations are extracted by the CitEc Project, subscribe to its RSS feed for this item.

Cited by:

Susan Athey & Guido Imbens, 2025. "Identification of Average Treatment Effects in Nonparametric Panel Models," Papers 2503.19873, arXiv.org.

Most related items

These are the items that most often cite the same works as this one and are cited by the same works as this one.

Bryan S. Graham, 2020. "Sparse network asymptotics for logistic regression," Papers 2010.04703, arXiv.org.
Graham, Bryan S., 2020. "Network data," Handbook of Econometrics,, Elsevier.
Graham, Bryan S. & Niu, Fengshi & Powell, James L., 2024. "Kernel density estimation for undirected dyadic data," Journal of Econometrics, Elsevier, vol. 240(2).
- Bryan S. Graham & Fengshi Niu & James L. Powell, 2019. "Kernel density estimation for undirected dyadic data," CeMMAP working papers CWP39/19, Centre for Microdata Methods and Practice, Institute for Fiscal Studies.
- Bryan S. Graham & Fengshi Niu & James L. Powell, 2019. "Kernel Density Estimation for Undirected Dyadic Data," Papers 1907.13630, arXiv.org.
Bryan S. Graham, 2019. "Network Data," CeMMAP working papers CWP71/19, Centre for Microdata Methods and Practice, Institute for Fiscal Studies.
Bryan S. Graham, 2019. "Network Data," Papers 1912.06346, arXiv.org.
- Bryan S. Graham, 2019. "Network Data," NBER Working Papers 26577, National Bureau of Economic Research, Inc.
Bryan S. Graham, 2019. "Dyadic Regression," Papers 1908.09029, arXiv.org.
Harold D. Chiang & Kengo Kato & Yuya Sasaki, 2023. "Inference for High-Dimensional Exchangeable Arrays," Journal of the American Statistical Association, Taylor & Francis Journals, vol. 118(543), pages 1595-1605, July.
- Harold D. Chiang & Kengo Kato & Yuya Sasaki, 2020. "Inference for high-dimensional exchangeable arrays," Papers 2009.05150, arXiv.org, revised Jul 2021.
Alejandro Sanchez-Becerra, 2022. "The Network Propensity Score: Spillovers, Homophily, and Selection into Treatment," Papers 2209.14391, arXiv.org.
Cui Zhang & Dandan Zhang, 2023. "Spatial Interactions and the Spread of COVID-19: A Network Perspective," Computational Economics, Springer;Society for Computational Economics, vol. 62(1), pages 383-405, June.
Sriroop Chaudhuri & Mimi Roy & Louis M. McDonald & Yves Emendack, 2021. "Reflections on farmers’ social networks: a means for sustainable agricultural development?," Environment, Development and Sustainability: A Multidisciplinary Approach to the Theory and Practice of Sustainable Development, Springer, vol. 23(3), pages 2973-3008, March.
Kerui Du & Qilin Huang & Presley K. Wesseh, 2025. "Domestic Pollution Havens: Linking Interregional Capital Flight and Water Pollution Regulation in China," Environmental & Resource Economics, Springer;European Association of Environmental and Resource Economists, vol. 88(1), pages 125-161, January.
Chakraborty, Tanika & Pandey, Manish, 2021. "Temporary International Migration, Shocks and Informal Insurance: Analysis using panel data," GLO Discussion Paper Series 759, Global Labor Organization (GLO).
- Chakraborty, Tanika & Pandey, Manish, 2021. "Temporary International Migration, Shocks and Informal Insurance: Analysis Using Panel Data," IZA Discussion Papers 14051, Institute of Labor Economics (IZA).
Bryan S. Graham, 2017. "An econometric model of network formation with degree heterogeneity," CeMMAP working papers 08/17, Institute for Fiscal Studies.
Fredriksson, Per G. & Mohanty, Aatishya, 2021. "Sunlight and Culture," Journal of Economic Behavior & Organization, Elsevier, vol. 188(C), pages 757-782.
Alia Aghajanian & Patricia Justino & Jean-Pierre Tranchant, 2020. "Riots and social capital in urban India," WIDER Working Paper Series wp-2020-42, World Institute for Development Economic Research (UNU-WIDER).
- Alia Aghajanian & Patricia Justino & Jean-Pierre Tranchant, 2020. "Riots and social capital in urban India," HiCN Working Papers 325, Households in Conflict Network.
Christian Ahlin & Hyeok Jeong, 2021. "A conditional Gini: measure, estimation, and application," The Journal of Economic Inequality, Springer;Society for the Study of Economic Inequality, vol. 19(2), pages 363-384, June.
Aoyagi, Keitaro & Sawada, Yasuyuki & Shoji, Masahiro, 2022. "Irrigation infrastructure and trust: Evidence from natural and lab-in-the-field experiments in rural communities," World Development, Elsevier, vol. 156(C).
Joseph B. Ajefu & Ayse Demir & Padmali Rodrigo, 2023. "Covid-19-induced Shocks, Access to Basic Needs and Coping Strategies," The European Journal of Development Research, Palgrave Macmillan;European Association of Development Research and Training Institutes (EADI), vol. 35(6), pages 1347-1368, December.
Atsebi, Jean-Marc Bédhat & Ferrer-i-Carbonell, Ada, 2022. "Relative deprivation in Tanzania: Relative concerns and empathy," Journal of Economic Behavior & Organization, Elsevier, vol. 198(C), pages 389-408.
Ronak Jain & Vatsal Khandelwal, 2024. "Silent networks: the role of inaccurate beliefs in reducing useful social interactions," ECON - Working Papers 455, Department of Economics - University of Zurich.
- Ronak Jain & Vatsal Khandelwal, 2024. "Silent networks: The role of inaccurate beliefs in reducing useful social interactions," CSAE Working Paper Series 2024-06, Centre for the Study of African Economies, University of Oxford.

More about this item

Statistics

Access and download statistics

Corrections

All material on this site has been provided by the respective publishers and authors. You can help correct errors and omissions. When requesting a correction, please mention this item's handle: RePEc:wly:emetrp:v:92:y:2024:i:6:p:1837-1868. See general information about how to correct material in RePEc.

If you have authored this item and are not yet registered with RePEc, we encourage you to do it here. This allows to link your profile to this item. It also allows you to accept potential citations to this item that we are uncertain about.

If CitEc recognized a bibliographic reference but did not link an item in RePEc to it, you can help with this form .

If you know of missing items citing this one, you can help us creating those links by adding the relevant references in the same way as above, for each refering item. If you are a registered author of this item, you may also want to check the "citations" tab in your RePEc Author Service profile, as there may be some citations waiting for confirmation.

For technical questions regarding this item, or to correct its authors, title, abstract, bibliographic or download information, contact: Wiley Content Delivery (email available below). General contact details of provider: https://edirc.repec.org/data/essssea.html .

Please note that corrections may take a couple of weeks to filter through the various RePEc services.

IDEAS is a RePEc service. RePEc uses bibliographic data supplied by the respective publishers.

Browse Econ Literature

More features

Sparse Network Asymptotics for Logistic Regression Under Possible Misspecification

Author

Abstract

Suggested Citation

Download full text from publisher

References listed on IDEAS

Citations

Most related items

More about this item

Statistics

Corrections

More services and features

MyIDEAS

Author registration

Rankings

RePEc Genealogy

RePEc Biblio

MPRA

New papers by email

EconAcademics

Plagiarism

About RePEc

RePEc home

Blog

Help/FAQ

RePEc team

Participating archives

Privacy statement

Help us

Corrections

Volunteers

Get papers listed

Open a RePEc archive

Get RePEc data