IDEAS home Printed from https://ideas.repec.org/a/taf/jnlasa/v107y2012i499p1119-1128.html
   My bibliography  Save this article

A Consistent Adjacency Spectral Embedding for Stochastic Blockmodel Graphs

Author

Listed:
  • Daniel L. Sussman
  • Minh Tang
  • Donniell E. Fishkind
  • Carey E. Priebe

Abstract

We present a method to estimate block membership of nodes in a random graph generated by a stochastic blockmodel. We use an embedding procedure motivated by the random dot product graph model, a particular example of the latent position model. The embedding associates each node with a vector; these vectors are clustered via minimization of a square error criterion. We prove that this method is consistent for assigning nodes to blocks, as only a negligible number of nodes will be misassigned. We prove consistency of the method for directed and undirected graphs. The consistent block assignment makes possible consistent parameter estimation for a stochastic blockmodel. We extend the result in the setting where the number of blocks grows slowly with the number of nodes. Our method is also computationally feasible even for very large graphs. We compare our method with Laplacian spectral clustering through analysis of simulated data and a graph derived from Wikipedia documents.

Suggested Citation

  • Daniel L. Sussman & Minh Tang & Donniell E. Fishkind & Carey E. Priebe, 2012. "A Consistent Adjacency Spectral Embedding for Stochastic Blockmodel Graphs," Journal of the American Statistical Association, Taylor & Francis Journals, vol. 107(499), pages 1119-1128, September.
  • Handle: RePEc:taf:jnlasa:v:107:y:2012:i:499:p:1119-1128 DOI: 10.1080/01621459.2012.699795
    as

    Download full text from publisher

    File URL: http://hdl.handle.net/10.1080/01621459.2012.699795
    Download Restriction: Access to full text is restricted to subscribers.

    As the access to this document is restricted, you may want to search for a different version of it.

    References listed on IDEAS

    as
    1. Hilary W. Hoynes & Diane Whitmore Schanzenbach, 2009. "Consumption Responses to In-Kind Transfers: Evidence from the Introduction of the Food Stamp Program," American Economic Journal: Applied Economics, American Economic Association, vol. 1(4), pages 109-139, October.
    2. Charles F. Manski, 1997. "Monotone Treatment Response," Econometrica, Econometric Society, vol. 65(6), pages 1311-1334, November.
    3. Nord, Mark & Andrews, Margaret S. & Carlson, Steven, 2008. "Household Food Security in the United States, 2007," Economic Research Report 56483, United States Department of Agriculture, Economic Research Service.
    4. Moffitt, Robert, 1983. "An Economic Model of Welfare Stigma," American Economic Review, American Economic Association, pages 1023-1035.
    5. Anderson, Simon P. & Renault, Regis, 2003. "Efficiency and surplus bounds in Cournot competition," Journal of Economic Theory, Elsevier, pages 253-264.
    6. Kreider, Brent & Pepper, John V., 2007. "Disability and Employment: Reevaluating the Evidence in Light of Reporting Errors," Journal of the American Statistical Association, American Statistical Association, vol. 102, pages 432-441, June.
    7. Craig Gundersen & Victor Oliveira, 2001. "The Food Stamp Program and Food Insufficiency," American Journal of Agricultural Economics, Agricultural and Applied Economics Association, vol. 83(4), pages 875-887.
    8. Whitney K. Newey & Richard J. Smith, 2004. "Higher Order Properties of Gmm and Generalized Empirical Likelihood Estimators," Econometrica, Econometric Society, pages 219-255.
    9. Kaushal, N., 2007. "Do food stamps cause obesity?: Evidence from immigrant experience," Journal of Health Economics, Elsevier, vol. 26(5), pages 968-991, September.
    10. Bollinger, Christopher R., 1996. "Bounding mean regressions when a binary regressor is mismeasured," Journal of Econometrics, Elsevier, vol. 73(2), pages 387-399, August.
    11. Sheetal Sekhri, 2011. "Public Provision and Protection of Natural Resources: Groundwater Irrigation in Rural India," American Economic Journal: Applied Economics, American Economic Association, pages 29-55.
    12. Brent Kreider & John Pepper, 2008. "Inferring disability status from corrupt data," Journal of Applied Econometrics, John Wiley & Sons, Ltd., vol. 23(3), pages 329-349.
    13. John V. Pepper, 2000. "The Intergenerational Transmission Of Welfare Receipt: A Nonparametric Bounds Analysis," The Review of Economics and Statistics, MIT Press, vol. 82(3), pages 472-488, August.
    14. Brent Kreider & Steven C. Hill, 2009. "Partially Identifying Treatment Effects with an Application to Covering the Uninsured," Journal of Human Resources, University of Wisconsin Press, vol. 44(2).
    15. Bhattacharya, Jayanta & Currie, Janet & Haider, Steven, 2004. "Poverty, food insecurity, and nutritional outcomes in children and adults," Journal of Health Economics, Elsevier, vol. 23(4), pages 839-862, July.
    16. Craig Gundersen & Susan Offutt, 2005. "Farm Poverty and Safety Nets," American Journal of Agricultural Economics, Agricultural and Applied Economics Association, vol. 87(4), pages 885-899.
    17. Anne Case & Darren Lubotsky & Christina Paxson, 2002. "Economic Status and Health in Childhood: The Origins of the Gradient," American Economic Review, American Economic Association, pages 1308-1334.
    18. Chad D. Meyerhoefer & Yuriy Pylypchuk, 2008. "Does Participation in the Food Stamp Program Increase the Prevalence of Obesity and Health Care Spending?," American Journal of Agricultural Economics, Agricultural and Applied Economics Association, vol. 90(2), pages 287-305.
    19. Molinari, Francesca, 2008. "Partial identification of probability distributions with misclassified data," Journal of Econometrics, Elsevier, pages 81-117.
    20. Charles F. Manski & John V. Pepper, 2000. "Monotone Instrumental Variables, with an Application to the Returns to Schooling," Econometrica, Econometric Society, vol. 68(4), pages 997-1012, July.
    21. Guido W. Imbens & Charles F. Manski, 2004. "Confidence Intervals for Partially Identified Parameters," Econometrica, Econometric Society, vol. 72(6), pages 1845-1857, November.
    22. Molinari, Francesca, 2010. "Missing Treatments," Journal of Business & Economic Statistics, American Statistical Association, vol. 28(1), pages 82-95.
    23. Janet Currie, 2003. "U.S. Food and Nutrition Programs," NBER Chapters,in: Means-Tested Transfer Programs in the United States, pages 199-290 National Bureau of Economic Research, Inc.
    Full references (including those not matched with items on IDEAS)

    Citations

    Citations are extracted by the CitEc Project, subscribe to its RSS feed for this item.
    as


    Cited by:

    1. A. Athreya & C. E. Priebe & M. Tang & V. Lyzinski & D. J. Marchette & D. L. Sussman, 2016. "A Limit Theorem for Scaled Eigenvectors of Random Dot Product Graphs," Sankhya A: The Indian Journal of Statistics, Springer;Indian Statistical Institute, vol. 78(1), pages 1-18, February.

    More about this item

    Statistics

    Access and download statistics

    Corrections

    All material on this site has been provided by the respective publishers and authors. You can help correct errors and omissions. When requesting a correction, please mention this item's handle: RePEc:taf:jnlasa:v:107:y:2012:i:499:p:1119-1128. See general information about how to correct material in RePEc.

    For technical questions regarding this item, or to correct its authors, title, abstract, bibliographic or download information, contact: (Chris Longhurst). General contact details of provider: http://www.tandfonline.com/UASA20 .

    If you have authored this item and are not yet registered with RePEc, we encourage you to do it here. This allows to link your profile to this item. It also allows you to accept potential citations to this item that we are uncertain about.

    We have no references for this item. You can help adding them by using this form .

    If you know of missing items citing this one, you can help us creating those links by adding the relevant references in the same way as above, for each refering item. If you are a registered author of this item, you may also want to check the "citations" tab in your RePEc Author Service profile, as there may be some citations waiting for confirmation.

    Please note that corrections may take a couple of weeks to filter through the various RePEc services.

    IDEAS is a RePEc service hosted by the Research Division of the Federal Reserve Bank of St. Louis . RePEc uses bibliographic data supplied by the respective publishers.