IDEAS home Printed from https://ideas.repec.org/a/bla/biomet/v73y2017i4p1189-1198.html
   My bibliography  Save this article

Modeling and analyzing respondent‐driven sampling as a counting process

Author

Listed:
  • Yakir Berchenko
  • Jonathan D. Rosenblatt
  • Simon D. W. Frost

Abstract

Respondent‐driven sampling (RDS) is an approach to sampling design and analysis which utilizes the networks of social relationships that connect members of the target population, using chain‐referral. RDS sampling will typically oversample participants with many acquaintances. Naïve estimators, such as the sample average, will thus be biased towards the state of the most highly connected individuals. Current methodology cannot estimate population size from RDS, and promotes inverse probability weighted estimators for population parameters such as HIV prevalence. We propose to use the timing of recruitment, typically collected and discarded, in order to estimate the population size via a counting process model. Once population size and degree frequencies are made available, prevalence can be debiased in a post‐stratified framework. We adapt methods developed for inference in epidemiology and software reliability to estimate the population size, degree counts and frequencies. A fundamental advantage of our approach is that it makes the assumptions of the sampling design explicit. This enables verification of the assumptions, maximum likelihood estimation, extension with covariates, and model selection. We develop large‐sample theory, proving consistency and asymptotic normality. We further compare our estimators to other estimators in the RDS literature, through simulation and real‐world data. In both cases, we find our estimators to outperform current methods. The likelihood problem in the model we present is separable, and thus efficiently solvable. We implement these estimators in an accompanying R package, chords, available on CRAN.

Suggested Citation

  • Yakir Berchenko & Jonathan D. Rosenblatt & Simon D. W. Frost, 2017. "Modeling and analyzing respondent‐driven sampling as a counting process," Biometrics, The International Biometric Society, vol. 73(4), pages 1189-1198, December.
  • Handle: RePEc:bla:biomet:v:73:y:2017:i:4:p:1189-1198
    DOI: 10.1111/biom.12678
    as

    Download full text from publisher

    File URL: https://doi.org/10.1111/biom.12678
    Download Restriction: no

    File URL: https://libkey.io/10.1111/biom.12678?utm_source=ideas
    LibKey link: if access is restricted and if your library uses this service, LibKey will redirect you to where you can use your library subscription to access this item
    ---><---

    References listed on IDEAS

    as
    1. Gile, Krista J., 2011. "Improved Inference for Respondent-Driven Sampling Data With Application to HIV Prevalence Estimation," Journal of the American Statistical Association, American Statistical Association, vol. 106(493), pages 135-146.
    2. Traud, Amanda L. & Mucha, Peter J. & Porter, Mason A., 2012. "Social structure of Facebook networks," Physica A: Statistical Mechanics and its Applications, Elsevier, vol. 391(16), pages 4165-4180.
    3. T. Britton, 1998. "Estimation in multitype epidemics," Journal of the Royal Statistical Society Series B, Royal Statistical Society, vol. 60(4), pages 663-679.
    Full references (including those not matched with items on IDEAS)

    Most related items

    These are the items that most often cite the same works as this one and are cited by the same works as this one.
    1. Xin Xu & Yang Lu & Yupeng Zhou & Zhiguo Fu & Yanjie Fu & Minghao Yin, 2021. "An Information-Explainable Random Walk Based Unsupervised Network Representation Learning Framework on Node Classification Tasks," Mathematics, MDPI, vol. 9(15), pages 1-14, July.
    2. Ian E. Fellows & Mark S. Handcock, 2023. "Modeling of networked populations when data is sampled or missing," METRON, Springer;Sapienza Università di Roma, vol. 81(1), pages 21-35, April.
    3. Jiashun Jin & Zheng Tracy Ke & Shengming Luo, 2022. "Improvements on SCORE, Especially for Weak Signals," Sankhya A: The Indian Journal of Statistics, Springer;Indian Statistical Institute, vol. 84(1), pages 127-162, June.
    4. Fatemi, Samira & Salehi, Mostafa & Veisi, Hadi & Jalili, Mahdi, 2018. "A fuzzy logic based estimator for respondent driven sampling of complex networks," Physica A: Statistical Mechanics and its Applications, Elsevier, vol. 510(C), pages 42-51.
    5. Saxena, Rakhi & Kaur, Sharanjit & Bhatnagar, Vasudha, 2019. "Identifying similar networks using structural hierarchy," Physica A: Statistical Mechanics and its Applications, Elsevier, vol. 536(C).
    6. Luca Braghieri & Ro'ee Levy & Alexey Makarin, 2022. "Social Media and Mental Health," American Economic Review, American Economic Association, vol. 112(11), pages 3660-3693, November.
    7. Chien-Min Huang & F. Jay Breidt, 2023. "A dual-frame approach for estimation with respondent-driven samples," METRON, Springer;Sapienza Università di Roma, vol. 81(1), pages 65-81, April.
    8. Ma, Shujie & Su, Liangjun & Zhang, Yichong, 2020. "Detecting Latent Communities in Network Formation Models," Economics and Statistics Working Papers 12-2020, Singapore Management University, School of Economics.
    9. Ciotti, Valerio & Bianconi, Ginestra & Capocci, Andrea & Colaiori, Francesca & Panzarasa, Pietro, 2015. "Degree correlations in signed social networks," Physica A: Statistical Mechanics and its Applications, Elsevier, vol. 422(C), pages 25-39.
    10. Wang, Mingyan & Zeng, An & Cui, Xiaohua, 2022. "Collective user switching behavior reveals the influence of TV channels and their hidden community structure," Physica A: Statistical Mechanics and its Applications, Elsevier, vol. 606(C).
    11. Lisa Avery & Alison Macpherson & Sarah Flicker & Michael Rotondi, 2021. "A review of reported network degree and recruitment characteristics in respondent driven sampling implications for applied researchers and methodologists," PLOS ONE, Public Library of Science, vol. 16(4), pages 1-19, April.
    12. Jason Jung, 2014. "Understanding information propagation on online social tagging systems: a case study on Flickr," Quality & Quantity: International Journal of Methodology, Springer, vol. 48(2), pages 745-754, March.
    13. Dongah Kim & Krista J. Gile & Honoria Guarino & Pedro Mateu‐Gelabert, 2021. "Inferring bivariate association from respondent‐driven sampling data," Journal of the Royal Statistical Society Series C, Royal Statistical Society, vol. 70(2), pages 415-433, March.
    14. Yuan, Wei-Guo & Liu, Yun, 2015. "A mixing evolution model for bidirectional microblog user networks," Physica A: Statistical Mechanics and its Applications, Elsevier, vol. 432(C), pages 167-179.
    15. Yi-Shan Sung & Dashun Wang & Soundar Kumara, 0. "Uncovering the effect of dominant attributes on community topology: A case of facebook networks," Information Systems Frontiers, Springer, vol. 0, pages 1-12.
    16. Karimi, Fariba & Ramenzoni, Verónica C. & Holme, Petter, 2014. "Structural differences between open and direct communication in an online community," Physica A: Statistical Mechanics and its Applications, Elsevier, vol. 414(C), pages 263-273.
    17. Florence Samkange-Zeeb & Ronja Foraita & Stefan Rach & Tilman Brand, 2019. "Feasibility of using respondent-driven sampling to recruit participants in superdiverse neighbourhoods for a general health survey," International Journal of Public Health, Springer;Swiss School of Public Health (SSPH+), vol. 64(3), pages 451-459, April.
    18. Malmros Jens & Masuda Naoki & Britton Tom, 2016. "Random Walks on Directed Networks: Inference and Respondent-Driven Sampling," Journal of Official Statistics, Sciendo, vol. 32(2), pages 433-459, June.
    19. Li, Jin-Yue & Li, Xiang & Li, Cong, 2021. "The Kronecker-clique model for higher-order clustering coefficients," Physica A: Statistical Mechanics and its Applications, Elsevier, vol. 582(C).
    20. Sun, Xin & Dong, Junyu & Tang, Ruichun & Xu, Mantao & Qi, Lin & Cai, Yang, 2015. "Topological evolution of virtual social networks by modeling social activities," Physica A: Statistical Mechanics and its Applications, Elsevier, vol. 433(C), pages 259-267.

    More about this item

    Statistics

    Access and download statistics

    Corrections

    All material on this site has been provided by the respective publishers and authors. You can help correct errors and omissions. When requesting a correction, please mention this item's handle: RePEc:bla:biomet:v:73:y:2017:i:4:p:1189-1198. See general information about how to correct material in RePEc.

    If you have authored this item and are not yet registered with RePEc, we encourage you to do it here. This allows to link your profile to this item. It also allows you to accept potential citations to this item that we are uncertain about.

    If CitEc recognized a bibliographic reference but did not link an item in RePEc to it, you can help with this form .

    If you know of missing items citing this one, you can help us creating those links by adding the relevant references in the same way as above, for each refering item. If you are a registered author of this item, you may also want to check the "citations" tab in your RePEc Author Service profile, as there may be some citations waiting for confirmation.

    For technical questions regarding this item, or to correct its authors, title, abstract, bibliographic or download information, contact: Wiley Content Delivery (email available below). General contact details of provider: http://www.blackwellpublishing.com/journal.asp?ref=0006-341X .

    Please note that corrections may take a couple of weeks to filter through the various RePEc services.

    IDEAS is a RePEc service. RePEc uses bibliographic data supplied by the respective publishers.