IDEAS home Printed from https://ideas.repec.org/a/ibn/ijspjl/v8y2019i5p66.html
   My bibliography  Save this article

Bayesian Analysis of Sparse Counts Obtained From the Unrelated Question Design

Author

Listed:
  • Balgobin Nandram
  • Yuan Yu

Abstract

In sample surveys with sensitive items, sampled units may not respond or they respond untruthfully. Usually a negative answer is given when it is actually positive, thereby leading to an estimate of the population proportion of positives (sensitive proportion) that is too small. In our study, we have binary data obtained from the unrelated-question design, and both the sensitive proportion and the nonsensitive proportion are of interest. A respondent answers the sensitive item with a known probability, and to avoid non-identifiable parameters, at least two (not necessarily exactly two) different random mechanisms are used, but only one for each cluster of respondents. The key point here is that the counts are sparse (very small sample sizes), and we show how to overcome some of the problems associated with the unrelated question design. A standard approach to this problem is to use the expectation-maximization (EM) algorithm. However, because we consider only small sample sizes (sparse counts), the EM algorithm may not converge and asymptotic theory, which can permit normality assumptions for inference, is not appropriate; so we develop a Bayesian method. To compare the EM algorithm and the Bayesian method, we have presented an example with sparse data on college cheating and a simulation study to illustrate the properties of our procedure. Finally, we discuss two extensions to accommodate finite population sampling and optional responses.

Suggested Citation

  • Balgobin Nandram & Yuan Yu, 2019. "Bayesian Analysis of Sparse Counts Obtained From the Unrelated Question Design," International Journal of Statistics and Probability, Canadian Center of Science and Education, vol. 8(5), pages 66-84, September.
  • Handle: RePEc:ibn:ijspjl:v:8:y:2019:i:5:p:66
    as

    Download full text from publisher

    File URL: http://www.ccsenet.org/journal/index.php/ijsp/article/download/0/0/40449/41895
    Download Restriction: no

    File URL: http://www.ccsenet.org/journal/index.php/ijsp/article/view/0/40449
    Download Restriction: no
    ---><---

    References listed on IDEAS

    as
    1. Balgobin Nandram & Yuan Yu, 2019. "Bayesian Analysis of a Sensitive Proportion for a Small Area," International Statistical Review, International Statistical Institute, vol. 87(S1), pages 104-120, May.
    2. Tasos Christofides, 2005. "Randomized response technique for two sensitive characteristics at the same time," Metrika: International Journal for Theoretical and Applied Statistics, Springer, vol. 62(1), pages 53-63, September.
    3. Heiko Groenitz, 2015. "Using prior information in privacy-protecting survey designs for categorical sensitive variables," Statistical Papers, Springer, vol. 56(1), pages 167-189, February.
    4. Graeme Blair & Kosuke Imai & Yang-Yang Zhou, 2015. "Design and Analysis of the Randomized Response Technique," Journal of the American Statistical Association, Taylor & Francis Journals, vol. 110(511), pages 1304-1319, September.
    Full references (including those not matched with items on IDEAS)

    Citations

    Citations are extracted by the CitEc Project, subscribe to its RSS feed for this item.
    as


    Cited by:

    1. Shen‐Ming Lee & Truong‐Nhat Le & Phuoc‐Loc Tran & Chin‐Shang Li, 2022. "Investigating the association of a sensitive attribute with a random variable using the Christofides generalised randomised response design and Bayesian methods," Journal of the Royal Statistical Society Series C, Royal Statistical Society, vol. 71(5), pages 1471-1502, November.
    2. Truong-Nhat Le & Shen-Ming Lee & Phuoc-Loc Tran & Chin-Shang Li, 2023. "Randomized Response Techniques: A Systematic Review from the Pioneering Work of Warner (1965) to the Present," Mathematics, MDPI, vol. 11(7), pages 1-26, April.

    Most related items

    These are the items that most often cite the same works as this one and are cited by the same works as this one.
    1. Truong-Nhat Le & Shen-Ming Lee & Phuoc-Loc Tran & Chin-Shang Li, 2023. "Randomized Response Techniques: A Systematic Review from the Pioneering Work of Warner (1965) to the Present," Mathematics, MDPI, vol. 11(7), pages 1-26, April.
    2. Shen‐Ming Lee & Truong‐Nhat Le & Phuoc‐Loc Tran & Chin‐Shang Li, 2022. "Investigating the association of a sensitive attribute with a random variable using the Christofides generalised randomised response design and Bayesian methods," Journal of the Royal Statistical Society Series C, Royal Statistical Society, vol. 71(5), pages 1471-1502, November.
    3. Shen-Ming Lee & Phuoc-Loc Tran & Truong-Nhat Le & Chin-Shang Li, 2023. "Prediction of a Sensitive Feature under Indirect Questioning via Warner’s Randomized Response Technique and Latent Class Model," Mathematics, MDPI, vol. 11(2), pages 1-21, January.
    4. Sylvain Chassang & Christian Zehnder, 2019. "Secure Survey Design in Organizations: Theory and Experiments," Working Papers 2019-22, Princeton University. Economics Department..
    5. Lucio Barabesi & Giancarlo Diana & Pier Perri, 2013. "Design-based distribution function estimation for stigmatized populations," Metrika: International Journal for Theoretical and Applied Statistics, Springer, vol. 76(7), pages 919-935, October.
    6. Burgstaller, Lilith & Feld, Lars P. & Pfeil, Katharina, 2022. "Working in the shadow: Survey techniques for measuring and explaining undeclared work," Journal of Economic Behavior & Organization, Elsevier, vol. 200(C), pages 661-671.
    7. Marc Höglinger & Ben Jann, 2018. "More is not always better: An experimental individual-level validation of the randomized response technique and the crosswise model," PLOS ONE, Public Library of Science, vol. 13(8), pages 1-22, August.
    8. Matthew J. Schneider & Dawn Iacobucci, 2020. "Protecting survey data on a consumer level," Journal of Marketing Analytics, Palgrave Macmillan, vol. 8(1), pages 3-17, March.
    9. Shu-Hui Hsieh & Shen-Ming Lee & Su-Hao Tu, 2018. "Randomized response techniques for a multi-level attribute using a single sensitive question," Statistical Papers, Springer, vol. 59(1), pages 291-306, March.
    10. Chuang, Erica & Dupas, Pascaline & Huillery, Elise & Seban, Juliette, 2021. "Sex, lies, and measurement: Consistency tests for indirect response survey methods," Journal of Development Economics, Elsevier, vol. 148(C).
    11. Liu, Yin & Tian, Guo-Liang, 2013. "A variant of the parallel model for sample surveys with sensitive characteristics," Computational Statistics & Data Analysis, Elsevier, vol. 67(C), pages 115-135.
    12. Chu, Amanda M.Y. & Omori, Yasuhiro & So, Hing-yu & So, Mike K.P., 2023. "A Multivariate Randomized Response Model for Sensitive Binary Data," Econometrics and Statistics, Elsevier, vol. 27(C), pages 16-35.
    13. Sarah Brierley, 2020. "Unprincipled Principals: Co‐opted Bureaucrats and Corruption in Ghana," American Journal of Political Science, John Wiley & Sons, vol. 64(2), pages 209-222, April.
    14. Sylvain Chassang & Christian Zehnder, 2019. "Secure Survey Design in Organizations: Theory and Experiments," NBER Working Papers 25918, National Bureau of Economic Research, Inc.
    15. Lee, Cheon-Sig & Sedory, Stephen A. & Singh, Sarjinder, 2013. "Estimating at least seven measures of qualitative variables from a single sample using randomized response technique," Statistics & Probability Letters, Elsevier, vol. 83(1), pages 399-409.
    16. Johanna Choumert-Nkolo & Pascale Phelinas, 2018. "New paradigms for household surveys in low and middle income countries [Nouveaux paradigmes d'élaboration des enquêtes ménages dans les pays du Sud]," CERDI Working papers halshs-01888609, HAL.
    17. Kim, Sahrok & Praveen Parboteeah, K. & Cullen, John B. & Jeong, Nara, 2022. "Social institutions approach to women’s firm ownership and firm bribery activity: A study of small-sized firms in emerging markets," Journal of Business Research, Elsevier, vol. 144(C), pages 1333-1349.
    18. Ivar Krumpal & Thomas Voss, 2020. "Sensitive Questions and Trust: Explaining Respondents’ Behavior in Randomized Response Surveys," SAGE Open, , vol. 10(3), pages 21582440209, July.
    19. Amanda M. Y. Chu & Mike K. P. So & Ray S. W. Chung, 2018. "Applying the Randomized Response Technique in Business Ethics Research: The Misuse of Information Systems Resources in the Workplace," Journal of Business Ethics, Springer, vol. 151(1), pages 195-212, August.
    20. Ó Ceallaigh, Diarmaid & Timmons, Shane & Robertson, Deirdre & Lunn, Pete, 2023. "Problem gambling: A narrative review of important policy-relevant issues," Research Series, Economic and Social Research Institute (ESRI), number SUSTAT119, June.

    More about this item

    Keywords

    data augmentation; EM algorithm; Gibbs sampler; latent variables; non-identifiable parameters; proportion; Rao-Blackwellized estimates;
    All these keywords.

    JEL classification:

    • R00 - Urban, Rural, Regional, Real Estate, and Transportation Economics - - General - - - General
    • Z0 - Other Special Topics - - General

    Statistics

    Access and download statistics

    Corrections

    All material on this site has been provided by the respective publishers and authors. You can help correct errors and omissions. When requesting a correction, please mention this item's handle: RePEc:ibn:ijspjl:v:8:y:2019:i:5:p:66. See general information about how to correct material in RePEc.

    If you have authored this item and are not yet registered with RePEc, we encourage you to do it here. This allows to link your profile to this item. It also allows you to accept potential citations to this item that we are uncertain about.

    If CitEc recognized a bibliographic reference but did not link an item in RePEc to it, you can help with this form .

    If you know of missing items citing this one, you can help us creating those links by adding the relevant references in the same way as above, for each refering item. If you are a registered author of this item, you may also want to check the "citations" tab in your RePEc Author Service profile, as there may be some citations waiting for confirmation.

    For technical questions regarding this item, or to correct its authors, title, abstract, bibliographic or download information, contact: Canadian Center of Science and Education (email available below). General contact details of provider: https://edirc.repec.org/data/cepflch.html .

    Please note that corrections may take a couple of weeks to filter through the various RePEc services.

    IDEAS is a RePEc service. RePEc uses bibliographic data supplied by the respective publishers.