IDEAS home Printed from https://ideas.repec.org/a/gam/jmathe/v13y2025i13p2134-d1690730.html
   My bibliography  Save this article

Kernel Density Estimation for Joint Scrambling in Sensitive Surveys

Author

Listed:
  • Alvan Caleb Arulandu

    (Department of Mathematics, Harvard University, 33 Lowell Mail Center, 10 Holyoke Place, Cambridge, MA 02138, USA)

  • Sat Gupta

    (Department of Mathematics and Statistics, University of North Carolina at Greensboro, 116 Petty Building, Greensboro, NC 27412, USA)

Abstract

Randomized response models aim to protect respondent privacy when sampling sensitive variables but consequently compromise estimator efficiency. We propose a new sampling method, titled joint scrambling, which preserves all true responses while protecting privacy by asking each respondent to jointly speak both their true response and multiple random responses in an arbitrary order. We give a kernel density estimator for the density function with asymptotically equivalent mean squared error for the optimal bandwidth yet greater generality than existing techniques for randomized response models. We also give consistent, unbiased estimators for a general class of estimands including the mean. For the cumulative distribution function, this estimator is more computationally efficient with asymptotically lower mean squared error than existing approaches. All results are verified via simulation and evaluated with respect to natural generalizations of existing privacy notions.

Suggested Citation

  • Alvan Caleb Arulandu & Sat Gupta, 2025. "Kernel Density Estimation for Joint Scrambling in Sensitive Surveys," Mathematics, MDPI, vol. 13(13), pages 1-17, June.
  • Handle: RePEc:gam:jmathe:v:13:y:2025:i:13:p:2134-:d:1690730
    as

    Download full text from publisher

    File URL: https://www.mdpi.com/2227-7390/13/13/2134/pdf
    Download Restriction: no

    File URL: https://www.mdpi.com/2227-7390/13/13/2134/
    Download Restriction: no
    ---><---

    References listed on IDEAS

    as
    1. Giancarlo Diana & Pier Perri, 2011. "A class of estimators for quantitative sensitive data," Statistical Papers, Springer, vol. 52(3), pages 633-650, August.
    2. Sat Gupta & Michael Parker & Sadia Khalil, 2024. "A Ratio Estimator for the Mean Using a Mixture Optional Enhance Trust (MOET) Randomized Response Model," Mathematics, MDPI, vol. 12(22), pages 1-17, November.
    3. Sayed A. Mostafa & Ibrahim A. Ahmad, 2019. "Kernel density estimation from complex surveys in the presence of complete auxiliary information," Metrika: International Journal for Theoretical and Applied Statistics, Springer, vol. 82(3), pages 295-338, April.
    Full references (including those not matched with items on IDEAS)

    Most related items

    These are the items that most often cite the same works as this one and are cited by the same works as this one.
    1. Lucio Barabesi & Giancarlo Diana & Pier Perri, 2013. "Design-based distribution function estimation for stigmatized populations," Metrika: International Journal for Theoretical and Applied Statistics, Springer, vol. 76(7), pages 919-935, October.
    2. Sat Gupta & Michael Parker & Sadia Khalil, 2024. "A Ratio Estimator for the Mean Using a Mixture Optional Enhance Trust (MOET) Randomized Response Model," Mathematics, MDPI, vol. 12(22), pages 1-17, November.
    3. María del Mar García Rueda & Pier Francesco Perri & Beatriz Rodríguez Cobo, 2018. "Advances in estimation by the item sum technique using auxiliary information in complex surveys," AStA Advances in Statistical Analysis, Springer;German Statistical Society, vol. 102(3), pages 455-478, July.
    4. Lucio Barabesi & Giancarlo Diana & Pier Perri, 2015. "Gini index estimation in randomized response surveys," AStA Advances in Statistical Analysis, Springer;German Statistical Society, vol. 99(1), pages 45-62, January.
    5. Kumari Priyanka & Pidugu Trisandhya & Richa Mittal, 2018. "Dealing sensitive characters on successive occasions through a general class of estimators using scrambled response techniques," METRON, Springer;Sapienza Università di Roma, vol. 76(2), pages 203-230, August.
    6. Priyanka Kumari & Trisandhya Pidugu, 2019. "Modelling Sensitive Issues On Successive Waves," Statistics in Transition New Series, Statistics Poland, vol. 20(1), pages 41-65, March.
    7. Javid Shabbir & Zabihullah Movaheedi, 2024. "Use of generalized randomized response model for enhancement of finite population variance: A simulation approach," PLOS ONE, Public Library of Science, vol. 19(12), pages 1-21, December.
    8. Muhammad Azeem & Sidra Ali, 2023. "A neutral comparative analysis of additive, multiplicative, and mixed quantitative randomized response models," PLOS ONE, Public Library of Science, vol. 18(4), pages 1-11, April.
    9. Muhammad Azeem & Javid Shabbir & Najma Salahuddin & Sundus Hussain & Musarrat Ijaz, 2023. "A comparative study of randomized response techniques using separate and combined metrics of efficiency and privacy," PLOS ONE, Public Library of Science, vol. 18(10), pages 1-15, October.
    10. Sayed A. Mostafa & Ibrahim A. Ahmad, 2021. "Kernel Density Estimation Based on the Distinct Units in Sampling with Replacement," Sankhya B: The Indian Journal of Statistics, Springer;Indian Statistical Institute, vol. 83(2), pages 507-547, November.
    11. Muhammad Azeem & Sundus Hussain & Musarrat Ijaz & Najma Salahuddin, 2024. "An improved quantitative randomized response technique for data collection in sensitive surveys," Quality & Quantity: International Journal of Methodology, Springer, vol. 58(1), pages 329-341, February.
    12. Andreas Quatember, 2012. "An extension of the standardized randomized response technique to a multi-stage setup," Statistical Methods & Applications, Springer;Società Italiana di Statistica, vol. 21(4), pages 475-484, November.
    13. Kumari Priyanka & Pidugu Trisandhya, 2019. "Modelling Sensitive Issues On Successive Waves," Statistics in Transition New Series, Polish Statistical Association, vol. 20(1), pages 41-65, March.
    14. Mausumi Bose, 2015. "Respondent privacy and estimation efficiency in randomized response surveys for discrete-valued sensitive variables," Statistical Papers, Springer, vol. 56(4), pages 1055-1069, November.

    More about this item

    Keywords

    ;
    ;
    ;
    ;
    ;

    Statistics

    Access and download statistics

    Corrections

    All material on this site has been provided by the respective publishers and authors. You can help correct errors and omissions. When requesting a correction, please mention this item's handle: RePEc:gam:jmathe:v:13:y:2025:i:13:p:2134-:d:1690730. See general information about how to correct material in RePEc.

    If you have authored this item and are not yet registered with RePEc, we encourage you to do it here. This allows to link your profile to this item. It also allows you to accept potential citations to this item that we are uncertain about.

    If CitEc recognized a bibliographic reference but did not link an item in RePEc to it, you can help with this form .

    If you know of missing items citing this one, you can help us creating those links by adding the relevant references in the same way as above, for each refering item. If you are a registered author of this item, you may also want to check the "citations" tab in your RePEc Author Service profile, as there may be some citations waiting for confirmation.

    For technical questions regarding this item, or to correct its authors, title, abstract, bibliographic or download information, contact: MDPI Indexing Manager (email available below). General contact details of provider: https://www.mdpi.com .

    Please note that corrections may take a couple of weeks to filter through the various RePEc services.

    IDEAS is a RePEc service. RePEc uses bibliographic data supplied by the respective publishers.