IDEAS home Printed from https://ideas.repec.org/a/eee/ecosta/v31y2024icp81-99.html

Differentially Private Goodness-of-Fit Tests for Continuous Variables

Author

Listed:
  • Kwak, Seung Woo
  • Ahn, Jeongyoun
  • Lee, Jaewoo
  • Park, Cheolwoo

Abstract

Data privacy is a growing concern in modern data analyses as more and more types of information about individuals are collected and shared. Statistical analysis in consideration of privacy is thus becoming an exciting area of research. Differential privacy can provide a means by which one can measure the stochastic risk of violating the privacy of individuals that can result from conducting an analysis, such as a simple query from a database and a hypothesis test. The main interest of the work is a goodness-of-fit test that compares the sampled data to a known distribution. Many differentially private goodness-of-fit tests have been proposed for discrete random variables, but little work has been done for continuous variables. The objective is to review some existing tests that guarantee differential privacy for discrete random variables, and to propose an extension to continuous cases via a discretization process. The proposed test procedures are demonstrated through simulated examples and applied to the Household Financial Welfare Survey of South Korea in 2018.

Suggested Citation

  • Kwak, Seung Woo & Ahn, Jeongyoun & Lee, Jaewoo & Park, Cheolwoo, 2024. "Differentially Private Goodness-of-Fit Tests for Continuous Variables," Econometrics and Statistics, Elsevier, vol. 31(C), pages 81-99.
  • Handle: RePEc:eee:ecosta:v:31:y:2024:i:c:p:81-99
    DOI: 10.1016/j.ecosta.2021.09.007
    as

    Download full text from publisher

    File URL: http://www.sciencedirect.com/science/article/pii/S2452306221001143
    Download Restriction: Full text for ScienceDirect subscribers only. Contains open access articles

    File URL: https://libkey.io/10.1016/j.ecosta.2021.09.007?utm_source=ideas
    LibKey link: if access is restricted and if your library uses this service, LibKey will redirect you to where you can use your library subscription to access this item
    ---><---

    As the access to this document is restricted, you may want to

    for a different version of it.

    References listed on IDEAS

    as
    1. Wasserman, Larry & Zhou, Shuheng, 2010. "A Statistical Framework for Differential Privacy," Journal of the American Statistical Association, American Statistical Association, vol. 105(489), pages 375-389.
    2. Campano, Fred & Salvatore, Dominick, 2006. "Income Distribution," OUP Catalogue, Oxford University Press, number 9780195300918.
    Full references (including those not matched with items on IDEAS)

    Most related items

    These are the items that most often cite the same works as this one and are cited by the same works as this one.
    1. Walker, Douglas O., 2007. "Patterns of income distribution among world regions," Journal of Policy Modeling, Elsevier, vol. 29(4), pages 643-655.
    2. John M. Abowd & Ian M. Schmutte & William Sexton & Lars Vilhuber, 2019. "Suboptimal Provision of Privacy and Statistical Accuracy When They are Public Goods," Papers 1906.09353, arXiv.org.
    3. Jing Lei & Anne‐Sophie Charest & Aleksandra Slavkovic & Adam Smith & Stephen Fienberg, 2018. "Differentially private model selection with penalized and constrained likelihood," Journal of the Royal Statistical Society Series A, Royal Statistical Society, vol. 181(3), pages 609-633, June.
    4. Roberto Dell’Anno & Jorge Martinez-Vazquez, 2013. "A Behavioral Local Public Finance Perspective on the Renter’s Illusion Hypothesis," International Center for Public Policy Working Paper Series, at AYSPS, GSU paper1303, International Center for Public Policy, Andrew Young School of Policy Studies, Georgia State University.
    5. Dawid, H. & Harting, P. & Neugart, M., 2018. "Cohesion policy and inequality dynamics: Insights from a heterogeneous agents macroeconomic model," Journal of Economic Behavior & Organization, Elsevier, vol. 150(C), pages 220-255.
    6. Marinko Škare & Saša Stjepanovic, 2014. "Income Distribution Determinants and Inequality – International Comparison," The AMFITEATRU ECONOMIC journal, Academy of Economic Studies - Bucharest, Romania, vol. 16(37), pages 980-980, August.
    7. Claire McKay Bowen & Fang Liu & Bingyue Su, 2021. "Differentially private data release via statistical election to partition sequentially," METRON, Springer;Sapienza Università di Roma, vol. 79(1), pages 1-31, April.
    8. Ryan Cumings-Menon, 2022. "Differentially Private Estimation via Statistical Depth," Papers 2207.12602, arXiv.org.
    9. Ron S. Jarmin & John M. Abowd & Robert Ashmead & Ryan Cumings-Menon & Nathan Goldschlag & Michael B. Hawes & Sallie Ann Keller & Daniel Kifer & Philip Leclerc & Jerome P. Reiter & Rolando A. Rodrígue, 2023. "An in-depth examination of requirements for disclosure risk assessment," Proceedings of the National Academy of Sciences, Proceedings of the National Academy of Sciences, vol. 120(43), pages 2220558120-, October.
    10. Dean, James W., 2007. "National welfare and individual happiness: Income distribution and beyond," Journal of Policy Modeling, Elsevier, vol. 29(4), pages 567-575.
    11. Salvatore, Dominick & Campano, Fred, 2022. "Regional differences in inequality and income distribution in the United States," Journal of Policy Modeling, Elsevier, vol. 44(4), pages 780-789.
    12. Cao, Zilong & Wu, Shisong & Li, Xuanang & Zhang, Hai, 2025. "Differentially private histogram with valid statistics," Statistics & Probability Letters, Elsevier, vol. 219(C).
    13. Chongliang Luo & Md. Nazmul Islam & Natalie E. Sheils & John Buresh & Jenna Reps & Martijn J. Schuemie & Patrick B. Ryan & Mackenzie Edmondson & Rui Duan & Jiayi Tong & Arielle Marks-Anglin & Jiang Bi, 2022. "DLMM as a lossless one-shot algorithm for collaborative multi-site distributed linear mixed models," Nature Communications, Nature, vol. 13(1), pages 1-10, December.
    14. Shaughnessy, Timothy M. & White, Mary L. & Brendler, Michael D., 2010. "The Income Distribution Effect of Natural Disasters: An Analysis of Hurricane Katrina," Journal of Regional Analysis and Policy, Mid-Continent Regional Science Association, vol. 40(01), pages 1-12.
    15. Salvatore, Dominick, 2010. "Growth or stagnation after recession for the U.S. and other large advanced economies," Journal of Policy Modeling, Elsevier, vol. 32(5), pages 637-647, September.
    16. Cárdenas-Retamal, Roberto & Dresdner-Cid, Jorge & Ceballos-Concha, Adams, 2021. "Impact assessment of salmon farming on income distribution in remote coastal areas: The Chilean case," Food Policy, Elsevier, vol. 101(C).
    17. Vishesh Karwa & Pavel N. Krivitsky & Aleksandra B. Slavković, 2017. "Sharing social network data: differentially private estimation of exponential family random-graph models," Journal of the Royal Statistical Society Series C, Royal Statistical Society, vol. 66(3), pages 481-500, April.
    18. John M. Abowd & Robert Ashmead & Ryan Cumings-Menon & Simson Garfinkel & Micah Heineck & Christine Heiss & Robert Johns & Daniel Kifer & Philip Leclerc & Ashwin Machanavajjhala & Brett Moran & William, 2022. "The 2020 Census Disclosure Avoidance System TopDown Algorithm," Papers 2204.08986, arXiv.org.
    19. James W. Dean & G. Robert Ross, 2006. "Paradoxes and Puzzles in Our Globalized World Public Support of Trade Policy, International Outsourcing Trade Liberalization, Globalization," Carleton Economic Papers 06-07, Carleton University, Department of Economics.
    20. Chiang, Yen-Sheng, 2015. "Inequality measures perform differently in global and local assessments: An exploratory computational experiment," Physica A: Statistical Mechanics and its Applications, Elsevier, vol. 437(C), pages 1-11.

    More about this item

    Keywords

    ;
    ;
    ;
    ;

    Statistics

    Access and download statistics

    Corrections

    All material on this site has been provided by the respective publishers and authors. You can help correct errors and omissions. When requesting a correction, please mention this item's handle: RePEc:eee:ecosta:v:31:y:2024:i:c:p:81-99. See general information about how to correct material in RePEc.

    If you have authored this item and are not yet registered with RePEc, we encourage you to do it here. This allows to link your profile to this item. It also allows you to accept potential citations to this item that we are uncertain about.

    If CitEc recognized a bibliographic reference but did not link an item in RePEc to it, you can help with this form .

    If you know of missing items citing this one, you can help us creating those links by adding the relevant references in the same way as above, for each refering item. If you are a registered author of this item, you may also want to check the "citations" tab in your RePEc Author Service profile, as there may be some citations waiting for confirmation.

    For technical questions regarding this item, or to correct its authors, title, abstract, bibliographic or download information, contact: Catherine Liu (email available below). General contact details of provider: https://www.journals.elsevier.com/econometrics-and-statistics .

    Please note that corrections may take a couple of weeks to filter through the various RePEc services.

    IDEAS is a RePEc service. RePEc uses bibliographic data supplied by the respective publishers.