IDEAS home Printed from https://ideas.repec.org/a/bla/jorssc/v69y2020i5p1251-1268.html
   My bibliography  Save this article

Adding measurement error to location data to protect subject confidentiality while allowing for consistent estimation of exposure effects

Author

Listed:
  • Mahesh Karra
  • David Canning
  • Ryoko Sato

Abstract

In public use data sets, it is desirable not to report a respondent's location precisely to protect subject confidentiality. However, the direct use of perturbed location data to construct explanatory exposure variables for regression models will generally make naive estimates of all parameters biased and inconsistent. We propose an approach where a perturbation vector, consisting of a random distance at a random angle, is added to a respondent's reported geographic co‐ordinates. We show that, as long as the distribution of the perturbation is public and there is an underlying prior population density map, external researchers can construct unbiased and consistent estimates of location‐dependent exposure effects by using numerical integration techniques over all possible actual locations, although coefficient confidence intervals are wider than if the true location data were known. We examine our method by using a Monte Carlo simulation exercise and apply it to a real world example using data on perceived and actual distance to a health facility in Tanzania.

Suggested Citation

  • Mahesh Karra & David Canning & Ryoko Sato, 2020. "Adding measurement error to location data to protect subject confidentiality while allowing for consistent estimation of exposure effects," Journal of the Royal Statistical Society Series C, Royal Statistical Society, vol. 69(5), pages 1251-1268, November.
  • Handle: RePEc:bla:jorssc:v:69:y:2020:i:5:p:1251-1268
    DOI: 10.1111/rssc.12439
    as

    Download full text from publisher

    File URL: https://doi.org/10.1111/rssc.12439
    Download Restriction: no

    File URL: https://libkey.io/10.1111/rssc.12439?utm_source=ideas
    LibKey link: if access is restricted and if your library uses this service, LibKey will redirect you to where you can use your library subscription to access this item
    ---><---

    References listed on IDEAS

    as
    1. James W. Hardin & Henrik Schmeidiche & Raymond J. Carroll, 2003. "The regression-calibration method for fitting generalized linear models with additive measurement error," Stata Journal, StataCorp LP, vol. 3(4), pages 373-385, December.
    2. Aigner, Dennis J., 1973. "Regression with a binary independent variable subject to errors of observation," Journal of Econometrics, Elsevier, vol. 1(1), pages 49-59, March.
    3. Jerry Hausman, 2001. "Mismeasured Variables in Econometric Analysis: Problems from the Right and Problems from the Left," Journal of Economic Perspectives, American Economic Association, vol. 15(4), pages 57-67, Fall.
    4. Sophia Rabe-Hesketh & Anders Skrondal & Andrew Pickles, 2003. "Maximum likelihood estimation of generalized linear models with covariate measurement error," Stata Journal, StataCorp LP, vol. 3(4), pages 386-411, December.
    5. James W. Hardin & Henrik Schmeidiche & Raymond J. Carroll, 2003. "The regression-calibration method for fitting generalized linear models with additive measurement error," Stata Journal, StataCorp LP, vol. 3(4), pages 361-372, December.
    6. AIGNER, Dennis J., 1973. "Regression with a binary independent variable subject to errors of observation," LIDAM Reprints CORE 130, Université catholique de Louvain, Center for Operations Research and Econometrics (CORE).
    7. Giuseppe Arbia & Giuseppe Espa & Diego Giuliani, 2015. "Measurement Errors Arising When Using Distances in Microeconometric Modelling and the Individuals’ Position Is Geo-Masked for Confidentiality," Econometrics, MDPI, vol. 3(4), pages 1-10, October.
    8. Graeme Blair & Kosuke Imai & Yang-Yang Zhou, 2015. "Design and Analysis of the Randomized Response Technique," Journal of the American Statistical Association, Taylor & Francis Journals, vol. 110(511), pages 1304-1319, September.
    9. Imai, Kosuke & Park, Bethany & Greene, Kenneth F., 2015. "Using the Predicted Responses from List Experiments as Explanatory Variables in Regression Models," Political Analysis, Cambridge University Press, vol. 23(2), pages 180-196, April.
    Full references (including those not matched with items on IDEAS)

    Most related items

    These are the items that most often cite the same works as this one and are cited by the same works as this one.
    1. Adele Bergin, 2015. "Employer Changes and Wage Changes: Estimation with Measurement Error in a Binary Variable," LABOUR, CEIS, vol. 29(2), pages 194-223, June.
    2. Marianne Page, 2006. "Father's Education and Children's Human Capital: Evidence from the World War II GI Bill," Working Papers 84, University of California, Davis, Department of Economics.
    3. Philip Oreopoulos & Marianne E. Page, 2006. "The Intergenerational Effects of Compulsory Schooling," Journal of Labor Economics, University of Chicago Press, vol. 24(4), pages 729-760, October.
    4. Dan A. Black & Lars Skipper & Jeffrey A. Smith & Jeffrey Andrew Smith, 2023. "Firm Training," CESifo Working Paper Series 10268, CESifo.
    5. Leah K. Lakdawala & David Simon, 2016. "The Intergenerational Consequences of Tobacco Policy," Working papers 2016-27, University of Connecticut, Department of Economics.
    6. Mengke Qiao & Ke-Wei Huang, 2021. "Correcting Misclassification Bias in Regression Models with Variables Generated via Data Mining," Information Systems Research, INFORMS, vol. 32(2), pages 462-480, June.
    7. Daniel Kaufmann, 2016. "Is Deflation Costly After All? Evidence from Noisy Historical Data," KOF Working papers 16-421, KOF Swiss Economic Institute, ETH Zurich.
    8. Adele Bergin, 2013. "Job Changes and Wage Changes: Estimation with Measurement Error in a Binary Variable," Economics Department Working Paper Series n240-13.pdf, Department of Economics, National University of Ireland - Maynooth.
    9. Mariana Carrera & Heather Royer & Mark Stehr & Justin Sydnor & Dmitry Taubinsky, 2022. "Who Chooses Commitment? Evidence and Welfare Implications [Self-Control and Demand for Commitment in Online Game Playing: Evidence from a Field Experiment]," The Review of Economic Studies, Review of Economic Studies Ltd, vol. 89(3), pages 1205-1244.
    10. Daniel Kaufmann, 2020. "Is deflation costly after all? The perils of erroneous historical classifications," Journal of Applied Econometrics, John Wiley & Sons, Ltd., vol. 35(5), pages 614-628, August.
    11. Nguimkeu, Pierre & Denteh, Augustine & Tchernis, Rusty, 2019. "On the estimation of treatment effects with endogenous misreporting," Journal of Econometrics, Elsevier, vol. 208(2), pages 487-506.
    12. Brent Kreider & Steven C. Hill, 2009. "Partially Identifying Treatment Effects with an Application to Covering the Uninsured," Journal of Human Resources, University of Wisconsin Press, vol. 44(2).
    13. Acerenza, Santiago & Ban, Kyunghoon & Kedagni, Desire, 2021. "Marginal Treatment Effects with Misclassified Treatment," ISU General Staff Papers 202106180700001132, Iowa State University, Department of Economics.
    14. Charles Courtemanche & Augustine Denteh & Rusty Tchernis, 2019. "Estimating the Associations between SNAP and Food Insecurity, Obesity, and Food Purchases with Imperfect Administrative Measures of Participation," Southern Economic Journal, John Wiley & Sons, vol. 86(1), pages 202-228, July.
    15. Miaari, Sami H. & Lee, Ines, 2020. "Obstacles on the Road to School: The Impacts of Mobility Restrictions on Educational Performance," IZA Discussion Papers 13563, Institute of Labor Economics (IZA).
    16. DiTraglia, Francis J. & García-Jimeno, Camilo, 2019. "Identifying the effect of a mis-classified, binary, endogenous regressor," Journal of Econometrics, Elsevier, vol. 209(2), pages 376-390.
    17. Holmlund, Helena, 2007. "A Researcher's Guide to the Swedish Compulsory School Reform," Working Paper Series 9/2007, Stockholm University, Swedish Institute for Social Research.
    18. Kyung Min Kang & Robert A. Moffitt, 2019. "The Effect of SNAP and School Food Programs on Food Security, Diet Quality, and Food Spending: Sensitivity to Program Reporting Error," Southern Economic Journal, John Wiley & Sons, vol. 86(1), pages 156-201, July.
    19. Abdurrahman Aydemir & George J. Borjas, 2011. "Attenuation Bias in Measuring the Wage Impact of Immigration," Journal of Labor Economics, University of Chicago Press, vol. 29(1), pages 69-113, January.
    20. Gittleman Maury, 2011. "Medicaid and Wealth: A Re-Examination," The B.E. Journal of Economic Analysis & Policy, De Gruyter, vol. 11(1), pages 1-25, November.

    More about this item

    Statistics

    Access and download statistics

    Corrections

    All material on this site has been provided by the respective publishers and authors. You can help correct errors and omissions. When requesting a correction, please mention this item's handle: RePEc:bla:jorssc:v:69:y:2020:i:5:p:1251-1268. See general information about how to correct material in RePEc.

    If you have authored this item and are not yet registered with RePEc, we encourage you to do it here. This allows to link your profile to this item. It also allows you to accept potential citations to this item that we are uncertain about.

    If CitEc recognized a bibliographic reference but did not link an item in RePEc to it, you can help with this form .

    If you know of missing items citing this one, you can help us creating those links by adding the relevant references in the same way as above, for each refering item. If you are a registered author of this item, you may also want to check the "citations" tab in your RePEc Author Service profile, as there may be some citations waiting for confirmation.

    For technical questions regarding this item, or to correct its authors, title, abstract, bibliographic or download information, contact: Wiley Content Delivery (email available below). General contact details of provider: https://edirc.repec.org/data/rssssea.html .

    Please note that corrections may take a couple of weeks to filter through the various RePEc services.

    IDEAS is a RePEc service. RePEc uses bibliographic data supplied by the respective publishers.