IDEAS home Printed from https://ideas.repec.org/p/nbr/nberwo/23534.html
   My bibliography  Save this paper

A Framework for Sharing Confidential Research Data, Applied to Investigating Differential Pay by Race in the U. S. Government

Author

Listed:
  • Andrés F. Barrientos
  • Alexander Bolton
  • Tom Balmat
  • Jerome P. Reiter
  • John M. de Figueiredo
  • Ashwin Machanavajjhala
  • Yan Chen
  • Charles Kneifel
  • Mark DeLong

Abstract

Data stewards seeking to provide access to large-scale social science data face a difficult challenge. They have to share data in ways that protect privacy and confidentiality, are informative for many analyses and purposes, and are relatively straightforward to use by data analysts. We present a framework for addressing this challenge. The framework uses an integrated system that includes fully synthetic data intended for wide access, coupled with means for approved users to access the confidential data via secure remote access solutions, glued together by verification servers that allow users to assess the quality of their analyses with the synthetic data. We apply this framework to data on the careers of employees of the U. S. federal government, studying differentials in pay by race. The integrated system performs as intended, allowing users to explore the synthetic data for potential pay differentials and learn through verifications which findings in the synthetic data hold up in the confidential data and which do not. We find differentials across races; for example, the gap between black and white female federal employees' pay increased over the time period. We present models for generating synthetic careers and differentially private algorithms for verification of regression results.

Suggested Citation

  • Andrés F. Barrientos & Alexander Bolton & Tom Balmat & Jerome P. Reiter & John M. de Figueiredo & Ashwin Machanavajjhala & Yan Chen & Charles Kneifel & Mark DeLong, 2017. "A Framework for Sharing Confidential Research Data, Applied to Investigating Differential Pay by Race in the U. S. Government," NBER Working Papers 23534, National Bureau of Economic Research, Inc.
  • Handle: RePEc:nbr:nberwo:23534
    Note: LE LS PE TWP
    as

    Download full text from publisher

    File URL: http://www.nber.org/papers/w23534.pdf
    Download Restriction: no
    ---><---

    References listed on IDEAS

    as
    1. Ryoichi Sakano, 2002. "Are black and white income distributions converging? time series analysis," The Review of Black Political Economy, Springer;National Economic Association, vol. 30(1), pages 91-106, June.
    2. David Card & Thomas Lemieux, 1994. "Changing Wage Structure and Black-White Wage Differentials: A Longitudinal Analysis," Working Papers 701, Princeton University, Department of Economics, Industrial Relations Section..
    3. A. Colin Cameron & Douglas L. Miller, 2015. "A Practitioner’s Guide to Cluster-Robust Inference," Journal of Human Resources, University of Wisconsin Press, vol. 50(2), pages 317-372.
    4. Altonji, Joseph G. & Blank, Rebecca M., 1999. "Race and gender in the labor market," Handbook of Labor Economics, in: O. Ashenfelter & D. Card (ed.), Handbook of Labor Economics, edition 1, volume 3, chapter 48, pages 3143-3259, Elsevier.
    5. Drechsler, Jörg & Dundler, Agnes & Bender, Stefan & Rässler, Susanne & Zwick, Thomas, 2007. "A new approach for disclosure control in the IAB Establishment Panel : multiple imputation for a better data access," IAB-Discussion Paper 200711, Institut für Arbeitsmarkt- und Berufsforschung (IAB), Nürnberg [Institute for Employment Research, Nuremberg, Germany].
    6. Borjas, George J, 1980. "Wage Determination in the Federal Government: The Role of Constituents and Bureaucrats," Journal of Political Economy, University of Chicago Press, vol. 88(6), pages 1110-1147, December.
    7. repec:eee:labchp:v:3:y:1999:i:pc:p:3143-3259 is not listed on IDEAS
    8. Gary A. Hoover & Ryan A. Compton & Daniel C. Giedeman, 2015. "The Impact of Economic Freedom on the Black/White Income Gap," American Economic Review, American Economic Association, vol. 105(5), pages 587-592, May.
    9. Reiter, Jerome P. & Raghunathan, Trivellore E., 2007. "The Multiple Adaptations of Multiple Imputation," Journal of the American Statistical Association, American Statistical Association, vol. 102, pages 1462-1471, December.
    10. Card, David & Lemieux, Thomas, 1994. "Changing Wage Structure and Black-White Wage Differentials," American Economic Review, American Economic Association, vol. 84(2), pages 29-33, May.
    11. Jerome P. Reiter, 2005. "Releasing multiply imputed, synthetic public use microdata: an illustration and empirical study," Journal of the Royal Statistical Society Series A, Royal Statistical Society, vol. 168(1), pages 185-205, January.
    12. Borjas, George J, 1982. "The Politics of Employment Discrimination in the Federal Bureaucracy," Journal of Law and Economics, University of Chicago Press, vol. 25(2), pages 271-299, October.
    13. Reiter, Jerome P. & Oganian, Anna & Karr, Alan F., 2009. "Verification servers: Enabling analysts to assess the quality of inferences from public use data," Computational Statistics & Data Analysis, Elsevier, vol. 53(4), pages 1475-1482, February.
    14. Alexander Bolton & John M. de Figueiredo & David E. Lewis, 2016. "Elections, Ideology, and Turnover in the U.S. Federal Government," NBER Working Papers 22932, National Bureau of Economic Research, Inc.
    15. Dan Black & Natalia Kolesnikova & Seth Sanders & Lowell Taylor, 2013. "The role of location in evaluating racial wage disparity," IZA Journal of Labor Economics, Springer;Forschungsinstitut zur Zukunft der Arbeit GmbH (IZA), vol. 2(1), pages 1-18, December.
    16. David Card & Thomas Lemieux, 1994. "Changing Wage Structure and Black-White Wage Differentials: A Longitudinal Analysis," Working Papers 701, Princeton University, Department of Economics, Industrial Relations Section..
    17. George J. Borjas, 1983. "The Measurement of Race and Gender Wage Differentials: Evidence from the Federal Sector," ILR Review, Cornell University, ILR School, vol. 37(1), pages 79-91, October.
    Full references (including those not matched with items on IDEAS)

    Most related items

    These are the items that most often cite the same works as this one and are cited by the same works as this one.
    1. Lex Borghans & Bas Ter Weel & Bruce A. Weinberg, 2014. "People Skills and the Labor-Market Outcomes of Underrepresented Groups," ILR Review, Cornell University, ILR School, vol. 67(2), pages 287-334, April.
    2. Jerome P. Reiter, 2009. "Using Multiple Imputation to Integrate and Disseminate Confidential Microdata," International Statistical Review, International Statistical Institute, vol. 77(2), pages 179-195, August.
    3. Lex Borghans & Bas Ter Weel & Bruce A. Weinberg, 2014. "People Skills and the Labor-Market Outcomes of Underrepresented Groups," ILR Review, Cornell University, ILR School, vol. 67(2), pages 287-334, April.
    4. Song Han, 2004. "Discrimination in Lending: Theory and Evidence," The Journal of Real Estate Finance and Economics, Springer, vol. 29(1), pages 5-46, July.
    5. Drechsler, Jörg & Reiter, Jerome P., 2011. "An empirical evaluation of easily implemented, nonparametric methods for generating synthetic datasets," Computational Statistics & Data Analysis, Elsevier, vol. 55(12), pages 3232-3243, December.
    6. Becky Pettit & Stephanie Ewert, 2009. "Employment gains and wage declines: The erosion of black women’s relative wages since 1980," Demography, Springer;Population Association of America (PAA), vol. 46(3), pages 469-492, August.
    7. Richey, Jeremiah & Tromp, Nikolas, 2016. "Decomposing Black-White Wage Gaps Across Distributions: Young U.S. Men and Women in 1990 vs. 2011," MPRA Paper 74335, University Library of Munich, Germany.
    8. Borghans, Lex & ter Weel, Bas & Weinberg, Bruce A., 2005. "People People: Social Capital and the Labor-Market Outcomes of Underrepresented Groups," IZA Discussion Papers 1494, Institute of Labor Economics (IZA).
    9. Marco FUGAZZA, 2003. "Racial discrimination: Theories, facts and policy," International Labour Review, International Labour Organization, vol. 142(4), pages 507-541, December.
    10. repec:eee:labchp:v:3:y:1999:i:pc:p:3143-3259 is not listed on IDEAS
    11. Matteo Iacoviello, 2008. "Household Debt and Income Inequality, 1963–2003," Journal of Money, Credit and Banking, Blackwell Publishing, vol. 40(5), pages 929-965, August.
    12. Longhi, Simonetta, 2017. "Spatial-Ethnic Inequalities: The Role of Location in the Estimation of Ethnic Wage Differentials," IZA Discussion Papers 11073, Institute of Labor Economics (IZA).
    13. Klein Martin & Sinha Bimal, 2013. "Statistical Analysis of Noise-Multiplied Data Using Multiple Imputation," Journal of Official Statistics, Sciendo, vol. 29(3), pages 425-465, June.
    14. Woodcock, Simon D. & Benedetto, Gary, 2009. "Distribution-preserving statistical disclosure limitation," Computational Statistics & Data Analysis, Elsevier, vol. 53(12), pages 4228-4242, October.
    15. Sylvie Démurger & Eric A. Hanushek & Lei Zhang, 2019. "Employer Learning and the Dynamics of Returns to Universities: Evidence from Chinese Elite Education during University Expansion," NBER Working Papers 25955, National Bureau of Economic Research, Inc.
    16. Richard B. Freeman & Lawrence F. Katz, 1995. "Introduction and Summary," NBER Chapters, in: Differences and Changes in Wage Structures, pages 1-22, National Bureau of Economic Research, Inc.
    17. Joshua Snoke & Gillian M. Raab & Beata Nowok & Chris Dibben & Aleksandra Slavkovic, 2018. "General and specific utility measures for synthetic data," Journal of the Royal Statistical Society Series A, Royal Statistical Society, vol. 181(3), pages 663-688, June.
    18. Yi Qian & Hui Xie, 2013. "Drive More Effective Data-Based Innovations: Enhancing the Utility of Secure Databases," NBER Working Papers 19586, National Bureau of Economic Research, Inc.
    19. Roland G. Fryer, Jr. & Devah Pager & Jörg L. Spenkuch, 2013. "Racial Disparities in Job Finding and Offered Wages," Journal of Law and Economics, University of Chicago Press, vol. 56(3), pages 633-689.
    20. Barry T. Hirsch & John V. Winters, 2014. "An Anatomy Of Racial and Ethnic Trends in Male Earnings in the U.S," Review of Income and Wealth, International Association for Research in Income and Wealth, vol. 60(4), pages 930-947, December.
    21. Amelie F. Constant & Martin Kahanec & Klaus F. Zimmermann, 2012. "The Russian–Ukrainian earnings divide," The Economics of Transition, The European Bank for Reconstruction and Development, vol. 20(1), pages 1-35, January.

    More about this item

    JEL classification:

    • C51 - Mathematical and Quantitative Methods - - Econometric Modeling - - - Model Construction and Estimation
    • C53 - Mathematical and Quantitative Methods - - Econometric Modeling - - - Forecasting and Prediction Models; Simulation Methods
    • C55 - Mathematical and Quantitative Methods - - Econometric Modeling - - - Large Data Sets: Modeling and Analysis
    • C81 - Mathematical and Quantitative Methods - - Data Collection and Data Estimation Methodology; Computer Programs - - - Methodology for Collecting, Estimating, and Organizing Microeconomic Data; Data Access
    • J15 - Labor and Demographic Economics - - Demographic Economics - - - Economics of Minorities, Races, Indigenous Peoples, and Immigrants; Non-labor Discrimination
    • J45 - Labor and Demographic Economics - - Particular Labor Markets - - - Public Sector Labor Markets

    NEP fields

    This paper has been announced in the following NEP Reports:

    Statistics

    Access and download statistics

    Corrections

    All material on this site has been provided by the respective publishers and authors. You can help correct errors and omissions. When requesting a correction, please mention this item's handle: RePEc:nbr:nberwo:23534. See general information about how to correct material in RePEc.

    If you have authored this item and are not yet registered with RePEc, we encourage you to do it here. This allows to link your profile to this item. It also allows you to accept potential citations to this item that we are uncertain about.

    If CitEc recognized a bibliographic reference but did not link an item in RePEc to it, you can help with this form .

    If you know of missing items citing this one, you can help us creating those links by adding the relevant references in the same way as above, for each refering item. If you are a registered author of this item, you may also want to check the "citations" tab in your RePEc Author Service profile, as there may be some citations waiting for confirmation.

    For technical questions regarding this item, or to correct its authors, title, abstract, bibliographic or download information, contact: the person in charge (email available below). General contact details of provider: https://edirc.repec.org/data/nberrus.html .

    Please note that corrections may take a couple of weeks to filter through the various RePEc services.

    IDEAS is a RePEc service. RePEc uses bibliographic data supplied by the respective publishers.