IDEAS home Printed from https://ideas.repec.org/a/inm/ormksc/v42y2023i1p189-207.html
   My bibliography  Save this article

Using Deep Learning to Overcome Privacy and Scalability Issues in Customer Data Transfer

Author

Listed:
  • Piyush Anand

    (Marketing Department, Jones Graduate School of Business, Rice University, Houston, Texas 77005)

  • Clarence Lee

    (Eisengard AI, San Francisco, California 94108)

Abstract

Customer privacy is increasingly important to marketers. High-profile breaches of databases containing sensitive customer information, and the growing need to build the infrastructure required to support analysis of big data present nontrivial obstacles to researchers seeking individual-level customer data from firms. In this paper, we show that recent developments in machine learning may enable firms to transfer a generative model instead of data , thus potentially obviating the process of anonymizing and sampling customer data for release for use in a variety of analytic use cases. We demonstrate the efficacy of a specific deep learning model, generative adversarial networks (GANs), in preserving desired characteristics of original data. We validate in real-world settings and find that GANs outperform benchmarks on the accuracy-privacy tradeoff. We also demonstrate that GANs can be used to solve marketing problems of price markups for optimal profits and customer targeting. Finally, we demonstrate that GANs have volume and velocity advantages, as the size of informational transfer grows according to model complexity, and it can readily handle real-time data streams.

Suggested Citation

  • Piyush Anand & Clarence Lee, 2023. "Using Deep Learning to Overcome Privacy and Scalability Issues in Customer Data Transfer," Marketing Science, INFORMS, vol. 42(1), pages 189-207, January.
  • Handle: RePEc:inm:ormksc:v:42:y:2023:i:1:p:189-207
    DOI: 10.1287/mksc.2022.1365
    as

    Download full text from publisher

    File URL: http://dx.doi.org/10.1287/mksc.2022.1365
    Download Restriction: no

    File URL: https://libkey.io/10.1287/mksc.2022.1365?utm_source=ideas
    LibKey link: if access is restricted and if your library uses this service, LibKey will redirect you to where you can use your library subscription to access this item
    ---><---

    References listed on IDEAS

    as
    1. Artem Timoshenko & John R. Hauser, 2019. "Identifying Customer Needs from User-Generated Content," Marketing Science, INFORMS, vol. 38(1), pages 1-20, January.
    2. Igal Hendel & Aviv Nevo, 2006. "Sales and consumer inventory," RAND Journal of Economics, RAND Corporation, vol. 37(3), pages 543-561, September.
    3. Reiter, Jerome P., 2005. "Estimating Risks of Identification Disclosure in Microdata," Journal of the American Statistical Association, American Statistical Association, vol. 100, pages 1103-1112, December.
    4. Avi Goldfarb & Catherine Tucker, 2011. "Online Display Advertising: Targeting and Obtrusiveness," Marketing Science, INFORMS, vol. 30(3), pages 389-404, 05-06.
    5. Lingxiao Huang & K. Sudhir & Nisheeth K. Vishnoi, 2020. "Coresets for Regressions with Panel Data," Papers 2011.00981, arXiv.org, revised Nov 2020.
    6. John M. Abowd & Kaj Gittings & Kevin L. McKinney & Bryce E. Stephens & Lars Vilhuber & Simon Woodcock, 2012. "Dynamically Consistent Noise Infusion and Partially Synthetic Data as Confidentiality Protection Measures for Related Time Series," Working Papers 12-13, Center for Economic Studies, U.S. Census Bureau.
    7. Matthew J. Schneider & Sharan Jagpal & Sachin Gupta & Shaobo Li & Yan Yu, 2018. "A Flexible Method for Protecting Marketing Data: An Application to Point-of-Sale Data," Marketing Science, INFORMS, vol. 37(1), pages 153-171, January.
    8. Steven Tenn, 2006. "Avoiding aggregation bias in demand estimation: A multivariate promotional disaggregation approach," Quantitative Marketing and Economics (QME), Springer, vol. 4(4), pages 383-405, December.
    9. Omid Rafieian & Hema Yoganarasimhan, 2021. "Targeting and Privacy in Mobile Advertising," Marketing Science, INFORMS, vol. 40(2), pages 193-218, March.
    10. Leeflang, P.S.H. & Wittink, Dick R., 2000. "Building models for marketing decisions: past, present and future," Research Report 00F20, University of Groningen, Research Institute SOM (Systems, Organisations and Management).
    11. Avi Goldfarb & Catherine Tucker, 2011. "Rejoinder--Implications of "Online Display Advertising: Targeting and Obtrusiveness"," Marketing Science, INFORMS, vol. 30(3), pages 413-415, 05-06.
    12. Aron Culotta & Jennifer Cutler, 2016. "Mining Brand Perceptions from Twitter Social Networks," Marketing Science, INFORMS, vol. 35(3), pages 343-362, May.
    13. Dinesh Puranam & Vishal Narayan & Vrinda Kadiyali, 2017. "The Effect of Calorie Posting Regulation on Consumer Opinion: A Flexible Latent Dirichlet Allocation Model with Informative Priors," Marketing Science, INFORMS, vol. 36(5), pages 726-746, September.
    14. repec:dgr:rugsom:00f20 is not listed on IDEAS
    15. Chang Hee Park & Young-Hoon Park, 2016. "Investigating Purchase Conversion by Uncovering Online Visit Patterns," Marketing Science, INFORMS, vol. 35(6), pages 894-914, November.
    16. Eguchi, Shinto & Copas, John, 2006. "Interpreting Kullback-Leibler divergence with the Neyman-Pearson lemma," Journal of Multivariate Analysis, Elsevier, vol. 97(9), pages 2034-2040, October.
    17. Pradeep Chintagunta & Dominique M. Hanssens & John R. Hauser, 2016. "Editorial—Marketing Science and Big Data," Marketing Science, INFORMS, vol. 35(3), pages 341-342, May.
    18. Matthew J. Schneider & John M. Abowd, 2015. "A new method for protecting interrelated time series with Bayesian prior distributions and synthetic data," Journal of the Royal Statistical Society Series A, Royal Statistical Society, vol. 178(4), pages 963-975, October.
    19. Thomas J. Steenburgh & Andrew Ainslie & Peder Hans Engebretson, 2003. "Massively Categorical Variables: Revealing the Information in Zip Codes," Marketing Science, INFORMS, vol. 22(1), pages 40-57, August.
    20. Xiao Liu & Param Vir Singh & Kannan Srinivasan, 2016. "A Structured Analysis of Unstructured Big Data by Leveraging Cloud Computing," Marketing Science, INFORMS, vol. 35(3), pages 363-388, May.
    Full references (including those not matched with items on IDEAS)

    Most related items

    These are the items that most often cite the same works as this one and are cited by the same works as this one.
    1. Wieringa, Jaap & Kannan, P.K. & Ma, Xiao & Reutterer, Thomas & Risselada, Hans & Skiera, Bernd, 2021. "Data analytics in a privacy-concerned world," Journal of Business Research, Elsevier, vol. 122(C), pages 915-925.
    2. Ming-Hui Huang & Roland T. Rust, 2021. "A strategic framework for artificial intelligence in marketing," Journal of the Academy of Marketing Science, Springer, vol. 49(1), pages 30-50, January.
    3. Matthew J. Schneider & Sharan Jagpal & Sachin Gupta & Shaobo Li & Yan Yu, 2018. "A Flexible Method for Protecting Marketing Data: An Application to Point-of-Sale Data," Marketing Science, INFORMS, vol. 37(1), pages 153-171, January.
    4. Vinay Singh & Brijesh Nanavati & Arpan Kumar Kar & Agam Gupta, 2023. "How to Maximize Clicks for Display Advertisement in Digital Marketing? A Reinforcement Learning Approach," Information Systems Frontiers, Springer, vol. 25(4), pages 1621-1638, August.
    5. David A. Schweidel & Yakov Bart & J. Jeffrey Inman & Andrew T. Stephen & Barak Libai & Michelle Andrews & Ana Babić Rosario & Inyoung Chae & Zoey Chen & Daniella Kupor & Chiara Longoni & Felipe Thomaz, 2022. "How consumer digital signals are reshaping the customer journey," Journal of the Academy of Marketing Science, Springer, vol. 50(6), pages 1257-1276, November.
    6. Harikesh S. Nair & Sanjog Misra & William J. Hornbuckle IV & Ranjan Mishra & Anand Acharya, 2017. "Big Data and Marketing Analytics in Gaming: Combining Empirical Models and Field Experimentation," Marketing Science, INFORMS, vol. 36(5), pages 699-725, September.
    7. Paramveer S. Dhillon & Sinan Aral, 2021. "Modeling Dynamic User Interests: A Neural Matrix Factorization Approach," Marketing Science, INFORMS, vol. 40(6), pages 1059-1080, November.
    8. Xiang Hui & Meng Liu & Tat Chan, 2023. "Targeted incentives, broad impacts: Evidence from an E-commerce platform," Quantitative Marketing and Economics (QME), Springer, vol. 21(4), pages 493-517, December.
    9. Alantari, Huwail J. & Currim, Imran S. & Deng, Yiting & Singh, Sameer, 2022. "An empirical comparison of machine learning methods for text-based sentiment analysis of online consumer reviews," International Journal of Research in Marketing, Elsevier, vol. 39(1), pages 1-19.
    10. Ning Zhong & David A. Schweidel, 2020. "Capturing Changes in Social Media Content: A Multiple Latent Changepoint Topic Model," Marketing Science, INFORMS, vol. 39(4), pages 827-846, July.
    11. Robert W. Palmatier & Andrew T. Crecelius, 2019. "The “first principles” of marketing strategy," AMS Review, Springer;Academy of Marketing Science, vol. 9(1), pages 5-26, June.
    12. Beke, Frank T. & Eggers, Felix & Verhoef, Peter C. & Wieringa, Jaap E., 2022. "Consumers’ privacy calculus: The PRICAL index development and validation," International Journal of Research in Marketing, Elsevier, vol. 39(1), pages 20-41.
    13. Xiang Hui & Meng Liu & Tat Chan, 2022. "Targeted Incentives, Broad Impacts: Evidence from an E-commerce Platform," CESifo Working Paper Series 9894, CESifo.
    14. Mengxia Zhang & Lan Luo, 2023. "Can Consumer-Posted Photos Serve as a Leading Indicator of Restaurant Survival? Evidence from Yelp," Management Science, INFORMS, vol. 69(1), pages 25-50, January.
    15. Shun-Yang Lee & Julian Runge & Daniel Yoo & Yakov Bart & Anett Gyurak & J. W. Schneider, 2023. "COVID-19 Demand Shocks Revisited: Did Advertising Technology Help Mitigate Adverse Consequences for Small and Midsize Businesses?," Papers 2307.09035, arXiv.org, revised Jan 2024.
    16. Potoglou, Dimitris & Palacios, Juan & Feijoo, Claudio & Gómez Barroso, Jose-Luis, 2015. "The supply of personal information: A study on the determinants of information provision in e-commerce scenarios," 26th European Regional ITS Conference, Madrid 2015 127174, International Telecommunications Society (ITS).
    17. Yanwen Wang & Chunhua Wu & Ting Zhu, 2019. "Mobile Hailing Technology and Taxi Driving Behaviors," Marketing Science, INFORMS, vol. 38(5), pages 734-755, September.
    18. Yi Yang & Kunpeng Zhang & Yangyang Fan, 2023. "sDTM: A Supervised Bayesian Deep Topic Model for Text Analytics," Information Systems Research, INFORMS, vol. 34(1), pages 137-156, March.
    19. Randall Lewis & Dan Nguyen, 2015. "Display advertising’s competitive spillovers to consumer search," Quantitative Marketing and Economics (QME), Springer, vol. 13(2), pages 93-115, June.
    20. Bag, Sujoy & Tiwari, Manoj Kumar & Chan, Felix T.S., 2019. "Predicting the consumer's purchase intention of durable goods: An attribute-level analysis," Journal of Business Research, Elsevier, vol. 94(C), pages 408-419.

    Corrections

    All material on this site has been provided by the respective publishers and authors. You can help correct errors and omissions. When requesting a correction, please mention this item's handle: RePEc:inm:ormksc:v:42:y:2023:i:1:p:189-207. See general information about how to correct material in RePEc.

    If you have authored this item and are not yet registered with RePEc, we encourage you to do it here. This allows to link your profile to this item. It also allows you to accept potential citations to this item that we are uncertain about.

    If CitEc recognized a bibliographic reference but did not link an item in RePEc to it, you can help with this form .

    If you know of missing items citing this one, you can help us creating those links by adding the relevant references in the same way as above, for each refering item. If you are a registered author of this item, you may also want to check the "citations" tab in your RePEc Author Service profile, as there may be some citations waiting for confirmation.

    For technical questions regarding this item, or to correct its authors, title, abstract, bibliographic or download information, contact: Chris Asher (email available below). General contact details of provider: https://edirc.repec.org/data/inforea.html .

    Please note that corrections may take a couple of weeks to filter through the various RePEc services.

    IDEAS is a RePEc service. RePEc uses bibliographic data supplied by the respective publishers.