IDEAS home Printed from https://ideas.repec.org/p/arx/papers/2510.15214.html
   My bibliography  Save this paper

How to Sell High-Dimensional Data Optimally

Author

Listed:
  • Andrew Li
  • R. Ravi
  • Karan Singh
  • Zihong Yi
  • Weizhong Zhang

Abstract

Motivated by the problem of selling large, proprietary data, we consider an information pricing problem proposed by Bergemann et al. that involves a decision-making buyer and a monopolistic seller. The seller has access to the underlying state of the world that determines the utility of the various actions the buyer may take. Since the buyer gains greater utility through better decisions resulting from more accurate assessments of the state, the seller can therefore promise the buyer supplemental information at a price. To contend with the fact that the seller may not be perfectly informed about the buyer's private preferences (or utility), we frame the problem of designing a data product as one where the seller designs a revenue-maximizing menu of statistical experiments. Prior work by Cai et al. showed that an optimal menu can be found in time polynomial in the state space, whereas we observe that the state space is naturally exponential in the dimension of the data. We propose an algorithm which, given only sampling access to the state space, provably generates a near-optimal menu with a number of samples independent of the state space. We then analyze a special case of high-dimensional Gaussian data, showing that (a) it suffices to consider scalar Gaussian experiments, (b) the optimal menu of such experiments can be found efficiently via a semidefinite program, and (c) full surplus extraction occurs if and only if a natural separation condition holds on the set of potential preferences of the buyer.

Suggested Citation

  • Andrew Li & R. Ravi & Karan Singh & Zihong Yi & Weizhong Zhang, 2025. "How to Sell High-Dimensional Data Optimally," Papers 2510.15214, arXiv.org.
  • Handle: RePEc:arx:papers:2510.15214
    as

    Download full text from publisher

    File URL: http://arxiv.org/pdf/2510.15214
    File Function: Latest version
    Download Restriction: no
    ---><---

    References listed on IDEAS

    as
    1. Dirk Bergemann & Alessandro Bonatti & Alex Smolin, 2018. "The Design and Price of Information," American Economic Review, American Economic Association, vol. 108(1), pages 1-48, January.
    2. Daron Acemoglu & Ali Makhdoumi & Azarakhsh Malekian & Asu Ozdaglar, 2022. "Too Much Data: Prices and Inefficiencies in Data Markets," American Economic Journal: Microeconomics, American Economic Association, vol. 14(4), pages 218-256, November.
    3. Dirk Bergemann & Stephen Morris, 2019. "Information Design: A Unified Perspective," Journal of Economic Literature, American Economic Association, vol. 57(1), pages 44-95, March.
    4. Emir Kamenica, 2019. "Bayesian Persuasion and Information Design," Annual Review of Economics, Annual Reviews, vol. 11(1), pages 249-272, August.
    5. Charles I. Jones & Christopher Tonetti, 2020. "Nonrivalry and the Economics of Data," American Economic Review, American Economic Association, vol. 110(9), pages 2819-2858, September.
    6. Péter Eső & Balázs Szentes, 2007. "Optimal Information Disclosure in Auctions and the Handicap Auction," The Review of Economic Studies, Review of Economic Studies Ltd, vol. 74(3), pages 705-731.
    7. Alonso, Ricardo & Câmara, Odilon, 2016. "Bayesian persuasion with heterogeneous priors," Journal of Economic Theory, Elsevier, vol. 165(C), pages 672-706.
    8. John Riley & Richard Zeckhauser, 1983. "Optimal Selling Strategies: When to Haggle, When to Hold Firm," The Quarterly Journal of Economics, President and Fellows of Harvard College, vol. 98(2), pages 267-289.
    9. Roger B. Myerson, 1981. "Optimal Auction Design," Mathematics of Operations Research, INFORMS, vol. 6(1), pages 58-73, February.
    10. Admati, Anat R. & Pfleiderer, Paul, 1986. "A monopolistic market for information," Journal of Economic Theory, Elsevier, vol. 39(2), pages 400-438, August.
    11. Admati, Anat R & Pfleiderer, Paul, 1990. "Direct and Indirect Sale of Information," Econometrica, Econometric Society, vol. 58(4), pages 901-928, July.
    Full references (including those not matched with items on IDEAS)

    Most related items

    These are the items that most often cite the same works as this one and are cited by the same works as this one.
    1. Bonatti, Alessandro & Dahleh, Munther & Horel, Thibaut & Nouripour, Amir, 2024. "Selling information in competitive environments," Journal of Economic Theory, Elsevier, vol. 216(C).
    2. Bergemann, Dirk & Ottaviani, Marco, 2021. "Information Markets and Nonmarkets," CEPR Discussion Papers 16459, C.E.P.R. Discussion Papers.
    3. Yingkai Li, 2021. "Selling Data to an Agent with Endogenous Information," Papers 2103.05788, arXiv.org, revised Aug 2023.
    4. Agarwal, Anish & Dahleh, Munther & Horel, Thibaut & Rui, Maryann, 2024. "Towards data auctions with externalities," Games and Economic Behavior, Elsevier, vol. 148(C), pages 323-356.
    5. Daron Acemoglu & Ali Makhdoumi & Azarakhsh Malekian & Asu Ozdaglar, 2022. "Too Much Data: Prices and Inefficiencies in Data Markets," American Economic Journal: Microeconomics, American Economic Association, vol. 14(4), pages 218-256, November.
    6. Jiadong Gu, 2024. "Data Trade and Consumer Privacy," Papers 2406.12457, arXiv.org, revised Jul 2024.
    7. Teddy Mekonnen & Bobak Pakzad-Hurson, 2024. "Competition, Persuasion, and Search," Papers 2411.11183, arXiv.org, revised Sep 2025.
    8. David Bounies & Antoine Dubus & Patrick Waelbroeck, 2020. "Market for Information and Selling Mechanisms," Working Papers ECARES 2020-07, ULB -- Universite Libre de Bruxelles.
    9. Elliott, M. & Galeotti., A. & Koh., A. & Li, W., 2021. "Market Segmentation Through Information," Janeway Institute Working Papers 2114, Faculty of Economics, University of Cambridge.
    10. Alex Smolin, 2023. "Disclosure and pricing of attributes," RAND Journal of Economics, RAND Corporation, vol. 54(4), pages 570-597, December.
    11. Wang, Han, 2025. "Contracting with heterogeneous researchers," Games and Economic Behavior, Elsevier, vol. 150(C), pages 278-294.
    12. Shih-Tang Su & Vijay G. Subramanian, 2022. "Order of Commitments in Bayesian Persuasion with Partial-informed Senders," Papers 2202.06479, arXiv.org.
    13. Alessandro Bonatti, 2023. "The Platform Dimension of Digital Privacy," NBER Chapters, in: The Economics of Privacy, pages 73-96, National Bureau of Economic Research, Inc.
    14. Galperti, Simone & Trevino, Isabel, 2020. "Coordination motives and competition for attention in information markets," Journal of Economic Theory, Elsevier, vol. 188(C).
    15. Liu, Ernest & Ma, Song & Veldkamp, Laura, 2025. "Data sales and data dilution," Journal of Financial Economics, Elsevier, vol. 169(C).
    16. Christa Gibbs & Benedict Guttman-Kenney & Donghoon Lee & Scott Nelson & Wilbert van der Klaauw & Jialan Wang, 2025. "Consumer Credit Reporting Data," Journal of Economic Literature, American Economic Association, vol. 63(2), pages 598-636, June.
    17. Mierendorff, Konrad, 2016. "Optimal dynamic mechanism design with deadlines," Journal of Economic Theory, Elsevier, vol. 161(C), pages 190-222.
    18. Andreas A. Haupt & Nicole Immorlica & Brendan Lucier, 2023. "Certification Design for a Competitive Market," Papers 2301.13449, arXiv.org.
    19. Chan, Jimmy & Gupta, Seher & Li, Fei & Wang, Yun, 2019. "Pivotal persuasion," Journal of Economic Theory, Elsevier, vol. 180(C), pages 178-202.
      • Jimmy Chan & Seher Gupta & Fei Li & Yun Wang, 2018. "Pivotal Persuasion," Working Papers 2018-11-03, Wang Yanan Institute for Studies in Economics (WISE), Xiamen University.
    20. Dirk Bergemann & Tibor Heumann & Stephen Morris, 2022. "Screening with Persuasion," Cowles Foundation Discussion Papers 2338, Cowles Foundation for Research in Economics, Yale University.

    More about this item

    NEP fields

    This paper has been announced in the following NEP Reports:

    Statistics

    Access and download statistics

    Corrections

    All material on this site has been provided by the respective publishers and authors. You can help correct errors and omissions. When requesting a correction, please mention this item's handle: RePEc:arx:papers:2510.15214. See general information about how to correct material in RePEc.

    If you have authored this item and are not yet registered with RePEc, we encourage you to do it here. This allows to link your profile to this item. It also allows you to accept potential citations to this item that we are uncertain about.

    If CitEc recognized a bibliographic reference but did not link an item in RePEc to it, you can help with this form .

    If you know of missing items citing this one, you can help us creating those links by adding the relevant references in the same way as above, for each refering item. If you are a registered author of this item, you may also want to check the "citations" tab in your RePEc Author Service profile, as there may be some citations waiting for confirmation.

    For technical questions regarding this item, or to correct its authors, title, abstract, bibliographic or download information, contact: arXiv administrators (email available below). General contact details of provider: http://arxiv.org/ .

    Please note that corrections may take a couple of weeks to filter through the various RePEc services.

    IDEAS is a RePEc service. RePEc uses bibliographic data supplied by the respective publishers.