How to Sell High-Dimensional Data Optimally

How to Sell High-Dimensional Data Optimally

Author

Listed:

Andrew Li
R. Ravi
Karan Singh
Zihong Yi
Weizhong Zhang

Abstract

Motivated by the problem of selling large, proprietary data, we consider an information pricing problem proposed by Bergemann et al. that involves a decision-making buyer and a monopolistic seller. The seller has access to the underlying state of the world that determines the utility of the various actions the buyer may take. Since the buyer gains greater utility through better decisions resulting from more accurate assessments of the state, the seller can therefore promise the buyer supplemental information at a price. To contend with the fact that the seller may not be perfectly informed about the buyer's private preferences (or utility), we frame the problem of designing a data product as one where the seller designs a revenue-maximizing menu of statistical experiments. Prior work by Cai et al. showed that an optimal menu can be found in time polynomial in the state space, whereas we observe that the state space is naturally exponential in the dimension of the data. We propose an algorithm which, given only sampling access to the state space, provably generates a near-optimal menu with a number of samples independent of the state space. We then analyze a special case of high-dimensional Gaussian data, showing that (a) it suffices to consider scalar Gaussian experiments, (b) the optimal menu of such experiments can be found efficiently via a semidefinite program, and (c) full surplus extraction occurs if and only if a natural separation condition holds on the set of potential preferences of the buyer.

Suggested Citation

Andrew Li & R. Ravi & Karan Singh & Zihong Yi & Weizhong Zhang, 2025. "How to Sell High-Dimensional Data Optimally," Papers 2510.15214, arXiv.org.

Handle: RePEc:arx:papers:2510.15214

Download full text from publisher

References listed on IDEAS

Emir Kamenica, 2019. "Bayesian Persuasion and Information Design," Annual Review of Economics, Annual Reviews, vol. 11(1), pages 249-272, August.
Charles I. Jones & Christopher Tonetti, 2020. "Nonrivalry and the Economics of Data," American Economic Review, American Economic Association, vol. 110(9), pages 2819-2858, September.
- Charles Jones & Christopher Tonetti, 2018. "Nonrivalry and the Economics of Data," 2018 Meeting Papers 477, Society for Economic Dynamics.
- Charles I. Jones & Christopher Tonetti, 2019. "Nonrivalry and the Economics of Data," NBER Working Papers 26260, National Bureau of Economic Research, Inc.
Péter Eső & Balázs Szentes, 2007. "Optimal Information Disclosure in Auctions and the Handicap Auction," The Review of Economic Studies, Review of Economic Studies Ltd, vol. 74(3), pages 705-731.
Alonso, Ricardo & Câmara, Odilon, 2016. "Bayesian persuasion with heterogeneous priors," Journal of Economic Theory, Elsevier, vol. 165(C), pages 672-706.
- Alonso, Ricardo & Câmara, Odilon, 2016. "Bayesian persuasion with heterogeneous priors," LSE Research Online Documents on Economics 67950, London School of Economics and Political Science, LSE Library.
John Riley & Richard Zeckhauser, 1983. "Optimal Selling Strategies: When to Haggle, When to Hold Firm," The Quarterly Journal of Economics, President and Fellows of Harvard College, vol. 98(2), pages 267-289.
Roger B. Myerson, 1981. "Optimal Auction Design," Mathematics of Operations Research, INFORMS, vol. 6(1), pages 58-73, February.
- Roger B. Myerson, 1978. "Optimal Auction Design," Discussion Papers 362, Northwestern University, Center for Mathematical Studies in Economics and Management Science.
Admati, Anat R. & Pfleiderer, Paul, 1986. "A monopolistic market for information," Journal of Economic Theory, Elsevier, vol. 39(2), pages 400-438, August.
Dirk Bergemann & Alessandro Bonatti & Alex Smolin, 2018. "The Design and Price of Information," American Economic Review, American Economic Association, vol. 108(1), pages 1-48, January.
- Dirk Bergemann & Alessandro Bonatti & Alex Smolin, 2016. "The Design and Price of Information," Cowles Foundation Discussion Papers 2049, Cowles Foundation for Research in Economics, Yale University.
- Bergemann, Dirk & Bonatti, Alessandro & Smolin, Alex, 2016. "The Design and Price of Information," CEPR Discussion Papers 11412, C.E.P.R. Discussion Papers.
- Dirk Bergemann & Alessandro Bonatti & Alex Smolin, 2017. "The Design and Price of Information," Cowles Foundation Discussion Papers 2049R, Cowles Foundation for Research in Economics, Yale University.
Dirk Bergemann & Stephen Morris, 2019. "Information Design: A Unified Perspective," Journal of Economic Literature, American Economic Association, vol. 57(1), pages 44-95, March.
- Dirk Bergemann & Stephen Morris, 2017. "Information Design: A Unified Perspective," Working Papers 089_2017, Princeton University, Department of Economics, Econometric Research Program..
- Dirk Bergemann & Stephen Morris, 2017. "Information Design: A Unified Perspective," Cowles Foundation Discussion Papers 2075R, Cowles Foundation for Research in Economics, Yale University, revised Mar 2017.
- Dirk Bergemann & Stephen Morris, 2017. "Information Design: A Unified Perspective," Cowles Foundation Discussion Papers 2075, Cowles Foundation for Research in Economics, Yale University.
- Dirk Bergemann & Stephen Morris, 2017. "Information Design: A Unified Perspective," Cowles Foundation Discussion Papers 2075R3, Cowles Foundation for Research in Economics, Yale University, revised Mar 2018.
- Bergemann, Dirk & Morris, Stephen, 2017. "Information Design: A Unified Perspective," CEPR Discussion Papers 11867, C.E.P.R. Discussion Papers.
- Dirk Bergemann & Stephen Morris, 2017. "Information Design: A Unified Perspective," Cowles Foundation Discussion Papers 2075R2, Cowles Foundation for Research in Economics, Yale University, revised Nov 2017.
Daron Acemoglu & Ali Makhdoumi & Azarakhsh Malekian & Asu Ozdaglar, 2022. "Too Much Data: Prices and Inefficiencies in Data Markets," American Economic Journal: Microeconomics, American Economic Association, vol. 14(4), pages 218-256, November.
- Acemoglu, Daron & Makhdoumi, Ali & Ozdaglar, Asuman & Malekian, Azarakhsh, 2019. "Too Much Data: Prices and Inefficiencies in Data Markets," CEPR Discussion Papers 14225, C.E.P.R. Discussion Papers.
- Daron Acemoglu & Ali Makhdoumi & Azarakhsh Malekian & Asuman Ozdaglar, 2019. "Too Much Data: Prices and Inefficiencies in Data Markets," NBER Working Papers 26296, National Bureau of Economic Research, Inc.
Admati, Anat R & Pfleiderer, Paul, 1990. "Direct and Indirect Sale of Information," Econometrica, Econometric Society, vol. 58(4), pages 901-928, July.

Full references (including those not matched with items on IDEAS)

Most related items

These are the items that most often cite the same works as this one and are cited by the same works as this one.

Bonatti, Alessandro & Dahleh, Munther & Horel, Thibaut & Nouripour, Amir, 2024. "Selling information in competitive environments," Journal of Economic Theory, Elsevier, vol. 216(C).
Bergemann, Dirk & Ottaviani, Marco, 2021. "Information Markets and Nonmarkets," CEPR Discussion Papers 16459, C.E.P.R. Discussion Papers.
- Dirk Bergemann & Marco Ottaviani, 2021. "Information Markets and Nonmarkets," Cowles Foundation Discussion Papers 2296, Cowles Foundation for Research in Economics, Yale University.
Yingkai Li, 2021. "Selling Data to an Agent with Endogenous Information," Papers 2103.05788, arXiv.org, revised Aug 2023.
Agarwal, Anish & Dahleh, Munther & Horel, Thibaut & Rui, Maryann, 2024. "Towards data auctions with externalities," Games and Economic Behavior, Elsevier, vol. 148(C), pages 323-356.
Daron Acemoglu & Ali Makhdoumi & Azarakhsh Malekian & Asu Ozdaglar, 2022. "Too Much Data: Prices and Inefficiencies in Data Markets," American Economic Journal: Microeconomics, American Economic Association, vol. 14(4), pages 218-256, November.
- Daron Acemoglu & Ali Makhdoumi & Azarakhsh Malekian & Asuman Ozdaglar, 2019. "Too Much Data: Prices and Inefficiencies in Data Markets," NBER Working Papers 26296, National Bureau of Economic Research, Inc.
- Acemoglu, Daron & Makhdoumi, Ali & Ozdaglar, Asuman & Malekian, Azarakhsh, 2019. "Too Much Data: Prices and Inefficiencies in Data Markets," CEPR Discussion Papers 14225, C.E.P.R. Discussion Papers.
Jiadong Gu, 2024. "Data Trade and Consumer Privacy," Papers 2406.12457, arXiv.org, revised Jan 2026.
Teddy Mekonnen & Bobak Pakzad-Hurson, 2024. "Competition, Persuasion, and Search," Papers 2411.11183, arXiv.org, revised Sep 2025.
Elliott, M. & Galeotti., A. & Koh., A. & Li, W., 2021. "Market Segmentation Through Information," Janeway Institute Working Papers 2114, Faculty of Economics, University of Cambridge.
Wang, Han, 2025. "Contracting with heterogeneous researchers," Games and Economic Behavior, Elsevier, vol. 150(C), pages 278-294.
Shih-Tang Su & Vijay G. Subramanian, 2022. "Order of Commitments in Bayesian Persuasion with Partial-informed Senders," Papers 2202.06479, arXiv.org.
Alessandro Bonatti, 2023. "The Platform Dimension of Digital Privacy," NBER Chapters, in: The Economics of Privacy, pages 73-96, National Bureau of Economic Research, Inc.
Galperti, Simone & Trevino, Isabel, 2020. "Coordination motives and competition for attention in information markets," Journal of Economic Theory, Elsevier, vol. 188(C).
Liu, Ernest & Ma, Song & Veldkamp, Laura, 2025. "Data sales and data dilution," Journal of Financial Economics, Elsevier, vol. 169(C).
Mierendorff, Konrad, 2016. "Optimal dynamic mechanism design with deadlines," Journal of Economic Theory, Elsevier, vol. 161(C), pages 190-222.
Andreas A. Haupt & Nicole Immorlica & Brendan Lucier, 2023. "Certification Design for a Competitive Market," Papers 2301.13449, arXiv.org.
Chan, Jimmy & Gupta, Seher & Li, Fei & Wang, Yun, 2019. "Pivotal persuasion," Journal of Economic Theory, Elsevier, vol. 180(C), pages 178-202.
- Jimmy Chan & Seher Gupta & Fei Li & Yun Wang, 2018. "Pivotal Persuasion," Working Papers 2018-11-03, Wang Yanan Institute for Studies in Economics (WISE), Xiamen University.
Arlindo Skenderaj, 2025. "Selling supplemental information," Papers 2511.14103, arXiv.org.
David Bounies & Antoine Dubus & Patrick Waelbroeck, 2020. "Market for Information and Selling Mechanisms," Working Papers ECARES 2020-07, ULB -- Universite Libre de Bruxelles.
- David Bounie & Antoine Dubus & Patrick Waelbroeck, 2022. "Market for Information and Selling Mechanisms," CER-ETH Economics working paper series 22/367, CER-ETH - Center of Economic Research (CER-ETH) at ETH Zurich.
- David Bounie & Antoine Dubus & Patrick Waelbroeck, 2020. "Market for Information and Selling Mechanisms," CESifo Working Paper Series 8307, CESifo.
Dirk Bergemann & Alessandro Bonatti & Alex Smolin, 2018. "The Design and Price of Information," American Economic Review, American Economic Association, vol. 108(1), pages 1-48, January.
- Bergemann, Dirk & Bonatti, Alessandro & Smolin, Alex, 2016. "The Design and Price of Information," CEPR Discussion Papers 11412, C.E.P.R. Discussion Papers.
- Dirk Bergemann & Alessandro Bonatti & Alex Smolin, 2016. "The Design and Price of Information," Cowles Foundation Discussion Papers 2049, Cowles Foundation for Research in Economics, Yale University.
- Dirk Bergemann & Alessandro Bonatti & Alex Smolin, 2017. "The Design and Price of Information," Cowles Foundation Discussion Papers 2049R, Cowles Foundation for Research in Economics, Yale University.
Wu, Haoyang, 2022. "A type-adjustable mechanism where the designer may obtain more payoffs by optimally controlling distributions of agents' types," MPRA Paper 113150, University Library of Munich, Germany.

More about this item

NEP fields

This paper has been announced in the following NEP Reports:

NEP-DES-2025-10-27 (Economic Design)
NEP-MIC-2025-10-27 (Microeconomics)
NEP-UPT-2025-10-27 (Utility Models and Prospect Theory)

Statistics

Access and download statistics

Corrections

All material on this site has been provided by the respective publishers and authors. You can help correct errors and omissions. When requesting a correction, please mention this item's handle: RePEc:arx:papers:2510.15214. See general information about how to correct material in RePEc.

If you have authored this item and are not yet registered with RePEc, we encourage you to do it here. This allows to link your profile to this item. It also allows you to accept potential citations to this item that we are uncertain about.

If CitEc recognized a bibliographic reference but did not link an item in RePEc to it, you can help with this form .

If you know of missing items citing this one, you can help us creating those links by adding the relevant references in the same way as above, for each refering item. If you are a registered author of this item, you may also want to check the "citations" tab in your RePEc Author Service profile, as there may be some citations waiting for confirmation.

For technical questions regarding this item, or to correct its authors, title, abstract, bibliographic or download information, contact: arXiv administrators (email available below). General contact details of provider: http://arxiv.org/ .

Please note that corrections may take a couple of weeks to filter through the various RePEc services.

IDEAS is a RePEc service. RePEc uses bibliographic data supplied by the respective publishers.

Browse Econ Literature

More features

How to Sell High-Dimensional Data Optimally

Author

Abstract

Suggested Citation

Download full text from publisher

References listed on IDEAS

Most related items

More about this item

NEP fields

Statistics

Corrections

More services and features

MyIDEAS

Author registration

Rankings

RePEc Genealogy

RePEc Biblio

MPRA

New papers by email

EconAcademics

Plagiarism

About RePEc

RePEc home

Blog

Help/FAQ

RePEc team

Participating archives

Privacy statement

Help us

Corrections

Volunteers

Get papers listed

Open a RePEc archive

Get RePEc data