IDEAS home Printed from https://ideas.repec.org/p/nbr/nberwo/32686.html
   My bibliography  Save this paper

Will User-Contributed AI Training Data Eat Its Own Tail?

Author

Listed:
  • Joshua S. Gans

Abstract

This paper examines and finds that the answer is likely to be no. The environment examined starts with users who contribute based on their motives to create a public good. Their own actions determine the quality of that public good but also embed a free-rider problem. When AI is trained on that data, it can generate similar contributions to the public good. It is shown that this increases the incentive of human users to provide contributions that are more costly to supply. Thus, the overall quality of contributions from both AI and humans rises compared to human-only contributions. In situations where platform providers want to generate more contributions using explicit incentives, the rate of return on such incentives is shown to be lower in this environment.

Suggested Citation

  • Joshua S. Gans, 2024. "Will User-Contributed AI Training Data Eat Its Own Tail?," NBER Working Papers 32686, National Bureau of Economic Research, Inc.
  • Handle: RePEc:nbr:nberwo:32686
    Note: PR
    as

    Download full text from publisher

    File URL: http://www.nber.org/papers/w32686.pdf
    Download Restriction: Access to the full text is generally limited to series subscribers, however if the top level domain of the client browser is in a developing country or transition economy free access is provided. More information about subscriptions and free access is available at http://www.nber.org/wwphelp.html. Free access is also available to older working papers.
    ---><---

    As the access to this document is restricted, you may want to look for a different version below or search for a different version of it.

    Other versions of this item:

    References listed on IDEAS

    as
    1. Lena Abou El-Komboz & Moritz Goldbeck, 2023. "Career Concerns As Public Good: The Role of Signaling for Open Source Software Development," Rationality and Competition Discussion Paper Series 453, CRC TRR 190 Rationality and Competition.
    2. Josh Lerner & Jean Tirole, 2002. "Some Simple Economics of Open Source," Journal of Industrial Economics, Wiley Blackwell, vol. 50(2), pages 197-234, June.
    3. Barbosu, Sandra & Gans, Joshua S., 2022. "Storm crowds: Evidence from Zooniverse on crowd contribution design," Research Policy, Elsevier, vol. 51(1).
    4. Lei Xu & Tingting Nian & Luis Cabral, 2018. "What Makes Geeks Tick? A Study of Stack Overflow Careers," Working Papers 18-04, New York University, Leonard N. Stern School of Business, Department of Economics.
    5. Engers, Maxim & Gans, Joshua S, 1998. "Why Referees Are Not Paid (Enough)," American Economic Review, American Economic Association, vol. 88(5), pages 1341-1349, December.
    6. Lei Xu & Tingting Nian & Luís Cabral, 2020. "What Makes Geeks Tick? A Study of Stack Overflow Careers," Management Science, INFORMS, vol. 66(2), pages 587-604, February.
    Full references (including those not matched with items on IDEAS)

    Most related items

    These are the items that most often cite the same works as this one and are cited by the same works as this one.
    1. Jing Wang & Gen Li & Kai-Lung Hui, 2022. "Monetary Incentives and Knowledge Spillover: Evidence from a Natural Experiment," Management Science, INFORMS, vol. 68(5), pages 3549-3572, May.
    2. Charles Ayoubi & Boris Thurm, 2023. "Knowledge diffusion and morality: Why do we freely share valuable information with Strangers?," Journal of Economics & Management Strategy, Wiley Blackwell, vol. 32(1), pages 75-99, January.
    3. Erdem Dogukan Yilmaz & Tim Meyer & Milan Miric, 2023. "Preventing Others from Commercializing Your Innovation: Evidence from Creative Commons Licenses," Papers 2309.00536, arXiv.org.
    4. Chowdhury Mohammad Sakib Anwar & Konstantinos Georgalos, 2024. "Position uncertainty in a sequential public goods game: an experiment," Experimental Economics, Springer;Economic Science Association, vol. 27(4), pages 820-853, September.
    5. Jiang, Zhi-Qiang & Wang, Peng & Ma, Jun-Chao & Zhu, Peican & Han, Zhen & Podobnik, Boris & Stanley, H. Eugene & Zhou, Wei-Xing & Alfaro-Bittner, Karin & Boccaletti, Stefano, 2023. "Unraveling the effects of network, direct and indirect reciprocity in online societies," Chaos, Solitons & Fractals, Elsevier, vol. 169(C).
    6. Miric, Milan & Jeppesen, Lars Bo, 2023. "How does competition influence innovative effort within a platform-based ecosystem? Contrasting paid and unpaid contributors," Research Policy, Elsevier, vol. 52(7).
    7. Yi Yang & Kunpeng Zhang & Yangyang Fan, 2023. "sDTM: A Supervised Bayesian Deep Topic Model for Text Analytics," Information Systems Research, INFORMS, vol. 34(1), pages 137-156, March.
    8. Engelhardt, Sebastian v. & Freytag, Andreas, 2013. "Institutions, culture, and open source," Journal of Economic Behavior & Organization, Elsevier, vol. 95(C), pages 90-110.
    9. Graziella Marzi, 2009. "If not for money for what? Digging into the OS/FS contributors’ motivations," Working Papers 166, University of Milano-Bicocca, Department of Economics, revised Jul 2009.
    10. Bitzer, Jürgen & Geishecker, Ingo, 2010. "Who contributes voluntarily to OSS? An investigation among German IT employees," Research Policy, Elsevier, vol. 39(1), pages 165-172, February.
    11. Arthur Schram & Boris Van Leeuwen & Theo Offerman, 2013. "Superstars Need Social Benefits: An Experiment on Network Formation," Working Papers 1306, Departament Empresa, Universitat Autònoma de Barcelona, revised Jul 2013.
    12. David, Paul A. & Shapiro, Joseph S., 2008. "Community-based production of open-source software: What do we know about the developers who participate?," Information Economics and Policy, Elsevier, vol. 20(4), pages 364-398, December.
    13. Liuan Wang & Lu (Lucy) Yan & Tongxin Zhou & Xitong Guo & Gregory R. Heim, 2020. "Understanding Physicians’ Online-Offline Behavior Dynamics: An Empirical Study," Information Systems Research, INFORMS, vol. 31(2), pages 537-555, June.
    14. Dolata, Ulrich, 2014. "Märkte und Macht der Internetkonzerne: Konzentration - Konkurrenz - Innovationsstrategien," Research Contributions to Organizational Sociology and Innovation Studies, SOI Discussion Papers 2014-04, University of Stuttgart, Institute for Social Sciences, Department of Organizational Sociology and Innovation Studies.
    15. Massimiliano Gambardella, 2011. "The Scope of Open Licenses in Cultural Contents Production and Distribution," Working Papers hal-04140977, HAL.
    16. James, Jennifer S. & Pardey, Philip G. & Alston, Julian M., 2008. "Agricultural R&D Policy: A Tragedy of the International Commons," Staff Papers 43094, University of Minnesota, Department of Applied Economics.
    17. Justus Haucap & Johannes Muck, 2015. "What drives the relevance and reputation of economics journals? An update from a survey among economists," Scientometrics, Springer;Akadémiai Kiadó, vol. 103(3), pages 849-877, June.
    18. Li, Yung-Ming & Lee, Yi-Lin, 2010. "Pricing peer-produced services: Quality, capacity, and competition issues," European Journal of Operational Research, Elsevier, vol. 207(3), pages 1658-1668, December.
    19. Bruno Frey, 2005. "Problems with Publishing: Existing State and Solutions," European Journal of Law and Economics, Springer, vol. 19(2), pages 173-190, April.
    20. Alison J. Bianchi & Soong Moon Kang & Daniel Stewart, 2012. "The Organizational Selection of Status Characteristics: Status Evaluations in an Open Source Community," Organization Science, INFORMS, vol. 23(2), pages 341-354, April.

    More about this item

    JEL classification:

    • D70 - Microeconomics - - Analysis of Collective Decision-Making - - - General
    • H44 - Public Economics - - Publicly Provided Goods - - - Publicly Provided Goods: Mixed Markets
    • O31 - Economic Development, Innovation, Technological Change, and Growth - - Innovation; Research and Development; Technological Change; Intellectual Property Rights - - - Innovation and Invention: Processes and Incentives

    NEP fields

    This paper has been announced in the following NEP Reports:

    Statistics

    Access and download statistics

    Corrections

    All material on this site has been provided by the respective publishers and authors. You can help correct errors and omissions. When requesting a correction, please mention this item's handle: RePEc:nbr:nberwo:32686. See general information about how to correct material in RePEc.

    If you have authored this item and are not yet registered with RePEc, we encourage you to do it here. This allows to link your profile to this item. It also allows you to accept potential citations to this item that we are uncertain about.

    If CitEc recognized a bibliographic reference but did not link an item in RePEc to it, you can help with this form .

    If you know of missing items citing this one, you can help us creating those links by adding the relevant references in the same way as above, for each refering item. If you are a registered author of this item, you may also want to check the "citations" tab in your RePEc Author Service profile, as there may be some citations waiting for confirmation.

    For technical questions regarding this item, or to correct its authors, title, abstract, bibliographic or download information, contact: the person in charge (email available below). General contact details of provider: https://edirc.repec.org/data/nberrus.html .

    Please note that corrections may take a couple of weeks to filter through the various RePEc services.

    IDEAS is a RePEc service. RePEc uses bibliographic data supplied by the respective publishers.