IDEAS home Printed from https://ideas.repec.org/p/ces/ceswps/_12521.html

The Content Moderator's Dilemma: Removal of Toxic Content and Distortions to Online Discourse

Author

Listed:
  • Mahyar Habibi
  • Dirk Hovy
  • Carlo Rasmus Schwarz

Abstract

There is an ongoing debate about how to moderate toxic speech on social media and the impact of content moderation on online discourse. This paper proposes and validates a methodology for measuring the content-moderation-induced distortions in online discourse using text embeddings from computational linguistics. Applying the method to a representative sample of 5 million US political Tweets, we find that removing toxic Tweets significantly alters the semantic composition of content. The magnitudes of the distortions are comparable to removing 4 out of 67 topics from the online discourse at random. This finding is consistent across different embedding models, toxicity metrics, and samples. Importantly, we demonstrate that these effects are not solely driven by toxic language but by the removal of topics often expressed in toxic form. We propose an alternative approach to content moderation that uses generative Large Language Models to rephrase toxic Tweets, preserving their salvageable content rather than removing them entirely. We show that this rephrasing strategy reduces toxicity while mitigating distortions in online content.

Suggested Citation

  • Mahyar Habibi & Dirk Hovy & Carlo Rasmus Schwarz, 2026. "The Content Moderator's Dilemma: Removal of Toxic Content and Distortions to Online Discourse," CESifo Working Paper Series 12521, CESifo.
  • Handle: RePEc:ces:ceswps:_12521
    as

    Download full text from publisher

    File URL: https://www.ifo.de/DocDL/cesifo1_wp12521.pdf
    Download Restriction: no
    ---><---

    References listed on IDEAS

    as
    1. Ruben Enikolopov & Maria Petrova & Ekaterina Zhuravskaya, 2011. "Media and Political Persuasion: Evidence from Russia," American Economic Review, American Economic Association, vol. 101(7), pages 3253-3285, December.
    2. Kok Choon Tay & Calvin M. L. Chan, 2021. "Digital Transformation of Banks: The Case of DBS," World Scientific Book Chapters, in: David Kuo Chuen Lee & Ding Ding & Chong Guan (ed.), Financial Management in the Digital Economy, chapter 8, pages 141-161, World Scientific Publishing Co. Pte. Ltd..
    3. , Darmadi & Sari, Ratna, 2021. "Gaya Kepemimpinan Transformasional Dan Motivasi Kerja," Thesis Commons 9mcyn, Center for Open Science.
    4. Leonardo Bursztyn & Georgy Egorov & Ruben Enikolopov & Maria Petrova, 2019. "Social Media and Xenophobia: Evidence from Russia," NBER Working Papers 26567, National Bureau of Economic Research, Inc.
    5. Chi Wing Chu & Tony Sit & Gongjun Xu, 2021. "Transformed Dynamic Quantile Regression on Censored Data," Journal of the American Statistical Association, Taylor & Francis Journals, vol. 116(534), pages 874-886, April.
    6. repec:nas:journl:v:115:y:2018:p:e3635-e3644 is not listed on IDEAS
    7. Thomas Fujiwara & Karsten Müller & Carlo Schwarz, 2024. "The Effect of Social Media on Elections: Evidence from The United States," Journal of the European Economic Association, European Economic Association, vol. 22(3), pages 1495-1539.
    8. Sergei Guriev & Daniel Treisman, 2019. "Informational Autocrats," Journal of Economic Perspectives, American Economic Association, vol. 33(4), pages 100-127, Fall.
    9. Kung-Jeng Wang & Diwanda Ageng Rizqi & Hong-Phuc Nguyen, 2021. "Skill transfer support model based on deep learning," Journal of Intelligent Manufacturing, Springer, vol. 32(4), pages 1129-1146, April.
    10. Elliott Ash & Stephen Hansen, 2023. "Text Algorithms in Economics," Annual Review of Economics, Annual Reviews, vol. 15(1), pages 659-688, September.
    11. Holmstrom, Bengt & Milgrom, Paul, 1991. "Multitask Principal-Agent Analyses: Incentive Contracts, Asset Ownership, and Job Design," The Journal of Law, Economics, and Organization, Oxford University Press, vol. 7(0), pages 24-52, Special I.
    12. Karsten Müller & Carlo Schwarz, 2021. "Fanning the Flames of Hate: Social Media and Hate Crime [Radio and the Rise of The Nazis in Prewar Germany]," Journal of the European Economic Association, European Economic Association, vol. 19(4), pages 2131-2167.
    13. Ekaterina Zhuravskaya & Maria Petrova & Ruben Enikolopov, 2020. "Political Effects of the Internet and Social Media," Annual Review of Economics, Annual Reviews, vol. 12(1), pages 415-438, August.
    14. Ruben Enikolopov & Maria Petrova & Konstantin Sonin, 2018. "Social Media and Corruption," American Economic Journal: Applied Economics, American Economic Association, vol. 10(1), pages 150-174, January.
    15. , Yangriani, 2021. "Yangriani - Managing Digital Transformation - GSLC 1," OSF Preprints 4btj6, Center for Open Science.
    16. Emeric Henry & Ekaterina Zhuravskaya & Sergei Guriev, 2022. "Checking and Sharing Alt-Facts," American Economic Journal: Economic Policy, American Economic Association, vol. 14(3), pages 55-86, August.
    17. Li, Tianya & Wang, Kejian & Wang, Jihao & Liu, Yueqi & Han, Yufen & Xu, Zhiyang & Lin, Guangyi & Liu, Yong, 2021. "Optimization of GDL to improve water transferability," Renewable Energy, Elsevier, vol. 179(C), pages 2086-2093.
    18. Egorov, Georgy & Guriev, Sergei & Sonin, Konstantin, 2009. "Why Resource-poor Dictators Allow Freer Media: A Theory and Evidence from Panel Data," American Political Science Review, Cambridge University Press, vol. 103(4), pages 645-668, November.
    19. Emeric Henry & Ekaterina Zhuravskaya & Sergei Guriev, 2022. "Checking and Sharing Alt-Facts," PSE-Ecole d'économie de Paris (Postprint) halshs-03342759, HAL.
    20. Asim Patra & Mohammed K. A. Kaabar & Sergejs Solovjovs, 2021. "Catalan Transform of k-Balancing Sequences," International Journal of Mathematics and Mathematical Sciences, Hindawi, vol. 2021, pages 1-6, December.
    21. Jonah Busch & Irene Ring & Monique Akullo & Oyut Amarjargal & Maud Borie & Rodrigo S. Cassola & Annabelle Cruz-Trinidad & Nils Droste & Joko Tri Haryanto & Ulan Kasymov & Nataliia Viktorivna Kotenko &, 2021. "A global review of ecological fiscal transfers," Nature Sustainability, Nature, vol. 4(9), pages 756-765, September.
    22. Roberto Mosquera & Mofioluwasademi Odunowo & Trent McNamara & Xiongfei Guo & Ragan Petrie, 2020. "The economic effects of Facebook," Experimental Economics, Springer;Economic Science Association, vol. 23(2), pages 575-602, June.
    23. repec:osf:osfxxx:4btj6_v1 is not listed on IDEAS
    24. Kun Wang & Christopher W. Johnson & Kane C. Bennett & Paul A. Johnson, 2021. "Predicting fault slip via transfer learning," Nature Communications, Nature, vol. 12(1), pages 1-11, December.
    25. Emeric Henry & Ekaterina Zhuravskaya & Sergei Guriev, 2022. "Checking and Sharing Alt-Facts," Post-Print halshs-03342759, HAL.
    26. Ruben Enikolopov & Alexey Makarin & Maria Petrova, 2023. "Online Corrigendum to “Social Media and Protest Participation: Evidence From Russia”," Econometrica, Econometric Society, vol. 91(3), pages 1-24, May.
    27. George Beknazar-Yuzbashev & Rafael Jiménez-Durán & Mateusz Stalinski, 2024. "A Model of Harmful Yet Engaging Content on Social Media," AEA Papers and Proceedings, American Economic Association, vol. 114, pages 678-683, May.
    28. Robert M. Bond & Christopher J. Fariss & Jason J. Jones & Adam D. I. Kramer & Cameron Marlow & Jaime E. Settle & James H. Fowler, 2012. "A 61-million-person experiment in social influence and political mobilization," Nature, Nature, vol. 489(7415), pages 295-298, September.
    29. Dirk Bergemann & Stephen Morris, 2019. "Information Design: A Unified Perspective," Journal of Economic Literature, American Economic Association, vol. 57(1), pages 44-95, March.
    30. Thomas Fujiwara & Karsten Müller & Carlo Schwarz, 2021. "The Effect of Social Media on Elections: Evidence from the United States," NBER Working Papers 28849, National Bureau of Economic Research, Inc.
    Full references (including those not matched with items on IDEAS)

    Most related items

    These are the items that most often cite the same works as this one and are cited by the same works as this one.
    1. Habibi, Mahyar & Hovy, Dirk & Schwarz, Carlo, 2026. "The Content Moderators Dilemma: Removal of Toxic Content and Distortions to Online Discourse," CAGE Online Working Paper Series 793, Competitive Advantage in the Global Economy (CAGE).
    2. Li, Dan & Li, Yijun & Wang, Chaoqun & Chen, Min & Wu, Qi, 2023. "Forecasting carbon prices based on real-time decomposition and causal temporal convolutional networks," Applied Energy, Elsevier, vol. 331(C).
    3. Jiménez Durán, Rafael & Müller, Karsten & Schwarz, Carlo, 2022. "The Effect of Content Moderation on Online and Offline Hate: Evidence from Germany's NetzDG," CEPR Discussion Papers 17554, C.E.P.R. Discussion Papers.
    4. Thomas Fujiwara & Karsten Müller & Carlo Schwarz, 2021. "The Effect of Social Media on Elections: Evidence from the United States," NBER Working Papers 28849, National Bureau of Economic Research, Inc.
    5. Thomas Fujiwara & Karsten Müller & Carlo Schwarz, 2024. "The Effect of Social Media on Elections: Evidence from The United States," Journal of the European Economic Association, European Economic Association, vol. 22(3), pages 1495-1539.
    6. Beknazar-Yuzbashev, George & Jiménez Durán, Rafael & McCrosky, Jesse & Stalinski, Mateusz, 2025. "Toxic content and user engagement on social media: Evidence from a field experiment," Working Papers 359, The University of Chicago Booth School of Business, George J. Stigler Center for the Study of the Economy and the State.
    7. Luca Braghieri & Ro'ee Levy & Alexey Makarin, 2022. "Social Media and Mental Health," American Economic Review, American Economic Association, vol. 112(11), pages 3660-3693, November.
    8. Beknazar-Yuzbashev, George & Jiménez-Durán, Rafael & McCrosky, Jesse & Stalinski, Mateusz, 2025. "Toxic Content and User Engagement on Social Media : Evidence from a Field Experiment," The Warwick Economics Research Paper Series (TWERPS) 1543, University of Warwick, Department of Economics.
    9. Ek, Claes & Samahita, Margaret, 2023. "Too much commitment? An online experiment with tempting YouTube content," Journal of Economic Behavior & Organization, Elsevier, vol. 208(C), pages 21-38.
    10. Cariolle, Joël & Elkhateeb, Yasmine & Maurel, Mathilde, 2024. "Misinformation technology: Internet use and political misperceptions in Africa," Journal of Comparative Economics, Elsevier, vol. 52(2), pages 400-433.
    11. Andres Raphaela & Berger Lara Marie, 2025. "Digitale Medienmärkte: Was tun gegen Hassrede und Falschinformationen?," Wirtschaftsdienst, Sciendo, vol. 105(3), pages 161-166.
    12. Gisli Gylfason, 2023. "From Tweets to the Streets: Twitter and Extremist Protests in the United States," PSE Working Papers halshs-04188189, HAL.
    13. Geraci, Andrea & Nardotto, Mattia & Reggiani, Tommaso & Sabatini, Fabio, 2022. "Broadband Internet and social capital," Journal of Public Economics, Elsevier, vol. 206(C).
    14. Garz, Marcel & Szucs, Ferenc, 2023. "Algorithmic selection and supply of political news on Facebook," Information Economics and Policy, Elsevier, vol. 62(C).
    15. Beknazar-Yuzbashev, George & Jiménez-Durán, Rafael & McCrosky, Jesse & Stalinski, Mateusz, 2025. "Toxic Content and User Engagement on Social Media: Evidence from a Field Experiment," CAGE Online Working Paper Series 741, Competitive Advantage in the Global Economy (CAGE).
    16. Guy Aridor & Rafael Jiménez-Durán & Ro'ee Levy & Lena Song, 2024. "The Economics of Social Media," Journal of Economic Literature, American Economic Association, vol. 62(4), pages 1422-1474, December.
    17. Gross, Ronit D. & Halevi, Tal & Koresh, Ella & Tzach, Yarden & Kanter, Ido, 2025. "Low-latency vision transformers via large-scale multi-head attention," Physica A: Statistical Mechanics and its Applications, Elsevier, vol. 675(C).
    18. George Beknazar-Yuzbashev & Rafael Jiménez-Durán & Jesse McCrosky & Mateusz Stalinski, 2025. "Toxic Content and User Engagement on Social Media: Evidence from a Field Experiment," CESifo Working Paper Series 11644, CESifo.
    19. Leopoldo Fergusson & Carlos Molina, 2020. "Facebook Causes Protests," HiCN Working Papers 323, Households in Conflict Network.
    20. Nicolás Ajzenman & Bruno Ferman & Pedro C. Sant’Anna, 2023. "Rooting for the Same Team: On the Interplay between Political and Social Identities in the Formation of Social Ties," Working Papers 231, Red Nacional de Investigadores en Economía (RedNIE).

    More about this item

    Keywords

    ;
    ;
    ;
    ;
    ;

    JEL classification:

    • L82 - Industrial Organization - - Industry Studies: Services - - - Entertainment; Media

    Statistics

    Access and download statistics

    Corrections

    All material on this site has been provided by the respective publishers and authors. You can help correct errors and omissions. When requesting a correction, please mention this item's handle: RePEc:ces:ceswps:_12521. See general information about how to correct material in RePEc.

    If you have authored this item and are not yet registered with RePEc, we encourage you to do it here. This allows to link your profile to this item. It also allows you to accept potential citations to this item that we are uncertain about.

    If CitEc recognized a bibliographic reference but did not link an item in RePEc to it, you can help with this form .

    If you know of missing items citing this one, you can help us creating those links by adding the relevant references in the same way as above, for each refering item. If you are a registered author of this item, you may also want to check the "citations" tab in your RePEc Author Service profile, as there may be some citations waiting for confirmation.

    For technical questions regarding this item, or to correct its authors, title, abstract, bibliographic or download information, contact: Klaus Wohlrabe (email available below). General contact details of provider: https://edirc.repec.org/data/cesifde.html .

    Please note that corrections may take a couple of weeks to filter through the various RePEc services.

    IDEAS is a RePEc service. RePEc uses bibliographic data supplied by the respective publishers.