IDEAS home Printed from https://ideas.repec.org/a/gam/jftint/v15y2023i12p397-d1297089.html
   My bibliography  Save this article

Methodological Approach for Identifying Websites with Infringing Content via Text Transformers and Dense Neural Networks

Author

Listed:
  • Aldo Hernandez-Suarez

    (Instituto Politecnico Nacional, ESIME Culhuacan, Mexico City 04440, Mexico)

  • Gabriel Sanchez-Perez

    (Instituto Politecnico Nacional, ESIME Culhuacan, Mexico City 04440, Mexico)

  • Linda Karina Toscano-Medina

    (Instituto Politecnico Nacional, ESIME Culhuacan, Mexico City 04440, Mexico)

  • Hector Manuel Perez-Meana

    (Instituto Politecnico Nacional, ESIME Culhuacan, Mexico City 04440, Mexico)

  • Jose Portillo-Portillo

    (Instituto Politecnico Nacional, ESIME Culhuacan, Mexico City 04440, Mexico)

  • Jesus Olivares-Mercado

    (Instituto Politecnico Nacional, ESIME Culhuacan, Mexico City 04440, Mexico)

Abstract

The rapid evolution of the Internet of Everything (IoE) has significantly enhanced global connectivity and multimedia content sharing, simultaneously escalating the unauthorized distribution of multimedia content, posing risks to intellectual property rights. In 2022 alone, about 130 billion accesses to potentially non-compliant websites were recorded, underscoring the challenges for industries reliant on copyright-protected assets. Amidst prevailing uncertainties and the need for technical and AI-integrated solutions, this study introduces two pivotal contributions. First, it establishes a novel taxonomy aimed at safeguarding and identifying IoE-based content infringements. Second, it proposes an innovative architecture combining IoE components with automated sensors to compile a dataset reflective of potential copyright breaches. This dataset is analyzed using a Bidirectional Encoder Representations from Transformers-based advanced Natural Language Processing (NLP) algorithm, further fine-tuned by a dense neural network (DNN), achieving a remarkable 98.71% accuracy in pinpointing websites that violate copyright.

Suggested Citation

  • Aldo Hernandez-Suarez & Gabriel Sanchez-Perez & Linda Karina Toscano-Medina & Hector Manuel Perez-Meana & Jose Portillo-Portillo & Jesus Olivares-Mercado, 2023. "Methodological Approach for Identifying Websites with Infringing Content via Text Transformers and Dense Neural Networks," Future Internet, MDPI, vol. 15(12), pages 1-31, December.
  • Handle: RePEc:gam:jftint:v:15:y:2023:i:12:p:397-:d:1297089
    as

    Download full text from publisher

    File URL: https://www.mdpi.com/1999-5903/15/12/397/pdf
    Download Restriction: no

    File URL: https://www.mdpi.com/1999-5903/15/12/397/
    Download Restriction: no
    ---><---

    References listed on IDEAS

    as
    1. Peukert, Christian & Claussen, Jörg & Kretschmer, Tobias, 2017. "Piracy and box office movie revenues: Evidence from Megaupload," International Journal of Industrial Organization, Elsevier, vol. 52(C), pages 188-215.
    2. Irina Atanasova, 2019. "Copyright Infringement In Digital Environment," Economics & Law, Faculty of Economics, SOUTH-WEST UNIVERSITY "NEOFIT RILSKI", BLAGOEVGRAD, vol. 1(1), pages 13-22.
    3. Hristos Karahalios, 2020. "Appraisal of a Ship’s Cybersecurity efficiency: the case of piracy," Journal of Transportation Security, Springer, vol. 13(3), pages 179-201, December.
    4. Vasja Roblek & Maja Meško & Mirjana Pejić Bach & Oshane Thorpe & Polona Šprajc, 2020. "The Interaction between Internet, Sustainable Development, and Emergence of Society 5.0," Data, MDPI, vol. 5(3), pages 1-27, September.
    5. Bradley, Wendy A. & Kolev, Julian, 2023. "How does digital piracy affect innovation? Evidence from software firms," Research Policy, Elsevier, vol. 52(3).
    Full references (including those not matched with items on IDEAS)

    Most related items

    These are the items that most often cite the same works as this one and are cited by the same works as this one.
    1. Christian Peukert & Margaritha Windisch, 2023. "The Economics of Copyright in the Digital Age," CESifo Working Paper Series 10687, CESifo.
    2. Luis Aguiar & Jörg Claussen & Christian Peukert, 2018. "Catch Me If You Can: Effectiveness and Consequences of Online Copyright Enforcement," Information Systems Research, INFORMS, vol. 29(3), pages 656-678, September.
    3. Marc Ivaldi & Ambre Nicolle & Frank Verboven & Jiekai Zhang, 2024. "Displacement and complementarity in the recorded music industry: evidence from France," Journal of Cultural Economics, Springer;The Association for Cultural Economics International, vol. 48(1), pages 43-94, March.
    4. Steven James Watson & Daniel John Zizzo & Piers Fleming, 2015. "Determinants of Unlawful File Sharing: A Scoping Review," PLOS ONE, Public Library of Science, vol. 10(6), pages 1-23, June.
    5. Essling, Christian & Koenen, Johannes & Peukert, Christian, 2017. "Competition for attention in the digital age: The case of single releases in the recorded music industry," Information Economics and Policy, Elsevier, vol. 40(C), pages 26-40.
    6. Christophe Bellégo & Romain De Nijs, 2020. "The Unintended Consequences of Antipiracy Laws on Markets with Asymmetric Piracy: The Case of the French Movie Industry," Information Systems Research, INFORMS, vol. 31(4), pages 1064-1086, December.
    7. Hong Luo & Julie Holland Mortimer, 2017. "Copyright Enforcement: Evidence from Two Field Experiments," Journal of Economics & Management Strategy, Wiley Blackwell, vol. 26(2), pages 499-528, June.
    8. Lee, Jonathan F., 2018. "Purchase, pirate, publicize: Private-network music sharing and market album sales," Information Economics and Policy, Elsevier, vol. 42(C), pages 35-55.
    9. Wojciech Hardy & Michał Krawczyk, 2023. "Internet piracy and book sales: a field experiment," GRAPE Working Papers 93, GRAPE Group for Research in Applied Economics.
    10. Kanazawa, Kyogo & Kawaguchi, Kohei, 2022. "Displacement effects of public libraries," Journal of the Japanese and International Economies, Elsevier, vol. 66(C).
    11. Peng, Shuxia & Li, Bo & Wu, Shuang, 2023. "Presence of piracy and legal protection: Decisions in the digital goods market under different contracts," European Journal of Operational Research, Elsevier, vol. 309(2), pages 578-596.
    12. Helian Xu & Shiqi Deng, 2024. "Digital Mergers and Acquisitions and Enterprise Innovation Quality: Analysis Based on Research and Development Investment and Overseas Subsidiaries," Sustainability, MDPI, vol. 16(3), pages 1-14, January.
    13. Wojciech Hardy, 2022. "Brace yourselves, pirates are coming! the effects of Game of Thrones leak on TV viewership," Journal of Cultural Economics, Springer;The Association for Cultural Economics International, vol. 46(1), pages 27-55, March.
    14. Erdem Dogukan Yilmaz & Tim Meyer & Milan Miric, 2023. "Preventing Others from Commercializing Your Innovation: Evidence from Creative Commons Licenses," Papers 2309.00536, arXiv.org.
    15. Batikas, Michail & Claussen, Jörg & Peukert, Christian, 2019. "Follow the money: Online piracy and self-regulation in the advertising industry," International Journal of Industrial Organization, Elsevier, vol. 65(C), pages 121-151.
    16. Wojciech Hardy & Michal Krawczyk & Joanna Tyrowicz, 2014. "Internet piracy and book sales: A field experiment," Artefactual Field Experiments 00696, The Field Experiments Website.
    17. Frick, Sarah J. & Fletcher, Deborah & Smith, Austin C., 2023. "Pirate and chill: The effect of netflix on illegal streaming," Journal of Economic Behavior & Organization, Elsevier, vol. 209(C), pages 334-347.
    18. Bradley, Wendy A. & Kolev, Julian, 2023. "How does digital piracy affect innovation? Evidence from software firms," Research Policy, Elsevier, vol. 52(3).
    19. Bae Sang Hoo & Yoo Kyeongwon, 2021. "Is Imitation Bad for the Production of Creative Works?," Review of Network Economics, De Gruyter, vol. 19(2), pages 115-144, January.
    20. Tarun Jain & Jishnu Hazra & T. C. Edwin Cheng, 2020. "Illegal Content Monitoring on Social Platforms," Production and Operations Management, Production and Operations Management Society, vol. 29(8), pages 1837-1857, August.

    Corrections

    All material on this site has been provided by the respective publishers and authors. You can help correct errors and omissions. When requesting a correction, please mention this item's handle: RePEc:gam:jftint:v:15:y:2023:i:12:p:397-:d:1297089. See general information about how to correct material in RePEc.

    If you have authored this item and are not yet registered with RePEc, we encourage you to do it here. This allows to link your profile to this item. It also allows you to accept potential citations to this item that we are uncertain about.

    If CitEc recognized a bibliographic reference but did not link an item in RePEc to it, you can help with this form .

    If you know of missing items citing this one, you can help us creating those links by adding the relevant references in the same way as above, for each refering item. If you are a registered author of this item, you may also want to check the "citations" tab in your RePEc Author Service profile, as there may be some citations waiting for confirmation.

    For technical questions regarding this item, or to correct its authors, title, abstract, bibliographic or download information, contact: MDPI Indexing Manager (email available below). General contact details of provider: https://www.mdpi.com .

    Please note that corrections may take a couple of weeks to filter through the various RePEc services.

    IDEAS is a RePEc service. RePEc uses bibliographic data supplied by the respective publishers.