IDEAS home Printed from https://ideas.repec.org/a/eee/infome/v15y2021i2s1751157721000067.html
   My bibliography  Save this article

An improved patent similarity measurement based on entities and semantic relations

Author

Listed:
  • An, Xin
  • Li, Jinghong
  • Xu, Shuo
  • Chen, Liang
  • Sun, Wei

Abstract

Patent similarity measurement, as one of the fundamental building blocks for patent analysis, is able to derive technical intelligence efficiently, but also can detect the risk of infringement and evaluate whether the invention meets the criteria of novelty and innovation. However, traditional approaches make implicitly several assumptions, such as bag of words in each component, semantic direction irrelevance and so on. In order to relax these assumptions, this study proposes an improved methodology on the basis of entities and semantic relations (functional and non-functional relations), which takes semantic direction of each sequence structure and the word order information of each component into consideration. Meanwhile, an algorithm for calculating the global importance of each sequence structure is put forward. Finally, to verify the effectiveness and performance of the improved semantic analysis, a case study is conducted on the thin film head subfield in the field of hard disk drive. Extensive experimental results show that our approach is significantly more accurate.

Suggested Citation

  • An, Xin & Li, Jinghong & Xu, Shuo & Chen, Liang & Sun, Wei, 2021. "An improved patent similarity measurement based on entities and semantic relations," Journal of Informetrics, Elsevier, vol. 15(2).
  • Handle: RePEc:eee:infome:v:15:y:2021:i:2:s1751157721000067
    DOI: 10.1016/j.joi.2021.101135
    as

    Download full text from publisher

    File URL: http://www.sciencedirect.com/science/article/pii/S1751157721000067
    Download Restriction: Full text for ScienceDirect subscribers only

    File URL: https://libkey.io/10.1016/j.joi.2021.101135?utm_source=ideas
    LibKey link: if access is restricted and if your library uses this service, LibKey will redirect you to where you can use your library subscription to access this item
    ---><---

    As the access to this document is restricted, you may want to search for a different version of it.

    References listed on IDEAS

    as
    1. Janghyeok Yoon & Hyunseok Park & Kwangsoo Kim, 2013. "Identifying technological competition trends for R&D planning using dynamic patent maps: SAO-based content analysis," Scientometrics, Springer;Akadémiai Kiadó, vol. 94(1), pages 313-331, January.
    2. Xuefeng Wang & Huichao Ren & Yun Chen & Yuqin Liu & Yali Qiao & Ying Huang, 2019. "Measuring patent similarity with SAO semantic analysis," Scientometrics, Springer;Akadémiai Kiadó, vol. 121(1), pages 1-23, October.
    3. An, Jaehyeong & Kim, Kyuwoong & Mortara, Letizia & Lee, Sungjoo, 2018. "Deriving technology intelligence from patents: Preposition-based semantic analysis," Journal of Informetrics, Elsevier, vol. 12(1), pages 217-236.
    4. Liang Chen & Shuo Xu & Lijun Zhu & Jing Zhang & Xiaoping Lei & Guancan Yang, 2020. "A deep learning based method for extracting semantic information from patent documents," Scientometrics, Springer;Akadémiai Kiadó, vol. 125(1), pages 289-312, October.
    5. Sam Arts & Bruno Cassiman & Juan Carlos Gomez, 2018. "Text matching to measure patent similarity," Strategic Management Journal, Wiley Blackwell, vol. 39(1), pages 62-84, January.
    6. Yuhang Yang & Qin Lu & Tiejun Zhao, 2010. "A delimiter‐based general approach for Chinese term extraction," Journal of the American Society for Information Science and Technology, Association for Information Science & Technology, vol. 61(1), pages 111-125, January.
    7. Andrew Rodriguez & Byunghoon Kim & Mehmet Turkoz & Jae-Min Lee & Byoung-Youl Coh & Myong K. Jeong, 2015. "New multi-stage similarity measure for calculation of pairwise patent similarity in a patent citation network," Scientometrics, Springer;Akadémiai Kiadó, vol. 103(2), pages 565-581, May.
    8. Zhang, Yi & Shang, Lining & Huang, Lu & Porter, Alan L. & Zhang, Guangquan & Lu, Jie & Zhu, Donghua, 2016. "A hybrid similarity measure method for patent portfolio analysis," Journal of Informetrics, Elsevier, vol. 10(4), pages 1108-1130.
    9. Janghyeok Yoon & Kwangsoo Kim, 2011. "Identifying rapidly evolving technological trends for R&D planning using SAO-based semantic patent networks," Scientometrics, Springer;Akadémiai Kiadó, vol. 88(1), pages 213-228, July.
    10. Janghyeok Yoon & Kwangsoo Kim, 2012. "Detecting signals of new technological opportunities using semantic patent analysis and outlier detection," Scientometrics, Springer;Akadémiai Kiadó, vol. 90(2), pages 445-461, February.
    11. Hyunseok Park & Janghyeok Yoon & Kwangsoo Kim, 2012. "Identifying patent infringement using SAO based semantic technological similarities," Scientometrics, Springer;Akadémiai Kiadó, vol. 90(2), pages 515-529, February.
    12. Henry Small, 1973. "Co‐citation in the scientific literature: A new measure of the relationship between two documents," Journal of the American Society for Information Science, Association for Information Science & Technology, vol. 24(4), pages 265-269, July.
    13. Yuhang Yang & Qin Lu & Tiejun Zhao, 2010. "A delimiter-based general approach for Chinese term extraction," Journal of the Association for Information Science & Technology, Association for Information Science & Technology, vol. 61(1), pages 111-125, January.
    14. Shuo Xu & Junwan Liu & Dongsheng Zhai & Xin An & Zheng Wang & Hongshen Pang, 2018. "Overlapping thematic structures extraction with mixed-membership stochastic blockmodel," Scientometrics, Springer;Akadémiai Kiadó, vol. 117(1), pages 61-84, October.
    15. Shuo Xu & Dongsheng Zhai & Feifei Wang & Xin An & Hongshen Pang & Yirong Sun, 2019. "A novel method for topic linkages between scientific publications and patents," Journal of the Association for Information Science & Technology, Association for Information Science & Technology, vol. 70(9), pages 1026-1042, September.
    Full references (including those not matched with items on IDEAS)

    Citations

    Citations are extracted by the CitEc Project, subscribe to its RSS feed for this item.
    as


    Cited by:

    1. Wei Du & Yibo Wang & Wei Xu & Jian Ma, 2021. "A personalized recommendation system for high-quality patent trading by leveraging hybrid patent analysis," Scientometrics, Springer;Akadémiai Kiadó, vol. 126(12), pages 9369-9391, December.
    2. Shuo Xu & Ling Li & Xin An & Liyuan Hao & Guancan Yang, 2021. "An approach for detecting the commonality and specialty between scientific publications and patents," Scientometrics, Springer;Akadémiai Kiadó, vol. 126(9), pages 7445-7475, September.
    3. Shicheng Tan & Tao Zhang & Shu Zhao & Yanping Zhang, 2023. "Self-supervised scientific document recommendation based on contrastive learning," Scientometrics, Springer;Akadémiai Kiadó, vol. 128(9), pages 5027-5049, September.
    4. Chen, Liang & Xu, Shuo & Zhu, Lijun & Zhang, Jing & Yang, Guancan & Xu, Haiyun, 2022. "A deep learning based method benefiting from characteristics of patents for semantic relation classification," Journal of Informetrics, Elsevier, vol. 16(3).
    5. Guangtong Li & L. Siddharth & Jianxi Luo, 2023. "Embedding knowledge graph of patent metadata to measure knowledge proximity," Journal of the Association for Information Science & Technology, Association for Information Science & Technology, vol. 74(4), pages 476-490, April.
    6. Jaewoong Choi & Jiho Lee & Janghyeok Yoon & Sion Jang & Jaeyoung Kim & Sungchul Choi, 2022. "A two-stage deep learning-based system for patent citation recommendation," Scientometrics, Springer;Akadémiai Kiadó, vol. 127(11), pages 6615-6636, November.
    7. Jee, Jeonghun & Park, Sanghyun & Lee, Sungjoo, 2022. "Potential of patent image data as technology intelligence source," Journal of Informetrics, Elsevier, vol. 16(2).

    Most related items

    These are the items that most often cite the same works as this one and are cited by the same works as this one.
    1. Chen, Liang & Xu, Shuo & Zhu, Lijun & Zhang, Jing & Yang, Guancan & Xu, Haiyun, 2022. "A deep learning based method benefiting from characteristics of patents for semantic relation classification," Journal of Informetrics, Elsevier, vol. 16(3).
    2. Hain, Daniel S. & Jurowetzki, Roman & Buchmann, Tobias & Wolf, Patrick, 2022. "A text-embedding-based approach to measuring patent-to-patent technological similarity," Technological Forecasting and Social Change, Elsevier, vol. 177(C).
    3. Xuefeng Wang & Huichao Ren & Yun Chen & Yuqin Liu & Yali Qiao & Ying Huang, 2019. "Measuring patent similarity with SAO semantic analysis," Scientometrics, Springer;Akadémiai Kiadó, vol. 121(1), pages 1-23, October.
    4. Liu, Zhenfeng & Feng, Jian & Uden, Lorna, 2023. "Technology opportunity analysis using hierarchical semantic networks and dual link prediction," Technovation, Elsevier, vol. 128(C).
    5. Xu, Shuo & Hao, Liyuan & Yang, Guancan & Lu, Kun & An, Xin, 2021. "A topic models based framework for detecting and forecasting emerging technologies," Technological Forecasting and Social Change, Elsevier, vol. 162(C).
    6. Hofmann, Peter & Keller, Robert & Urbach, Nils, 2019. "Inter-technology relationship networks: Arranging technologies through text mining," Technological Forecasting and Social Change, Elsevier, vol. 143(C), pages 202-213.
    7. Yang, Chao & Huang, Cui & Su, Jun, 2018. "An improved SAO network-based method for technology trend analysis: A case study of graphene," Journal of Informetrics, Elsevier, vol. 12(1), pages 271-286.
    8. Hyunseok Park & Janghyeok Yoon & Kwangsoo Kim, 2013. "Identification and evaluation of corporations for merger and acquisition strategies using patent information and text mining," Scientometrics, Springer;Akadémiai Kiadó, vol. 97(3), pages 883-909, December.
    9. Jungpyo Lee & So Young Sohn, 2021. "Recommendation system for technology convergence opportunities based on self-supervised representation learning," Scientometrics, Springer;Akadémiai Kiadó, vol. 126(1), pages 1-25, January.
    10. Yoon, Janghyeok & Park, Hyunseok & Seo, Wonchul & Lee, Jae-Min & Coh, Byoung-youl & Kim, Jonghwa, 2015. "Technology opportunity discovery (TOD) from existing technologies and products: A function-based TOD framework," Technological Forecasting and Social Change, Elsevier, vol. 100(C), pages 153-167.
    11. Kyuwoong Kim & Kyeongmin Park & Sungjoo Lee, 2019. "Investigating technology opportunities: the use of SAOx analysis," Scientometrics, Springer;Akadémiai Kiadó, vol. 118(1), pages 45-70, January.
    12. Ren, Haiying & Zhao, Yuhui, 2021. "Technology opportunity discovery based on constructing, evaluating, and searching knowledge networks," Technovation, Elsevier, vol. 101(C).
    13. Guo, Junfang & Wang, Xuefeng & Li, Qianrui & Zhu, Donghua, 2016. "Subject–action–object-based morphology analysis for determining the direction of technological change," Technological Forecasting and Social Change, Elsevier, vol. 105(C), pages 27-40.
    14. Liang Chen & Shuo Xu & Lijun Zhu & Jing Zhang & Xiaoping Lei & Guancan Yang, 2020. "A deep learning based method for extracting semantic information from patent documents," Scientometrics, Springer;Akadémiai Kiadó, vol. 125(1), pages 289-312, October.
    15. Farshad Madani, 2015. "‘Technology Mining’ bibliometrics analysis: applying network analysis and cluster analysis," Scientometrics, Springer;Akadémiai Kiadó, vol. 105(1), pages 323-335, October.
    16. Hanlin You & Mengjun Li & Jiang Jiang & Bingfeng Ge & Xueting Zhang, 2017. "Evolution monitoring for innovation sources using patent cluster analysis," Scientometrics, Springer;Akadémiai Kiadó, vol. 111(2), pages 693-715, May.
    17. Jiwon Yu & Jong-Gyu Hwang & Jumi Hwang & Sung Chan Jun & Sumin Kang & Chulung Lee & Hyundong Kim, 2020. "Identification of Vacant and Emerging Technologies in Smart Mobility Through the GTM-Based Patent Map Development," Sustainability, MDPI, vol. 12(22), pages 1-22, November.
    18. Janghyeok Yoon & Hyunseok Park & Kwangsoo Kim, 2013. "Identifying technological competition trends for R&D planning using dynamic patent maps: SAO-based content analysis," Scientometrics, Springer;Akadémiai Kiadó, vol. 94(1), pages 313-331, January.
    19. José Manuel López‐Fernández & Mariluz Maté‐Sánchez‐Val & Francisco Manuel Somohano‐Rodriguez, 2021. "The effect of micro‐territorial networks on industrial small and medium enterprises' innovation: A case study in the Spanish region of Cantabria," Papers in Regional Science, Wiley Blackwell, vol. 100(1), pages 51-77, February.
    20. Changyong Lee & Gyumin Lee, 2019. "Technology opportunity analysis based on recombinant search: patent landscape analysis for idea generation," Scientometrics, Springer;Akadémiai Kiadó, vol. 121(2), pages 603-632, November.

    Corrections

    All material on this site has been provided by the respective publishers and authors. You can help correct errors and omissions. When requesting a correction, please mention this item's handle: RePEc:eee:infome:v:15:y:2021:i:2:s1751157721000067. See general information about how to correct material in RePEc.

    If you have authored this item and are not yet registered with RePEc, we encourage you to do it here. This allows to link your profile to this item. It also allows you to accept potential citations to this item that we are uncertain about.

    If CitEc recognized a bibliographic reference but did not link an item in RePEc to it, you can help with this form .

    If you know of missing items citing this one, you can help us creating those links by adding the relevant references in the same way as above, for each refering item. If you are a registered author of this item, you may also want to check the "citations" tab in your RePEc Author Service profile, as there may be some citations waiting for confirmation.

    For technical questions regarding this item, or to correct its authors, title, abstract, bibliographic or download information, contact: Catherine Liu (email available below). General contact details of provider: http://www.elsevier.com/locate/joi .

    Please note that corrections may take a couple of weeks to filter through the various RePEc services.

    IDEAS is a RePEc service. RePEc uses bibliographic data supplied by the respective publishers.