IDEAS home Printed from https://ideas.repec.org/a/spr/scient/v103y2015i2d10.1007_s11192-015-1531-8.html
   My bibliography  Save this article

New multi-stage similarity measure for calculation of pairwise patent similarity in a patent citation network

Author

Listed:
  • Andrew Rodriguez

    (Rutgers University)

  • Byunghoon Kim

    (Rutgers University)

  • Mehmet Turkoz

    (Rutgers University)

  • Jae-Min Lee

    (Korea Institute of Science and Technology Information)

  • Byoung-Youl Coh

    (Korea Institute of Science and Technology Information)

  • Myong K. Jeong

    (Rutgers University)

Abstract

Being able to effectively measure similarity between patents in a complex patent citation network is a crucial task in understanding patent relatedness. In the past, techniques such as text mining and keyword analysis have been applied for patent similarity calculation. The drawback of these approaches is that they depend on word choice and writing style of authors. Most existing graph-based approaches use common neighbor-based measures, which only consider direct adjacency. In this work we propose new similarity measures for patents in a patent citation network using only the patent citation network structure. The proposed similarity measures leverage direct and indirect co-citation links between patents. A challenge is when some patents receive a large number of citations, thus are considered more similar to many other patents in the patent citation network. To overcome this challenge, we propose a normalization technique to account for the case where some pairs are ranked very similar to each other because they both are cited by many other patents. We validate our proposed similarity measures using US class codes for US patents and the well-known Jaccard similarity index. Experiments show that the proposed methods perform well when compared to the Jaccard similarity index.

Suggested Citation

  • Andrew Rodriguez & Byunghoon Kim & Mehmet Turkoz & Jae-Min Lee & Byoung-Youl Coh & Myong K. Jeong, 2015. "New multi-stage similarity measure for calculation of pairwise patent similarity in a patent citation network," Scientometrics, Springer;Akadémiai Kiadó, vol. 103(2), pages 565-581, May.
  • Handle: RePEc:spr:scient:v:103:y:2015:i:2:d:10.1007_s11192-015-1531-8
    DOI: 10.1007/s11192-015-1531-8
    as

    Download full text from publisher

    File URL: http://link.springer.com/10.1007/s11192-015-1531-8
    File Function: Abstract
    Download Restriction: Access to the full text of the articles in this series is restricted.

    File URL: https://libkey.io/10.1007/s11192-015-1531-8?utm_source=ideas
    LibKey link: if access is restricted and if your library uses this service, LibKey will redirect you to where you can use your library subscription to access this item
    ---><---

    As the access to this document is restricted, you may want to search for a different version of it.

    References listed on IDEAS

    as
    1. Gamal Atallah & Gabriel Rodríguez, 2006. "Indirect patent citations," Scientometrics, Springer;Akadémiai Kiadó, vol. 67(3), pages 437-465, June.
    2. Byunghoon Kim & Gianluca Gazzola & Jae-Min Lee & Dohyun Kim & Kanghoe Kim & Myong K. Jeong, 2014. "Inter-cluster connectivity analysis for technology opportunity discovery," Scientometrics, Springer;Akadémiai Kiadó, vol. 98(3), pages 1811-1825, March.
    3. M. M. Kessler, 1963. "Bibliographic coupling between scientific papers," American Documentation, Wiley Blackwell, vol. 14(1), pages 10-25, January.
    4. von Wartburg, Iwan & Teichert, Thorsten & Rost, Katja, 2005. "Inventive progress measured by multi-stage patent citation analysis," Research Policy, Elsevier, vol. 34(10), pages 1591-1607, December.
    5. Leo Egghe & Ronald Rousseau, 2002. "Co-citation, bibliographic coupling and a characterization of lattice citation networks," Scientometrics, Springer;Akadémiai Kiadó, vol. 55(3), pages 349-361, November.
    6. Henry Small, 1973. "Co‐citation in the scientific literature: A new measure of the relationship between two documents," Journal of the American Society for Information Science, Association for Information Science & Technology, vol. 24(4), pages 265-269, July.
    7. Euiseok Kim & Yongrae Cho & Wonjoon Kim, 2014. "Dynamic patterns of technological convergence in printed electronics technologies: patent citation network," Scientometrics, Springer;Akadémiai Kiadó, vol. 98(2), pages 975-998, February.
    8. Gnyawali, Devi R. & Park, Byung-Jin (Robert), 2011. "Co-opetition between giants: Collaboration with competitors for technological innovation," Research Policy, Elsevier, vol. 40(5), pages 650-663, June.
    9. Martin G. Moehrle & Jan M. Gerken, 2012. "Measuring textual patent similarity on the basis of combined concepts: design decisions and their consequences," Scientometrics, Springer;Akadémiai Kiadó, vol. 91(3), pages 805-826, June.
    10. Breschi, Stefano & Lissoni, Francesco & Malerba, Franco, 2003. "Knowledge-relatedness in firm technological diversification," Research Policy, Elsevier, vol. 32(1), pages 69-87, January.
    Full references (including those not matched with items on IDEAS)

    Citations

    Citations are extracted by the CitEc Project, subscribe to its RSS feed for this item.
    as


    Cited by:

    1. Hanlin You & Mengjun Li & Jiang Jiang & Bingfeng Ge & Xueting Zhang, 2017. "Evolution monitoring for innovation sources using patent cluster analysis," Scientometrics, Springer;Akadémiai Kiadó, vol. 111(2), pages 693-715, May.
    2. Jeon, Eunji & Yoon, Naeun & Sohn, So Young, 2023. "Exploring new digital therapeutics technologies for psychiatric disorders using BERTopic and PatentSBERTa," Technological Forecasting and Social Change, Elsevier, vol. 186(PA).
    3. Hanlin You & Mengjun Li & Keith W. Hipel & Jiang Jiang & Bingfeng Ge & Hante Duan, 2017. "Development trend forecasting for coherent light generator technology based on patent citation network analysis," Scientometrics, Springer;Akadémiai Kiadó, vol. 111(1), pages 297-315, April.
    4. Joon Hyung Cho & Jungpyo Lee & So Young Sohn, 2021. "Predicting future technological convergence patterns based on machine learning using link prediction," Scientometrics, Springer;Akadémiai Kiadó, vol. 126(7), pages 5413-5429, July.
    5. Kim, Tae San & Sohn, So Young, 2020. "Machine-learning-based deep semantic analysis approach for forecasting new technology convergence," Technological Forecasting and Social Change, Elsevier, vol. 157(C).
    6. Jiang, Hongxun & Fan, Shaokun & Zhang, Nan & Zhu, Bin, 2023. "Deep learning for predicting patent application outcome: The fusion of text and network embeddings," Journal of Informetrics, Elsevier, vol. 17(2).
    7. An, Xin & Li, Jinghong & Xu, Shuo & Chen, Liang & Sun, Wei, 2021. "An improved patent similarity measurement based on entities and semantic relations," Journal of Informetrics, Elsevier, vol. 15(2).
    8. Byunghoon Kim & Gianluca Gazzola & Jaekyung Yang & Jae-Min Lee & Byoung-Youl Coh & Myong K. Jeong & Young-Seon Jeong, 2017. "Two-phase edge outlier detection method for technology opportunity discovery," Scientometrics, Springer;Akadémiai Kiadó, vol. 113(1), pages 1-16, October.
    9. Hain, Daniel S. & Jurowetzki, Roman & Buchmann, Tobias & Wolf, Patrick, 2022. "A text-embedding-based approach to measuring patent-to-patent technological similarity," Technological Forecasting and Social Change, Elsevier, vol. 177(C).
    10. Juhyun Lee & Sangsung Park & Jiho Kang, 2021. "Introducing Patents with Indirect Connection (PIC) for Establishing Patent Strategies," Sustainability, MDPI, vol. 13(2), pages 1-15, January.

    Most related items

    These are the items that most often cite the same works as this one and are cited by the same works as this one.
    1. Guan-Can Yang & Gang Li & Chun-Ya Li & Yun-Hua Zhao & Jing Zhang & Tong Liu & Dar-Zen Chen & Mu-Hsuan Huang, 2015. "Using the comprehensive patent citation network (CPC) to evaluate patent value," Scientometrics, Springer;Akadémiai Kiadó, vol. 105(3), pages 1319-1346, December.
    2. Osmo Kuusi & Martin Meyer, 2007. "Anticipating technological breakthroughs: Using bibliographic coupling to explore the nanotubes paradigm," Scientometrics, Springer;Akadémiai Kiadó, vol. 70(3), pages 759-777, March.
    3. Chen, Dar-Zen & Huang, Mu-Hsuan & Hsieh, Hui-Chen & Lin, Chang-Pin, 2011. "Identifying missing relevant patent citation links by using bibliographic coupling in LED illuminating technology," Journal of Informetrics, Elsevier, vol. 5(3), pages 400-412.
    4. Kuan, Chung-Huei & Chen, Dar-Zen & Huang, Mu-Hsuan, 2019. "Bibliographically coupled patents: Their temporal pattern and combined relevance," Journal of Informetrics, Elsevier, vol. 13(4).
    5. Liu, Yunmei & Yang, Liu & Chen, Min, 2021. "A new citation concept: Triangular citation in the literature," Journal of Informetrics, Elsevier, vol. 15(2).
    6. Icaro Romolo Sousa Agostino & Wesley Vieira da Silva & Claudimar Pereira da Veiga & Adriano Mendonça Souza, 2020. "Forecasting models in the manufacturing processes and operations management: Systematic literature review," Journal of Forecasting, John Wiley & Sons, Ltd., vol. 39(7), pages 1043-1056, November.
    7. Yun, Jinhyuk & Ahn, Sejung & Lee, June Young, 2020. "Return to basics: Clustering of scientific literature using structural information," Journal of Informetrics, Elsevier, vol. 14(4).
    8. Takano, Yasutomo & Kajikawa, Yuya, 2019. "Extracting commercialization opportunities of the Internet of Things: Measuring text similarity between papers and patents," Technological Forecasting and Social Change, Elsevier, vol. 138(C), pages 45-68.
    9. Hofmann, Peter & Keller, Robert & Urbach, Nils, 2019. "Inter-technology relationship networks: Arranging technologies through text mining," Technological Forecasting and Social Change, Elsevier, vol. 143(C), pages 202-213.
    10. Mu-hsuan Huang & Chia-Pin Chang, 2015. "A comparative study on detecting research fronts in the organic light-emitting diode (OLED) field using bibliographic coupling and co-citation," Scientometrics, Springer;Akadémiai Kiadó, vol. 102(3), pages 2041-2057, March.
    11. Tahamtan, Iman & Bornmann, Lutz, 2018. "Creativity in science and the link to cited references: Is the creative potential of papers reflected in their cited references?," Journal of Informetrics, Elsevier, vol. 12(3), pages 906-930.
    12. Yun, Jinhyuk, 2022. "Generalization of bibliographic coupling and co-citation using the node split network," Journal of Informetrics, Elsevier, vol. 16(2).
    13. Hsi-Yin Yeh & Yi-Shan Sung & Hsiao-Wen Yang & Wan-Chu Tsai & Dar-Zen Chen, 2013. "The bibliographic coupling approach to filter the cited and uncited patent citations: a case of electric vehicle technology," Scientometrics, Springer;Akadémiai Kiadó, vol. 94(1), pages 75-93, January.
    14. Nassiri, Isar & Masoudi-Nejad, Ali & Jalili, Mahdi & Moeini, Ali, 2013. "Normalized Similarity Index: An adjusted index to prioritize article citations," Journal of Informetrics, Elsevier, vol. 7(1), pages 91-98.
    15. Chihmao Hsieh, 2011. "Explicitly searching for useful inventions: dynamic relatedness and the costs of connecting versus synthesizing," Scientometrics, Springer;Akadémiai Kiadó, vol. 86(2), pages 381-404, February.
    16. Piñeiro-Chousa, Juan & López-Cabarcos, M. Ángeles & Romero-Castro, Noelia María & Pérez-Pico, Ada María, 2020. "Innovation, entrepreneurship and knowledge in the business scientific field: Mapping the research front," Journal of Business Research, Elsevier, vol. 115(C), pages 475-485.
    17. Su, Hsin-Ning & Moaniba, Igam M., 2017. "Investigating the dynamics of interdisciplinary evolution in technology developments," Technological Forecasting and Social Change, Elsevier, vol. 122(C), pages 12-23.
    18. David N. Matzig & Clemens Schmid & Felix Riede, 2023. "Mapping the field of cultural evolutionary theory and methods in archaeology using bibliometric methods," Palgrave Communications, Palgrave Macmillan, vol. 10(1), pages 1-17, December.
    19. Rey-Long Liu, 2017. "A new bibliographic coupling measure with descriptive capability," Scientometrics, Springer;Akadémiai Kiadó, vol. 110(2), pages 915-935, February.
    20. Lilian Cervo Cabrera & Carlos Eduardo Caldarelli & Marcia Regina Gabardo Camara, 2020. "Mapping collaboration in international coffee certification research," Scientometrics, Springer;Akadémiai Kiadó, vol. 124(3), pages 2597-2618, September.

    Corrections

    All material on this site has been provided by the respective publishers and authors. You can help correct errors and omissions. When requesting a correction, please mention this item's handle: RePEc:spr:scient:v:103:y:2015:i:2:d:10.1007_s11192-015-1531-8. See general information about how to correct material in RePEc.

    If you have authored this item and are not yet registered with RePEc, we encourage you to do it here. This allows to link your profile to this item. It also allows you to accept potential citations to this item that we are uncertain about.

    If CitEc recognized a bibliographic reference but did not link an item in RePEc to it, you can help with this form .

    If you know of missing items citing this one, you can help us creating those links by adding the relevant references in the same way as above, for each refering item. If you are a registered author of this item, you may also want to check the "citations" tab in your RePEc Author Service profile, as there may be some citations waiting for confirmation.

    For technical questions regarding this item, or to correct its authors, title, abstract, bibliographic or download information, contact: Sonal Shukla or Springer Nature Abstracting and Indexing (email available below). General contact details of provider: http://www.springer.com .

    Please note that corrections may take a couple of weeks to filter through the various RePEc services.

    IDEAS is a RePEc service. RePEc uses bibliographic data supplied by the respective publishers.