IDEAS home Printed from https://ideas.repec.org/a/spr/scient/v123y2020i2d10.1007_s11192-020-03414-8.html
   My bibliography  Save this article

Feasibility of activity-based expert profiling using text mining of scientific publications and patents

Author

Listed:
  • Mark Bukowski

    (University Hospital Aachen, RWTH Aachen University)

  • Sandra Geisler

    (Fraunhofer Institute for Applied Information Technology FIT)

  • Thomas Schmitz-Rode

    (University Hospital Aachen, RWTH Aachen University)

  • Robert Farkas

    (University Hospital Aachen, RWTH Aachen University)

Abstract

Research and development (R&D) in many technological areas is characterized by growing complexity. In biomedical engineering, too, interdisciplinary collaboration is regarded as a promising way to master this challenge. Therefore, identifying suitable experts becomes crucial, which is currently being researched, amongst others, by analyzing semantic data. However, previous approaches lack clarity and traceability of the mechanisms for compiling top-n lists of recommended experts, as domain specificity in profiling is insufficient. Moreover, these recommenders are mainly based on scientific publications, while patents are rarely considered as an important outcome of R&D. Thus, we study the feasibility of profiling 16 biomedical engineering experts using both publications and patents. These documents are automatically labeled according to a three-dimensional domain model by machine learning-based classifiers. On this basis, we created various activity-based representations, including author-contribution-weighting. We evaluated the profiling through self- and external-assessments and tested the recommendation compared to scientometric measures in three case studies. All interviewed experts identify themselves among 10 pseudonymous profiles and 96% of all 51 external-assignments are correct. The recommendation over three case studies reaches a high mean average precision of 89% and contrasts with the use of scientometric measures (41%). Moreover, the activity based on patents primarily corresponds to that of publications but patents also introduce new activities. The author-contribution-weighting improves the performance. In conclusion, our findings show that exploiting publications and patents enables comprehensible profiling of biomedical engineering experts that allows visual comparisons and clear selection and ranking of potential R&D collaboration partners along the translational value chain.

Suggested Citation

  • Mark Bukowski & Sandra Geisler & Thomas Schmitz-Rode & Robert Farkas, 2020. "Feasibility of activity-based expert profiling using text mining of scientific publications and patents," Scientometrics, Springer;Akadémiai Kiadó, vol. 123(2), pages 579-620, May.
  • Handle: RePEc:spr:scient:v:123:y:2020:i:2:d:10.1007_s11192-020-03414-8
    DOI: 10.1007/s11192-020-03414-8
    as

    Download full text from publisher

    File URL: http://link.springer.com/10.1007/s11192-020-03414-8
    File Function: Abstract
    Download Restriction: Access to the full text of the articles in this series is restricted.

    File URL: https://libkey.io/10.1007/s11192-020-03414-8?utm_source=ideas
    LibKey link: if access is restricted and if your library uses this service, LibKey will redirect you to where you can use your library subscription to access this item
    ---><---

    As the access to this document is restricted, you may want to search for a different version of it.

    References listed on IDEAS

    as
    1. Guan, Jiancheng & Liu, Na, 2016. "Exploitative and exploratory innovations in knowledge network and collaboration network: A patent analysis in the technological field of nano-energy," Research Policy, Elsevier, vol. 45(1), pages 97-112.
    2. Lissoni, Francesco & Montobbio, Fabio & Zirulia, Lorenzo, 2013. "Inventorship and authorship as attribution rights: An enquiry into the economics of scientific credit," Journal of Economic Behavior & Organization, Elsevier, vol. 95(C), pages 49-69.
    3. Anthony F. J. Raan & Jos J. Winnink, 2018. "Do younger Sleeping Beauties prefer a technological prince?," Scientometrics, Springer;Akadémiai Kiadó, vol. 114(2), pages 701-717, February.
    4. Leonardo Costa Ribeiro & Márcia Siqueira Rapini & Leandro Alves Silva & Eduardo Motta Albuquerque, 2018. "Growth patterns of the network of international collaboration in science," Scientometrics, Springer;Akadémiai Kiadó, vol. 114(1), pages 159-179, January.
    5. Richard Berendsen & Maarten Rijke & Krisztian Balog & Toine Bogers & Antal Bosch, 2013. "On the assessment of expertise profiles," Journal of the Association for Information Science & Technology, Association for Information Science & Technology, vol. 64(10), pages 2024-2044, October.
    6. Anne-Wil Harzing & Satu Alakangas, 2016. "Google Scholar, Scopus and the Web of Science: a longitudinal and cross-disciplinary comparison," Scientometrics, Springer;Akadémiai Kiadó, vol. 106(2), pages 787-804, February.
    7. Paula Stephan & Reinhilde Veugelers & Jian Wang, 2017. "Reviewers are blinkered by bibliometrics," Nature, Nature, vol. 544(7651), pages 411-412, April.
    8. Richard Berendsen & Maarten de Rijke & Krisztian Balog & Toine Bogers & Antal van den Bosch, 2013. "On the assessment of expertise profiles," Journal of the American Society for Information Science and Technology, Association for Information Science & Technology, vol. 64(10), pages 2024-2044, October.
    9. Anthony F. J. van Raan, 2004. "Sleeping Beauties in science," Scientometrics, Springer;Akadémiai Kiadó, vol. 59(3), pages 467-472, March.
    10. Denisa Mindruta, 2013. "Value creation in university-firm research collaborations: A matching approach," Post-Print hal-00818682, HAL.
    11. Rinze Benedictus & Frank Miedema & Mark W. J. Ferguson, 2016. "Fewer numbers, better science," Nature, Nature, vol. 538(7626), pages 453-455, October.
    12. Simon J. D. Schillebeeckx & Yimin Lin & Gerard George, 2019. "When Do Expert Teams Fail to Create Impactful Inventions?," Journal of Management Studies, Wiley Blackwell, vol. 56(6), pages 1073-1104, September.
    13. Haeussler, Carolin & Sauermann, Henry, 2013. "Credit where credit is due? The impact of project contributions and social factors on authorship and inventorship," Research Policy, Elsevier, vol. 42(3), pages 688-703.
    14. Bercovitz, Janet & Feldman, Maryann, 2011. "The mechanisms of collaboration in inventive teams: Composition, social networks, and geography," Research Policy, Elsevier, vol. 40(1), pages 81-93, February.
    15. Shaobo Li & Jie Hu & Yuxin Cui & Jianjun Hu, 2018. "DeepPatent: patent classification with convolutional neural networks and word embedding," Scientometrics, Springer;Akadémiai Kiadó, vol. 117(2), pages 721-744, November.
    16. Kuan, Chung-Huei & Huang, Mu-Hsuan & Chen, Dar-Zen, 2011. "Ranking patent assignee performance by h-index and shape descriptors," Journal of Informetrics, Elsevier, vol. 5(2), pages 303-312.
    17. Ding, Ying, 2011. "Scientific collaboration and endorsement: Network analysis of coauthorship and citation networks," Journal of Informetrics, Elsevier, vol. 5(1), pages 187-203.
    18. Thushari Silva & Jian Ma & Chen Yang & Haidan Liang, 2015. "A profile-boosted research analytics framework to recommend journals for manuscripts," Journal of the Association for Information Science & Technology, Association for Information Science & Technology, vol. 66(1), pages 180-200, January.
    19. Lin Zhu & Donghua Zhu & Xuefeng Wang & Scott W. Cunningham & Zhinan Wang, 2019. "An integrated solution for detecting rising technology stars in co-inventor networks," Scientometrics, Springer;Akadémiai Kiadó, vol. 121(1), pages 137-172, October.
    20. Diana Hicks & Paul Wouters & Ludo Waltman & Sarah de Rijcke & Ismael Rafols, 2015. "Bibliometrics: The Leiden Manifesto for research metrics," Nature, Nature, vol. 520(7548), pages 429-431, April.
    21. Jian Cheng Guan & Xia Gao, 2009. "Exploring the h‐index at patent level," Journal of the American Society for Information Science and Technology, Association for Information Science & Technology, vol. 60(1), pages 35-40, January.
    22. Teja Tscharntke & Michael E Hochberg & Tatyana A Rand & Vincent H Resh & Jochen Krauss, 2007. "Author Sequence and Credit for Contributions in Multiauthored Publications," PLOS Biology, Public Library of Science, vol. 5(1), pages 1-2, January.
    23. Denisa Mindruta, 2013. "Value creation in university-firm research collaborations: A matching approach," Strategic Management Journal, Wiley Blackwell, vol. 34(6), pages 644-665, June.
    24. Jianlin Zhou & An Zeng & Ying Fan & Zengru Di, 2018. "The representative works of scientists," Scientometrics, Springer;Akadémiai Kiadó, vol. 117(3), pages 1721-1732, December.
    25. Andi Rexha & Mark Kröll & Hermann Ziak & Roman Kern, 2018. "Authorship identification of documents with high content similarity," Scientometrics, Springer;Akadémiai Kiadó, vol. 115(1), pages 223-237, April.
    26. Jean O. Lanjouw & Mark Schankerman, 2004. "Patent Quality and Research Productivity: Measuring Innovation with Multiple Indicators," Economic Journal, Royal Economic Society, vol. 114(495), pages 441-465, April.
    Full references (including those not matched with items on IDEAS)

    Most related items

    These are the items that most often cite the same works as this one and are cited by the same works as this one.
    1. Carolin Haeussler & Henry Sauermann, 2016. "The Division of Labor in Teams: A Conceptual Framework and Application to Collaborations in Science," NBER Working Papers 22241, National Bureau of Economic Research, Inc.
    2. Simon J. D. Schillebeeckx & Sankalp Chaturvedi & Gerard George & Zella King, 2016. "What do i want? The effects of individual aspiration and relational capability on collaboration preferences," Strategic Management Journal, Wiley Blackwell, vol. 37(7), pages 1493-1506, July.
    3. Xiancheng Li & Luca Verginer & Massimo Riccaboni & P. Panzarasa, 2022. "A network approach to expertise retrieval based on path similarity and credit allocation," Journal of Economic Interaction and Coordination, Springer;Society for Economic Science with Heterogeneous Interacting Agents, vol. 17(2), pages 501-533, April.
    4. Mors, Marie Louise & Waguespack, David M., 2021. "Fast success and slow failure: The process speed of dispersed research teams," Research Policy, Elsevier, vol. 50(5).
    5. Slavova, Kremena & Jong, Simcha, 2021. "University alliances and firm exploratory innovation: Evidence from therapeutic product development," Technovation, Elsevier, vol. 107(C).
    6. Jonathan P. Tennant & Harry Crane & Tom Crick & Jacinto Davila & Asura Enkhbayar & Johanna Havemann & Bianca Kramer & Ryan Martin & Paola Masuzzo & Andy Nobes & Curt Rice & Bárbara Rivera-López & Tony, 2019. "Ten Hot Topics around Scholarly Publishing," Publications, MDPI, vol. 7(2), pages 1-24, May.
    7. Goossen, Martin C. & Paruchuri, Srikanth, 2022. "Measurement errors and estimation biases with incomplete social networks: replication studies on intra-firm inventor network analysis," Research Policy, Elsevier, vol. 51(1).
    8. Guoli Chen & Sterling Huang & Philipp Meyer‐Doyle & Denisa Mindruta, 2021. "Generalist versus specialist CEOs and acquisitions: Two‐sided matching and the impact of CEO characteristics on firm outcomes," Strategic Management Journal, Wiley Blackwell, vol. 42(6), pages 1184-1214, June.
    9. Michele O’Dwyer & Raffaele Filieri & Lisa O’Malley, 2023. "Establishing successful university–industry collaborations: barriers and enablers deconstructed," The Journal of Technology Transfer, Springer, vol. 48(3), pages 900-931, June.
    10. Jun Hong Park & Hyunseog Chung & Ki Hong Kim & Jin Ju Kim & Chulung Lee, 2021. "The Impact of Technological Capability on Financial Performance in the Semiconductor Industry," Sustainability, MDPI, vol. 13(2), pages 1-20, January.
    11. Yi Zhang & Kaihua Chen & Guilong Zhu & Richard C. M. Yam & Jiancheng Guan, 2016. "Inter-organizational scientific collaborations and policy effects: an ego-network evolutionary perspective of the Chinese Academy of Sciences," Scientometrics, Springer;Akadémiai Kiadó, vol. 108(3), pages 1383-1415, September.
    12. Zhai, Li & Yan, Xiangbin, 2022. "A directed collaboration network for exploring the order of scientific collaboration," Journal of Informetrics, Elsevier, vol. 16(4).
    13. Wildgaard, Lorna, 2016. "A critical cluster analysis of 44 indicators of author-level performance," Journal of Informetrics, Elsevier, vol. 10(4), pages 1055-1078.
    14. Helena H. Zhang & Fred Y. Ye, 2020. "Identifying ‘associated-sleeping-beauties’ in ‘swan-groups’ based on small qualified datasets of physics and economics," Scientometrics, Springer;Akadémiai Kiadó, vol. 122(3), pages 1525-1537, March.
    15. Lanu Kim & Jason H. Portenoy & Jevin D. West & Katherine W. Stovel, 2020. "Scientific journals still matter in the era of academic search engines and preprint archives," Journal of the Association for Information Science & Technology, Association for Information Science & Technology, vol. 71(10), pages 1218-1226, October.
    16. Gita Ghiasi & Matthew Harsh & Andrea Schiffauerova, 2018. "Inequality and collaboration patterns in Canadian nanotechnology: implications for pro-poor and gender-inclusive policy," Scientometrics, Springer;Akadémiai Kiadó, vol. 115(2), pages 785-815, May.
    17. Mladen Djuric & Marina Dobrota & Jovan Filipovic, 2020. "Complexity-based quality indicators for human and social capital in science and research: the case of Serbian Homeland versus Diaspora," Scientometrics, Springer;Akadémiai Kiadó, vol. 124(1), pages 303-328, July.
    18. Damioli, G. & Van Roy, V. & Vertesy, D. & Vivarelli, M., 2021. "May AI revolution be labour-friendly? Some micro evidence from the supply side," GLO Discussion Paper Series 823, Global Labor Organization (GLO).
    19. Elizabeth Troncoso & Daniel A. López & René Ruby-Figueroa & Dieter Koch & Ricardo Reich, 2024. "Does Quality Matter? Quality Assurance in Research for the Chilean Higher Education System," Publications, MDPI, vol. 12(1), pages 1-20, February.
    20. Nina Su & Zhuqin Shi & Xianqi Zhu & Yunsheng Xin, 2021. "An Evolutionary Game Model of Collaborative Innovation Between Enterprises and Colleges Under Government Participation of China," SAGE Open, , vol. 11(1), pages 21582440219, February.

    More about this item

    Keywords

    Supervised learning; Biomedical engineering domain model; Translational value chain; Research evaluation; Author contribution; Domain-specific recommendation; Self- and external-assessment;
    All these keywords.

    JEL classification:

    • O32 - Economic Development, Innovation, Technological Change, and Growth - - Innovation; Research and Development; Technological Change; Intellectual Property Rights - - - Management of Technological Innovation and R&D
    • I19 - Health, Education, and Welfare - - Health - - - Other
    • D83 - Microeconomics - - Information, Knowledge, and Uncertainty - - - Search; Learning; Information and Knowledge; Communication; Belief; Unawareness
    • C38 - Mathematical and Quantitative Methods - - Multiple or Simultaneous Equation Models; Multiple Variables - - - Classification Methdos; Cluster Analysis; Principal Components; Factor Analysis
    • C89 - Mathematical and Quantitative Methods - - Data Collection and Data Estimation Methodology; Computer Programs - - - Other
    • C99 - Mathematical and Quantitative Methods - - Design of Experiments - - - Other

    Statistics

    Access and download statistics

    Corrections

    All material on this site has been provided by the respective publishers and authors. You can help correct errors and omissions. When requesting a correction, please mention this item's handle: RePEc:spr:scient:v:123:y:2020:i:2:d:10.1007_s11192-020-03414-8. See general information about how to correct material in RePEc.

    If you have authored this item and are not yet registered with RePEc, we encourage you to do it here. This allows to link your profile to this item. It also allows you to accept potential citations to this item that we are uncertain about.

    If CitEc recognized a bibliographic reference but did not link an item in RePEc to it, you can help with this form .

    If you know of missing items citing this one, you can help us creating those links by adding the relevant references in the same way as above, for each refering item. If you are a registered author of this item, you may also want to check the "citations" tab in your RePEc Author Service profile, as there may be some citations waiting for confirmation.

    For technical questions regarding this item, or to correct its authors, title, abstract, bibliographic or download information, contact: Sonal Shukla or Springer Nature Abstracting and Indexing (email available below). General contact details of provider: http://www.springer.com .

    Please note that corrections may take a couple of weeks to filter through the various RePEc services.

    IDEAS is a RePEc service. RePEc uses bibliographic data supplied by the respective publishers.