IDEAS home Printed from https://ideas.repec.org/a/spr/scient/v129y2024i2d10.1007_s11192-023-04899-9.html
   My bibliography  Save this article

Distributional characteristics of Dimensions concepts: An Empirical Analysis using Zipf’s law

Author

Listed:
  • Solanki Gupta

    (Banaras Hindu University)

  • Vivek Kumar Singh

    (Banaras Hindu University
    University of Delhi)

Abstract

The massive growth in scholarly outputs during the last few decades has resulted into the creation of several scholarly databases to index the outputs. These scholarly databases index publication records and provide different metadata fields for different kinds of usage ranging from retrieval and research evaluation to various scientometric analysis. The ‘author keywords’ is one such important metadata field provided by many databases and used for different text-based and thematic structure analysis. The Dimensions database, however, does not provide ‘author keywords’ metadata field, instead it provides automatically generated terms from the article full texts, called ‘concepts’. Therefore, it is not clear whether different text-based analysis can be done with data provided by Dimensions database. Therefore, this article explores the distributional characteristics of Dimensions concepts. The Dimensions concept data obtained for a sufficiently large sample of scholarly articles is analysed through rank frequency distribution plots in the log–log space. Existence of Zipfian distribution is explored. The results indicate that Dimensions concepts adhere to the Zipfian properties which in turn indicates that Dimensions concepts have similar distributional characteristics as author keywords and hence they may have the same expressive power as that of author or index keywords for scientometric exercises. The study is novel as it is the first study to explore the distributional characteristics of the Dimensions concepts, particularly with respect to Zipfian properties, which provide the statistical foundation for understanding the Dimensions concepts and help to model and analyse them.

Suggested Citation

  • Solanki Gupta & Vivek Kumar Singh, 2024. "Distributional characteristics of Dimensions concepts: An Empirical Analysis using Zipf’s law," Scientometrics, Springer;Akadémiai Kiadó, vol. 129(2), pages 1037-1053, February.
  • Handle: RePEc:spr:scient:v:129:y:2024:i:2:d:10.1007_s11192-023-04899-9
    DOI: 10.1007/s11192-023-04899-9
    as

    Download full text from publisher

    File URL: http://link.springer.com/10.1007/s11192-023-04899-9
    File Function: Abstract
    Download Restriction: Access to the full text of the articles in this series is restricted.

    File URL: https://libkey.io/10.1007/s11192-023-04899-9?utm_source=ideas
    LibKey link: if access is restricted and if your library uses this service, LibKey will redirect you to where you can use your library subscription to access this item
    ---><---

    As the access to this document is restricted, you may want to search for a different version of it.

    References listed on IDEAS

    as
    1. Banshal, Sumit Kumar & Gupta, Solanki & Lathabai, Hiran H & Singh, Vivek Kumar, 2022. "Power Laws in altmetrics: An empirical analysis," Journal of Informetrics, Elsevier, vol. 16(3).
    2. Vivek Kumar Singh & Prashasti Singh & Mousumi Karmakar & Jacqueline Leta & Philipp Mayr, 2021. "The journal coverage of Web of Science, Scopus and Dimensions: A comparative analysis," Scientometrics, Springer;Akadémiai Kiadó, vol. 126(6), pages 5113-5142, June.
    3. Michal Brzezinski, 2015. "Power laws in citation distributions: evidence from Scopus," Scientometrics, Springer;Akadémiai Kiadó, vol. 103(1), pages 213-228, April.
    4. Zi-Ke Zhang & Linyuan Lü & Jian-Guo Liu & Tao Zhou, 2008. "Empirical analysis on a keyword-based semantic system," The European Physical Journal B: Condensed Matter and Complex Systems, Springer;EDP Sciences, vol. 66(4), pages 557-561, December.
    5. Juan Zhang & Qi Yu & Fashan Zheng & Chao Long & Zuxun Lu & Zhiguang Duan, 2016. "Comparing keywords plus of WOS and author keywords: A case study of patient adherence research," Journal of the Association for Information Science & Technology, Association for Information Science & Technology, vol. 67(4), pages 967-972, April.
    6. Hiran H. Lathabai & Abhirup Nandy & Vivek Kumar Singh, 2021. "x-index: Identifying core competency and thematic research strengths of institutions using an NLP and network based ranking framework," Scientometrics, Springer;Akadémiai Kiadó, vol. 126(12), pages 9557-9583, December.
    7. Lu, Wei & Liu, Zhifeng & Huang, Yong & Bu, Yi & Li, Xin & Cheng, Qikai, 2020. "How do authors select keywords? A preliminary study of author keyword selection behavior," Journal of Informetrics, Elsevier, vol. 14(4).
    Full references (including those not matched with items on IDEAS)

    Most related items

    These are the items that most often cite the same works as this one and are cited by the same works as this one.
    1. Gerson Pech & Catarina Delgado & Silvio Paolo Sorella, 2022. "Classifying papers into subfields using Abstracts, Titles, Keywords and KeyWords Plus through pattern detection and optimization procedures: An application in Physics," Journal of the Association for Information Science & Technology, Association for Information Science & Technology, vol. 73(11), pages 1513-1528, November.
    2. Solanki Gupta & Vivek Kumar Singh & Sumit Kumar Banshal, 2024. "Altmetric data quality analysis using Benford’s law," Scientometrics, Springer;Akadémiai Kiadó, vol. 129(7), pages 4597-4621, July.
    3. Ramona Bran & Laurentiu Tiru & Gabriela Grosseck & Carmen Holotescu & Laura Malita, 2021. "Learning from Each Other—A Bibliometric Review of Research on Information Disorders," Sustainability, MDPI, vol. 13(18), pages 1-39, September.
    4. Jinqing Yang & Zhifeng Liu & Xiufeng Cheng & Guanghui Ye, 2024. "Understanding the keyword adoption behavior patterns of researchers from a functional structure perspective," Scientometrics, Springer;Akadémiai Kiadó, vol. 129(6), pages 3359-3384, June.
    5. de Camargo Fiorini, Paula & Roman Pais Seles, Bruno Michel & Chiappetta Jabbour, Charbel Jose & Barberio Mariano, Enzo & de Sousa Jabbour, Ana Beatriz Lopes, 2018. "Management theory and big data literature: From a review to a research agenda," International Journal of Information Management, Elsevier, vol. 43(C), pages 112-129.
    6. Marek Kwiek & Wojciech Roszka, 2022. "Academic vs. biological age in research on academic careers: a large-scale study with implications for scientifically developing systems," Scientometrics, Springer;Akadémiai Kiadó, vol. 127(6), pages 3543-3575, June.
    7. Vivek Kumar Singh & Abhirup Nandy & Prashasti Singh & Mousumi Karmakar & Aakash Singh & Hiran H. Lathabai & Satya Swarup Srichandan & Anurag Kanaujia, 2022. "Indian Science Reports: a web-based scientometric portal for mapping Indian research competencies at overall and institutional levels," Scientometrics, Springer;Akadémiai Kiadó, vol. 127(7), pages 4227-4236, July.
    8. Sofia Schöbel & Anuschka Schmitt & Dennis Benner & Mohammed Saqr & Andreas Janson & Jan Marco Leimeister, 2024. "Charting the Evolution and Future of Conversational Agents: A Research Agenda Along Five Waves and New Frontiers," Information Systems Frontiers, Springer, vol. 26(2), pages 729-754, April.
    9. BOWERS Dominique & MATLALA Ntswaki & BERHADIEN Moegamat & UMETOR Henry & GONGXEKA Thabo, 2024. "Sustainable Supply Chain Management And Disruptive Theory: A Bibliometric Review," Management of Sustainable Development, Lucian Blaga University of Sibiu, Faculty of Economic Sciences, vol. 16(1), pages 11-23, June.
    10. Klara Fischer & Giulia Vico & Helena Röcklinsberg & Hans Liljenström & Riccardo Bommarco, 2025. "Progress towards sustainable agriculture hampered by siloed scientific discourses," Nature Sustainability, Nature, vol. 8(1), pages 66-74, January.
    11. Zhentao Liang & Jin Mao & Kun Lu & Gang Li, 2021. "Finding citations for PubMed: a large-scale comparison between five freely available bibliographic data sources," Scientometrics, Springer;Akadémiai Kiadó, vol. 126(12), pages 9519-9542, December.
    12. Dušan Nikolić & Dragan Ivanović & Lidija Ivanović, 2024. "An open-source tool for merging data from multiple citation databases," Scientometrics, Springer;Akadémiai Kiadó, vol. 129(7), pages 4573-4595, July.
    13. Paul Handro & Bogdan Dima, 2024. "Analyzing Financial Markets Efficiency: Insights from a Bibliometric and Content Review," Journal of Financial Studies, Institute of Financial Studies, vol. 16(9), pages 119-175, May.
    14. Antonio De Nicola & Maria Luisa Villani, 2021. "Smart City Ontologies and Their Applications: A Systematic Literature Review," Sustainability, MDPI, vol. 13(10), pages 1-40, May.
    15. Rodrigo Dorantes-Gilardi & Aurora A. Ramírez-Álvarez & Diana Terrazas-Santamaría, 2023. "Is there a differentiated gender effect of collaboration with super-cited authors? Evidence from junior researchers in economics," Scientometrics, Springer;Akadémiai Kiadó, vol. 128(4), pages 2317-2336, April.
    16. Wai Ming To & Billy T. W. Yu, 2025. "Artificial Intelligence Research in Tourism and Hospitality Journals: Trends, Emerging Themes, and the Rise of Generative AI," Tourism and Hospitality, MDPI, vol. 6(2), pages 1-20, April.
    17. Giuseppe Craparo & Elisa Isabel Cano Montero & Jesús Fernando Santos Peñalver, 2024. "Trends in the circular economy applied to the agricultural sector in the framework of the SDGs," Environment, Development and Sustainability: A Multidisciplinary Approach to the Theory and Practice of Sustainable Development, Springer, vol. 26(10), pages 26699-26729, October.
    18. Thøgersen, John, 2023. "How does origin labelling on food packaging influence consumer product evaluation and choices? A systematic literature review," Food Policy, Elsevier, vol. 119(C).
    19. Dewan F. Wahid & Elkafi Hassini, 2022. "A Literature Review on Correlation Clustering: Cross-disciplinary Taxonomy with Bibliometric Analysis," SN Operations Research Forum, Springer, vol. 3(3), pages 1-42, September.
    20. Caputo, Andrea & Pizzi, Simone & Pellegrini, Massimiliano M. & Dabić, Marina, 2021. "Digitalization and business models: Where are we going? A science map of the field," Journal of Business Research, Elsevier, vol. 123(C), pages 489-501.

    Corrections

    All material on this site has been provided by the respective publishers and authors. You can help correct errors and omissions. When requesting a correction, please mention this item's handle: RePEc:spr:scient:v:129:y:2024:i:2:d:10.1007_s11192-023-04899-9. See general information about how to correct material in RePEc.

    If you have authored this item and are not yet registered with RePEc, we encourage you to do it here. This allows to link your profile to this item. It also allows you to accept potential citations to this item that we are uncertain about.

    If CitEc recognized a bibliographic reference but did not link an item in RePEc to it, you can help with this form .

    If you know of missing items citing this one, you can help us creating those links by adding the relevant references in the same way as above, for each refering item. If you are a registered author of this item, you may also want to check the "citations" tab in your RePEc Author Service profile, as there may be some citations waiting for confirmation.

    For technical questions regarding this item, or to correct its authors, title, abstract, bibliographic or download information, contact: Sonal Shukla or Springer Nature Abstracting and Indexing (email available below). General contact details of provider: http://www.springer.com .

    Please note that corrections may take a couple of weeks to filter through the various RePEc services.

    IDEAS is a RePEc service. RePEc uses bibliographic data supplied by the respective publishers.