IDEAS home Printed from https://ideas.repec.org/a/eee/infome/v14y2020i4s1751157720300134.html
   My bibliography  Save this article

How do authors select keywords? A preliminary study of author keyword selection behavior

Author

Listed:
  • Lu, Wei
  • Liu, Zhifeng
  • Huang, Yong
  • Bu, Yi
  • Li, Xin
  • Cheng, Qikai

Abstract

Author keywords for scientific literature are terms selected and created by authors. Although most studies have focused on how to apply author keywords to represent their research interests, little is known about the process of how authors select keywords. To fill this research gap, this study presents a pilot study on author keyword selection behavior. Our empirical results show that the average percentages of author keywords appearing in titles, abstracts, and both titles and abstracts are 31%, 52.1%, and 56.7%, respectively. Meanwhile, we find that keywords also appear in references and high-frequency keywords. The proportions of author-selected keywords appearing in the references and high-frequency keywords are 41.6% and 56.1%, respectively. In addition, keywords of papers written by core authors (productive authors) are found to appear less frequently in titles and abstracts in their papers than that of others, and appear more frequently in references and high-frequency keywords. The percentages of keywords appearing in titles and abstracts in scientific papers are negatively correlated with citation counts of papers. In contrast, the percentages of author keywords appearing in high-frequency keywords are positively associated with citation counts of papers.

Suggested Citation

  • Lu, Wei & Liu, Zhifeng & Huang, Yong & Bu, Yi & Li, Xin & Cheng, Qikai, 2020. "How do authors select keywords? A preliminary study of author keyword selection behavior," Journal of Informetrics, Elsevier, vol. 14(4).
  • Handle: RePEc:eee:infome:v:14:y:2020:i:4:s1751157720300134
    DOI: 10.1016/j.joi.2020.101066
    as

    Download full text from publisher

    File URL: http://www.sciencedirect.com/science/article/pii/S1751157720300134
    Download Restriction: Full text for ScienceDirect subscribers only

    File URL: https://libkey.io/10.1016/j.joi.2020.101066?utm_source=ideas
    LibKey link: if access is restricted and if your library uses this service, LibKey will redirect you to where you can use your library subscription to access this item
    ---><---

    As the access to this document is restricted, you may want to search for a different version of it.

    References listed on IDEAS

    as
    1. Muhammad Farooq & Muhammad Asim & Muhammad Imran & Shahid Imran & Jameel Ahmad & Muhammad Rizwan Younis, 2018. "Mapping past, current and future energy research trend in Pakistan: a scientometric assessment," Scientometrics, Springer;Akadémiai Kiadó, vol. 117(3), pages 1733-1753, December.
    2. Feng Guo & Chao Ma & Qingling Shi & Qingqing Zong, 2018. "Succinct effect or informative effect: the relationship between title length and the number of citations," Scientometrics, Springer;Akadémiai Kiadó, vol. 116(3), pages 1531-1539, September.
    3. Ya-Ning Chen & Hao-Ren Ke, 2014. "A study on mental models of taggers and experts for article indexing based on analysis of keyword usage," Journal of the Association for Information Science & Technology, Association for Information Science & Technology, vol. 65(8), pages 1675-1694, August.
    4. Uddin, Shahadat & Khan, Arif, 2016. "The impact of author-selected keywords on citation counts," Journal of Informetrics, Elsevier, vol. 10(4), pages 1166-1177.
    5. Isidoro Gil‐Leiva & Adolfo Alonso‐Arroyo, 2007. "Keywords given by authors of scientific articles in database descriptors," Journal of the American Society for Information Science and Technology, Association for Information Science & Technology, vol. 58(8), pages 1175-1187, June.
    6. Carlos Olmeda-Gómez & Maria-Antonia Ovalle-Perandones & Antonio Perianes-Rodríguez, 2017. "Co-word analysis and thematic landscapes in Spanish information science literature, 1985–2014," Scientometrics, Springer;Akadémiai Kiadó, vol. 113(1), pages 195-217, October.
    7. Chao Lu & Yi Bu & Jie Wang & Ying Ding & Vetle Torvik & Matthew Schnaars & Chengzhi Zhang, 2019. "Examining scientific writing styles from the perspective of linguistic complexity," Journal of the Association for Information Science & Technology, Association for Information Science & Technology, vol. 70(5), pages 462-475, May.
    8. Naser Rashidi & Hussein Meihami, 2018. "Informetrics of Scientometrics abstracts: a rhetorical move analysis of the research abstracts published in Scientometrics journal," Scientometrics, Springer;Akadémiai Kiadó, vol. 116(3), pages 1975-1994, September.
    9. Jiaying Liu & Jiahao Tian & Xiangjie Kong & Ivan Lee & Feng Xia, 2019. "Two decades of information systems: a bibliometric review," Scientometrics, Springer;Akadémiai Kiadó, vol. 118(2), pages 617-643, February.
    10. John Hudson, 2016. "An analysis of the titles of papers submitted to the UK REF in 2014: authors, disciplines, and stylistic details," Scientometrics, Springer;Akadémiai Kiadó, vol. 109(2), pages 871-889, November.
    11. Binglu Wang & Yi Bu & Yang Xu, 2018. "A quantitative exploration on reasons for citing articles from the perspective of cited authors," Scientometrics, Springer;Akadémiai Kiadó, vol. 116(2), pages 675-687, August.
    12. Lawrence D. Fu & Constantin F. Aliferis, 2010. "Using content-based and bibliometric features for machine learning models to predict citation counts in the biomedical literature," Scientometrics, Springer;Akadémiai Kiadó, vol. 85(1), pages 257-270, October.
    13. S. Lozano & L. Calzada-Infante & B. Adenso-Díaz & S. García, 2019. "Complex network analysis of keywords co-occurrence in the recent efficiency analysis literature," Scientometrics, Springer;Akadémiai Kiadó, vol. 120(2), pages 609-629, August.
    14. Hao-Ren Ke & Ya-Ning Chen, 2012. "Structure and pattern of social tags for keyword selection behaviors," Scientometrics, Springer;Akadémiai Kiadó, vol. 92(1), pages 43-62, July.
    15. María Pinto & Rosaura Fernández-Pascual & David Caballero-Mariscal & Dora Sales & David Guerrero & Alejandro Uribe, 2019. "Scientific production on mobile information literacy in higher education: a bibliometric analysis (2006–2017)," Scientometrics, Springer;Akadémiai Kiadó, vol. 120(1), pages 57-85, July.
    16. Munan Li, 2018. "Classifying and ranking topic terms based on a novel approach: role differentiation of author keywords," Scientometrics, Springer;Akadémiai Kiadó, vol. 116(1), pages 77-100, July.
    17. Haunschild, Robin & Leydesdorff, Loet & Bornmann, Lutz & Hellsten, Iina & Marx, Werner, 2019. "Does the public discuss other topics on climate change than researchers? A comparison of explorative networks based on author keywords and hashtags," Journal of Informetrics, Elsevier, vol. 13(2), pages 695-707.
    18. Wei Lu & Yong Huang & Yi Bu & Qikai Cheng, 2018. "Functional structure identification of scientific documents in computer science," Scientometrics, Springer;Akadémiai Kiadó, vol. 115(1), pages 463-486, April.
    19. Lei Lei & Sheng Yan, 2016. "Readability and citations in information science: evidence from abstracts and articles of four journals (2003–2012)," Scientometrics, Springer;Akadémiai Kiadó, vol. 108(3), pages 1155-1169, September.
    20. Chao Min & Ying Ding & Jiang Li & Yi Bu & Lei Pei & Jianjun Sun, 2018. "Innovation or imitation: The diffusion of citations," Journal of the Association for Information Science & Technology, Association for Information Science & Technology, vol. 69(10), pages 1271-1282, October.
    Full references (including those not matched with items on IDEAS)

    Citations

    Citations are extracted by the CitEc Project, subscribe to its RSS feed for this item.
    as


    Cited by:

    1. Ying Lian & Xiaofeng Lin & Xuefan Dong & Shengjie Hou, 2022. "A Normalized Rich-Club Connectivity-Based Strategy for Keyword Selection in Social Media Analysis," Sustainability, MDPI, vol. 14(13), pages 1-19, June.
    2. Deliu Alexandra & Voicu Bogdan & Glăvan Eugen, 2023. "The COVID-19 pandemic and refugees. A scoping review," Social Change Review, Sciendo, vol. 21(1), pages 1-29, December.
    3. Gerson Pech & Catarina Delgado & Silvio Paolo Sorella, 2022. "Classifying papers into subfields using Abstracts, Titles, Keywords and KeyWords Plus through pattern detection and optimization procedures: An application in Physics," Journal of the Association for Information Science & Technology, Association for Information Science & Technology, vol. 73(11), pages 1513-1528, November.
    4. Mnahel Babgi & Benyamin Aghhavani-Shajari & Marion Vieux, 2023. "Evolution of Digitalization and Internationalization Literature: A Bibliometric and Content Analysis for Future Research Agenda," Post-Print hal-04398156, HAL.
    5. Ziqi Yuan & Guozhu Jia, 2022. "Systematic investigation of keywords selection and processing strategy on search engine forecasting: a case of tourist volume in Beijing," Information Technology & Tourism, Springer, vol. 24(4), pages 547-580, December.
    6. Haoye Sun & Thorsten Teichert, 2024. "Scarcity in today´s consumer markets: scoping the research landscape by author keywords," Management Review Quarterly, Springer, vol. 74(1), pages 93-120, February.
    7. Lázaro Florido-Benítez, 2023. "A Bibliometric Overview of the International Airports and Airlines ‘IAA’ Topic in Journals and Scientific Community," Logistics, MDPI, vol. 7(3), pages 1-25, June.
    8. Rabindra Lamsal & T. V. Vijay Kumar, 2023. "Twitter-Based Disaster Response Using Machine Learning," International Journal of Social Ecology and Sustainable Development (IJSESD), IGI Global, vol. 14(1), pages 1-18, January.
    9. Liang, Zhentao & Ba, Zhichao & Mao, Jin & Li, Gang, 2023. "Research complexity increases with scientists’ academic age: Evidence from library and information science," Journal of Informetrics, Elsevier, vol. 17(1).
    10. Xinyuan Zhang & Qing Xie & Chaemin Song & Min Song, 2022. "Mining the evolutionary process of knowledge through multiple relationships between keywords," Scientometrics, Springer;Akadémiai Kiadó, vol. 127(4), pages 2023-2053, April.

    Most related items

    These are the items that most often cite the same works as this one and are cited by the same works as this one.
    1. Hu, Ya-Han & Tai, Chun-Tien & Liu, Kang Ernest & Cai, Cheng-Fang, 2020. "Identification of highly-cited papers using topic-model-based and bibliometric features: the consideration of keyword popularity," Journal of Informetrics, Elsevier, vol. 14(1).
    2. Kong, Ling & Wang, Dongbo, 2020. "Comparison of citations and attention of cover and non-cover papers," Journal of Informetrics, Elsevier, vol. 14(4).
    3. Qianjin Zong & Yafen Xie & Rongchan Tuo & Jingshi Huang & Yang Yang, 2019. "The impact of video abstract on citation counts: evidence from a retrospective cohort study of New Journal of Physics," Scientometrics, Springer;Akadémiai Kiadó, vol. 119(3), pages 1715-1727, June.
    4. Gabriele Sampagnaro, 2023. "Keyword occurrences and journal specialization," Scientometrics, Springer;Akadémiai Kiadó, vol. 128(10), pages 5629-5645, October.
    5. T. Liskiewicz & G. Liskiewicz & J. Paczesny, 2021. "Factors affecting the citations of papers in tribology journals," Scientometrics, Springer;Akadémiai Kiadó, vol. 126(4), pages 3321-3336, April.
    6. Qianjin Zong & Yafen Xie & Jiechun Liang, 2020. "Does open peer review improve citation count? Evidence from a propensity score matching analysis of PeerJ," Scientometrics, Springer;Akadémiai Kiadó, vol. 125(1), pages 607-623, October.
    7. Shengzhi Huang & Jiajia Qian & Yong Huang & Wei Lu & Yi Bu & Jinqing Yang & Qikai Cheng, 2022. "Disclosing the relationship between citation structure and future impact of a publication," Journal of the Association for Information Science & Technology, Association for Information Science & Technology, vol. 73(7), pages 1025-1042, July.
    8. Raminta Pranckutė, 2021. "Web of Science (WoS) and Scopus: The Titans of Bibliographic Information in Today’s Academic World," Publications, MDPI, vol. 9(1), pages 1-59, March.
    9. Lu, Chao & Bu, Yi & Dong, Xianlei & Wang, Jie & Ding, Ying & Larivière, Vincent & Sugimoto, Cassidy R. & Paul, Logan & Zhang, Chengzhi, 2019. "Analyzing linguistic complexity and scientific impact," Journal of Informetrics, Elsevier, vol. 13(3), pages 817-829.
    10. Song, Ningyuan & Chen, Kejun & Zhao, Yuehua, 2023. "Understanding writing styles of scientific papers in the IS-LS domain: Evidence from abstracts over the past three decades," Journal of Informetrics, Elsevier, vol. 17(1).
    11. Amon, Julian & Hornik, Kurt, 2022. "Is it all bafflegab? – Linguistic and meta characteristics of research articles in prestigious economics journals," Journal of Informetrics, Elsevier, vol. 16(2).
    12. Ma, Chao & Li, Yiwei & Guo, Feng & Si, Kao, 2019. "The citation trap: Papers published at year-end receive systematically fewer citations," Journal of Economic Behavior & Organization, Elsevier, vol. 166(C), pages 667-687.
    13. Umar, Tarik, 2022. "Complexity aversion when SeekingAlpha," Journal of Accounting and Economics, Elsevier, vol. 73(2).
    14. Khalid Ahmed Al-Ansari & Ahmet Faruk Aysan, 2021. "More than ten years of Blockchain creation: How did we use the technology and which direction is the research heading? [Plus de dix ans de création Blockchain : Comment avons-nous utilisé la techno," Working Papers hal-03343048, HAL.
    15. Gao, Qiang & Liang, Zhentao & Wang, Ping & Hou, Jingrui & Chen, Xiuxiu & Liu, Manman, 2021. "Potential index: Revealing the future impact of research topics based on current knowledge networks," Journal of Informetrics, Elsevier, vol. 15(3).
    16. Yating Li & Ye Chen & Qiyu Wang, 2021. "Evolution and diffusion of information literacy topics," Scientometrics, Springer;Akadémiai Kiadó, vol. 126(5), pages 4195-4224, May.
    17. Jianhua Hou & Xiucai Yang & Yang Zhang, 2023. "The effect of social media knowledge cascade: an analysis of scientific papers diffusion," Scientometrics, Springer;Akadémiai Kiadó, vol. 128(9), pages 5169-5195, September.
    18. Weimao Ke, 2013. "A fitness model for scholarly impact analysis," Scientometrics, Springer;Akadémiai Kiadó, vol. 94(3), pages 981-998, March.
    19. Waqar Muhammad Ashraf & Ghulam Moeen Uddin & Syed Muhammad Arafat & Sher Afghan & Ahmad Hassan Kamal & Muhammad Asim & Muhammad Haider Khan & Muhammad Waqas Rafique & Uwe Naumann & Sajawal Gul Niazi &, 2020. "Optimization of a 660 MW e Supercritical Power Plant Performance—A Case of Industry 4.0 in the Data-Driven Operational Management Part 1. Thermal Efficiency," Energies, MDPI, vol. 13(21), pages 1-33, October.
    20. Ju Wen & Lei Lei, 2022. "Adjectives and adverbs in life sciences across 50 years: implications for emotions and readability in academic texts," Scientometrics, Springer;Akadémiai Kiadó, vol. 127(8), pages 4731-4749, August.

    Corrections

    All material on this site has been provided by the respective publishers and authors. You can help correct errors and omissions. When requesting a correction, please mention this item's handle: RePEc:eee:infome:v:14:y:2020:i:4:s1751157720300134. See general information about how to correct material in RePEc.

    If you have authored this item and are not yet registered with RePEc, we encourage you to do it here. This allows to link your profile to this item. It also allows you to accept potential citations to this item that we are uncertain about.

    If CitEc recognized a bibliographic reference but did not link an item in RePEc to it, you can help with this form .

    If you know of missing items citing this one, you can help us creating those links by adding the relevant references in the same way as above, for each refering item. If you are a registered author of this item, you may also want to check the "citations" tab in your RePEc Author Service profile, as there may be some citations waiting for confirmation.

    For technical questions regarding this item, or to correct its authors, title, abstract, bibliographic or download information, contact: Catherine Liu (email available below). General contact details of provider: http://www.elsevier.com/locate/joi .

    Please note that corrections may take a couple of weeks to filter through the various RePEc services.

    IDEAS is a RePEc service. RePEc uses bibliographic data supplied by the respective publishers.