IDEAS home Printed from https://ideas.repec.org/a/spr/scient/v114y2018i3d10.1007_s11192-017-2618-1.html
   My bibliography  Save this article

Predicting scientific impact based on h-index

Author

Listed:
  • Samreen Ayaz

    (Capital University of Science & Technology)

  • Nayyer Masood

    (Capital University of Science & Technology)

  • Muhammad Arshad Islam

    (Capital University of Science & Technology)

Abstract

Predicting the future impact of a scientist/researcher is a critical task. The objective of this work is to evaluate different h-index prediction models for the field of Computer Science. Different combinations of parameters have been identified to build the model and applied on a large data set taken from Arnetminer comprised of almost 1.8 million authors and 2.1 million publications’ record of Computer Science. Machine learning prediction technique, regression, is used to find the best set of parameters suitable for h-index prediction for the scientists from all career ages, without enforcing any constraint on their current h-index values with R 2 as a metric to measure the accuracy. Further, these parameters are evaluated for different career ages and different thresholds for h-index values. Prediction results for 1 year are really good, having R 2 0.93 but for 5 years R 2 declines to 0.82 on average. Hence inferred that prediction of h-index is difficult for longer periods. Predictions for the researchers having 1 year experience are not precise, having R 2 0.60 for 1 year and 0.33 for 5 years. Considering scientists of different career ages, average R 2 values for researchers having 20–36 years of experience were 0.99. For the researches having different h-index values, researchers having low h-index were difficult to predict. Parameters set comprising of current h-index, average citations per paper, number of coauthors, years since publishing first article, number of publications, number of impact factor publications, and number of publications in distinct journals performed better than all other combinations.

Suggested Citation

  • Samreen Ayaz & Nayyer Masood & Muhammad Arshad Islam, 2018. "Predicting scientific impact based on h-index," Scientometrics, Springer;Akadémiai Kiadó, vol. 114(3), pages 993-1010, March.
  • Handle: RePEc:spr:scient:v:114:y:2018:i:3:d:10.1007_s11192-017-2618-1
    DOI: 10.1007/s11192-017-2618-1
    as

    Download full text from publisher

    File URL: http://link.springer.com/10.1007/s11192-017-2618-1
    File Function: Abstract
    Download Restriction: Access to the full text of the articles in this series is restricted.

    File URL: https://libkey.io/10.1007/s11192-017-2618-1?utm_source=ideas
    LibKey link: if access is restricted and if your library uses this service, LibKey will redirect you to where you can use your library subscription to access this item
    ---><---

    As the access to this document is restricted, you may want to search for a different version of it.

    References listed on IDEAS

    as
    1. Lutz Bornmann & Rüdiger Mutz & Hans‐Dieter Daniel, 2008. "Are there better indices for evaluation purposes than the h index? A comparison of nine different variants of the h index using data from biomedicine," Journal of the American Society for Information Science and Technology, Association for Information Science & Technology, vol. 59(5), pages 830-837, March.
    2. Charles Oppenheim, 2007. "Using the h‐index to rank influential British researchers in information science and librarianship," Journal of the American Society for Information Science and Technology, Association for Information Science & Technology, vol. 58(2), pages 297-301, January.
    3. Bu, Yi & Ni, Shaokang & Huang, Win-bin, 2017. "Combining multiple scholarly relationships with author cocitation analysis: A preliminary exploration on improving knowledge domain mappings," Journal of Informetrics, Elsevier, vol. 11(3), pages 810-822.
    4. ., 2017. "Standing on the shoulders of giants," Chapters, in: Endogenous Innovation, chapter 1, pages 3-24, Edward Elgar Publishing.
    5. Amjad, Tehmina & Ding, Ying & Xu, Jian & Zhang, Chenwei & Daud, Ali & Tang, Jie & Song, Min, 2017. "Standing on the shoulders of giants," Journal of Informetrics, Elsevier, vol. 11(1), pages 307-323.
    6. Daniel E. Acuna & Stefano Allesina & Konrad P. Kording, 2012. "Predicting scientific success," Nature, Nature, vol. 489(7415), pages 201-202, September.
    7. Xiangjie Kong & Huizhen Jiang & Wei Wang & Teshome Megersa Bekele & Zhenzhen Xu & Meng Wang, 2017. "Exploring dynamic research interest and academic influence for scientific collaborator recommendation," Scientometrics, Springer;Akadémiai Kiadó, vol. 113(1), pages 369-385, October.
    8. Schreiber, Michael, 2013. "How relevant is the predictive power of the h-index? A case study of the time-dependent Hirsch index," Journal of Informetrics, Elsevier, vol. 7(2), pages 325-329.
    9. Samreen Ayaz & Muhammad Tanvir Afzal, 2016. "Identification of conversion factor for completing-h index for the field of mathematics," Scientometrics, Springer;Akadémiai Kiadó, vol. 109(3), pages 1511-1524, December.
    Full references (including those not matched with items on IDEAS)

    Citations

    Citations are extracted by the CitEc Project, subscribe to its RSS feed for this item.
    as


    Cited by:

    1. Deming Lin & Tianhui Gong & Wenbin Liu & Martin Meyer, 2020. "An entropy-based measure for the evolution of h index research," Scientometrics, Springer;Akadémiai Kiadó, vol. 125(3), pages 2283-2298, December.
    2. Wang, Jiang-Pan & Guo, Qiang & Zhou, Lei & Liu, Jian-Guo, 2019. "Dynamic credit allocation for researchers," Physica A: Statistical Mechanics and its Applications, Elsevier, vol. 520(C), pages 208-216.
    3. Wanjun Xia & Tianrui Li & Chongshou Li, 2023. "A review of scientific impact prediction: tasks, features and methods," Scientometrics, Springer;Akadémiai Kiadó, vol. 128(1), pages 543-585, January.
    4. Matthias Kuppler, 2022. "Predicting the future impact of Computer Science researchers: Is there a gender bias?," Scientometrics, Springer;Akadémiai Kiadó, vol. 127(11), pages 6695-6732, November.
    5. Pilar Valderrama & Manuel Escabias & Evaristo Jiménez-Contreras & Mariano J. Valderrama & Pilar Baca, 2018. "A mixed longitudinal and cross-sectional model to forecast the journal impact factor in the field of Dentistry," Scientometrics, Springer;Akadémiai Kiadó, vol. 116(2), pages 1203-1212, August.
    6. Diana Purwitasari & Chastine Fatichah & Surya Sumpeno & Christian Steglich & Mauridhi Hery Purnomo, 2020. "Identifying collaboration dynamics of bipartite author-topic networks with the influences of interest changes," Scientometrics, Springer;Akadémiai Kiadó, vol. 122(3), pages 1407-1443, March.
    7. Jakub Rybacki & Dobromił Serwa, 2021. "What Makes a Successful Scientist in a Central Bank? Evidence From the RePEc Database," Central European Journal of Economic Modelling and Econometrics, Central European Journal of Economic Modelling and Econometrics, vol. 13(3), pages 331-357, September.
    8. Klemiński, Rajmund & Kazienko, Przemyslaw & Kajdanowicz, Tomasz, 2021. "Where should I publish? Heterogeneous, networks-based prediction of paper’s citation success," Journal of Informetrics, Elsevier, vol. 15(3).
    9. Pilar Valderrama & Evaristo Jiménez-Contreras & Manuel Escabias & Mariano J. Valderrama, 2022. "Introducing a bibliometric index based on factor analysis," Scientometrics, Springer;Akadémiai Kiadó, vol. 127(1), pages 509-522, January.

    Most related items

    These are the items that most often cite the same works as this one and are cited by the same works as this one.
    1. Kong, Xiangjie & Mao, Mengyi & Jiang, Huizhen & Yu, Shuo & Wan, Liangtian, 2019. "How does collaboration affect researchers’ positions in co-authorship networks?," Journal of Informetrics, Elsevier, vol. 13(3), pages 887-900.
    2. Li Hou & Qiang Wu & Yundong Xie, 2022. "Does early publishing in top journals really predict long-term scientific success in the business field?," Scientometrics, Springer;Akadémiai Kiadó, vol. 127(11), pages 6083-6107, November.
    3. Zhiya Zuo & Kang Zhao, 2021. "Understanding and predicting future research impact at different career stages—A social network perspective," Journal of the Association for Information Science & Technology, Association for Information Science & Technology, vol. 72(4), pages 454-472, April.
    4. Cabrerizo, F.J. & Alonso, S. & Herrera-Viedma, E. & Herrera, F., 2010. "q2-Index: Quantitative and qualitative evaluation based on the number and impact of papers in the Hirsch core," Journal of Informetrics, Elsevier, vol. 4(1), pages 23-28.
    5. Mingkun Wei, 2020. "Research on impact evaluation of open access journals," Scientometrics, Springer;Akadémiai Kiadó, vol. 122(2), pages 1027-1049, February.
    6. Mingers, John & Yang, Liying, 2017. "Evaluating journal quality: A review of journal citation indicators and ranking in business and management," European Journal of Operational Research, Elsevier, vol. 257(1), pages 323-337.
    7. Shen, Hongquan & Cheng, Ying & Ju, Xiufang & Xie, Juan, 2022. "Rethinking the effect of inter-gender collaboration on research performance for scholars," Journal of Informetrics, Elsevier, vol. 16(4).
    8. Jun Zhang & Yan Hu & Zhaolong Ning & Amr Tolba & Elsayed Elashkar & Feng Xia, 2018. "AIRank: Author Impact Ranking through Positions in Collaboration Networks," Complexity, Hindawi, vol. 2018, pages 1-16, June.
    9. Danielle H. Lee, 2019. "Predicting the research performance of early career scientists," Scientometrics, Springer;Akadémiai Kiadó, vol. 121(3), pages 1481-1504, December.
    10. Wanjun Xia & Tianrui Li & Chongshou Li, 2023. "A review of scientific impact prediction: tasks, features and methods," Scientometrics, Springer;Akadémiai Kiadó, vol. 128(1), pages 543-585, January.
    11. Mingers, John & Leydesdorff, Loet, 2015. "A review of theory and practice in scientometrics," European Journal of Operational Research, Elsevier, vol. 246(1), pages 1-19.
    12. Madiha Ameer & Muhammad Tanvir Afzal, 2019. "Evaluation of h-index and its qualitative and quantitative variants in Neuroscience," Scientometrics, Springer;Akadémiai Kiadó, vol. 121(2), pages 653-673, November.
    13. S. Alonso & F. J. Cabrerizo & E. Herrera-Viedma & F. Herrera, 2010. "hg-index: a new index to characterize the scientific output of researchers based on the h- and g-indices," Scientometrics, Springer;Akadémiai Kiadó, vol. 82(2), pages 391-400, February.
    14. Dimitris Bertsimas & Erik Brynjolfsson & Shachar Reichman & John Silberholz, 2015. "OR Forum—Tenure Analytics: Models for Predicting Research Impact," Operations Research, INFORMS, vol. 63(6), pages 1246-1261, December.
    15. Deise Deolindo Silva & Maria Cláudia Cabrini Grácio, 2021. "Dispersion measures for h-index: a study of the Brazilian researchers in the field of mathematics," Scientometrics, Springer;Akadémiai Kiadó, vol. 126(3), pages 1983-2011, March.
    16. Schreiber, Michael, 2015. "Restricting the h-index to a publication and citation time window: A case study of a timed Hirsch index," Journal of Informetrics, Elsevier, vol. 9(1), pages 150-155.
    17. Xie, Qing & Zhang, Xinyuan & Kim, Giyeong & Song, Min, 2022. "Exploring the influence of coauthorship with top scientists on researchers’ affiliation, research topic, productivity, and impact," Journal of Informetrics, Elsevier, vol. 16(3).
    18. Muhammad Sajid Qureshi & Ali Daud, 2021. "Fine-grained academic rankings: mapping affiliation of the influential researchers with the top ranked HEIs," Scientometrics, Springer;Akadémiai Kiadó, vol. 126(10), pages 8331-8361, October.
    19. Miguel A. García-Pérez, 2013. "Limited validity of equations to predict the future h index," Scientometrics, Springer;Akadémiai Kiadó, vol. 96(3), pages 901-909, September.
    20. Jianlin Zhou & An Zeng & Ying Fan & Zengru Di, 2018. "Identifying important scholars via directed scientific collaboration networks," Scientometrics, Springer;Akadémiai Kiadó, vol. 114(3), pages 1327-1343, March.

    Corrections

    All material on this site has been provided by the respective publishers and authors. You can help correct errors and omissions. When requesting a correction, please mention this item's handle: RePEc:spr:scient:v:114:y:2018:i:3:d:10.1007_s11192-017-2618-1. See general information about how to correct material in RePEc.

    If you have authored this item and are not yet registered with RePEc, we encourage you to do it here. This allows to link your profile to this item. It also allows you to accept potential citations to this item that we are uncertain about.

    If CitEc recognized a bibliographic reference but did not link an item in RePEc to it, you can help with this form .

    If you know of missing items citing this one, you can help us creating those links by adding the relevant references in the same way as above, for each refering item. If you are a registered author of this item, you may also want to check the "citations" tab in your RePEc Author Service profile, as there may be some citations waiting for confirmation.

    For technical questions regarding this item, or to correct its authors, title, abstract, bibliographic or download information, contact: Sonal Shukla or Springer Nature Abstracting and Indexing (email available below). General contact details of provider: http://www.springer.com .

    Please note that corrections may take a couple of weeks to filter through the various RePEc services.

    IDEAS is a RePEc service. RePEc uses bibliographic data supplied by the respective publishers.