IDEAS home Printed from https://ideas.repec.org/a/spr/scient/v102y2015i2d10.1007_s11192-014-1455-8.html
   My bibliography  Save this article

Using machine learning techniques for rising star prediction in co-author network

Author

Listed:
  • Ali Daud

    (International Islamic University)

  • Muhammad Ahmad

    (Allama Iqbal Open University)

  • M. S. I. Malik

    (International Islamic University)

  • Dunren Che

    (Southern Illinois University)

Abstract

Online bibliographic databases are powerful resources for research in data mining and social network analysis especially co-author networks. Predicting future rising stars is to find brilliant scholars/researchers in co-author networks. In this paper, we propose a solution for rising star prediction by applying machine learning techniques. For classification task, discriminative and generative modeling techniques are considered and two algorithms are chosen for each category. The author, co-authorship and venue based information are incorporated, resulting in eleven features with their mathematical formulations. Extensive experiments are performed to analyze the impact of individual feature, category wise and their combination w.r.t classification accuracy. Then, two ranking lists for top 30 scholars are presented from predicted rising stars. In addition, this concept is demonstrated for prediction of rising stars in database domain. Data from DBLP and Arnetminer databases (1996–2000 for wide disciplines) are used for algorithms’ experimental analysis.

Suggested Citation

  • Ali Daud & Muhammad Ahmad & M. S. I. Malik & Dunren Che, 2015. "Using machine learning techniques for rising star prediction in co-author network," Scientometrics, Springer;Akadémiai Kiadó, vol. 102(2), pages 1687-1711, February.
  • Handle: RePEc:spr:scient:v:102:y:2015:i:2:d:10.1007_s11192-014-1455-8
    DOI: 10.1007/s11192-014-1455-8
    as

    Download full text from publisher

    File URL: http://link.springer.com/10.1007/s11192-014-1455-8
    File Function: Abstract
    Download Restriction: Access to the full text of the articles in this series is restricted.

    File URL: https://libkey.io/10.1007/s11192-014-1455-8?utm_source=ideas
    LibKey link: if access is restricted and if your library uses this service, LibKey will redirect you to where you can use your library subscription to access this item
    ---><---

    As the access to this document is restricted, you may want to search for a different version of it.

    References listed on IDEAS

    as
    1. Zongyang Ma & Aixin Sun & Gao Cong, 2013. "On predicting the popularity of newly emerging hashtags in Twitter," Journal of the Association for Information Science & Technology, Association for Information Science & Technology, vol. 64(7), pages 1399-1410, July.
    2. Pascal Cuxac & Jean-Charles Lamirel & Valerie Bonvallot, 2013. "Efficient supervised and semi-supervised approaches for affiliations disambiguation," Scientometrics, Springer;Akadémiai Kiadó, vol. 97(1), pages 47-58, October.
    3. Raf Guns & Ronald Rousseau, 2014. "Recommending research collaborations using link prediction and random forest classifiers," Scientometrics, Springer;Akadémiai Kiadó, vol. 101(2), pages 1461-1473, November.
    4. N. Speybroeck, 2012. "Classification and regression trees," International Journal of Public Health, Springer;Swiss School of Public Health (SSPH+), vol. 57(1), pages 243-246, February.
    5. Guo Zhang & Ying Ding & Staša Milojević, 2013. "Citation content analysis (CCA): A framework for syntactic and semantic analysis of citation content," Journal of the American Society for Information Science and Technology, Association for Information Science & Technology, vol. 64(7), pages 1490-1503, July.
    6. Zongyang Ma & Aixin Sun & Gao Cong, 2013. "On predicting the popularity of newly emerging hashtags in Twitter," Journal of the American Society for Information Science and Technology, Association for Information Science & Technology, vol. 64(7), pages 1399-1410, July.
    Full references (including those not matched with items on IDEAS)

    Citations

    Citations are extracted by the CitEc Project, subscribe to its RSS feed for this item.
    as


    Cited by:

    1. Ali Daud & Min Song & Malik Khizar Hayat & Tehmina Amjad & Rabeeh Ayaz Abbasi & Hassan Dawood & Anwar Ghani, 2020. "Finding rising stars in bibliometric networks," Scientometrics, Springer;Akadémiai Kiadó, vol. 124(1), pages 633-661, July.
    2. Tehmina Amjad & Nafeesa Shahid & Ali Daud & Asma Khatoon, 2022. "Citation burst prediction in a bibliometric network," Scientometrics, Springer;Akadémiai Kiadó, vol. 127(5), pages 2773-2790, May.
    3. Amjad, Tehmina & Ding, Ying & Xu, Jian & Zhang, Chenwei & Daud, Ali & Tang, Jie & Song, Min, 2017. "Standing on the shoulders of giants," Journal of Informetrics, Elsevier, vol. 11(1), pages 307-323.
    4. Jorge A. V. Tohalino & Laura V. C. Quispe & Diego R. Amancio, 2021. "Analyzing the relationship between text features and grants productivity," Scientometrics, Springer;Akadémiai Kiadó, vol. 126(5), pages 4255-4275, May.
    5. Jeong, Yoo Kyung & Xie, Qing & Yan, Erjia & Song, Min, 2020. "Examining drug and side effect relation using author–entity pair bipartite networks," Journal of Informetrics, Elsevier, vol. 14(1).
    6. Wanjun Xia & Tianrui Li & Chongshou Li, 2023. "A review of scientific impact prediction: tasks, features and methods," Scientometrics, Springer;Akadémiai Kiadó, vol. 128(1), pages 543-585, January.
    7. Tehmina Amjad & Javeria Munir, 2021. "Investigating the impact of collaboration with authority authors: a case study of bibliographic data in field of philosophy," Scientometrics, Springer;Akadémiai Kiadó, vol. 126(5), pages 4333-4353, May.
    8. Kraft-Todd, Gordon T. & Rand, David G., 2021. "Practice what you preach: Credibility-enhancing displays and the growth of open science," Organizational Behavior and Human Decision Processes, Elsevier, vol. 164(C), pages 1-10.
    9. Saarela, Mirka & Kärkkäinen, Tommi, 2020. "Can we automate expert-based journal rankings? Analysis of the Finnish publication indicator," Journal of Informetrics, Elsevier, vol. 14(2).
    10. Aftab Nawaz & MSI Malik, 2022. "Rising stars prediction in reviewer network," Electronic Commerce Research, Springer, vol. 22(1), pages 53-75, March.
    11. Yubing Nie & Yifan Zhu & Qika Lin & Sifan Zhang & Pengfei Shi & Zhendong Niu, 2019. "Academic rising star prediction via scholar’s evaluation model and machine learning techniques," Scientometrics, Springer;Akadémiai Kiadó, vol. 120(2), pages 461-476, August.
    12. Lin Zhu & Junjie Zhang & Scott W. Cunningham, 2022. "Domain expertise extraction for finding rising stars," Scientometrics, Springer;Akadémiai Kiadó, vol. 127(9), pages 5475-5495, September.
    13. Malik Khizar Hayat & Ali Daud, 2017. "Anomaly detection in heterogeneous bibliographic information networks using co-evolution pattern mining," Scientometrics, Springer;Akadémiai Kiadó, vol. 113(1), pages 149-175, October.
    14. Kumar, Dhananjay & Bhowmick, Plaban Kumar & Paik, Jiaul H, 2023. "Researcher influence prediction (ResIP) using academic genealogy network," Journal of Informetrics, Elsevier, vol. 17(2).
    15. Chung, Jaemin & Ko, Namuk & Kim, Hyeonsu & Yoon, Janghyeok, 2021. "Inventor profile mining approach for prospective human resource scouting," Journal of Informetrics, Elsevier, vol. 15(1).
    16. Panagopoulos, George & Tsatsaronis, George & Varlamis, Iraklis, 2017. "Detecting rising stars in dynamic collaborative networks," Journal of Informetrics, Elsevier, vol. 11(1), pages 198-222.
    17. Lin Zhu & Donghua Zhu & Xuefeng Wang & Scott W. Cunningham & Zhinan Wang, 2019. "An integrated solution for detecting rising technology stars in co-inventor networks," Scientometrics, Springer;Akadémiai Kiadó, vol. 121(1), pages 137-172, October.
    18. Xi Zhang & Xianhai Wang & Hongke Zhao & Patricia Ordóñez de Pablos & Yongqiang Sun & Hui Xiong, 2019. "An effectiveness analysis of altmetrics indices for different levels of artificial intelligence publications," Scientometrics, Springer;Akadémiai Kiadó, vol. 119(3), pages 1311-1344, June.

    Most related items

    These are the items that most often cite the same works as this one and are cited by the same works as this one.
    1. Jabłońska-Sabuka, Matylda & Sitarz, Robert & Kraslawski, Andrzej, 2014. "Forecasting research trends using population dynamics model with Burgers’ type interaction," Journal of Informetrics, Elsevier, vol. 8(1), pages 111-122.
    2. Zhao, Qihang & Feng, Xiaodong, 2022. "Utilizing citation network structure to predict paper citation counts: A Deep learning approach," Journal of Informetrics, Elsevier, vol. 16(1).
    3. Sharad Goel & Ashton Anderson & Jake Hofman & Duncan J. Watts, 2016. "The Structural Virality of Online Diffusion," Management Science, INFORMS, vol. 62(1), pages 180-196, January.
    4. Cui, Hao & Kertész, János, 2023. "“Born in Rome” or “Sleeping Beauty”: Emergence of hashtag popularity on the Chinese microblog Sina Weibo," Physica A: Statistical Mechanics and its Applications, Elsevier, vol. 619(C).
    5. Jaebong Son & Jintae Lee & Kai R. Larsen & Jiyoung Woo, 2020. "Understanding the uncertainty of disaster tweets and its effect on retweeting: The perspectives of uncertainty reduction theory and information entropy," Journal of the Association for Information Science & Technology, Association for Information Science & Technology, vol. 71(10), pages 1145-1161, October.
    6. Paige Brown Jarreau & Imogene A Cancellare & Becky J Carmichael & Lance Porter & Daniel Toker & Samantha Z Yammine, 2019. "Using selfies to challenge public stereotypes of scientists," PLOS ONE, Public Library of Science, vol. 14(5), pages 1-23, May.
    7. Ali Daud & Min Song & Malik Khizar Hayat & Tehmina Amjad & Rabeeh Ayaz Abbasi & Hassan Dawood & Anwar Ghani, 2020. "Finding rising stars in bibliometric networks," Scientometrics, Springer;Akadémiai Kiadó, vol. 124(1), pages 633-661, July.
    8. Wai Hong Tan & Feng Chen, 2021. "Predicting the popularity of tweets using internal and external knowledge: an empirical Bayes type approach," AStA Advances in Statistical Analysis, Springer;German Statistical Society, vol. 105(2), pages 335-352, June.
    9. Arora, Anuja & Bansal, Shivam & Kandpal, Chandrashekhar & Aswani, Reema & Dwivedi, Yogesh, 2019. "Measuring social media influencer index- insights from facebook, Twitter and Instagram," Journal of Retailing and Consumer Services, Elsevier, vol. 49(C), pages 86-101.
    10. Son, Jaebong & Lee, Hyung Koo & Jin, Sung & Lee, Jintae, 2019. "Content features of tweets for effective communication during disasters: A media synchronicity theory perspective," International Journal of Information Management, Elsevier, vol. 45(C), pages 56-68.
    11. António Fonseca & Jorge Louçã, 2018. "Explaining the emergence of online popularity through a model of information diffusion," Computational and Mathematical Organization Theory, Springer, vol. 24(2), pages 169-187, June.
    12. Lutz Bornmann & Robin Haunschild & Sven E. Hug, 2018. "Visualizing the context of citations referencing papers published by Eugene Garfield: a new type of keyword co-occurrence analysis," Scientometrics, Springer;Akadémiai Kiadó, vol. 114(2), pages 427-437, February.
    13. Eustache Mêgnigbêto, 2018. "Correlation Between Transmission Power and Some Indicators Used to Measure the Knowledge-Based Economy: Case of Six OECD Countries," Journal of the Knowledge Economy, Springer;Portland International Center for Management of Engineering and Technology (PICMET), vol. 9(4), pages 1168-1183, December.
    14. Liu, Xiaojuan & Wang, Chenlin & Chen, Dar-Zen & Huang, Mu-Hsuan, 2022. "Exploring perception of retraction based on mentioned status in post-retraction citations," Journal of Informetrics, Elsevier, vol. 16(3).
    15. Deyun Yin & Kazuyuki Motohashi & Jianwei Dang, 2020. "Large-scale name disambiguation of Chinese patent inventors (1985–2016)," Scientometrics, Springer;Akadémiai Kiadó, vol. 122(2), pages 765-790, February.
    16. Tranos, Emmanouil & Incera, Andre Carrascal & Willis, George, 2022. "Using the web to predict regional trade flows: data extraction, modelling, and validation," OSF Preprints 9bu5z, Center for Open Science.
    17. Andrea Ancona & Roy Cerqueti & Gianluca Vagnani, 2023. "A novel methodology to disambiguate organization names: an application to EU Framework Programmes data," Scientometrics, Springer;Akadémiai Kiadó, vol. 128(8), pages 4447-4474, August.
    18. Xiaowen Xi & Jiaqi Wei & Ying Guo & Weiyu Duan, 2022. "Academic collaborations: a recommender framework spanning research interests and network topology," Scientometrics, Springer;Akadémiai Kiadó, vol. 127(11), pages 6787-6808, November.
    19. Kim, Ha Jin & Jeong, Yoo Kyung & Song, Min, 2016. "Content- and proximity-based author co-citation analysis using citation sentences," Journal of Informetrics, Elsevier, vol. 10(4), pages 954-966.
    20. Lisa Dandolo & Christina Hartig & Klaus Telkmann & Sophie Horstmann & Lars Schwettmann & Peter Selsam & Alexandra Schneider & Gabriele Bolte & on behalf of the INGER Study Group, 2022. "Decision Tree Analyses to Explore the Relevance of Multiple Sex/Gender Dimensions for the Exposure to Green Spaces: Results from the KORA INGER Study," IJERPH, MDPI, vol. 19(12), pages 1-25, June.

    Corrections

    All material on this site has been provided by the respective publishers and authors. You can help correct errors and omissions. When requesting a correction, please mention this item's handle: RePEc:spr:scient:v:102:y:2015:i:2:d:10.1007_s11192-014-1455-8. See general information about how to correct material in RePEc.

    If you have authored this item and are not yet registered with RePEc, we encourage you to do it here. This allows to link your profile to this item. It also allows you to accept potential citations to this item that we are uncertain about.

    If CitEc recognized a bibliographic reference but did not link an item in RePEc to it, you can help with this form .

    If you know of missing items citing this one, you can help us creating those links by adding the relevant references in the same way as above, for each refering item. If you are a registered author of this item, you may also want to check the "citations" tab in your RePEc Author Service profile, as there may be some citations waiting for confirmation.

    For technical questions regarding this item, or to correct its authors, title, abstract, bibliographic or download information, contact: Sonal Shukla or Springer Nature Abstracting and Indexing (email available below). General contact details of provider: http://www.springer.com .

    Please note that corrections may take a couple of weeks to filter through the various RePEc services.

    IDEAS is a RePEc service. RePEc uses bibliographic data supplied by the respective publishers.