IDEAS home Printed from https://ideas.repec.org/a/spr/scient/v117y2018i1d10.1007_s11192-018-2840-5.html
   My bibliography  Save this article

The next generation (plus one): an analysis of doctoral students’ academic fecundity based on a novel approach to advisor identification

Author

Listed:
  • Dominik P. Heinisch

    (University of Kassel)

  • Guido Buenstorf

    (University of Kassel
    University of Gothenburg
    IWH Leibniz Institute of Economics Halle)

Abstract

Scientific communities reproduce themselves by allowing senior scientists to educate young researchers, in particular through the training of doctoral students. This process of reproduction is imperfectly understood, in part because there are few large-scale datasets linking doctoral students to their advisors. We present a novel approach employing machine learning techniques to identify advisors among (frequent) co-authors in doctoral students’ publications. This approach enabled us to construct an original dataset encompassing more than 20,000 doctoral student-advisor pairs in applied physics and electrical engineering from German universities, 1975–2005. We employ this dataset to analyze the “fecundity” of doctoral students, i.e. their probability to become advisors themselves.

Suggested Citation

  • Dominik P. Heinisch & Guido Buenstorf, 2018. "The next generation (plus one): an analysis of doctoral students’ academic fecundity based on a novel approach to advisor identification," Scientometrics, Springer;Akadémiai Kiadó, vol. 117(1), pages 351-380, October.
  • Handle: RePEc:spr:scient:v:117:y:2018:i:1:d:10.1007_s11192-018-2840-5
    DOI: 10.1007/s11192-018-2840-5
    as

    Download full text from publisher

    File URL: http://link.springer.com/10.1007/s11192-018-2840-5
    File Function: Abstract
    Download Restriction: Access to the full text of the articles in this series is restricted.

    File URL: https://libkey.io/10.1007/s11192-018-2840-5?utm_source=ideas
    LibKey link: if access is restricted and if your library uses this service, LibKey will redirect you to where you can use your library subscription to access this item
    ---><---

    As the access to this document is restricted, you may want to search for a different version of it.

    References listed on IDEAS

    as
    1. Jian Wang & Kaspars Berzins & Diana Hicks & Julia Melkers & Fang Xiao & Diogo Pinheiro, 2012. "A boosted-trees method for name disambiguation," Scientometrics, Springer;Akadémiai Kiadó, vol. 93(2), pages 391-411, November.
    2. Linda Reijnhoudt & Rodrigo Costas & Ed Noyons & Katy Börner & Andrea Scharnhorst, 2014. "‘Seed + expand’: a general methodology for detecting publication oeuvres of individual researchers," Scientometrics, Springer;Akadémiai Kiadó, vol. 101(2), pages 1403-1417, November.
    3. Noriyuki Morichika & Sotaro Shibayama, 2016. "Use of dissertation data in science policy research," Scientometrics, Springer;Akadémiai Kiadó, vol. 108(1), pages 221-241, July.
    4. Fabian Waldinger, 2016. "Bombs, Brains, and Science: The Role of Human and Physical Capital for the Creation of Scientific Knowledge," The Review of Economics and Statistics, MIT Press, vol. 98(5), pages 811-831, December.
    5. Coralio Ballester & Antoni Calvó-Armengol & Yves Zenou, 2006. "Who's Who in Networks. Wanted: The Key Player," Econometrica, Econometric Society, vol. 74(5), pages 1403-1417, September.
    6. Anja Schoen & Dominik Heinisch & Guido Buenstorf, 2014. "Playing the ‘Name Game’ to identify academic patents in Germany," Scientometrics, Springer;Akadémiai Kiadó, vol. 101(1), pages 527-545, October.
    7. Tartari, Valentina & Perkmann, Markus & Salter, Ammon, 2014. "In good company: The influence of peers on industry engagement by academic scientists," Research Policy, Elsevier, vol. 43(7), pages 1189-1203.
    8. John P. Conley & Ali Sina Onder, 2014. "The Research Productivity of New PhDs in Economics: The Surprisingly High Non-success of the Successful," Journal of Economic Perspectives, American Economic Association, vol. 28(3), pages 205-216, Summer.
    9. Ciriaco Andrea D'Angelo & Cristiano Giuffrida & Giovanni Abramo, 2011. "A heuristic approach to author name disambiguation in bibliometrics databases for large‐scale research assessments," Journal of the American Society for Information Science and Technology, Association for Information Science & Technology, vol. 62(2), pages 257-269, February.
    10. Levin, Sharon G & Stephan, Paula E, 1991. "Research Productivity over the Life Cycle: Evidence for Academic Scientists," American Economic Review, American Economic Association, vol. 81(1), pages 114-132, March.
    11. Friedman, Jerome H. & Hastie, Trevor & Tibshirani, Rob, 2010. "Regularization Paths for Generalized Linear Models via Coordinate Descent," Journal of Statistical Software, Foundation for Open Access Statistics, vol. 33(i01).
    12. Baruffaldi, Stefano & Visentin, Fabiana & Conti, Annamaria, 2016. "The productivity of science & engineering PhD students hired from supervisors’ networks," Research Policy, Elsevier, vol. 45(4), pages 785-796.
    13. Ciriaco Andrea D'Angelo & Cristiano Giuffrida & Giovanni Abramo, 2011. "A heuristic approach to author name disambiguation in bibliometrics databases for large-scale research assessments," Journal of the Association for Information Science & Technology, Association for Information Science & Technology, vol. 62(2), pages 257-269, February.
    14. Balsmeier, Benjamin & Pellens, Maikel, 2014. "Who makes, who breaks: Which scientists stay in academe?," Economics Letters, Elsevier, vol. 122(2), pages 229-232.
    15. Hugo Horta & Francisco M. Veloso & Rócio Grediaga, 2010. "Navel Gazing: Academic Inbreeding and Scientific Productivity," Management Science, INFORMS, vol. 56(3), pages 414-429, March.
    16. Rossi, Luciano & Freire, Igor L. & Mena-Chalco, Jesús P., 2017. "Genealogical index: A metric to analyze advisor–advisee relationships," Journal of Informetrics, Elsevier, vol. 11(2), pages 564-582.
    17. Stephen V David & Benjamin Y Hayden, 2012. "Neurotree: A Collaborative, Graphical Database of the Academic Genealogy of Neuroscience," PLOS ONE, Public Library of Science, vol. 7(10), pages 1-12, October.
    18. Culp, Mark & Johnson, Kjell & Michailides, George, 2006. "ada: An R Package for Stochastic Boosting," Journal of Statistical Software, Foundation for Open Access Statistics, vol. 17(i02).
    Full references (including those not matched with items on IDEAS)

    Citations

    Citations are extracted by the CitEc Project, subscribe to its RSS feed for this item.
    as


    Cited by:

    1. Dhananjay Kumar & Plaban Kumar Bhowmick & Sumana Dey & Debarshi Kumar Sanyal, 2023. "On the banks of Shodhganga: analysis of the academic genealogy graph of an Indian ETD repository," Scientometrics, Springer;Akadémiai Kiadó, vol. 128(7), pages 3879-3914, July.
    2. Rafael J. P. Damaceno & Luciano Rossi & Rogério Mugnaini & Jesús P. Mena-Chalco, 2019. "The Brazilian academic genealogy: evidence of advisor–advisee relationships through quantitative analysis," Scientometrics, Springer;Akadémiai Kiadó, vol. 119(1), pages 303-333, April.
    3. Heinisch, Dominik & Koenig, Johannes & Otto, Anne, 2019. "The IAB-INCHER project of earned doctorates (IIPED): A supervised machine learning approach to identify doctorate recipients in the German integrated employment biography data," IAB-Discussion Paper 201913, Institut für Arbeitsmarkt- und Berufsforschung (IAB), Nürnberg [Institute for Employment Research, Nuremberg, Germany].
    4. Tobias Koopmann & Maximilian Stubbemann & Matthias Kapa & Michael Paris & Guido Buenstorf & Tom Hanika & Andreas Hotho & Robert Jäschke & Gerd Stumme, 2021. "Proximity dimensions and the emergence of collaboration: a HypTrails study on German AI research," Scientometrics, Springer;Akadémiai Kiadó, vol. 126(12), pages 9847-9868, December.
    5. Saarela, Mirka & Kärkkäinen, Tommi, 2020. "Can we automate expert-based journal rankings? Analysis of the Finnish publication indicator," Journal of Informetrics, Elsevier, vol. 14(2).
    6. Buenstorf, Guido & Heinisch, Dominik P., 2020. "When do firms get ideas from hiring PhDs?," Research Policy, Elsevier, vol. 49(3).
    7. Paul Donner, 2021. "Citation analysis of Ph.D. theses with data from Scopus and Google Books," Scientometrics, Springer;Akadémiai Kiadó, vol. 126(12), pages 9431-9456, December.

    Most related items

    These are the items that most often cite the same works as this one and are cited by the same works as this one.
    1. Quentin Plantec & Benjamin Cabanes & Pascal Le Masson & Benoit Weil, 2021. "Market-Pull Or Research Push? Effects Of Research Orientations On University-Industry Collaborative Ph.D. Projects' Performances," Post-Print halshs-03190142, HAL.
    2. Jinseok Kim & Jinmo Kim & Jason Owen-Smith, 2019. "Generating automatically labeled data for author name disambiguation: an iterative clustering method," Scientometrics, Springer;Akadémiai Kiadó, vol. 118(1), pages 253-280, January.
    3. Giovanni Abramo & Ciriaco Andrea D’Angelo, 2022. "Drivers of academic engagement in public–private research collaboration: an empirical study," The Journal of Technology Transfer, Springer, vol. 47(6), pages 1861-1884, December.
    4. Corsini, Alberto & Pezzoni, Michele & Visentin, Fabiana, 2022. "What makes a productive Ph.D. student?," Research Policy, Elsevier, vol. 51(10).
    5. Hsieh, Chih-Sheng & König, Michael D. & Liu, Xiaodong & Zimmermann, Christian, 2018. "Superstar Economists: Coauthorship Networks and Research Output," IZA Discussion Papers 11916, Institute of Labor Economics (IZA).
    6. Fernanda Morillo & Ignacio Santabárbara & Javier Aparicio, 2013. "The automatic normalisation challenge: detailed addresses identification," Scientometrics, Springer;Akadémiai Kiadó, vol. 95(3), pages 953-966, June.
    7. Giovanni Abramo & Ciriaco Andrea D’Angelo & Anastasiia Soldatenkova, 2017. "How long do top scientists maintain their stardom? An analysis by region, gender and discipline: evidence from Italy," Scientometrics, Springer;Akadémiai Kiadó, vol. 110(2), pages 867-877, February.
    8. Myra Mohnen, 2022. "Stars and Brokers: Knowledge Spillovers Among Medical Scientists," Management Science, INFORMS, vol. 68(4), pages 2513-2532, April.
    9. Broström, Anders, 2019. "Academic breeding grounds: Home department conditions and early career performance of academic researchers," Research Policy, Elsevier, vol. 48(7), pages 1647-1665.
    10. Plantec, Quentin & Cabanes, Benjamin & le Masson, Pascal & Weil, Benoit, 2023. "Early-career academic engagement in university–industry collaborative PhDs: Research orientation and project performance," Research Policy, Elsevier, vol. 52(9).
    11. Abramo, Giovanni & D'Angelo, Ciriaco Andrea & Di Costa, Flavia, 2019. "Diversification versus specialization in scientific research: Which strategy pays off?," Technovation, Elsevier, vol. 82, pages 51-57.
    12. Hottenrott, Hanna & Lawson, Cornelia, 2014. "Flying the nest: How the home department shapes researchers’ career paths," Department of Economics and Statistics Cognetti de Martiis LEI & BRICK - Laboratory of Economics of Innovation "Franco Momigliano", Bureau of Research in Innovation, Complexity and Knowledge, Collegio 201409, University of Turin.
    13. Alison M. J. Buchan & Eva Jurczyk & Ruth Isserlin & Gary D. Bader, 2016. "Global neuroscience and mental health research: a bibliometrics case study," Scientometrics, Springer;Akadémiai Kiadó, vol. 109(1), pages 515-531, October.
    14. Gorodnichenko, Yuriy & Pham, Tho & Talavera, Oleksandr, 2021. "Conference presentations and academic publishing," Economic Modelling, Elsevier, vol. 95(C), pages 228-254.
    15. Ciriaco Andrea D’Angelo & Nees Jan Eck, 2020. "Collecting large-scale publication data at the level of individual researchers: a practical proposal for author name disambiguation," Scientometrics, Springer;Akadémiai Kiadó, vol. 123(2), pages 883-907, May.
    16. Gianluca Fabiano & Andrea Marcellusi & Giampiero Favato, 2020. "Public–private contribution to biopharmaceutical discoveries: a bibliometric analysis of biomedical research in UK," Scientometrics, Springer;Akadémiai Kiadó, vol. 124(1), pages 153-168, July.
    17. Chih-Sheng Hsieh & Michael König & Xiaodong Liu & Christian Zimmermann, 2020. "Collaboration in Bipartite Networks, with an Application to Coauthorship Networks," Tinbergen Institute Discussion Papers 20-056/VIII, Tinbergen Institute.
    18. Lutz Bornmann & Werner Marx, 2014. "How to evaluate individual researchers working in the natural and life sciences meaningfully? A proposal of methods based on percentiles of citations," Scientometrics, Springer;Akadémiai Kiadó, vol. 98(1), pages 487-509, January.
    19. Abramo, Giovanni & D'Angelo, Ciriaco Andrea & Grilli, Leonardo, 2021. "The effects of citation-based research evaluation schemes on self-citation behavior," Journal of Informetrics, Elsevier, vol. 15(4).
    20. Abramo, Giovanni & D’Angelo, Ciriaco Andrea, 2014. "Assessing national strengths and weaknesses in research fields," Journal of Informetrics, Elsevier, vol. 8(3), pages 766-775.

    More about this item

    Keywords

    Advisor identification; Fecundity; Ph.D. training; Advisor affects; Academic careers; Machine learning;
    All these keywords.

    JEL classification:

    • O30 - Economic Development, Innovation, Technological Change, and Growth - - Innovation; Research and Development; Technological Change; Intellectual Property Rights - - - General
    • D83 - Microeconomics - - Information, Knowledge, and Uncertainty - - - Search; Learning; Information and Knowledge; Communication; Belief; Unawareness
    • D85 - Microeconomics - - Information, Knowledge, and Uncertainty - - - Network Formation

    Statistics

    Access and download statistics

    Corrections

    All material on this site has been provided by the respective publishers and authors. You can help correct errors and omissions. When requesting a correction, please mention this item's handle: RePEc:spr:scient:v:117:y:2018:i:1:d:10.1007_s11192-018-2840-5. See general information about how to correct material in RePEc.

    If you have authored this item and are not yet registered with RePEc, we encourage you to do it here. This allows to link your profile to this item. It also allows you to accept potential citations to this item that we are uncertain about.

    If CitEc recognized a bibliographic reference but did not link an item in RePEc to it, you can help with this form .

    If you know of missing items citing this one, you can help us creating those links by adding the relevant references in the same way as above, for each refering item. If you are a registered author of this item, you may also want to check the "citations" tab in your RePEc Author Service profile, as there may be some citations waiting for confirmation.

    For technical questions regarding this item, or to correct its authors, title, abstract, bibliographic or download information, contact: Sonal Shukla or Springer Nature Abstracting and Indexing (email available below). General contact details of provider: http://www.springer.com .

    Please note that corrections may take a couple of weeks to filter through the various RePEc services.

    IDEAS is a RePEc service. RePEc uses bibliographic data supplied by the respective publishers.