IDEAS home Printed from https://ideas.repec.org/p/iab/iabdpa/201913.html
   My bibliography  Save this paper

The IAB-INCHER project of earned doctorates (IIPED): A supervised machine learning approach to identify doctorate recipients in the German integrated employment biography data

Author

Listed:
  • Heinisch, Dominik

    (University of Kassel and INCHER-Kassel (Germany))

  • Koenig, Johannes
  • Otto, Anne

    (Institute for Employment Research (IAB), Nuremberg, Germany)

Abstract

"Only scarce information is available on doctorate recipients' career outcomes in Germany (BuWiN 2013). With the current information base, graduate students cannot make an informed decision whether to start a doctorate (Benderly 2018, Blank 2017). Administrative labour market data could provide the necessary information, is however incomplete in this respect. In this paper, we describe the record linkage of two datasets to close this information gap: data on doctorate recipients collected in the catalogue of the German National Library (DNB), and the German labour market biographies (IEB) from the German Institute of Employment Research. We use a machine learning based methodology, which 1) improves the record linkage of datasets without unique identifiers, and 2) evaluates the quality of the record linkage. The machine learning algorithms are trained on a synthetic training and evaluation dataset. In an exemplary analysis we compare the employment status of female and male doctorate recipients in Germany." (Author's abstract, IAB-Doku) ((en))

Suggested Citation

  • Heinisch, Dominik & Koenig, Johannes & Otto, Anne, 2019. "The IAB-INCHER project of earned doctorates (IIPED): A supervised machine learning approach to identify doctorate recipients in the German integrated employment biography data," IAB-Discussion Paper 201913, Institut für Arbeitsmarkt- und Berufsforschung (IAB), Nürnberg [Institute for Employment Research, Nuremberg, Germany].
  • Handle: RePEc:iab:iabdpa:201913
    as

    Download full text from publisher

    File URL: https://doku.iab.de/discussionpapers/2019/dp1319.pdf
    Download Restriction: no
    ---><---

    References listed on IDEAS

    as
    1. repec:iab:iabfme:201406(en is not listed on IDEAS
    2. Dorner, Matthias & Bender, Stefan & Harhoff, Dietmar & Hoisl, Karin & Scioch, Patrycja, 2014. "The MPI-IC-IAB-Inventor data 2002 (MIID 2002): Record-linkage of patent register data with labor market biography data of the IAB," FDZ Methodenreport 201406_en, Institut für Arbeitsmarkt- und Berufsforschung (IAB), Nürnberg [Institute for Employment Research, Nuremberg, Germany].
    3. Dorner, Matthias & Heining, Jörg & Jacobebbinghaus, Peter & Seth, Stefan, 2010. "Sample of Integrated Labour Market Biographies (SIAB) 1975-2008," FDZ Methodenreport 201009_en, Institut für Arbeitsmarkt- und Berufsforschung (IAB), Nürnberg [Institute for Employment Research, Nuremberg, Germany].
    4. Wydra-Somaggio, Gabriele, 2015. "Das Ausbildungspanel Saarland : Dokumentation der Datenaufbereitung," IAB-Regional. Berichte und Analysen aus dem Regionalen Forschungsnetz. IAB Rheinland-Pfalz-Saarland 201503, Institut für Arbeitsmarkt- und Berufsforschung (IAB), Nürnberg [Institute for Employment Research, Nuremberg, Germany].
    5. Dominik P. Heinisch & Guido Buenstorf, 2018. "The next generation (plus one): an analysis of doctoral students’ academic fecundity based on a novel approach to advisor identification," Scientometrics, Springer;Akadémiai Kiadó, vol. 117(1), pages 351-380, October.
    6. Manfred Antoni & Stefan Seth, 2012. "ALWA-ADIAB – Linked Individual Survey and Administrative Data for Substantive and Methodological Research," Schmollers Jahrbuch : Journal of Applied Social Science Studies / Zeitschrift für Wirtschafts- und Sozialwissenschaften, Duncker & Humblot, Berlin, vol. 132(1), pages 141-146.
    7. Buenstorf Guido & Geissler Matthias, 2014. "Like Doktorvater, like Son? Tracing Role Model Learning in the Evolution of German Laser Research," Journal of Economics and Statistics (Jahrbuecher fuer Nationaloekonomie und Statistik), De Gruyter, vol. 234(2-3), pages 158-184, April.
    8. Vom Berge, Philipp & König, Marion & Seth, Stefan, 2013. "Sample of Integrated Labour Market Biographies (SIAB) 1975-2010," FDZ Datenreport. Documentation on Labour Market Data 201301_en, Institut für Arbeitsmarkt- und Berufsforschung (IAB), Nürnberg [Institute for Employment Research, Nuremberg, Germany].
    9. Friedman, Jerome H. & Hastie, Trevor & Tibshirani, Rob, 2010. "Regularization Paths for Generalized Linear Models via Coordinate Descent," Journal of Statistical Software, Foundation for Open Access Statistics, vol. 33(i01).
    10. Antoni, Manfred & Ganzer, Andreas & Vom Berge, Philipp, 2016. "Sample of integrated labour market biographies (SIAB) 1975-2014," FDZ Datenreport. Documentation on Labour Market Data 201604_en, Institut für Arbeitsmarkt- und Berufsforschung (IAB), Nürnberg [Institute for Employment Research, Nuremberg, Germany].
    11. Teichert, Christian & Niebuhr, Annekatrin & Otto, Anne & Rossen, Anja, 2018. "Graduate migration in Germany - new evidence from an event history analysis," IAB-Discussion Paper 201803, Institut für Arbeitsmarkt- und Berufsforschung (IAB), Nürnberg [Institute for Employment Research, Nuremberg, Germany].
    12. Matthias Dorner & Jörg Heining & Peter Jacobebbinghaus & Stefan Seth, 2010. "The Sample of Integrated Labour Market Biographies," Schmollers Jahrbuch : Journal of Applied Social Science Studies / Zeitschrift für Wirtschafts- und Sozialwissenschaften, Duncker & Humblot, Berlin, vol. 130(4), pages 599-608.
    13. repec:iab:iabfda:201604(en is not listed on IDEAS
    14. Culp, Mark & Johnson, Kjell & Michailides, George, 2006. "ada: An R Package for Stochastic Boosting," Journal of Statistical Software, Foundation for Open Access Statistics, vol. 17(i02).
    15. Dorner, Matthias & Heining, Jörg & Jacobebbinghaus, Peter & Seth, Stefan, 2010. "Sample of Integrated Labour Market Biographies (SIAB) 1975-2008," FDZ Datenreport. Documentation on Labour Market Data 201001_en, Institut für Arbeitsmarkt- und Berufsforschung (IAB), Nürnberg [Institute for Employment Research, Nuremberg, Germany].
    Full references (including those not matched with items on IDEAS)

    Most related items

    These are the items that most often cite the same works as this one and are cited by the same works as this one.
    1. Lucas Hafner & Benjamin Lochner, 2022. "Do minimum wages improve self-rated health? Evidence from a natural experiment," Empirical Economics, Springer, vol. 62(6), pages 2989-3014, June.
    2. Valentin Schiele, 2022. "Labor market spillover effects of a compulsory schooling reform in Germany," Working Papers Dissertations 84, Paderborn University, Faculty of Business Administration and Economics.
    3. Alfred Garloff & Carsten Pohl & Norbert Schanne, 2013. "Do small labor market entry cohorts reduce unemployment?," Demographic Research, Max Planck Institute for Demographic Research, Rostock, Germany, vol. 29(15), pages 379-406.
    4. Charlotte Senftleben-König, "undated". "Public Sector Employment and Local Multipliers," BDPEMS Working Papers 2014010, Berlin School of Economics.
    5. Kohlbrecher, Britta & Merkl, Christian & Nordmeier, Daniela, 2016. "Revisiting the matching function," Journal of Economic Dynamics and Control, Elsevier, vol. 69(C), pages 350-374.
    6. repec:iab:iabfda:201109(de is not listed on IDEAS
    7. Stefan Bender & Nicholas Bloom & David Card & John Van Reenen & Stefanie Wolter, 2018. "Management Practices, Workforce Selection, and Productivity," Journal of Labor Economics, University of Chicago Press, vol. 36(S1), pages 371-409.
    8. repec:iab:iabfda:201307(en is not listed on IDEAS
    9. Oberfichtner, Michael, 2019. "Arbeitslosenversicherung für Existenzgründer: Unterschiedliche Leistungen trotz gleicher Beiträge (The unemployment insurance for business founders: Different benefits despite equal contributions)," IAB-Kurzbericht 201901, Institut für Arbeitsmarkt- und Berufsforschung (IAB), Nürnberg [Institute for Employment Research, Nuremberg, Germany].
    10. Alfred Garloff & Rüdiger Wapler, 2013. "Are the Number of Skilled Workers Running Out in Germany? The (Non)-Consequences of Demographic Change," ERSA conference papers ersa13p854, European Regional Science Association.
    11. Angela Rauch & Anja Burghardt & Johannes Eggs & Anita Tisch & Silke Tophoven, 2015. "lidA–leben in der Arbeit. German cohort study on work, age and health [lidA–leben in der Arbeit. Kohortenstudie zu Gesundheit und Älterwerden in der Arbeit]," Journal for Labour Market Research, Springer;Institute for Employment Research/ Institut für Arbeitsmarkt- und Berufsforschung (IAB), vol. 48(3), pages 195-202, October.
    12. Dorner, Matthias & Fryges, Helmut & Schopen, Kathrin, 2017. "Wages in high-tech start-ups – Do academic spin-offs pay a wage premium?," Research Policy, Elsevier, vol. 46(1), pages 1-18.
    13. Philipp Berge & Hanna Frings, 2020. "High-impact minimum wages and heterogeneous regions," Empirical Economics, Springer, vol. 59(2), pages 701-729, August.
    14. Hochfellner, Daniela & Müller, Dana & Wurdack, Anja, 2014. "BASiD - Biografiedaten ausgewählter Sozialversicherungsträger in Deutschland," FDZ Datenreport. Documentation on Labour Market Data 201109_de, Institut für Arbeitsmarkt- und Berufsforschung (IAB), Nürnberg [Institute for Employment Research, Nuremberg, Germany].
    15. Jahn, Elke & Pozzoli, Dario, 2011. "Does the Sector Experience Affect the Pay Gap for Temporary Agency Workers?," Working Papers 11-9, University of Aarhus, Aarhus School of Business, Department of Economics.
    16. Zabel, Cordula, 2013. "Effects of participating in skill training and workfare on employment entries for lone mothers receiving means-tested benefits in Germany," IAB-Discussion Paper 201303, Institut für Arbeitsmarkt- und Berufsforschung (IAB), Nürnberg [Institute for Employment Research, Nuremberg, Germany].
    17. Elisabeth Bublitz, 2018. "Matching skills of individuals and firms along the career path," Oxford Economic Papers, Oxford University Press, vol. 70(2), pages 509-537.
    18. Jahn, Elke & Hirsch, Boris, 2012. "Is there monopsonistic discrimination against immigrants? First evidence from linked employer employee data," VfS Annual Conference 2012 (Goettingen): New Approaches and Challenges for the Labor Market of the 21st Century 65417, Verein für Socialpolitik / German Economic Association.
    19. Bossler, Mario & Schank, Thorsten, 2020. "Wage Inequality in Germany after the Minimum Wage Introduction," IZA Discussion Papers 13003, Institute of Labor Economics (IZA).
    20. Charlotte Senftleben-Koenig & Hanna Wielandt, 2014. "Spatial Wage Inequality and Technological Change," SFB 649 Discussion Papers SFB649DP2014-038, Sonderforschungsbereich 649, Humboldt University, Berlin, Germany.
    21. Thomas Kruppe & Julia Lang, 2018. "Labour market effects of retraining for the unemployed: the role of occupations," Applied Economics, Taylor & Francis Journals, vol. 50(14), pages 1578-1600, March.
    22. repec:iab:iabfda:201109(en is not listed on IDEAS
    23. Diego Montano & Richard Peter, 2022. "Informal care-giving and the intention to give up employment: the role of perceived supervisor behaviour in a cohort of German employees," European Journal of Ageing, Springer, vol. 19(3), pages 575-585, September.

    More about this item

    Keywords

    Bundesrepublik Deutschland ; beruflicher Verbleib ; Berufserfolg ; Berufsverlauf ; Bibliothek ; Datengewinnung ; Dissertation ; Frauen ; Hochschulabsolventen ; Datenfusion ; Integrierte Erwerbsbiografien ; künstliche Intelligenz ; Lernen ; Männer ; Promotion ; 1975-2015;
    All these keywords.

    JEL classification:

    • C81 - Mathematical and Quantitative Methods - - Data Collection and Data Estimation Methodology; Computer Programs - - - Methodology for Collecting, Estimating, and Organizing Microeconomic Data; Data Access
    • E24 - Macroeconomics and Monetary Economics - - Consumption, Saving, Production, Employment, and Investment - - - Employment; Unemployment; Wages; Intergenerational Income Distribution; Aggregate Human Capital; Aggregate Labor Productivity
    • I20 - Health, Education, and Welfare - - Education - - - General

    NEP fields

    This paper has been announced in the following NEP Reports:

    Statistics

    Access and download statistics

    Corrections

    All material on this site has been provided by the respective publishers and authors. You can help correct errors and omissions. When requesting a correction, please mention this item's handle: RePEc:iab:iabdpa:201913. See general information about how to correct material in RePEc.

    If you have authored this item and are not yet registered with RePEc, we encourage you to do it here. This allows to link your profile to this item. It also allows you to accept potential citations to this item that we are uncertain about.

    If CitEc recognized a bibliographic reference but did not link an item in RePEc to it, you can help with this form .

    If you know of missing items citing this one, you can help us creating those links by adding the relevant references in the same way as above, for each refering item. If you are a registered author of this item, you may also want to check the "citations" tab in your RePEc Author Service profile, as there may be some citations waiting for confirmation.

    For technical questions regarding this item, or to correct its authors, title, abstract, bibliographic or download information, contact: IAB, Geschäftsbereich Wissenschaftliche Fachinformation und Bibliothek (email available below). General contact details of provider: https://edirc.repec.org/data/iabbbde.html .

    Please note that corrections may take a couple of weeks to filter through the various RePEc services.

    IDEAS is a RePEc service. RePEc uses bibliographic data supplied by the respective publishers.