IDEAS home Printed from https://ideas.repec.org/p/zbw/zewdip/22065.html
   My bibliography  Save this paper

Measuring the digitalisation of firms: A novel text mining approach

Author

Listed:
  • Axenbeck, Janna
  • Breithaupt, Patrick

Abstract

Due to the omnipresence of digital technologies in the economy, measuring firm digitalisation is of high importance. However, current indicators show several shortcomings, e.g., they lack timeliness and regional granularity. In this study, we show that advances in text mining and comprehensive firm website content can be leveraged to generate real-time and large-scale estimates of firm digitalisation. We use a transfer learning approach to capture the latent definition of digitalisation. For this purpose, we train a random forest regression model on labeled German newspaper articles and apply it on firm's website content. The predictions are used as a continuous indicator for firm digitalisation. Plausibility checks confirm the link to established digitalisation indicators at the firm and sectoral level as well as for firm size classes and regions. Lastly, we illustrate the indicator's potential for giving timely answers to pressing economic issues by analysing the link between digitalisation and firm resilience during the Covid-19 shock.

Suggested Citation

  • Axenbeck, Janna & Breithaupt, Patrick, 2022. "Measuring the digitalisation of firms: A novel text mining approach," ZEW Discussion Papers 22-065, ZEW - Leibniz Centre for European Economic Research.
  • Handle: RePEc:zbw:zewdip:22065
    as

    Download full text from publisher

    File URL: https://www.econstor.eu/bitstream/10419/268244/1/1830443747.pdf
    Download Restriction: no
    ---><---

    References listed on IDEAS

    as
    1. Rammer, Christian & Es-Sadki, Nordine, 2023. "Using big data for generating firm-level innovation indicators - a literature review," Technological Forecasting and Social Change, Elsevier, vol. 197(C).
    2. David Lenz & Peter Winker, 2020. "Measuring the diffusion of innovations with paragraph vector topic models," PLOS ONE, Public Library of Science, vol. 15(1), pages 1-18, January.
    3. Janna Axenbeck & Patrick Breithaupt, 2021. "Innovation indicators based on firm websites—Which website characteristics predict firm-level innovation activity?," PLOS ONE, Public Library of Science, vol. 16(4), pages 1-23, April.
    4. Nicholas Bloom & Raffaella Sadun & John Van Reenen, 2012. "Americans Do IT Better: US Multinationals and the Productivity Miracle," American Economic Review, American Economic Association, vol. 102(1), pages 167-201, February.
    5. Julian Oliver Dörr & Jan Kinne & David Lenz & Georg Licht & Peter Winker, 2022. "An integrated data framework for policy guidance during the coronavirus pandemic: Towards real-time decision support for economic policymakers," PLOS ONE, Public Library of Science, vol. 17(2), pages 1-30, February.
    6. Chris Forman & Avi Goldfarb & Shane Greenstein, 2009. "The Internet and Local Wages: Convergence or Divergence?," NBER Working Papers 14750, National Bureau of Economic Research, Inc.
    7. Bronwyn H. Hall & Francesca Lotti & Jacques Mairesse, 2013. "Evidence on the impact of R&D and ICT investments on innovation and productivity in Italian firms," Economics of Innovation and New Technology, Taylor & Francis Journals, vol. 22(3), pages 300-328, April.
    8. Erik Brynjolfsson & Lorin M. Hitt, 2003. "Computing Productivity: Firm-Level Evidence," The Review of Economics and Statistics, MIT Press, vol. 85(4), pages 793-808, November.
    9. Niebel, Thomas, 2018. "ICT and economic growth – Comparing developing, emerging and developed countries," World Development, Elsevier, vol. 104(C), pages 197-211.
    10. Irene Bertschek & Michael Polder & Patrick Schulte, 2019. "ICT and resilience in times of crisis: evidence from cross-country micro moments data," Economics of Innovation and New Technology, Taylor & Francis Journals, vol. 28(8), pages 759-774, November.
    11. Bertschek Irene & Briglauer Wolfgang & Hüschelrath Kai & Kauf Benedikt & Niebel Thomas, 2015. "The Economic Impacts of Broadband Internet: A Survey," Review of Network Economics, De Gruyter, vol. 14(4), pages 201-227, December.
    12. Jacques Mairesse & Nathalie Greenan & Agnes Topiol-Bensaid, 2001. "Information Technology and Research and Development Impacts on Productivity and Skills: Looking for Correlations on French Firm Level Data," NBER Working Papers 8075, National Bureau of Economic Research, Inc.
    13. Jan Kinne & Janna Axenbeck, 2020. "Web mining for innovation ecosystem mapping: a framework and a large-scale pilot study," Scientometrics, Springer;Akadémiai Kiadó, vol. 125(3), pages 2011-2041, December.
    14. Bertschek, Irene & Niebel, Thomas, 2016. "Mobile and more productive? Firm-level evidence on the productivity effects of mobile internet use," Telecommunications Policy, Elsevier, vol. 40(9), pages 888-898.
    15. Joseph E. Engelberg & Christopher A. Parsons, 2011. "The Causal Impact of Media in Financial Markets," Journal of Finance, American Finance Association, vol. 66(1), pages 67-97, February.
    16. Jeremy Ginsberg & Matthew H. Mohebbi & Rajan S. Patel & Lynnette Brammer & Mark S. Smolinski & Larry Brilliant, 2009. "Detecting influenza epidemics using search engine query data," Nature, Nature, vol. 457(7232), pages 1012-1014, February.
    17. Larsen, Vegard H. & Thorsrud, Leif A., 2019. "The value of news for economic developments," Journal of Econometrics, Elsevier, vol. 210(1), pages 203-218.
    18. Abdullah Gök & Alec Waterworth & Philip Shapira, 2015. "Use of web mining in studying innovation," Scientometrics, Springer;Akadémiai Kiadó, vol. 102(1), pages 653-671, January.
    19. Stefan Schweikl & Robert Obermaier, 2020. "Lessons from three decades of IT productivity research: towards a better understanding of IT-induced productivity effects," Management Review Quarterly, Springer, vol. 70(4), pages 461-507, November.
    20. Paul C. Tetlock, 2007. "Giving Content to Investor Sentiment: The Role of Media in the Stock Market," Journal of Finance, American Finance Association, vol. 62(3), pages 1139-1168, June.
    21. Jan Kinne & David Lenz, 2021. "Predicting innovative firms using web mining and deep learning," PLOS ONE, Public Library of Science, vol. 16(4), pages 1-18, April.
    Full references (including those not matched with items on IDEAS)

    Most related items

    These are the items that most often cite the same works as this one and are cited by the same works as this one.
    1. Schubert, Torben & Ashouri, Sajad & Deschryvere, Matthias & Jäger, Angela & Visentin, Fabiana & Cunningham, Scott & Hajikhani, Arash & Pukelis, Lukas & Suominen, Arho, 2023. "The role of product digitization for productivity," MERIT Working Papers 2023-004, United Nations University - Maastricht Economic and Social Research Institute on Innovation and Technology (MERIT).
    2. Janna Axenbeck & Patrick Breithaupt, 2021. "Innovation indicators based on firm websites—Which website characteristics predict firm-level innovation activity?," PLOS ONE, Public Library of Science, vol. 16(4), pages 1-23, April.
    3. Viete, Steffen & Erdsiek, Daniel, 2020. "Mobile Information Technologies and Firm Performance: The Role of Employee Autonomy," Information Economics and Policy, Elsevier, vol. 51(C).
    4. Nicholas Bloom & Raffaella Sadun & John Van Reenen, 2012. "Americans Do IT Better: US Multinationals and the Productivity Miracle," American Economic Review, American Economic Association, vol. 102(1), pages 167-201, February.
    5. Vegard Høghaug Larsen & Leif Anders Thorsrud, 2022. "Asset returns, news topics, and media effects," Scandinavian Journal of Economics, Wiley Blackwell, vol. 124(3), pages 838-868, July.
    6. Chen, Wen & Niebel, Thomas & Saam, Marianne, 2016. "Are intangibles more productive in ICT-intensive industries? Evidence from EU countries," Telecommunications Policy, Elsevier, vol. 40(5), pages 471-484.
    7. Dörr, Julian Oliver & Kinne, Jan & Lenz, David & Licht, Georg & Winker, Peter, 2021. "An integrated data framework for policy guidance in times of dynamic economic shocks," ZEW Discussion Papers 21-062, ZEW - Leibniz Centre for European Economic Research.
    8. Viete, Steffen & Erdsiek, Daniel, 2018. "Trust-based work time and the productivity effects of mobile information technologies in the workplace," ZEW Discussion Papers 18-013, ZEW - Leibniz Centre for European Economic Research.
    9. Michelle Connolly & James Prieger, 2009. "Economics at the FCC, 2008–2009: Broadband and Merger Review," Review of Industrial Organization, Springer;The Industrial Organization Society, vol. 35(4), pages 387-417, December.
    10. Mariana Viollaz, 2017. "ICT Adoption in Micro and Small Firms: Can Internet Access Improve Labor Productivity?," CESifo Working Paper Series 6839, CESifo.
    11. Koutroumpis, Pantelis & Leiponen, Aija & Thomas, Llewellyn D.W., 2020. "Small is big in ICT: The impact of R&D on productivity," Telecommunications Policy, Elsevier, vol. 44(1).
    12. Jorge Antonio Rodríguez-Moreno & María Engracia Rochina-Barrachina, 2019. "ICT Use, Investments in R&D and Workers’ Training, Firms’ Productivity and Markups: The Case of Ecuadorian Manufacturing," The European Journal of Development Research, Palgrave Macmillan;European Association of Development Research and Training Institutes (EADI), vol. 31(4), pages 1063-1106, September.
    13. Rammer, Christian & Es-Sadki, Nordine, 2023. "Using big data for generating firm-level innovation indicators - a literature review," Technological Forecasting and Social Change, Elsevier, vol. 197(C).
    14. Schmidt, Sebastian & Kinne, Jan & Lautenbach, Sven & Blaschke, Thomas & Lenz, David & Resch, Bernd, 2022. "Greenwashing in the US metal industry? A novel approach combining SO2 concentrations from satellite data, a plant-level firm database and web text mining," ZEW Discussion Papers 22-006, ZEW - Leibniz Centre for European Economic Research.
    15. Forman, Chris & van Zeebroeck, Nicolas, 2019. "Digital technology adoption and knowledge flows within firms: Can the Internet overcome geographic and technological distance?," Research Policy, Elsevier, vol. 48(8), pages 1-1.
    16. Aboal D. & Tacsir E., 2015. "Innovation and productivity in services and manufacturing : The role of ICT investment," MERIT Working Papers 2015-012, United Nations University - Maastricht Economic and Social Research Institute on Innovation and Technology (MERIT).
    17. Piñeiro-Chousa, Juan & López-Cabarcos, M.Ángeles & Ribeiro-Soriano, Domingo, 2020. "Does investor attention influence water companies’ stock returns?," Technological Forecasting and Social Change, Elsevier, vol. 158(C).
    18. Kaus, Wolfhard & Slavtchev, Viktor & Zimmermann, Markus, 2020. "Intangible capital and productivity: Firm-level evidence from German manufacturing," IWH Discussion Papers 1/2020, Halle Institute for Economic Research (IWH).
    19. Nucci, Francesco & Puccioni, Chiara & Ricchi, Ottavio, 2023. "Digital technologies and productivity: A firm-level investigation," Economic Modelling, Elsevier, vol. 128(C).
    20. Ariel Herbert FAMBEU, 2016. "Déterminants De L’Adoption Des Tic Dans Un Pays En Développement : Une Analyse Économétrique Sur Les Entreprises Industrielles Au Cameroun," Region et Developpement, Region et Developpement, LEAD, Universite du Sud - Toulon Var, vol. 43, pages 159-186.

    More about this item

    Keywords

    web-mining; text as data; machine learning; digitalisation;
    All these keywords.

    JEL classification:

    • C53 - Mathematical and Quantitative Methods - - Econometric Modeling - - - Forecasting and Prediction Models; Simulation Methods
    • C81 - Mathematical and Quantitative Methods - - Data Collection and Data Estimation Methodology; Computer Programs - - - Methodology for Collecting, Estimating, and Organizing Microeconomic Data; Data Access
    • O30 - Economic Development, Innovation, Technological Change, and Growth - - Innovation; Research and Development; Technological Change; Intellectual Property Rights - - - General

    NEP fields

    This paper has been announced in the following NEP Reports:

    Statistics

    Access and download statistics

    Corrections

    All material on this site has been provided by the respective publishers and authors. You can help correct errors and omissions. When requesting a correction, please mention this item's handle: RePEc:zbw:zewdip:22065. See general information about how to correct material in RePEc.

    If you have authored this item and are not yet registered with RePEc, we encourage you to do it here. This allows to link your profile to this item. It also allows you to accept potential citations to this item that we are uncertain about.

    If CitEc recognized a bibliographic reference but did not link an item in RePEc to it, you can help with this form .

    If you know of missing items citing this one, you can help us creating those links by adding the relevant references in the same way as above, for each refering item. If you are a registered author of this item, you may also want to check the "citations" tab in your RePEc Author Service profile, as there may be some citations waiting for confirmation.

    For technical questions regarding this item, or to correct its authors, title, abstract, bibliographic or download information, contact: ZBW - Leibniz Information Centre for Economics (email available below). General contact details of provider: https://edirc.repec.org/data/zemande.html .

    Please note that corrections may take a couple of weeks to filter through the various RePEc services.

    IDEAS is a RePEc service. RePEc uses bibliographic data supplied by the respective publishers.