IDEAS home Printed from https://ideas.repec.org/p/nbr/nberwo/21558.html
   My bibliography  Save this paper

How to Construct Nationally Representative Firm Level data from the ORBIS Global Database

Author

Listed:
  • Sebnem Kalemli-Ozcan
  • Bent Sorensen
  • Carolina Villegas-Sanchez
  • Vadym Volosovych
  • Sevcan Yesiltas

Abstract

Firm-level data on productivity, financial activity and firms' international linkages have become essential for research in the fields of macro, international finance and growth. This paper describes the development of a firm-level global panel dataset for public and private companies based on the administrative micro-dataset ORBIS, provided commercially by Bureau van Dijk Electronic Publishing (BvD). The ORBIS database provides data on firms' financial and productive activities from balance sheets and income statements together with detailed information on firms' domestic and international ownership structure for over 130 million companies across the world. Researchers need to overcome several challenges before making the database usable for research. First, the database is not designed for large downloads that is essential for an econometric analysis. Second, there are several inherent biases in the database that affect the download process and lead to missing information. Third, the raw data may contain a number of irregularities which, if not dealt with, will result in data loss during a standard cleaning procedure. In combination, these issues cause minimal coverage of small firms, extensive missing data, and poor national representation. We give detailed instructions on the data gathering process from ORBIS in terms of downloading methodology and cleaning procedure so that a researcher can construct a database that is nationally representative with minimal missing information. We provide examples from several European countries to present the process and discuss the resulting dataset in detail.

Suggested Citation

  • Sebnem Kalemli-Ozcan & Bent Sorensen & Carolina Villegas-Sanchez & Vadym Volosovych & Sevcan Yesiltas, 2015. "How to Construct Nationally Representative Firm Level data from the ORBIS Global Database," NBER Working Papers 21558, National Bureau of Economic Research, Inc.
  • Handle: RePEc:nbr:nberwo:21558
    Note: EFG IFM PR
    as

    Download full text from publisher

    File URL: http://www.nber.org/papers/w21558.pdf
    Download Restriction: no

    Other versions of this item:

    More about this item

    JEL classification:

    • E0 - Macroeconomics and Monetary Economics - - General
    • F0 - International Economics - - General
    • O0 - Economic Development, Innovation, Technological Change, and Growth - - General

    NEP fields

    This paper has been announced in the following NEP Reports:

    Statistics

    Access and download statistics

    Corrections

    All material on this site has been provided by the respective publishers and authors. You can help correct errors and omissions. When requesting a correction, please mention this item's handle: RePEc:nbr:nberwo:21558. See general information about how to correct material in RePEc.

    For technical questions regarding this item, or to correct its authors, title, abstract, bibliographic or download information, contact: (). General contact details of provider: http://edirc.repec.org/data/nberrus.html .

    If you have authored this item and are not yet registered with RePEc, we encourage you to do it here. This allows to link your profile to this item. It also allows you to accept potential citations to this item that we are uncertain about.

    We have no references for this item. You can help adding them by using this form .

    If you know of missing items citing this one, you can help us creating those links by adding the relevant references in the same way as above, for each refering item. If you are a registered author of this item, you may also want to check the "citations" tab in your RePEc Author Service profile, as there may be some citations waiting for confirmation.

    Please note that corrections may take a couple of weeks to filter through the various RePEc services.

    IDEAS is a RePEc service hosted by the Research Division of the Federal Reserve Bank of St. Louis . RePEc uses bibliographic data supplied by the respective publishers.