IDEAS home Printed from https://ideas.repec.org/p/nsr/escoet/escoe-tr-14.html
   My bibliography  Save this paper

Matching UK Business Microdata – A Study Using ONS and CBI Business Surveys

Author

Listed:
  • Michael J Mahony
  • Josh Martin

Abstract

Business data linkage is a powerful tool to unlock new insights, that are often not possible using data from one source alone. However, it can be challenging and often requires a number of decisions to be made on how the linking should be conducted. Such decisions can affect the match rates and conclusions drawn from the linked data. To provide some useful information to researchers on the common pitfalls when doing data linkage, and some potential solutions, we provide an account of a business data linkage exercise. We link three sources: a survey of businesses conducted by the Confederation of British Industry (CBI), the FAME dataset of business financial data from Bureau van Dijk, and the Inter-Departmental Business Register (IDBR). This requires the use of business names and addresses as linking 'keys' which are subject to error and imprecision, resulting in less than complete matches. We detail a novel solution to choose among ‘multiple matches’ when a propensity-score matching approach is unable to select a definitive match, which we implemented when linking the CBI data with the IDBR. We report match results, which are around 50 per cent when linking the CBI survey with FAME, and around 90 per cent when linking the CBI survey with IDBR. We also report variation by geography, size and time-period. We then use the IDBR-linked CBI data to match on data from various ONS business surveys, which typically have match rates of less than 50 per cent, and in some cases far lower. We conclude with some recommendations for researchers when conducting data linkage.

Suggested Citation

  • Michael J Mahony & Josh Martin, 2022. "Matching UK Business Microdata – A Study Using ONS and CBI Business Surveys," Economic Statistics Centre of Excellence (ESCoE) Technical Reports ESCOE-TR-14, Economic Statistics Centre of Excellence (ESCoE).
  • Handle: RePEc:nsr:escoet:escoe-tr-14
    as

    Download full text from publisher

    File URL: https://escoe-website.s3.amazonaws.com/wp-content/uploads/2022/01/18094535/ESCoE-TR-14.pdf
    Download Restriction: no
    ---><---

    References listed on IDEAS

    as
    1. Kevin Lee & Michael Mahony & Paul Mizen, 2020. "The CBI Suite of Business Surveys," Economic Statistics Centre of Excellence (ESCoE) Technical Reports ESCOE-TR-08, Economic Statistics Centre of Excellence (ESCoE).
    2. Raffo, Julio & Lhuillery, Stéphane, 2009. "How to play the "Names Game": Patent retrieval comparing different heuristics," Research Policy, Elsevier, vol. 38(10), pages 1617-1627, December.
    Full references (including those not matched with items on IDEAS)

    Most related items

    These are the items that most often cite the same works as this one and are cited by the same works as this one.
    1. Nils Grashof, 2020. "Sinking or swimming in the cluster labour pool? A firm-specific analysis of the effect of specialized labour," Jena Economics Research Papers 2020-006, Friedrich-Schiller-University Jena.
    2. Battke, Benedikt & Schmidt, Tobias S. & Stollenwerk, Stephan & Hoffmann, Volker H., 2016. "Internal or external spillovers—Which kind of knowledge is more likely to flow within or across technologies," Research Policy, Elsevier, vol. 45(1), pages 27-41.
    3. Deyun Yin & Kazuyuki Motohashi & Jianwei Dang, 2020. "Large-scale name disambiguation of Chinese patent inventors (1985–2016)," Scientometrics, Springer;Akadémiai Kiadó, vol. 122(2), pages 765-790, February.
    4. Francesco Lissoni & Michele Pezzoni & Bianca Potì & Sandra Romagnosi, 2012. "University autonomy, IP legislation and academic patenting: Italy, 1996-2007," Post-Print hal-00779750, HAL.
    5. Stefano Breschi & Francesco Lissoni & Gianluca Tarasconi, 2014. "Inventor Data for Research on Migration and Innovation: A Survey and a Pilot," WIPO Economic Research Working Papers 17, World Intellectual Property Organization - Economics and Statistics Division.
    6. Santiago Camara, 2022. "Granular Linkages, Supplier Cost Shocks & Export Performance," Papers 2203.07282, arXiv.org.
    7. Carayol, Nicolas & Bergé, Laurent & Cassi, Lorenzo & Roux, Pascale, 2019. "Unintended triadic closure in social networks: The strategic formation of research collaborations between French inventors," Journal of Economic Behavior & Organization, Elsevier, vol. 163(C), pages 218-238.
    8. Markus Simeth & Michele Cincera, 2016. "Corporate Science, Innovation, and Firm Value," Management Science, INFORMS, vol. 62(7), pages 1970-1981, July.
    9. Jung, Taehyun & Ejermo, Olof, 2014. "Demographic patterns and trends in patenting: Gender, age, and education of inventors," Technological Forecasting and Social Change, Elsevier, vol. 86(C), pages 110-124.
    10. Favaro, Donata & Ninka, Eniel & Turvani, Margherita, 2012. "Productivity in innovation: the role of inventor connections and mobility," MPRA Paper 38950, University Library of Munich, Germany.
    11. Ufuk Akcigit & Santiago Caicedo & Ernest Miguelez & Stefanie Stantcheva & Valerio Sterzi, 2018. "Dancing with the Stars: Innovation Through Interactions," NBER Working Papers 24466, National Bureau of Economic Research, Inc.
    12. Josh Martin & Kyle Jones, 2022. "An Occupation and Asset Driven Approach to Capital Utilisation Adjustment in Productivity Statistics," Economic Statistics Centre of Excellence (ESCoE) Discussion Papers ESCoE DP-2022-11, Economic Statistics Centre of Excellence (ESCoE).
    13. Roberta Piergiovanni & Enrico Santarelli, 2013. "The more you spend, the more you get? The effects of R&D and capital expenditures on the patenting activities of biotechnology firms," Scientometrics, Springer;Akadémiai Kiadó, vol. 94(2), pages 497-521, February.
    14. Miguélez, Ernest & Moreno, Rosina, 2015. "Knowledge flows and the absorptive capacity of regions," Research Policy, Elsevier, vol. 44(4), pages 833-848.
    15. Martin Kalthaus, 2020. "Knowledge recombination along the technology life cycle," Journal of Evolutionary Economics, Springer, vol. 30(3), pages 643-704, July.
    16. Peveri, Julieta & Sangnier, Marc, 2023. "Gender differences in re-contesting decisions: New evidence from French municipal elections," Journal of Economic Behavior & Organization, Elsevier, vol. 214(C), pages 574-594.
    17. A. Bozio & D. Irac & L. Py, 2014. "Impact of research tax credit on R&D and innovation: evidence from the 2008 French reform," Working papers 532, Banque de France.
    18. Paulo Vinícius Marcondes Cordeiro & Dario Eduardo Amaral Dergint & Kazuo Hatakeyama, 2014. "Proposal Of Method For An Automatic Complementarities Search Between Companies' R&D," International Journal of Innovation and Technology Management (IJITM), World Scientific Publishing Co. Pte. Ltd., vol. 11(02), pages 1-21.
    19. Foray, D. & Raffo, J., 2014. "The emergence of an educational tool industry: Opportunities and challenges for innovation in education," Research Policy, Elsevier, vol. 43(10), pages 1707-1715.
    20. Li, Guan-Cheng & Lai, Ronald & D’Amour, Alexander & Doolin, David M. & Sun, Ye & Torvik, Vetle I. & Yu, Amy Z. & Fleming, Lee, 2014. "Disambiguation and co-authorship networks of the U.S. patent inventor database (1975–2010)," Research Policy, Elsevier, vol. 43(6), pages 941-955.

    More about this item

    Keywords

    business surveys; data linkage; microdata analysis;
    All these keywords.

    JEL classification:

    • C55 - Mathematical and Quantitative Methods - - Econometric Modeling - - - Large Data Sets: Modeling and Analysis
    • C81 - Mathematical and Quantitative Methods - - Data Collection and Data Estimation Methodology; Computer Programs - - - Methodology for Collecting, Estimating, and Organizing Microeconomic Data; Data Access
    • C89 - Mathematical and Quantitative Methods - - Data Collection and Data Estimation Methodology; Computer Programs - - - Other

    Statistics

    Access and download statistics

    Corrections

    All material on this site has been provided by the respective publishers and authors. You can help correct errors and omissions. When requesting a correction, please mention this item's handle: RePEc:nsr:escoet:escoe-tr-14. See general information about how to correct material in RePEc.

    If you have authored this item and are not yet registered with RePEc, we encourage you to do it here. This allows to link your profile to this item. It also allows you to accept potential citations to this item that we are uncertain about.

    If CitEc recognized a bibliographic reference but did not link an item in RePEc to it, you can help with this form .

    If you know of missing items citing this one, you can help us creating those links by adding the relevant references in the same way as above, for each refering item. If you are a registered author of this item, you may also want to check the "citations" tab in your RePEc Author Service profile, as there may be some citations waiting for confirmation.

    For technical questions regarding this item, or to correct its authors, title, abstract, bibliographic or download information, contact: ESCoE Centre Manager (email available below). General contact details of provider: https://edirc.repec.org/data/escoeuk.html .

    Please note that corrections may take a couple of weeks to filter through the various RePEc services.

    IDEAS is a RePEc service. RePEc uses bibliographic data supplied by the respective publishers.