IDEAS home Printed from https://ideas.repec.org/a/bla/jorssa/v181y2018i4p1211-1230.html
   My bibliography  Save this article

Correlates of record linkage and estimating risks of non‐linkage biases in business data sets

Author

Listed:
  • Jamie C. Moore
  • Peter W. F. Smith
  • Gabriele B. Durrant

Abstract

Researchers often utilize data sets that link information from multiple sources, but non‐linkage biases caused by linked and non‐linked subject differences are little understood, especially in business data sets. We address these knowledge gaps by studying biases in linkable 2010 UK Small Business Survey data sets. We identify correlates of business linkage propensity, and also for the first time its components: consent to linkage and register identifier appendability. As well, we take a novel approach to evaluating non‐linkage bias risks, by computing data set representativeness indicators (comparable, decomposable sample subset similarity measures). We find that the main impacts on linkage propensities and bias risks are due to consenter–non‐consenter differences explicable given business survey response processes, and differences between subjects with and without identifiers caused by register undercoverage of very small businesses. We then discuss consequences for the analysis of linked business data sets, and implications of the evaluation methods we introduce for linked data set producers and users.

Suggested Citation

  • Jamie C. Moore & Peter W. F. Smith & Gabriele B. Durrant, 2018. "Correlates of record linkage and estimating risks of non‐linkage biases in business data sets," Journal of the Royal Statistical Society Series A, Royal Statistical Society, vol. 181(4), pages 1211-1230, October.
  • Handle: RePEc:bla:jorssa:v:181:y:2018:i:4:p:1211-1230
    DOI: 10.1111/rssa.12342
    as

    Download full text from publisher

    File URL: https://doi.org/10.1111/rssa.12342
    Download Restriction: no

    File URL: https://libkey.io/10.1111/rssa.12342?utm_source=ideas
    LibKey link: if access is restricted and if your library uses this service, LibKey will redirect you to where you can use your library subscription to access this item
    ---><---

    References listed on IDEAS

    as
    1. Melissa Bjelland & Bruce Fallick & John Haltiwanger & Erika McEntarfer, 2011. "Employer-to-Employer Flows in the United States: Estimates Using Linked Employer-Employee Data," Journal of Business & Economic Statistics, Taylor & Francis Journals, vol. 29(4), pages 493-505, October.
    2. Timothy Dunne & J. Bradford Jensen & Mark J. Roberts, 2009. "Introduction to "Producer Dynamics: New Evidence from Micro Data"," NBER Chapters, in: Producer Dynamics: New Evidence from Micro Data, pages 1-12, National Bureau of Economic Research, Inc.
    3. Abowd, John M. & Vilhuber, Lars, 2005. "The Sensitivity of Economic Statistics to Coding Errors in Personal Identifiers," Journal of Business & Economic Statistics, American Statistical Association, vol. 23, pages 133-152, April.
    4. Barry Schouten & Natalie Shlomo, 2017. "Selecting Adaptive Survey Design Strata with Partial R-indicators," International Statistical Review, International Statistical Institute, vol. 85(1), pages 143-163, April.
    5. Shlomo, Natalie & Skinner, Chris J. & Schouten, Barry, 2012. "Estimation of an indicator of the representativeness of survey response," LSE Research Online Documents on Economics 39124, London School of Economics and Political Science, LSE Library.
    6. John J. Abowd & John Haltiwanger & Julia Lane, 2004. "Integrated Longitudinal Employer-Employee Data for the United States," American Economic Review, American Economic Association, vol. 94(2), pages 224-229, May.
    7. Barry Schouten & Fannie Cobben & Peter Lundquist & James Wagner, 2016. "Does more balanced survey response imply less non-response bias?," Journal of the Royal Statistical Society Series A, Royal Statistical Society, vol. 179(3), pages 727-748, June.
    8. Alexander Hijzen & Richard Upward & Peter W. Wright, 2010. "Job Creation, Job Destruction and the Role of Small Firms: Firm‐Level Evidence for the UK," Oxford Bulletin of Economics and Statistics, Department of Economics, University of Oxford, vol. 72(5), pages 621-647, October.
    9. Robert Hayes & Catrin Omerod & Felix Ritchie, 2007. "Earnings: summary of sources and developments," Economic & Labour Market Review, Palgrave Macmillan;Office for National Statistics, vol. 1(1), pages 42-47, January.
    10. Barry Schouten & Jelke Bethlehem & Koen Beullens & Øyvin Kleven & Geert Loosveldt & Annemieke Luiten & Katja Rutar & Natalie Shlomo & Chris Skinner, 2012. "Evaluating, Comparing, Monitoring, and Improving Representativeness of Survey Response Through R-Indicators and Partial R-Indicators," International Statistical Review, International Statistical Institute, vol. 80(3), pages 382-399, December.
    11. Felix Ritchie, 2008. "Secure access to confidential microdata: four years of the Virtual Microdata Laboratory," Economic & Labour Market Review, Palgrave Macmillan;Office for National Statistics, vol. 2(5), pages 29-34, May.
    12. Florian Janik & Susanne Kohaut, 2012. "Why don’t they answer? Unit non-response in the IAB establishment panel," Quality & Quantity: International Journal of Methodology, Springer, vol. 46(3), pages 917-934, April.
    13. repec:taf:jnlbes:v:30:y:2012:i:2:p:191-201 is not listed on IDEAS
    14. Schouten, Barry & Shlomo, Natalie & Skinner, Chris J., 2011. "Indicators for monitoring and improving representativeness of response," LSE Research Online Documents on Economics 39121, London School of Economics and Political Science, LSE Library.
    15. Timothy Dunne & J. Bradford Jensen & Mark J. Roberts, 2009. "Producer Dynamics: New Evidence from Micro Data," NBER Books, National Bureau of Economic Research, Inc, number dunn05-1, October.
    Full references (including those not matched with items on IDEAS)

    Citations

    Citations are extracted by the CitEc Project, subscribe to its RSS feed for this item.
    as


    Cited by:

    1. Valentin Reich, 2024. "Machine Learning Based Linkage of Company Data for Economic Research: Application to the EBDC Business Panels," ifo Working Paper Series 409, ifo Institute - Leibniz Institute for Economic Research at the University of Munich.
    2. Serena Pattaro & Nick Bailey & Chris Dibben, 2020. "Using Linked Longitudinal Administrative Data to Identify Social Disadvantage," Social Indicators Research: An International and Interdisciplinary Journal for Quality-of-Life Measurement, Springer, vol. 147(3), pages 865-895, February.

    Most related items

    These are the items that most often cite the same works as this one and are cited by the same works as this one.
    1. Roberts Caroline & Vandenplas Caroline & Herzing Jessica M.E., 2020. "A Validation of R-Indicators as a Measure of the Risk of Bias using Data from a Nonresponse Follow-Up Survey," Journal of Official Statistics, Sciendo, vol. 36(3), pages 675-701, September.
    2. Ian M. Schmutte, 2015. "Job Referral Networks and the Determination of Earnings in Local Labor Markets," Journal of Labor Economics, University of Chicago Press, vol. 33(1), pages 1-32.
    3. Abowd, John M. & Vilhuber, Lars, 2011. "National estimates of gross employment and job flows from the Quarterly Workforce Indicators with demographic and industry detail," Journal of Econometrics, Elsevier, vol. 161(1), pages 82-99, March.
    4. Thais Paiva & Jerry Reiter, 2014. "Using Imputation Techniques To Evaluate Stopping Rules In Adaptive Survey Design," Working Papers 14-40, Center for Economic Studies, U.S. Census Bureau.
    5. Jamie C. Moore & Gabriele B. Durrant & Peter W. F. Smith, 2021. "Do coefficients of variation of response propensities approximate non‐response biases during survey data collection?," Journal of the Royal Statistical Society Series A, Royal Statistical Society, vol. 184(1), pages 301-323, January.
    6. Fariha Kamal & C.J. Krizan, 2012. "Decomposing Aggregate Trade Flows: New Evidence from U.S. Traders," Working Papers 12-17, Center for Economic Studies, U.S. Census Bureau.
    7. Henry Hyatt & Erika McEntarfer, 2012. "Job-to-Job Flows and the Business Cycle," Working Papers 12-04, Center for Economic Studies, U.S. Census Bureau.
    8. Barry Schouten & Natalie Shlomo, 2017. "Selecting Adaptive Survey Design Strata with Partial R-indicators," International Statistical Review, International Statistical Institute, vol. 85(1), pages 143-163, April.
    9. Barth, Erling & Davis, James C. & Freeman, Richard B. & McElheran, Kristina, 2023. "Twisting the demand curve: Digitalization and the older workforce," Journal of Econometrics, Elsevier, vol. 233(2), pages 443-467.
    10. David Card & Jesse Rothstein & Moises Yi, 2021. "Location, Location, Location," Working Papers 21-32, Center for Economic Studies, U.S. Census Bureau.
    11. Manova, Kalina & Yu, Zhihong, 2017. "Multi-product firms and product quality," Journal of International Economics, Elsevier, vol. 109(C), pages 116-137.
    12. Gerard Hoberg & S. Katie Moon, 2019. "The Offshoring Return Premium," Management Science, INFORMS, vol. 67(6), pages 2876-2899, June.
    13. Andrew B. Bernard & Stephen J. Redding & Peter K. Schott, 2011. "Multiproduct Firms and Trade Liberalization," The Quarterly Journal of Economics, President and Fellows of Harvard College, vol. 126(3), pages 1271-1318.
    14. James J. Fetzer & Tina Highfill & Kassu W. Hossiso & Thomas F. Howells III & Erich H. Strassner & Jeffrey A. Young, 2021. "Accounting for Firm Heterogeneity within US Industries: Extended Supply-Use Tables and Trade in Value Added Using Enterprise and Establishment Level Data," NBER Chapters, in: Challenges of Globalization in the Measurement of National Accounts, pages 311-342, National Bureau of Economic Research, Inc.
    15. Diego Vivanco, 2019. "Assessing firm heterogeneity within industries for the Chilean economy," Economic Statistics Series 127, Central Bank of Chile.
    16. Marc-Andreas Muendler, 2014. "Export or merge? Proximity vs. concentration in product space," Asia-Pacific Journal of Accounting & Economics, Taylor & Francis Journals, vol. 21(1), pages 35-57, March.
    17. Kristian Behrens & Giordano Mion & Yasusada Murata & Jens Südekum, 2014. "Trade, Wages, And Productivity," International Economic Review, Department of Economics, University of Pennsylvania and Osaka University Institute of Social and Economic Research Association, vol. 55(4), pages 1305-1348, November.
    18. Alessandra Bonfiglioli & Gino Gancia, 2019. "Heterogeneity, selection and labor market disparities," Review of Economic Dynamics, Elsevier for the Society for Economic Dynamics, vol. 31, pages 305-325, January.
    19. Tscheke, Jan, 2016. "Operational Hedging of Exchange Rate Risks," Discussion Papers in Economics 30227, University of Munich, Department of Economics.
    20. Gita Gopinath & Emine Boz & Camila Casas & Federico J. Díez & Pierre-Olivier Gourinchas & Mikkel Plagborg-Møller, 2020. "Dominant Currency Paradigm," American Economic Review, American Economic Association, vol. 110(3), pages 677-719, March.

    More about this item

    Statistics

    Access and download statistics

    Corrections

    All material on this site has been provided by the respective publishers and authors. You can help correct errors and omissions. When requesting a correction, please mention this item's handle: RePEc:bla:jorssa:v:181:y:2018:i:4:p:1211-1230. See general information about how to correct material in RePEc.

    If you have authored this item and are not yet registered with RePEc, we encourage you to do it here. This allows to link your profile to this item. It also allows you to accept potential citations to this item that we are uncertain about.

    If CitEc recognized a bibliographic reference but did not link an item in RePEc to it, you can help with this form .

    If you know of missing items citing this one, you can help us creating those links by adding the relevant references in the same way as above, for each refering item. If you are a registered author of this item, you may also want to check the "citations" tab in your RePEc Author Service profile, as there may be some citations waiting for confirmation.

    For technical questions regarding this item, or to correct its authors, title, abstract, bibliographic or download information, contact: Wiley Content Delivery (email available below). General contact details of provider: https://edirc.repec.org/data/rssssea.html .

    Please note that corrections may take a couple of weeks to filter through the various RePEc services.

    IDEAS is a RePEc service. RePEc uses bibliographic data supplied by the respective publishers.