IDEAS home Printed from https://ideas.repec.org/a/bla/jorssa/v181y2018i4p1211-1230.html
   My bibliography  Save this article

Correlates of record linkage and estimating risks of non‐linkage biases in business data sets

Author

Listed:
  • Jamie C. Moore
  • Peter W. F. Smith
  • Gabriele B. Durrant

Abstract

Researchers often utilize data sets that link information from multiple sources, but non‐linkage biases caused by linked and non‐linked subject differences are little understood, especially in business data sets. We address these knowledge gaps by studying biases in linkable 2010 UK Small Business Survey data sets. We identify correlates of business linkage propensity, and also for the first time its components: consent to linkage and register identifier appendability. As well, we take a novel approach to evaluating non‐linkage bias risks, by computing data set representativeness indicators (comparable, decomposable sample subset similarity measures). We find that the main impacts on linkage propensities and bias risks are due to consenter–non‐consenter differences explicable given business survey response processes, and differences between subjects with and without identifiers caused by register undercoverage of very small businesses. We then discuss consequences for the analysis of linked business data sets, and implications of the evaluation methods we introduce for linked data set producers and users.

Suggested Citation

  • Jamie C. Moore & Peter W. F. Smith & Gabriele B. Durrant, 2018. "Correlates of record linkage and estimating risks of non‐linkage biases in business data sets," Journal of the Royal Statistical Society Series A, Royal Statistical Society, vol. 181(4), pages 1211-1230, October.
  • Handle: RePEc:bla:jorssa:v:181:y:2018:i:4:p:1211-1230
    DOI: 10.1111/rssa.12342
    as

    Download full text from publisher

    File URL: https://doi.org/10.1111/rssa.12342
    Download Restriction: no

    File URL: https://libkey.io/10.1111/rssa.12342?utm_source=ideas
    LibKey link: if access is restricted and if your library uses this service, LibKey will redirect you to where you can use your library subscription to access this item
    ---><---

    References listed on IDEAS

    as
    1. Melissa Bjelland & Bruce Fallick & John Haltiwanger & Erika McEntarfer, 2011. "Employer-to-Employer Flows in the United States: Estimates Using Linked Employer-Employee Data," Journal of Business & Economic Statistics, Taylor & Francis Journals, vol. 29(4), pages 493-505, October.
    2. Schouten, Barry & Shlomo, Natalie & Skinner, Chris J., 2011. "Indicators for monitoring and improving representativeness of response," LSE Research Online Documents on Economics 39121, London School of Economics and Political Science, LSE Library.
    3. Felix Ritchie, 2008. "Secure access to confidential microdata: four years of the Virtual Microdata Laboratory," Economic & Labour Market Review, Palgrave Macmillan;Office for National Statistics, vol. 2(5), pages 29-34, May.
    4. Abowd, John M. & Vilhuber, Lars, 2005. "The Sensitivity of Economic Statistics to Coding Errors in Personal Identifiers," Journal of Business & Economic Statistics, American Statistical Association, vol. 23, pages 133-152, April.
    5. Robert Hayes & Catrin Omerod & Felix Ritchie, 2007. "Earnings: summary of sources and developments," Economic & Labour Market Review, Palgrave Macmillan;Office for National Statistics, vol. 1(1), pages 42-47, January.
    6. Alexander Hijzen & Richard Upward & Peter W. Wright, 2010. "Job Creation, Job Destruction and the Role of Small Firms: Firm‐Level Evidence for the UK," Oxford Bulletin of Economics and Statistics, Department of Economics, University of Oxford, vol. 72(5), pages 621-647, October.
    7. Barry Schouten & Jelke Bethlehem & Koen Beullens & Øyvin Kleven & Geert Loosveldt & Annemieke Luiten & Katja Rutar & Natalie Shlomo & Chris Skinner, 2012. "Evaluating, Comparing, Monitoring, and Improving Representativeness of Survey Response Through R-Indicators and Partial R-Indicators," International Statistical Review, International Statistical Institute, vol. 80(3), pages 382-399, December.
    8. Barry Schouten & Natalie Shlomo, 2017. "Selecting Adaptive Survey Design Strata with Partial R-indicators," International Statistical Review, International Statistical Institute, vol. 85(1), pages 143-163, April.
    9. Florian Janik & Susanne Kohaut, 2012. "Why don’t they answer? Unit non-response in the IAB establishment panel," Quality & Quantity: International Journal of Methodology, Springer, vol. 46(3), pages 917-934, April.
    10. repec:taf:jnlbes:v:30:y:2012:i:2:p:191-201 is not listed on IDEAS
    11. Timothy Dunne & J. Bradford Jensen & Mark J. Roberts, 2009. "Producer Dynamics: New Evidence from Micro Data," NBER Books, National Bureau of Economic Research, Inc, number dunn05-1, March.
    12. Shlomo, Natalie & Skinner, Chris J. & Schouten, Barry, 2012. "Estimation of an indicator of the representativeness of survey response," LSE Research Online Documents on Economics 39124, London School of Economics and Political Science, LSE Library.
    13. John J. Abowd & John Haltiwanger & Julia Lane, 2004. "Integrated Longitudinal Employer-Employee Data for the United States," American Economic Review, American Economic Association, vol. 94(2), pages 224-229, May.
    14. Timothy Dunne & J. Bradford Jensen & Mark J. Roberts, 2009. "Introduction to "Producer Dynamics: New Evidence from Micro Data"," NBER Chapters, in: Producer Dynamics: New Evidence from Micro Data, pages 1-12, National Bureau of Economic Research, Inc.
    15. Barry Schouten & Fannie Cobben & Peter Lundquist & James Wagner, 2016. "Does more balanced survey response imply less non-response bias?," Journal of the Royal Statistical Society Series A, Royal Statistical Society, vol. 179(3), pages 727-748, June.
    Full references (including those not matched with items on IDEAS)

    Citations

    Citations are extracted by the CitEc Project, subscribe to its RSS feed for this item.
    as


    Cited by:

    1. Valentin Reich, 2024. "Machine Learning Based Linkage of Company Data for Economic Research: Application to the EBDC Business Panels," ifo Working Paper Series 409, ifo Institute - Leibniz Institute for Economic Research at the University of Munich.
    2. Serena Pattaro & Nick Bailey & Chris Dibben, 2020. "Using Linked Longitudinal Administrative Data to Identify Social Disadvantage," Social Indicators Research: An International and Interdisciplinary Journal for Quality-of-Life Measurement, Springer, vol. 147(3), pages 865-895, February.

    Most related items

    These are the items that most often cite the same works as this one and are cited by the same works as this one.
    1. Roberts Caroline & Vandenplas Caroline & Herzing Jessica M.E., 2020. "A Validation of R-Indicators as a Measure of the Risk of Bias using Data from a Nonresponse Follow-Up Survey," Journal of Official Statistics, Sciendo, vol. 36(3), pages 675-701, September.
    2. Fariha Kamal & C.J. Krizan, 2012. "Decomposing Aggregate Trade Flows: New Evidence from U.S. Traders," Working Papers 12-17, Center for Economic Studies, U.S. Census Bureau.
    3. Henry Hyatt & Erika McEntarfer, 2012. "Job-to-Job Flows and the Business Cycle," Working Papers 12-04, Center for Economic Studies, U.S. Census Bureau.
    4. Ian M. Schmutte, 2015. "Job Referral Networks and the Determination of Earnings in Local Labor Markets," Journal of Labor Economics, University of Chicago Press, vol. 33(1), pages 1-32.
    5. Barry Schouten & Natalie Shlomo, 2017. "Selecting Adaptive Survey Design Strata with Partial R-indicators," International Statistical Review, International Statistical Institute, vol. 85(1), pages 143-163, April.
    6. Abowd, John M. & Vilhuber, Lars, 2011. "National estimates of gross employment and job flows from the Quarterly Workforce Indicators with demographic and industry detail," Journal of Econometrics, Elsevier, vol. 161(1), pages 82-99, March.
    7. Jamie C. Moore & Gabriele B. Durrant & Peter W. F. Smith, 2021. "Do coefficients of variation of response propensities approximate non‐response biases during survey data collection?," Journal of the Royal Statistical Society Series A, Royal Statistical Society, vol. 184(1), pages 301-323, January.
    8. Thais Paiva & Jerry Reiter, 2014. "Using Imputation Techniques To Evaluate Stopping Rules In Adaptive Survey Design," Working Papers 14-40, Center for Economic Studies, U.S. Census Bureau.
    9. Särndal Carl-Erik & Lundquist Peter, 2017. "Inconsistent Regression and Nonresponse Bias: Exploring Their Relationship as a Function of Response Imbalance," Journal of Official Statistics, Sciendo, vol. 33(3), pages 709-734, September.
    10. Ahn, JaeBin & Choi, Moon Jung, 2020. "From firm-level imports to aggregate productivity: Evidence from Korean manufacturing firm data," Japan and the World Economy, Elsevier, vol. 56(C).
    11. Barth, Erling & Davis, James C. & Freeman, Richard B. & McElheran, Kristina, 2023. "Twisting the demand curve: Digitalization and the older workforce," Journal of Econometrics, Elsevier, vol. 233(2), pages 443-467.
    12. José Fillat & Stefania Garetto & Arthur V. Smith, 2018. "What are the consequences of global banking for the international transmission of shocks?: a quantitative analysis," Working Papers 18-11, Federal Reserve Bank of Boston.
    13. Paul L. E. Grieco & Shengyu Li & Hongsong Zhang, 2022. "Input prices, productivity, and trade dynamics: long‐run effects of liberalization on Chinese paint manufacturers," RAND Journal of Economics, RAND Corporation, vol. 53(3), pages 516-560, September.
    14. Jose L. Fillat & Stefania Garetto & Arthur V. Smith, 2018. "What are the consequences of global banking for the international transmission of shocks? A quantitative analysis∗," Boston University - Department of Economics - The Institute for Economic Development Working Papers Series dp-303, Boston University - Department of Economics, revised Oct 2018.
    15. Nitish Jain & Karan Girotra & Serguei Netessine, 2014. "Managing Global Sourcing: Inventory Performance," Management Science, INFORMS, vol. 60(5), pages 1202-1222, May.
    16. Kevin L. McKinney & John M. Abowd & John Sabelhaus, 2021. "United States Earnings Dynamics: Inequality, Mobility, and Volatility," NBER Chapters, in: Measuring Distribution and Mobility of Income and Wealth, pages 69-104, National Bureau of Economic Research, Inc.
    17. Horst Raff & Nicolas Schmitt, 2016. "Manufacturers and retailers in the global economy," Canadian Journal of Economics/Revue canadienne d'économique, John Wiley & Sons, vol. 49(2), pages 685-706, May.
    18. Justin Pierce & Peter Schott, 2009. "A Concordance Between Ten-Digit U.S. Harmonized System Codes and SIC/NAICS Product Classes and Industries," Working Papers 09-41, Center for Economic Studies, U.S. Census Bureau.
    19. Alessandra Bonfiglioli & Rosario Crinò & Gino Gancia, 2021. "International Trade with Heterogeneous Firms: Theory and Evidence," CESifo Working Paper Series 9423, CESifo.
    20. Barry Schouten & Fannie Cobben & Peter Lundquist & James Wagner, 2016. "Does more balanced survey response imply less non-response bias?," Journal of the Royal Statistical Society Series A, Royal Statistical Society, vol. 179(3), pages 727-748, June.

    More about this item

    Statistics

    Access and download statistics

    Corrections

    All material on this site has been provided by the respective publishers and authors. You can help correct errors and omissions. When requesting a correction, please mention this item's handle: RePEc:bla:jorssa:v:181:y:2018:i:4:p:1211-1230. See general information about how to correct material in RePEc.

    If you have authored this item and are not yet registered with RePEc, we encourage you to do it here. This allows to link your profile to this item. It also allows you to accept potential citations to this item that we are uncertain about.

    If CitEc recognized a bibliographic reference but did not link an item in RePEc to it, you can help with this form .

    If you know of missing items citing this one, you can help us creating those links by adding the relevant references in the same way as above, for each refering item. If you are a registered author of this item, you may also want to check the "citations" tab in your RePEc Author Service profile, as there may be some citations waiting for confirmation.

    For technical questions regarding this item, or to correct its authors, title, abstract, bibliographic or download information, contact: Wiley Content Delivery (email available below). General contact details of provider: https://edirc.repec.org/data/rssssea.html .

    Please note that corrections may take a couple of weeks to filter through the various RePEc services.

    IDEAS is a RePEc service. RePEc uses bibliographic data supplied by the respective publishers.