IDEAS home Printed from https://ideas.repec.org/p/cen/tnotes/26-03.html

Imputing Immigrant Entrepreneurship in the 2002 SBO: Methods and Results

Author

Listed:
  • Parag Mahajan

Abstract

This memo documents efforts to understand and improve the quality of data produced through Census Bureau business surveys by developing an imputation model for foreign-born ownership of businesses that can extend backward in time, to the 2002 Survey of Business Owners. It uses a model selection procedure that ultimately selects a random forest model with good out-of-sample prediction capability (AUC = 0.86). It then estimates that foreign-born individuals founded 16% of new employer businesses that started between 1998 and 2002. This estimate extends the longest known U.S. time series on immigrant entrepreneurship that does not rely on self-employment as a proxy for entrepreneurship.

Suggested Citation

  • Parag Mahajan, 2026. "Imputing Immigrant Entrepreneurship in the 2002 SBO: Methods and Results," CES Technical Notes Series 26-03, Center for Economic Studies, U.S. Census Bureau.
  • Handle: RePEc:cen:tnotes:26-03
    as

    Download full text from publisher

    File URL: https://www2.census.gov/ces/tn/CES-TN-2026-03.pdf
    File Function: Abstract
    Download Restriction: CES Technical Notes may contain confidential data and, thereby, disclosure is prohibited. Researchers on approved projects (to apply for access, please see https://www.census.gov/ces/rdcresearch/howtoapply.html) with the correct permissions can request full text notes from CES.Technical.Notes.List@census.gov.

    File URL: https://www.census.gov/about/adrm/ced/apply-for-access.html?CES-TN-2026-03
    File Function: First version, 2026
    Download Restriction: CES Technical Notes may contain confidential data and, thereby, disclosure is prohibited. Researchers on approved projects (to apply for access, please see https://www.census.gov/ces/rdcresearch/howtoapply.html) with the correct permissions can request full text notes from CES.Technical.Notes.List@census.gov.
    ---><---

    As the access to this document is restricted, you may want to

    for a different version of it.

    More about this item

    Keywords

    ;
    ;
    ;
    ;
    ;

    Statistics

    Access and download statistics

    Corrections

    All material on this site has been provided by the respective publishers and authors. You can help correct errors and omissions. When requesting a correction, please mention this item's handle: RePEc:cen:tnotes:26-03. See general information about how to correct material in RePEc.

    If you have authored this item and are not yet registered with RePEc, we encourage you to do it here. This allows to link your profile to this item. It also allows you to accept potential citations to this item that we are uncertain about.

    We have no bibliographic references for this item. You can help adding them by using this form .

    If you know of missing items citing this one, you can help us creating those links by adding the relevant references in the same way as above, for each refering item. If you are a registered author of this item, you may also want to check the "citations" tab in your RePEc Author Service profile, as there may be some citations waiting for confirmation.

    For technical questions regarding this item, or to correct its authors, title, abstract, bibliographic or download information, contact: Danielle H. Sandler (email available below). General contact details of provider: https://edirc.repec.org/data/cesgvus.html .

    Please note that corrections may take a couple of weeks to filter through the various RePEc services.

    IDEAS is a RePEc service. RePEc uses bibliographic data supplied by the respective publishers.