IDEAS home Printed from https://ideas.repec.org/p/nbr/nberwo/30735.html
   My bibliography  Save this paper

Why Transform Y? A Critical Assessment of Dependent-Variable Transformations in Regression Models for Skewed and Sometimes-Zero Outcomes

Author

Listed:
  • John Mullahy
  • Edward C. Norton

Abstract

Dependent variables that are non-negative, follow right-skewed distributions, and have large probability mass at zero arise often in empirical economics. Two classes of models that transform the dependent variable y — the natural logarithm of y plus a constant and the inverse hyperbolic sine — have been widely used in empirical work. We show that these two classes of models share several features that raise concerns about their application. The concerns are particularly prominent when dependent variables are frequently observed at zero, which in many instances is the main motivation for using them in the first place. The crux of the concern is that these models have an extra parameter that is generally not determined by theory but whose values have enormous consequences for point estimates. As these parameters go to extreme values estimated marginal effects on outcomes' natural scales approach those of either an untransformed linear regression or a normed linear probability model. Across a wide variety of simulated data, two-part models yield correct marginal effects, as do OLS on the untransformed y and Poisson regression. If researchers care about estimating marginal effects, we recommend using these simpler models that do not rely on transformations.

Suggested Citation

  • John Mullahy & Edward C. Norton, 2022. "Why Transform Y? A Critical Assessment of Dependent-Variable Transformations in Regression Models for Skewed and Sometimes-Zero Outcomes," NBER Working Papers 30735, National Bureau of Economic Research, Inc.
  • Handle: RePEc:nbr:nberwo:30735
    Note: EH
    as

    Download full text from publisher

    File URL: http://www.nber.org/papers/w30735.pdf
    Download Restriction: Access to the full text is generally limited to series subscribers, however if the top level domain of the client browser is in a developing country or transition economy free access is provided. More information about subscriptions and free access is available at http://www.nber.org/wwphelp.html. Free access is also available to older working papers.
    ---><---

    As the access to this document is restricted, you may want to search for a different version of it.

    Citations

    Citations are extracted by the CitEc Project, subscribe to its RSS feed for this item.
    as


    Cited by:

    1. Fabian Gaessler & Dietmar Harhoff & Stefan Sorg & Georg von Graevenitz, 2024. "Patents, Freedom to Operate, and Follow-on Innovation: Evidence from Post-Grant Opposition," Rationality and Competition Discussion Paper Series 494, CRC TRR 190 Rationality and Competition.
    2. Matteo Aquilina & Giulio Cornelli & Marina Sanchez del Villar, 2024. "Regulation, information asymmetries and the funding of new ventures," BIS Working Papers 1162, Bank for International Settlements.
    3. Kiss, Andrea & Garlick, Robert & Orkin, Kate & Hensel, Lukas, 2023. "Jobseekers' Beliefs about Comparative Advantage and (Mis)Directed Search," IZA Discussion Papers 16522, Institute of Labor Economics (IZA).
    4. Steinbach, Sandro, 2023. "The Russia–Ukraine war and global trade reallocations," Economics Letters, Elsevier, vol. 226(C).
    5. Böken, Johannes & Draca, Mirko & Mastrorocco, Nicola & Ornaghi, Arianna, 2023. "The Returns to Viral Media: The Case of US Campaign Contributions," CAGE Online Working Paper Series 681, Competitive Advantage in the Global Economy (CAGE).
    6. Isabelle Chort & Berk Öktem, 2024. "Agricultural shocks, coping policies and deforestation: Evidence from the coffee leaf rust epidemic in Mexico," American Journal of Agricultural Economics, John Wiley & Sons, vol. 106(3), pages 1020-1057, May.
    7. Daniel Borbely & Gennaro Rossi, 2023. "Urban regeneration projects and crime: evidence from Glasgow," Journal of Economic Geography, Oxford University Press, vol. 23(6), pages 1273-1301.
    8. Antea Barišić & Mahdi Ghodsi & Michael Landesmann, 2024. "Technological Push and Pull Factors of Bilateral Migration," wiiw Working Papers 242, The Vienna Institute for International Economic Studies, wiiw.

    More about this item

    JEL classification:

    • C18 - Mathematical and Quantitative Methods - - Econometric and Statistical Methods and Methodology: General - - - Methodolical Issues: General
    • C20 - Mathematical and Quantitative Methods - - Single Equation Models; Single Variables - - - General
    • I10 - Health, Education, and Welfare - - Health - - - General

    NEP fields

    This paper has been announced in the following NEP Reports:

    Statistics

    Access and download statistics

    Corrections

    All material on this site has been provided by the respective publishers and authors. You can help correct errors and omissions. When requesting a correction, please mention this item's handle: RePEc:nbr:nberwo:30735. See general information about how to correct material in RePEc.

    If you have authored this item and are not yet registered with RePEc, we encourage you to do it here. This allows to link your profile to this item. It also allows you to accept potential citations to this item that we are uncertain about.

    We have no bibliographic references for this item. You can help adding them by using this form .

    If you know of missing items citing this one, you can help us creating those links by adding the relevant references in the same way as above, for each refering item. If you are a registered author of this item, you may also want to check the "citations" tab in your RePEc Author Service profile, as there may be some citations waiting for confirmation.

    For technical questions regarding this item, or to correct its authors, title, abstract, bibliographic or download information, contact: the person in charge (email available below). General contact details of provider: https://edirc.repec.org/data/nberrus.html .

    Please note that corrections may take a couple of weeks to filter through the various RePEc services.

    IDEAS is a RePEc service. RePEc uses bibliographic data supplied by the respective publishers.