IDEAS home Printed from https://ideas.repec.org/p/crs/wpaper/2022-08.html
   My bibliography  Save this paper

Dealing with Logs and Zeros in Regression Models

Author

Listed:
  • Christophe Bellégo

    (CREST (UMR 9194), ENSAE, Institut Polytechnique de Paris)

  • David Benatia

    (HEC Montréal, Département d'Economie Appliquée)

  • Louis-Daniel Pape

    (CREST (UMR 9194), ENSAE, Institut Polytechnique de Paris)

Abstract

Log-linear models are prevalent in empirical research. Yet, how to handle zeros in the dependent variable remains an unsettled issue. This article clarifies it and addresses the “log of zero” by developing a new family of estimators called iterated Ordinary Least Squares (iOLS). This family nests standard approaches such as log-linear and Poisson regressions, offers several computational advantages, and corresponds to the correct way to perform the popular log(Y + 1) transformation. We extend it to the endogenous regressor setting (i2SLS) and overcome other common issues with Poisson models, such as controlling for many fixed-effects. We also develop specification tests to help researchers select between alternative estimators. Finally, our methods are illustrated through numerical simulations and replications of landmark publications.

Suggested Citation

  • Christophe Bellégo & David Benatia & Louis-Daniel Pape, 2022. "Dealing with Logs and Zeros in Regression Models," Working Papers 2022-08, Center for Research in Economics and Statistics.
  • Handle: RePEc:crs:wpaper:2022-08
    as

    Download full text from publisher

    File URL: http://crest.science/RePEc/wpstorage/2022-08.pdf
    File Function: CREST working paper version
    Download Restriction: no
    ---><---

    Other versions of this item:

    References listed on IDEAS

    as
    1. Fernández-Val, Iván & Weidner, Martin, 2016. "Individual and time effects in nonlinear panel models with large N, T," Journal of Econometrics, Elsevier, vol. 192(1), pages 291-312.
    2. Davidson, Russell & MacKinnon, James G, 1981. "Several Tests for Model Specification in the Presence of Alternative Hypotheses," Econometrica, Econometric Society, vol. 49(3), pages 781-793, May.
    3. Ravallion, Martin, 2017. "A concave log-like transformation allowing non-positive values," Economics Letters, Elsevier, vol. 161(C), pages 130-132.
    4. Eaton Jonathan & Tamura Akiko, 1994. "Bilateralism and Regionalism in Japanese and U.S. Trade and Direct Foreign Investment Patterns," Journal of the Japanese and International Economies, Elsevier, vol. 8(4), pages 478-510, December.
    5. Rainer Winkelmann, 2008. "Econometric Analysis of Count Data," Springer Books, Springer, edition 0, number 978-3-540-78389-3, November.
    6. Silva João M. C. Santos & Tenreyro Silvana & Windmeijer Frank, 2015. "Testing Competing Models for Non-negative Data with Many Zeros," Journal of Econometric Methods, De Gruyter, vol. 4(1), pages 1-18, January.
    7. John Mullahy, 1997. "Instrumental-Variable Estimation Of Count Data Models: Applications To Models Of Cigarette Smoking Behavior," The Review of Economics and Statistics, MIT Press, vol. 79(4), pages 586-593, November.
    8. Dominitz, Jeff & Sherman, Robert P., 2005. "Some Convergence Theory For Iterative Estimation Procedures With An Application To Semiparametric Estimation," Econometric Theory, Cambridge University Press, vol. 21(4), pages 838-863, August.
    9. Manning, Willard G. & Mullahy, John, 2001. "Estimating log models: to transform or not to transform?," Journal of Health Economics, Elsevier, vol. 20(4), pages 461-494, July.
    Full references (including those not matched with items on IDEAS)

    Citations

    Citations are extracted by the CitEc Project, subscribe to its RSS feed for this item.
    as


    Cited by:

    1. Congiu, Raffaele & Sabatino, Lorien & Sapi, Geza, 2022. "The Impact of Privacy Regulation on Web Traffic: Evidence From the GDPR," Information Economics and Policy, Elsevier, vol. 61(C).
    2. Gustave Kenedi & Louis Sirugue, 2021. "The Anatomy of Intergenerational Income Mobility in France and its Spatial Variations," Working Papers halshs-03455282, HAL.
    3. Qiao Wen, 2022. "Estimating Education and Labor Market Consequences of China’s Higher Education Expansion," Sustainability, MDPI, vol. 14(13), pages 1-25, June.
    4. Cosimo Beverelli & Rohit Ticku, 2023. "Global Livestock Trade and Infectious Diseases," RSCAS Working Papers 2023/09, European University Institute.
    5. Mengzhen Wang & Xingong Ding & Baekryul Choi, 2023. "FDI or International-Trade-Driven Green Growth of 24 Korean Manufacturing Industries? Evidence from Heterogeneous Panel Based on Non-Causality Test," Sustainability, MDPI, vol. 15(7), pages 1-20, March.
    6. Carlos Alberto Piscarreta Pinto Ferreira, 2023. "Drivers of Sovereign Bond Demand – The Case of Japans," Working Papers REM 2023/0264, ISEG - Lisbon School of Economics and Management, REM, Universidade de Lisboa.
    7. Reda Cherif & Christoph Grimpe & Fuad Hasanov & Wolfgang Sofka, 2023. "Promoting Innovation: The Differential Impact of R&D Subsidies," Journal of Industry, Competition and Trade, Springer, vol. 23(3), pages 187-241, December.
    8. Gustave Kenedi & Louis Sirugue, 2021. "The Anatomy of Intergenerational Income Mobility in France and its Spatial Variations," PSE Working Papers halshs-03455282, HAL.

    Most related items

    These are the items that most often cite the same works as this one and are cited by the same works as this one.
    1. J. M. C. Santos Silva & Silvana Tenreyro, 2022. "The Log of Gravity at 15," Portuguese Economic Journal, Springer;Instituto Superior de Economia e Gestao, vol. 21(3), pages 423-437, September.
    2. Elina Bryngemark & Patrik Söderholm, 2022. "Green industrial policies and domestic production of biofuels: an econometric analysis of OECD countries," Environmental Economics and Policy Studies, Springer;Society for Environmental Economics and Policy Studies - SEEPS, vol. 24(2), pages 225-261, April.
    3. Theresa M. Greaney & Kozo Kiyota, 2020. "Japan’s Outward FDI Potential," Working Papers 202005, University of Hawaii at Manoa, Department of Economics.
    4. Santos Silva, J.M.C. & Tenreyro, Silvana, 2011. "Further simulation evidence on the performance of the Poisson pseudo-maximum likelihood estimator," Economics Letters, Elsevier, vol. 112(2), pages 220-222, August.
    5. Greaney, Theresa M. & Kiyota, Kozo, 2020. "Japan's outward FDI potential," Journal of the Japanese and International Economies, Elsevier, vol. 57(C).
    6. Mariam Camarero & Laura Montolio & Cecilio Tamarit, 2022. "Explaining German outward FDI in the EU: a reassessment using Bayesian model averaging and GLM estimators," Empirical Economics, Springer, vol. 62(2), pages 487-511, February.
    7. Silva João M. C. Santos & Tenreyro Silvana & Windmeijer Frank, 2015. "Testing Competing Models for Non-negative Data with Many Zeros," Journal of Econometric Methods, De Gruyter, vol. 4(1), pages 1-18, January.
    8. Kalle Hirvonen & John Hoddinott, 2017. "Agricultural production and children's diets: evidence from rural Ethiopia," Agricultural Economics, International Association of Agricultural Economists, vol. 48(4), pages 469-480, July.
    9. Börner, Lars & Severgnini, Battista, 2011. "Epidemic trade," Discussion Papers 2011/12, Free University Berlin, School of Business & Economics.
    10. Head, Keith & Mayer, Thierry, 2014. "Gravity Equations: Workhorse,Toolkit, and Cookbook," Handbook of International Economics, in: Gopinath, G. & Helpman, . & Rogoff, K. (ed.), Handbook of International Economics, edition 1, volume 4, chapter 0, pages 131-195, Elsevier.
    11. Mariam Camarero & Sergi Moliner & Cecilio Tamarit, 2021. "Is there a euro effect in the drivers of US FDI? New evidence using Bayesian model averaging techniques," Review of World Economics (Weltwirtschaftliches Archiv), Springer;Institut für Weltwirtschaft (Kiel Institute for the World Economy), vol. 157(4), pages 881-926, November.
    12. Santos Silva, J.M.C. & Tenreyro, Silvana, 2010. "On the existence of the maximum likelihood estimates in Poisson regression," Economics Letters, Elsevier, vol. 107(2), pages 310-312, May.
    13. Hoshi, Takeo & Kiyota, Kozo, 2019. "Potential for inward foreign direct investment in Japan," Journal of the Japanese and International Economies, Elsevier, vol. 52(C), pages 32-52.
    14. Juliana D. Araujo & Povilas Lastauskas & Chris Papageorgiou, 2017. "Evolution of Bilateral Capital Flows to Developing Countries at Intensive and Extensive Margins," Journal of Money, Credit and Banking, Blackwell Publishing, vol. 49(7), pages 1517-1554, October.
    15. repec:hal:spmain:info:hdl:2441/75dbbb2hc596np6q8flqf6i79k is not listed on IDEAS
    16. J. M. C. Santos Silva & Silvana Tenreyro, 2011. "poisson: Some convergence issues," Stata Journal, StataCorp LP, vol. 11(2), pages 215-225, June.
    17. Hiroaki Kaido & Kaspar Wüthrich, 2021. "Decentralization estimators for instrumental variable quantile regression models," Quantitative Economics, Econometric Society, vol. 12(2), pages 443-475, May.
    18. Kareem, Fatima Olanike & Martinez-Zarzoso, Inmaculada & Brümmer, Bernhard, 2016. "Fitting the Gravity Model when Zero Trade Flows are Frequent: a Comparison of Estimation Techniques using Africa's Trade Data," GlobalFood Discussion Papers 230588, Georg-August-Universitaet Goettingen, GlobalFood, Department of Agricultural Economics and Rural Development.
    19. Weidner, Martin & Zylkin, Thomas, 2021. "Bias and consistency in three-way gravity models," Journal of International Economics, Elsevier, vol. 132(C).
    20. Koen Jochmans, 2017. "Two-Way Models for Gravity," The Review of Economics and Statistics, MIT Press, vol. 99(3), pages 478-485, July.
    21. Ciani Emanuele & Fisher Paul, 2019. "Dif-in-Dif Estimators of Multiplicative Treatment Effects," Journal of Econometric Methods, De Gruyter, vol. 8(1), pages 1-10, January.

    More about this item

    Keywords

    Contraction mapping; Elasticity; Gravity model; Iterative estimator; Log-linear; Selection bias;
    All these keywords.

    JEL classification:

    • C26 - Mathematical and Quantitative Methods - - Single Equation Models; Single Variables - - - Instrumental Variables (IV) Estimation
    • C52 - Mathematical and Quantitative Methods - - Econometric Modeling - - - Model Evaluation, Validation, and Selection
    • C55 - Mathematical and Quantitative Methods - - Econometric Modeling - - - Large Data Sets: Modeling and Analysis

    NEP fields

    This paper has been announced in the following NEP Reports:

    Statistics

    Access and download statistics

    Corrections

    All material on this site has been provided by the respective publishers and authors. You can help correct errors and omissions. When requesting a correction, please mention this item's handle: RePEc:crs:wpaper:2022-08. See general information about how to correct material in RePEc.

    If you have authored this item and are not yet registered with RePEc, we encourage you to do it here. This allows to link your profile to this item. It also allows you to accept potential citations to this item that we are uncertain about.

    If CitEc recognized a bibliographic reference but did not link an item in RePEc to it, you can help with this form .

    If you know of missing items citing this one, you can help us creating those links by adding the relevant references in the same way as above, for each refering item. If you are a registered author of this item, you may also want to check the "citations" tab in your RePEc Author Service profile, as there may be some citations waiting for confirmation.

    For technical questions regarding this item, or to correct its authors, title, abstract, bibliographic or download information, contact: Secretariat General (email available below). General contact details of provider: https://edirc.repec.org/data/crestfr.html .

    Please note that corrections may take a couple of weeks to filter through the various RePEc services.

    IDEAS is a RePEc service. RePEc uses bibliographic data supplied by the respective publishers.