Estimating Log Models: To Transform or Not to Transform?

My bibliography Save this paper

Estimating Log Models: To Transform or Not to Transform?

Author

Listed:

Willard G. Manning
John Mullahy

Registered:

Willard G. Manning †
John Mullahy

Abstract

Data on health care expenditures, length of stay, utilization of health services, consumption of unhealthy commodities, etc. are typically characterized by: (a) nonnegative outcomes; (b) nontrivial fractions of zero outcomes in the population (and sample); and (c) positively-skewed distributions of the nonzero realizations. Similar data structures are encountered in labor economics as well. This paper provides simulation-based evidence on the finite-sample behavior of two sets of estimators designed to look at the effect of a set of covariates x on the expected outcome, E(y|x), under a range of data problems encountered in every day practice: generalized linear models (GLM), a subset of which can simply be viewed as differentially weighted nonlinear least-squares estimators, and those derived from least-squares estimators for the ln(y). We consider the first- and second- order behavior of these candidate estimators under alternative assumptions on the data generating processes. Our results indicate that the choice of estimator for models of ln(E(x|y)) can have major implications for empirical results if the estimator is not designed to deal with the specific data generating mechanism. Garden-variety statistical problems - skewness, kurtosis, and heteroscedasticity - can lead to an appreciable bias for some estimators or appreciable losses in precision for others.

Suggested Citation

Willard G. Manning & John Mullahy, 1999. "Estimating Log Models: To Transform or Not to Transform?," NBER Technical Working Papers 0246, National Bureau of Economic Research, Inc.

Handle: RePEc:nbr:nberte:0246
Note: TWP

Download full text from publisher

Other versions of this item:

Manning, Willard G. & Mullahy, John, 2001. "Estimating log models: to transform or not to transform?," Journal of Health Economics, Elsevier, vol. 20(4), pages 461-494, July.

References listed on IDEAS

Manning, Willard G, et al, 1987. "Health Insurance and the Demand for Medical Care: Evidence from a Randomized Experiment," American Economic Review, American Economic Association, vol. 77(3), pages 251-277, June.
Mullahy, John, 1998. "Much ado about two: reconsidering retransformation and the two-part model in health econometrics," Journal of Health Economics, Elsevier, vol. 17(3), pages 247-281, June.
Manning, W. G. & Duan, N. & Rogers, W. H., 1987. "Monte Carlo evidence on the choice between sample selection and two-part models," Journal of Econometrics, Elsevier, vol. 35(1), pages 59-82, May.
John Mullahy, 1998. "Much Ado About Two: Reconsidering Retransformation and the Two-Part Model in Health Economics," NBER Technical Working Papers 0228, National Bureau of Economic Research, Inc.
Wooldridge, Jeffrey M., 1991. "On the application of robust, regression- based diagnostics to models of conditional means and conditional variances," Journal of Econometrics, Elsevier, vol. 47(1), pages 5-46, January.
Kennedy, Peter E, 1981. "Estimation with Correctly Interpreted Dummy Variables in Semilogarithmic Equations [The Interpretation of Dummy Variables in Semilogarithmic Equations]," American Economic Review, American Economic Association, vol. 71(4), pages 801-801, September.
Andrew M. Jones, 2012. "health econometrics," The New Palgrave Dictionary of Economics,, Palgrave Macmillan.
- Jones, Andrew M., 2000. "Health econometrics," Handbook of Health Economics, in: A. J. Culyer & J. P. Newhouse (ed.), Handbook of Health Economics, edition 1, volume 1, chapter 6, pages 265-344, Elsevier.
Gourieroux, Christian & Monfort, Alain & Trognon, Alain, 1984. "Pseudo Maximum Likelihood Methods: Applications to Poisson Models," Econometrica, Econometric Society, vol. 52(3), pages 701-720, May.
- Gourieroux Christian & Monfort Alain & Trognon A, 1982. "Pseudo maximum lilelihood methods : applications to poisson models," CEPREMAP Working Papers (Couverture Orange) 8203, CEPREMAP.
Manning, Willard G., 1998. "The logged dependent variable, heteroscedasticity, and the retransformation problem," Journal of Health Economics, Elsevier, vol. 17(3), pages 283-295, June.
Kennedy, Peter, 1983. "Logarithmic Dependent Variables and Prediction Bias," Oxford Bulletin of Economics and Statistics, Department of Economics, University of Oxford, vol. 45(4), pages 389-392, November.
Duan, Naihua, et al, 1983. "A Comparison of Alternative Models for the Demand for Medical Care," Journal of Business & Economic Statistics, American Statistical Association, vol. 1(2), pages 115-126, April.
Blough, David K. & Madden, Carolyn W. & Hornbrook, Mark C., 1999. "Modeling risk using generalized linear models," Journal of Health Economics, Elsevier, vol. 18(2), pages 153-171, April.
A. J. Culyer & J. P. Newhouse (ed.), 2000. "Handbook of Health Economics," Handbook of Health Economics, Elsevier, edition 1, volume 1, number 1.

Full references (including those not matched with items on IDEAS)

Most related items

These are the items that most often cite the same works as this one and are cited by the same works as this one.

Cantoni, Eva & Ronchetti, Elvezio, 2006. "A robust approach for skewed and heavy-tailed outcomes in the analysis of health care expenditures," Journal of Health Economics, Elsevier, vol. 25(2), pages 198-213, March.
Manos Matsaganis & Theodore Mitrakos & Panos Tsakloglou, 2009. "Modelling health expenditure at the household level in Greece," The European Journal of Health Economics, Springer;Deutsche Gesellschaft für Gesundheitsökonomie (DGGÖ), vol. 10(3), pages 329-336, July.
Jones, A.M, 2010. "Models For Health Care," Health, Econometrics and Data Group (HEDG) Working Papers 10/01, HEDG, c/o Department of Economics, University of York.
Jay Dev Dubey, 2021. "Measuring Income Elasticity of Healthcare-Seeking Behavior in India: A Conditional Quantile Regression Approach," Journal of Quantitative Economics, Springer;The Indian Econometric Society (TIES), vol. 19(4), pages 767-793, December.
Borislava Mihaylova & Andrew Briggs & Anthony O'Hagan & Simon G. Thompson, 2011. "Review of statistical methods for analysing healthcare resources and costs," Health Economics, John Wiley & Sons, Ltd., vol. 20(8), pages 897-916, August.
Jeonghoon Ahn, 2004. "Panel Data Sample Selection Model: an Application to Employee Choice of Health Plan Type and Medical Cost Estimation," Econometric Society 2004 Far Eastern Meetings 560, Econometric Society.
Buntin, Melinda Beeuwkes & Zaslavsky, Alan M., 2004. "Too much ado about two-part models and transformation?: Comparing methods of modeling Medicare expenditures," Journal of Health Economics, Elsevier, vol. 23(3), pages 525-542, May.
Liu, Lei & Strawderman, Robert L. & Cowen, Mark E. & Shih, Ya-Chen T., 2010. "A flexible two-part random effects model for correlated medical costs," Journal of Health Economics, Elsevier, vol. 29(1), pages 110-123, January.
Keane, Michael & Stavrunova, Olena, 2016. "Adverse selection, moral hazard and the demand for Medigap insurance," Journal of Econometrics, Elsevier, vol. 190(1), pages 62-78.
- Keane, M. & Stavrunova, O., 2010. "Adverse Selection, Moral Hazard and the Demand for Medigap Insurance," Health, Econometrics and Data Group (HEDG) Working Papers 10/14, HEDG, c/o Department of Economics, University of York.
- Michael Keane & Olena Stavrunova, 2011. "Adverse Selection, Moral Hazard and the Demand for Medigap Insurance," Working Paper Series 167, Finance Discipline Group, UTS Business School, University of Technology, Sydney.
- Michael P. Keane & Olean Stavrunova, 2014. "Adverse Selection, Moral Hazard and the Demand for Medigap Insurance," Economics Papers 2014-W02, Economics Group, Nuffield College, University of Oxford.
- Michael Keane & Olena Stavrunova, 2011. "Adverse Selection, Moral Hazard and the Demand for Medigap Insurance," Working Papers 201119, ARC Centre of Excellence in Population Ageing Research (CEPAR), Australian School of Business, University of New South Wales.
- Michael P. Keane & Olena Stavrunova, 2012. "Adverse Selection, Moral Hazard and the Demand for Medigap Insurance," Economics Papers 2012-W10, Economics Group, Nuffield College, University of Oxford.
van Doorslaer, Eddy & Wagstaff, Adam & van der Burg, Hattem & Christiansen, Terkel & De Graeve, Diana & Duchesne, Inge & Gerdtham, Ulf-G & Gerfin, Michael & Geurts, Jose & Gross, Lorna, 2000. "Equity in the delivery of health care in Europe and the US," Journal of Health Economics, Elsevier, vol. 19(5), pages 553-583, September.
Galina Besstremyannaya, 2012. "Estimating income equity in social health insurance system," Working Papers w0172, New Economic School (NES).
Anirban Basu & Willard G. Manning, 2010. "Estimating lifetime or episode‐of‐illness costs under censoring," Health Economics, John Wiley & Sons, Ltd., vol. 19(9), pages 1010-1028, September.
Hao Yu, 2017. "China’s medical savings accounts: an analysis of the price elasticity of demand for health care," The European Journal of Health Economics, Springer;Deutsche Gesellschaft für Gesundheitsökonomie (DGGÖ), vol. 18(6), pages 773-785, July.
Liu, Lei & Conaway, Mark R. & Knaus, William A. & Bergin, James D., 2008. "A random effects four-part model, with application to correlated medical costs," Computational Statistics & Data Analysis, Elsevier, vol. 52(9), pages 4458-4473, May.
Basu, A & Polsky, D & Manning, W G, 2008. "Use of propensity scores in non-linear response models: The case for health care expenditures," Health, Econometrics and Data Group (HEDG) Working Papers 08/11, HEDG, c/o Department of Economics, University of York.
- Anirban Basu & Daniel Polsky & Willard G. Manning, 2008. "Use of Propensity Scores in Non-Linear Response Models: The Case for Health Care Expenditures," NBER Working Papers 14086, National Bureau of Economic Research, Inc.
Andreas Bayerstadler & Franz Benstetter & Christian Heumann & Fabian Winter, 2014. "A predictive modeling approach to increasing the economic effectiveness of disease management programs," Health Care Management Science, Springer, vol. 17(3), pages 284-301, September.
Manos Matsaganis & Theodore Mitrakos & Panos Tsakloglou, 2008. "Modelling Household Expenditure on Health Care in Greece," Working Papers 68, Bank of Greece.
Farrell, Susan & Manning, Willard G. & Finch, Michael D., 2003. "Alcohol dependence and the price of alcoholic beverages," Journal of Health Economics, Elsevier, vol. 22(1), pages 117-147, January.
Kathleen Carey & Theodore Stefos, 2011. "Measuring the cost of hospital adverse patient safety events," Health Economics, John Wiley & Sons, Ltd., vol. 20(12), pages 1417-1430, December.
Besstremyannaya, Galina, 2017. "Measuring income equity in the demand for healthcare with finite mixture models," Applied Econometrics, Russian Presidential Academy of National Economy and Public Administration (RANEPA), vol. 46, pages 5-29.

More about this item

JEL classification:

C2 - Mathematical and Quantitative Methods - - Single Equation Models; Single Variables
I1 - Health, Education, and Welfare - - Health

NEP fields

This paper has been announced in the following NEP Reports:

NEP-ECM-1999-11-28 (Econometrics)

Statistics

Access and download statistics

Corrections

All material on this site has been provided by the respective publishers and authors. You can help correct errors and omissions. When requesting a correction, please mention this item's handle: RePEc:nbr:nberte:0246. See general information about how to correct material in RePEc.

If you have authored this item and are not yet registered with RePEc, we encourage you to do it here. This allows to link your profile to this item. It also allows you to accept potential citations to this item that we are uncertain about.

If CitEc recognized a bibliographic reference but did not link an item in RePEc to it, you can help with this form .

If you know of missing items citing this one, you can help us creating those links by adding the relevant references in the same way as above, for each refering item. If you are a registered author of this item, you may also want to check the "citations" tab in your RePEc Author Service profile, as there may be some citations waiting for confirmation.

For technical questions regarding this item, or to correct its authors, title, abstract, bibliographic or download information, contact: the person in charge (email available below). General contact details of provider: https://edirc.repec.org/data/nberrus.html .

Please note that corrections may take a couple of weeks to filter through the various RePEc services.

IDEAS is a RePEc service. RePEc uses bibliographic data supplied by the respective publishers.

Browse Econ Literature

More features

Estimating Log Models: To Transform or Not to Transform?

Author

Abstract

Suggested Citation

Download full text from publisher

Other versions of this item:

References listed on IDEAS

Most related items

More about this item

JEL classification:

NEP fields

Statistics

Corrections

More services and features

MyIDEAS

Author registration

Rankings

RePEc Genealogy

RePEc Biblio

MPRA

New papers by email

EconAcademics

Plagiarism

About RePEc

RePEc home

Blog

Help/FAQ

RePEc team

Participating archives

Privacy statement

Help us

Corrections

Volunteers

Get papers listed

Open a RePEc archive

Get RePEc data