IDEAS home Printed from https://ideas.repec.org/p/qld/uqcepa/76.html
   My bibliography  Save this paper

To Smooth or Not to Smooth? The Case of Discrete Variables in Nonparametric Regressions

Author

Abstract

In a seminal paper, Racine and Li, (Journal of Econometrics, 2004) introduce a tool which admits discrete and categorical variables as regressors in nonparametric regres- sions. The method is similar to the smoothing techniques for continuous regressors but uses discrete kernels. In the literature, it is generally admitted that it is always better to smooth the discrete variables. In this paper we investigate the potential problem linked to the bandwidths selection for the continuous variable due to the presence of the discrete variables. We find that in some cases, the performance of the resulting regression estimates may be deteriorated by smoothing the discrete variables in the way addressed so far in the literature, and that a fully separate estimation (without any smoothing of the discrete variable) may provide significantly better results, and we explain why this may happen. The problem being posed, we then suggest how to use the Racine and Li approach to overcome these difficulties and to provide estimates with better performances. We investigate through some simulated data sets and by more ex- tensive Monte-Carlo experiments the performances of all the proposed approaches and we find that, as expected, our suggested approach has the best performances. We also briefly illustrate the consequences of these issues on the estimation of the derivatives of the regression. Finally, we exemplify the phenomenon with an empirical illustration. Our main objective is to warn the practitioners of the potential problems posed by smoothing discrete variables by using the so far available softwares and to suggest a safer approach to implement the procedure.

Suggested Citation

  • Valentin Zelenyuk & Leopold Simar, 2011. "To Smooth or Not to Smooth? The Case of Discrete Variables in Nonparametric Regressions," CEPA Working Papers Series WP102011, School of Economics, University of Queensland, Australia.
  • Handle: RePEc:qld:uqcepa:76
    as

    Download full text from publisher

    File URL: https://economics.uq.edu.au/files/5190/WP102011.pdf
    Download Restriction: no
    ---><---

    Other versions of this item:

    References listed on IDEAS

    as
    1. Maasoumi, Esfandiar & Racine, Jeff & Stengos, Thanasis, 2007. "Growth and convergence: A profile of distribution dynamics and mobility," Journal of Econometrics, Elsevier, vol. 136(2), pages 483-508, February.
    2. Leopold Simar & Valentin Zelenyuk, 2006. "On Testing Equality of Distributions of Technical Efficiency Scores," Econometric Reviews, Taylor & Francis Journals, vol. 25(4), pages 497-522.
    3. Pagan,Adrian & Ullah,Aman, 1999. "Nonparametric Econometrics," Cambridge Books, Cambridge University Press, number 9780521355643.
    4. Peter Hall & Qi Li & Jeffrey S. Racine, 2007. "Nonparametric Estimation of Regression Functions in the Presence of Irrelevant Regressors," The Review of Economics and Statistics, MIT Press, vol. 89(4), pages 784-789, November.
    5. W. Walls, 2009. "Screen wars, star wars, and sequels," Empirical Economics, Springer, vol. 37(2), pages 447-461, October.
    6. T. Stengos & E. Zacharias, 2006. "Intertemporal pricing and price discrimination: a semiparametric hedonic analysis of the personal computer market," Journal of Applied Econometrics, John Wiley & Sons, Ltd., vol. 21(3), pages 371-386, April.
    7. Racine, Jeff & Li, Qi, 2004. "Nonparametric estimation of regression functions with both categorical and continuous data," Journal of Econometrics, Elsevier, vol. 119(1), pages 99-130, March.
    8. Park, Byeong U. & Simar, Leopold & Zelenyuk, Valentin, 2010. "Local maximum likelihood techniques with categorical data," LIDAM Discussion Papers ISBA 2010052, Université catholique de Louvain, Institute of Statistics, Biostatistics and Actuarial Sciences (ISBA).
    9. Li, Qi & Racine, Jeff, 2003. "Nonparametric estimation of distributions with categorical and continuous data," Journal of Multivariate Analysis, Elsevier, vol. 86(2), pages 266-292, August.
    10. Park, Byeong U. & Simar, Léopold & Zelenyuk, Valentin, 2008. "Local likelihood estimation of truncated regression and its partial derivatives: Theory and application," Journal of Econometrics, Elsevier, vol. 146(1), pages 185-198, September.
    11. Daniel J. Henderson & Valentin Zelenyuk, 2007. "Testing for (Efficiency) Catching-up," Southern Economic Journal, John Wiley & Sons, vol. 73(4), pages 1003-1019, April.
    12. Hsiao, Cheng & Li, Qi & Racine, Jeffrey S., 2007. "A consistent model specification test with mixed discrete and continuous data," Journal of Econometrics, Elsevier, vol. 140(2), pages 802-826, October.
    13. Jeffery Racine & Jeffrey Hart & Qi Li, 2006. "Testing the Significance of Categorical Predictor Variables in Nonparametric Regression Models," Econometric Reviews, Taylor & Francis Journals, vol. 25(4), pages 523-544.
    14. Li, Qi & Racine, Jeffrey S. & Wooldridge, Jeffrey M., 2009. "Efficient Estimation of Average Treatment Effects with Mixed Categorical and Continuous Data," Journal of Business & Economic Statistics, American Statistical Association, vol. 27(2), pages 206-223.
    15. Ozkan Eren & Daniel J. Henderson, 2008. "The impact of homework on student achievement," Econometrics Journal, Royal Economic Society, vol. 11(2), pages 326-348, July.
    16. Daniel J. Henderson, 2010. "A test for multimodality of regression derivatives with application to nonparametric growth regressions," Journal of Applied Econometrics, John Wiley & Sons, Ltd., vol. 25(3), pages 458-480.
    17. Peter Hall & Jeff Racine & Qi Li, 2004. "Cross-Validation and the Estimation of Conditional Probability Densities," Journal of the American Statistical Association, American Statistical Association, vol. 99, pages 1015-1026, December.
    18. Daniel J. Henderson & Christopher F. Parmeter & Subal C. Kumbhakar, 2007. "Nonparametric estimation of a hedonic price function," Journal of Applied Econometrics, John Wiley & Sons, Ltd., vol. 22(3), pages 695-699.
    19. Oleg Badunenko & Daniel J. Henderson & Valentin Zelenyuk, 2008. "Technological Change and Transition: Relative Contributions to Worldwide Growth During the 1990s," Oxford Bulletin of Economics and Statistics, Department of Economics, University of Oxford, vol. 70(4), pages 461-492, August.
    20. Li, Qi & Racine, Jeffrey S, 2008. "Nonparametric Estimation of Conditional CDF and Quantile Functions With Mixed Categorical and Continuous Data," Journal of Business & Economic Statistics, American Statistical Association, vol. 26, pages 423-434.
    21. Subodh Kumar & R. Robert Russell, 2002. "Technological Change, Technological Catch-up, and Capital Deepening: Relative Contributions to Growth and Convergence," American Economic Review, American Economic Association, vol. 92(3), pages 527-548, June.
    22. Daniel J. Henderson & R. Robert Russell, 2005. "Human Capital And Convergence: A Production-Frontier Approach ," International Economic Review, Department of Economics, University of Pennsylvania and Osaka University Institute of Social and Economic Research Association, vol. 46(4), pages 1167-1205, November.
    23. Valentina Hartarska & Christopher F. Parmeter & Denis Nadolnyak, 2010. "Economies of Scope of Lending and Mobilizing Deposits in Microfinance Institutions: A Semiparametric Analysis," American Journal of Agricultural Economics, Agricultural and Applied Economics Association, vol. 93(2), pages 389-398.
    24. Qi Li & Jeffrey Scott Racine, 2006. "Nonparametric Econometrics: Theory and Practice," Economics Books, Princeton University Press, edition 1, volume 1, number 8355.
    25. Gozalo, Pedro & Linton, Oliver, 2000. "Local nonlinear least squares: Using parametric information in nonparametric regression," Journal of Econometrics, Elsevier, vol. 99(1), pages 63-106, November.
    Full references (including those not matched with items on IDEAS)

    Citations

    Citations are extracted by the CitEc Project, subscribe to its RSS feed for this item.
    as


    Cited by:

    1. Li, Degui & Simar, Léopold & Zelenyuk, Valentin, 2016. "Generalized nonparametric smoothing with mixed discrete and continuous data," Computational Statistics & Data Analysis, Elsevier, vol. 100(C), pages 424-444.
    2. Jeffrey S. Racine, 2016. "A Correction to "Generalized Nonparametric Smoothing with Mixed Discrete and Continuous Data" by Li, Simar & Zelenyuk (2014, CSDA)," Department of Economics Working Papers 2016-01, McMaster University.

    Most related items

    These are the items that most often cite the same works as this one and are cited by the same works as this one.
    1. Li, Degui & Simar, Léopold & Zelenyuk, Valentin, 2016. "Generalized nonparametric smoothing with mixed discrete and continuous data," Computational Statistics & Data Analysis, Elsevier, vol. 100(C), pages 424-444.
    2. Léopold Simar & Ingrid Keilegom & Valentin Zelenyuk, 2017. "Nonparametric least squares methods for stochastic frontier models," Journal of Productivity Analysis, Springer, vol. 47(3), pages 189-204, June.
    3. Qi Li & Jeffrey Scott Racine, 2006. "Nonparametric Econometrics: Theory and Practice," Economics Books, Princeton University Press, edition 1, volume 1, number 8355.
    4. Michael S. Delgado & Daniel J. Henderson & Christopher F. Parmeter, 2014. "Does Education Matter for Economic Growth?," Oxford Bulletin of Economics and Statistics, Department of Economics, University of Oxford, vol. 76(3), pages 334-359, June.
    5. Haupt, Harry & Schnurbus, Joachim & Semmler, Willi, 2018. "Estimation of grouped, time-varying convergence in economic growth," Econometrics and Statistics, Elsevier, vol. 8(C), pages 141-158.
    6. Daniel J. Henderson, 2010. "A test for multimodality of regression derivatives with application to nonparametric growth regressions," Journal of Applied Econometrics, John Wiley & Sons, Ltd., vol. 25(3), pages 458-480.
    7. Byeong Park & Léopold Simar & Valentin Zelenyuk, 2015. "Categorical data in local maximum likelihood: theory and applications to productivity analysis," Journal of Productivity Analysis, Springer, vol. 43(2), pages 199-214, April.
    8. Hayfield, Tristen & Racine, Jeffrey S., 2008. "Nonparametric Econometrics: The np Package," Journal of Statistical Software, Foundation for Open Access Statistics, vol. 27(i05).
    9. Henderson, Daniel J. & Papageorgiou, Chris & Parmeter, Christopher F., 2013. "Who benefits from financial development? New methods, new evidence," European Economic Review, Elsevier, vol. 63(C), pages 47-67.
    10. repec:jss:jstsof:27:i05 is not listed on IDEAS
    11. Daniel J. Henderson, 2009. "A Non‐parametric Examination of Capital–Skill Complementarity," Oxford Bulletin of Economics and Statistics, Department of Economics, University of Oxford, vol. 71(4), pages 519-538, August.
    12. Nuno Baetas da Silva & João Sousa Andrade, 2016. "The relationship between social transfers and poverty reduction: A nonparametric approach for the EU-27," GEMF Working Papers 2016-09, GEMF, Faculty of Economics, University of Coimbra.
    13. Camilla Mastromarco & Léopold Simar, 2015. "Effect of FDI and Time on Catching Up: New Insights from a Conditional Nonparametric Frontier Analysis," Journal of Applied Econometrics, John Wiley & Sons, Ltd., vol. 30(5), pages 826-847, August.
    14. Haupt, Harry & Meier, Verena, 2016. "Dealing with heterogeneity, nonlinearity and club misclassification in growth convergence: A nonparametric two-step approach," Center for Mathematical Economics Working Papers 455, Center for Mathematical Economics, Bielefeld University.
    15. Valentin Zelenyuk, 2014. "Testing Significance of Contributions in Growth Accounting, with Application to Testing ICT Impact on Labor Productivity of Developed Countries," International Journal of Business and Economics, School of Management Development, Feng Chia University, Taichung, Taiwan, vol. 13(2), pages 115-126, December.
    16. Zongwu Cai & Qi Li, 2013. "Some Recent Develop- ments on Nonparametric Econometrics," Working Papers 2013-10-14, Wang Yanan Institute for Studies in Economics (WISE), Xiamen University.
    17. Jeffrey Racine, 2008. "Nonparametric econometrics: a primer (in Russian)," Quantile, Quantile, issue 4, pages 7-56, March.
    18. Nolwenn Roudaut & Anne Vanhems, 2012. "Explaining firms efficiency in the Ivorian manufacturing sector: a robust nonparametric approach," Journal of Productivity Analysis, Springer, vol. 37(2), pages 155-169, April.
    19. Arribas Ivan & Perez Francisco & Tortosa-Ausina Emili, 2010. "The Determinants of International Financial Integration Revisited: The Role of Networks and Geographic Neutrality," Studies in Nonlinear Dynamics & Econometrics, De Gruyter, vol. 15(1), pages 1-55, December.
    20. Daniel J. Henderson & Alexandre Olbrecht & Solomon W. Polachek, 2006. "Do Former College Athletes Earn More at Work?: A Nonparametric Assessment," Journal of Human Resources, University of Wisconsin Press, vol. 41(3).

    More about this item

    NEP fields

    This paper has been announced in the following NEP Reports:

    Statistics

    Access and download statistics

    Corrections

    All material on this site has been provided by the respective publishers and authors. You can help correct errors and omissions. When requesting a correction, please mention this item's handle: RePEc:qld:uqcepa:76. See general information about how to correct material in RePEc.

    If you have authored this item and are not yet registered with RePEc, we encourage you to do it here. This allows to link your profile to this item. It also allows you to accept potential citations to this item that we are uncertain about.

    If CitEc recognized a bibliographic reference but did not link an item in RePEc to it, you can help with this form .

    If you know of missing items citing this one, you can help us creating those links by adding the relevant references in the same way as above, for each refering item. If you are a registered author of this item, you may also want to check the "citations" tab in your RePEc Author Service profile, as there may be some citations waiting for confirmation.

    For technical questions regarding this item, or to correct its authors, title, abstract, bibliographic or download information, contact: SOE IT (email available below). General contact details of provider: https://edirc.repec.org/data/decuqau.html .

    Please note that corrections may take a couple of weeks to filter through the various RePEc services.

    IDEAS is a RePEc service. RePEc uses bibliographic data supplied by the respective publishers.