IDEAS home Printed from https://ideas.repec.org/p/iza/izadps/dp7003.html
   My bibliography  Save this paper

A Flexible Sample Selection Model: A GTL-Copula Approach

Author

Listed:
  • Hasebe, Takuya

    (Sophia University)

  • Vijverberg, Wim P.

    (CUNY Graduate Center)

Abstract

In this paper, we propose a new approach to estimating sample selection models that combines Generalized Tukey Lambda (GTL) distributions with copulas. The GTL distribution is a versatile univariate distribution that permits a wide range of skewness and thick- or thin-tailed behavior in the data that it represents. Copulas help create versatile representations of bivariate distribution. The versatility arising from inserting GTL marginal distributions into copula-constructed bivariate distributions reduces the dependence of estimated parameters on distributional assumptions in applied research. A thorough Monte Carlo study illustrates that our proposed estimator performs well under normal and nonnormal settings, both with and without an instrument in the selection equation that fulfills the exclusion restriction that is often considered to be a requisite for implementation of sample selection models in empirical research. Five applications ranging from wages and health expenditures to speeding tickets and international disputes illustrate the value of the proposed GTL-copula estimator.

Suggested Citation

  • Hasebe, Takuya & Vijverberg, Wim P., 2012. "A Flexible Sample Selection Model: A GTL-Copula Approach," IZA Discussion Papers 7003, Institute of Labor Economics (IZA).
  • Handle: RePEc:iza:izadps:dp7003
    as

    Download full text from publisher

    File URL: https://docs.iza.org/dp7003.pdf
    Download Restriction: no
    ---><---

    References listed on IDEAS

    as
    1. Amos Golan & Enrico Moretti & Jeffrey M.Perloff, 2004. "A Small-Sample Estimator for the Sample-Selection Model," Econometric Reviews, Taylor & Francis Journals, vol. 23(1), pages 71-91.
    2. Gianna Boero & Jeremy Smith & Kenneth Wallis, 2005. "The Sensitivity of Chi-Squared Goodness-of-Fit Tests to the Partitioning of Data," Econometric Reviews, Taylor & Francis Journals, vol. 23(4), pages 341-370.
    3. Trivedi, Pravin K. & Zimmer, David M., 2007. "Copula Modeling: An Introduction for Practitioners," Foundations and Trends(R) in Econometrics, now publishers, vol. 1(1), pages 1-111, April.
    4. Deb, Partha & Trivedi, Pravin K., 2002. "The structure of demand for health care: latent class versus two-part models," Journal of Health Economics, Elsevier, vol. 21(4), pages 601-625, July.
    5. Diane Dancer & Anu Rammohan & Murray D. Smith, 2008. "Infant mortality and child nutrition in Bangladesh," Health Economics, John Wiley & Sons, Ltd., vol. 17(9), pages 1015-1035, September.
    6. Orazio P. Attanasio & Costas Meghir & Ana Santiago, 2012. "Education Choices in Mexico: Using a Structural Model and a Randomized Experiment to Evaluate PROGRESA," The Review of Economic Studies, Review of Economic Studies Ltd, vol. 79(1), pages 37-66.
    7. Patrick Puhani, 2000. "The Heckman Correction for Sample Selection and Its Critique," Journal of Economic Surveys, Wiley Blackwell, vol. 14(1), pages 53-68, February.
    8. Lee, Lung-Fei, 1983. "Generalized Econometric Models with Selectivity," Econometrica, Econometric Society, vol. 51(2), pages 507-512, March.
    9. Prokhorov, Artem & Schmidt, Peter, 2009. "Likelihood-based estimation in a panel setting: Robustness, redundancy and validity of copulas," Journal of Econometrics, Elsevier, vol. 153(1), pages 93-104, November.
    10. Lung-Fei Lee, 1982. "Some Approaches to the Correction of Selectivity Bias," The Review of Economic Studies, Review of Economic Studies Ltd, vol. 49(3), pages 355-372.
    11. Jose-Mari Sarabia, 1997. "A hierarchy of lorenz curves based on the generalized tukey's lambda distribution," Econometric Reviews, Taylor & Francis Journals, vol. 16(3), pages 305-320.
    12. Su, Steve, 2007. "Numerical maximum log likelihood estimation for generalized lambda distributions," Computational Statistics & Data Analysis, Elsevier, vol. 51(8), pages 3983-3998, May.
    13. A. D. Roy, 1951. "Some Thoughts On The Distribution Of Earnings," Oxford Economic Papers, Oxford University Press, vol. 3(2), pages 135-146.
    14. Zimmer, David M. & Trivedi, Pravin K., 2006. "Using Trivariate Copulas to Model Sample Selection and Treatment Effects: Application to Family Health Care Demand," Journal of Business & Economic Statistics, American Statistical Association, vol. 24, pages 63-76, January.
    15. Daryl Pregibon, 1980. "Goodness of Link Tests for Generalized Linear Models," Journal of the Royal Statistical Society Series C, Royal Statistical Society, vol. 29(1), pages 15-24, March.
    16. Erik Meijer & Tom Wansbeek, 2007. "The Sample Selection Model from a Method of Moments Perspective," Econometric Reviews, Taylor & Francis Journals, vol. 26(1), pages 25-51.
    17. Rainer Winkelmann, 2012. "Copula Bivariate Probit Models: With An Application To Medical Expenditures," Health Economics, John Wiley & Sons, Ltd., vol. 21(12), pages 1444-1455, December.
    18. Vijverberg, Chu-Ping C. & Vijverberg, Wim P., 2012. "Pregibit: A Family of Discrete Choice Models," IZA Discussion Papers 6359, Institute of Labor Economics (IZA).
    19. Michael D. Makowsky & Thomas Stratmann, 2009. "Political Economy at Any Speed: What Determines Traffic Citations?," American Economic Review, American Economic Association, vol. 99(1), pages 509-527, March.
    20. Olsen, Randall J, 1980. "A Least Squares Correction for Selectivity Bias," Econometrica, Econometric Society, vol. 48(7), pages 1815-1820, November.
    21. Murray D. Smith, 2003. "Modelling sample selection using Archimedean copulas," Econometrics Journal, Royal Economic Society, vol. 6(1), pages 99-123, June.
    22. Vuong, Quang H, 1989. "Likelihood Ratio Tests for Model Selection and Non-nested Hypotheses," Econometrica, Econometric Society, vol. 57(2), pages 307-333, March.
    23. Whitney K. Newey, 2009. "Two-step series estimation of sample selection models," Econometrics Journal, Royal Economic Society, vol. 12(s1), pages 217-229, January.
    24. A. Colin Cameron & Tong Li & Pravin K. Trivedi & David M. Zimmer, 2004. "Modelling the differences in counted outcomes using bivariate copula models with application to mismeasured counts," Econometrics Journal, Royal Economic Society, vol. 7(2), pages 566-584, December.
    25. Koenker, Roger & Yoon, Jungmo, 2009. "Parametric links for binary choice models: A Fisherian-Bayesian colloquy," Journal of Econometrics, Elsevier, vol. 152(2), pages 120-130, October.
    26. Francis Vella, 1998. "Estimating Models with Sample Selection Bias: A Survey," Journal of Human Resources, University of Wisconsin Press, vol. 33(1), pages 127-169.
    27. Yen, Steven T. & Yuan, Yan & Liu, Xiaowen, 2009. "Alcohol consumption by men in China: A non-Gaussian censored system approach," China Economic Review, Elsevier, vol. 20(2), pages 162-173, June.
    28. James Heckman & Justin L. Tobias & Edward Vytlacil, 2003. "Simple Estimators for Treatment Parameters in a Latent-Variable Framework," The Review of Economics and Statistics, MIT Press, vol. 85(3), pages 748-755, August.
    29. Cameron,A. Colin & Trivedi,Pravin K., 2005. "Microeconometrics," Cambridge Books, Cambridge University Press, number 9780521848053.
    30. Mitali Das & Whitney K. Newey & Francis Vella, 2003. "Nonparametric Estimation of Sample Selection Models," The Review of Economic Studies, Review of Economic Studies Ltd, vol. 70(1), pages 33-58.
    31. Lee, Lung-Fei, 1978. "Unionism and Wage Rates: A Simultaneous Equations Model with Qualitative and Limited Dependent Variables," International Economic Review, Department of Economics, University of Pennsylvania and Osaka University Institute of Social and Economic Research Association, vol. 19(2), pages 415-433, June.
    32. Maria Fraga O. Martins, 2001. "Parametric and semiparametric estimation of sample selection models: an empirical application to the female labour force in Portugal," Journal of Applied Econometrics, John Wiley & Sons, Ltd., vol. 16(1), pages 23-39.
    33. Gourieroux, Christian & Holly, Alberto & Monfort, Alain, 1982. "Likelihood Ratio Test, Wald Test, and Kuhn-Tucker Test in Linear Models with Inequality Constraints on the Regression Parameters," Econometrica, Econometric Society, vol. 50(1), pages 63-80, January.
    34. Ahn, Hyungtaik & Powell, James L., 1993. "Semiparametric estimation of censored selection models with a nonparametric selection mechanism," Journal of Econometrics, Elsevier, vol. 58(1-2), pages 3-29, July.
    35. Vijverberg, Wim P. & Hasebe, Takuya, 2015. "GTL Regression: A Linear Model with Skewed and Thick-Tailed Disturbances," IZA Discussion Papers 8898, Institute of Labor Economics (IZA).
    36. Heckman, James J, 1974. "Shadow Prices, Market Wages, and Labor Supply," Econometrica, Econometric Society, vol. 42(4), pages 679-694, July.
    37. Zuehlke, Thomas W & Zeman, Allen R, 1991. "A Comparison of Two-Stage Estimators of Censored Regression Models," The Review of Economics and Statistics, MIT Press, vol. 73(1), pages 185-188, February.
    38. Margarita Genius & Elisabetta Strazzera, 2008. "Applying the copula approach to sample selection modelling," Applied Economics, Taylor & Francis Journals, vol. 40(11), pages 1443-1455.
    39. Murray D. Smith, 2005. "Using Copulas to Model Switching Regimes with an Application to Child Labour," The Economic Record, The Economic Society of Australia, vol. 81(s1), pages 47-57, August.
    Full references (including those not matched with items on IDEAS)

    Citations

    Citations are extracted by the CitEc Project, subscribe to its RSS feed for this item.
    as


    Cited by:

    1. Juliana D. Araujo & Povilas Lastauskas & Chris Papageorgiou, 2017. "Evolution of Bilateral Capital Flows to Developing Countries at Intensive and Extensive Margins," Journal of Money, Credit and Banking, Blackwell Publishing, vol. 49(7), pages 1517-1554, October.
    2. Pablo Mitnik & David Grusky, 2018. "The Intergenerational Elasticity of What? The Case for Redefining the Workhorse Measure of Economic Mobility," Working Papers 2018-043, Human Capital and Economic Opportunity Working Group.
    3. Marra, Giampiero & Wyszynski, Karol, 2016. "Semi-parametric copula sample selection models for count responses," Computational Statistics & Data Analysis, Elsevier, vol. 104(C), pages 110-129.
    4. Pigini Claudia, 2015. "Bivariate Non-Normality in the Sample Selection Model," Journal of Econometric Methods, De Gruyter, vol. 4(1), pages 1-22, January.
    5. Karol Wyszynski & Giampiero Marra, 2018. "Sample selection models for count data in R," Computational Statistics, Springer, vol. 33(3), pages 1385-1412, September.
    6. Wojtyś, Magorzata & Marra, Giampiero & Radice, Rosalba, 2016. "Copula Regression Spline Sample Selection Models: The R Package SemiParSampleSel," Journal of Statistical Software, Foundation for Open Access Statistics, vol. 71(i06).

    Most related items

    These are the items that most often cite the same works as this one and are cited by the same works as this one.
    1. Bhat, Chandra R. & Eluru, Naveen, 2009. "A copula-based approach to accommodate residential self-selection effects in travel behavior modeling," Transportation Research Part B: Methodological, Elsevier, vol. 43(7), pages 749-765, August.
    2. Claudia PIGINI, 2012. "Of Butterflies and Caterpillars: Bivariate Normality in the Sample Selection Model," Working Papers 377, Universita' Politecnica delle Marche (I), Dipartimento di Scienze Economiche e Sociali.
    3. Schwiebert, Jörg, 2012. "Analyzing the Composition of the Female Workforce - A Semiparametric Copula Approach," Hannover Economic Papers (HEP) dp-503, Leibniz Universität Hannover, Wirtschaftswissenschaftliche Fakultät.
    4. Liu, Ruixuan & Yu, Zhengfei, 2022. "Sample selection models with monotone control functions," Journal of Econometrics, Elsevier, vol. 226(2), pages 321-342.
    5. Yulia V. Marchenko & Marc G. Genton, 2012. "A Heckman Selection- t Model," Journal of the American Statistical Association, Taylor & Francis Journals, vol. 107(497), pages 304-317, March.
    6. Jörg Schwiebert, 2016. "Multinomial choice models based on Archimedean copulas," AStA Advances in Statistical Analysis, Springer;German Statistical Society, vol. 100(3), pages 333-354, July.
    7. Jacopo Mazza, 2012. "Does Risk Matter? A Semi-parametric Model for Educational Choices in the Presence of Uncertainty," Economics Discussion Paper Series 1225, Economics, The University of Manchester.
    8. Chen, Heng & Fan, Yanqin & Wu, Jisong, 2014. "A flexible parametric approach for estimating switching regime models and treatment effect parameters," Journal of Econometrics, Elsevier, vol. 181(2), pages 77-91.
    9. Margarita Genius & Elisabetta Strazzera, 2008. "Applying the copula approach to sample selection modelling," Applied Economics, Taylor & Francis Journals, vol. 40(11), pages 1443-1455.
    10. Pigini Claudia, 2015. "Bivariate Non-Normality in the Sample Selection Model," Journal of Econometric Methods, De Gruyter, vol. 4(1), pages 1-22, January.
    11. Joo, Joonhwi & LaLonde, Robert J., 2014. "Testing for Selection Bias," IZA Discussion Papers 8455, Institute of Labor Economics (IZA).
    12. Wiemann, Paul F.V. & Klein, Nadja & Kneib, Thomas, 2022. "Correcting for sample selection bias in Bayesian distributional regression models," Computational Statistics & Data Analysis, Elsevier, vol. 168(C).
    13. Mikhail Zhelonkin & Marc G. Genton & Elvezio Ronchetti, 2016. "Robust inference in sample selection models," Journal of the Royal Statistical Society Series B, Royal Statistical Society, vol. 78(4), pages 805-827, September.
    14. James J. Heckman, 2005. "Micro Data, Heterogeneity and the Evaluation of Public Policy Part 2," The American Economist, Sage Publications, vol. 49(1), pages 16-44, March.
    15. Rainer Winkelmann, 2012. "Copula Bivariate Probit Models: With An Application To Medical Expenditures," Health Economics, John Wiley & Sons, Ltd., vol. 21(12), pages 1444-1455, December.
    16. Victor Chernozhukov & Ivan Fernandez-Val & Siyi Luo, 2018. "Distribution regression with sample selection, with an application to wage decompositions in the UK," CeMMAP working papers CWP68/18, Centre for Microdata Methods and Practice, Institute for Fiscal Studies.
    17. Elisabetta Strazzera & Margarita Genius, 2004. "The Copula Approach to Sample Selection Modelling: An Application to the Recreational Value of Forests," Working Papers 2004.73, Fondazione Eni Enrico Mattei.
    18. M. Genius & E. Strazzera, 2003. "The copula approach of sampling selection modelling: an application to the recreational value of forests," Working Paper CRENoS 200308, Centre for North South Economic Research, University of Cagliari and Sassari, Sardinia.
    19. Rainer Winkelmann, 2009. "Copula-based bivariate binary response models," SOI - Working Papers 0913, Socioeconomic Institute - University of Zurich.
    20. Victor Chernozhukov & Ivan Fernandez-Val & Siyi Luo, 2023. "Distribution regression with sample selection and UK wage decomposition," CeMMAP working papers 09/23, Institute for Fiscal Studies.

    More about this item

    Keywords

    sample selection; copula; Generalized Tukey Lambda distribution;
    All these keywords.

    JEL classification:

    • C24 - Mathematical and Quantitative Methods - - Single Equation Models; Single Variables - - - Truncated and Censored Models; Switching Regression Models; Threshold Regression Models
    • C35 - Mathematical and Quantitative Methods - - Multiple or Simultaneous Equation Models; Multiple Variables - - - Discrete Regression and Qualitative Choice Models; Discrete Regressors; Proportions

    NEP fields

    This paper has been announced in the following NEP Reports:

    Statistics

    Access and download statistics

    Corrections

    All material on this site has been provided by the respective publishers and authors. You can help correct errors and omissions. When requesting a correction, please mention this item's handle: RePEc:iza:izadps:dp7003. See general information about how to correct material in RePEc.

    If you have authored this item and are not yet registered with RePEc, we encourage you to do it here. This allows to link your profile to this item. It also allows you to accept potential citations to this item that we are uncertain about.

    If CitEc recognized a bibliographic reference but did not link an item in RePEc to it, you can help with this form .

    If you know of missing items citing this one, you can help us creating those links by adding the relevant references in the same way as above, for each refering item. If you are a registered author of this item, you may also want to check the "citations" tab in your RePEc Author Service profile, as there may be some citations waiting for confirmation.

    For technical questions regarding this item, or to correct its authors, title, abstract, bibliographic or download information, contact: Holger Hinte (email available below). General contact details of provider: https://edirc.repec.org/data/izaaade.html .

    Please note that corrections may take a couple of weeks to filter through the various RePEc services.

    IDEAS is a RePEc service. RePEc uses bibliographic data supplied by the respective publishers.