IDEAS home Printed from https://ideas.repec.org/a/jss/jstsof/v027i07.html
   My bibliography  Save this article

Sample Selection Models in R: Package sampleSelection

Author

Listed:
  • Toomet, Ott
  • Henningsen, Arne

Abstract

This paper describes the implementation of Heckman-type sample selection models in R. We discuss the sample selection problem as well as the Heckman solution to it, and argue that although modern econometrics has non- and semiparametric estimation methods in its toolbox, Heckman models are an integral part of the modern applied analysis and econometrics syllabus. We describe the implementation of these models in the package sampleSelection and illustrate the usage of the package on several simulation and real data examples. Our examples demonstrate the effect of exclusion restrictions, identification at infinity and misspecification. We argue that the package can be used both in applied research and teaching.

Suggested Citation

  • Toomet, Ott & Henningsen, Arne, 2008. "Sample Selection Models in R: Package sampleSelection," Journal of Statistical Software, Foundation for Open Access Statistics, vol. 27(i07).
  • Handle: RePEc:jss:jstsof:v:027:i07
    DOI: http://hdl.handle.net/10.18637/jss.v027.i07
    as

    Download full text from publisher

    File URL: https://www.jstatsoft.org/index.php/jss/article/view/v027i07/v27i07.pdf
    Download Restriction: no

    File URL: https://www.jstatsoft.org/index.php/jss/article/downloadSuppFile/v027i07/sampleSelection_0.6-0.tar.gz
    Download Restriction: no

    File URL: https://www.jstatsoft.org/index.php/jss/article/downloadSuppFile/v027i07/v27i07.R.zip
    Download Restriction: no

    File URL: https://libkey.io/http://hdl.handle.net/10.18637/jss.v027.i07?utm_source=ideas
    LibKey link: if access is restricted and if your library uses this service, LibKey will redirect you to where you can use your library subscription to access this item
    ---><---

    References listed on IDEAS

    as
    1. James J. Heckman, 1976. "The Common Structure of Statistical Models of Truncation, Sample Selection and Limited Dependent Variables and a Simple Estimator for Such Models," NBER Chapters, in: Annals of Economic and Social Measurement, Volume 5, number 4, pages 475-492, National Bureau of Economic Research, Inc.
    2. Pagan,Adrian & Ullah,Aman, 1999. "Nonparametric Econometrics," Cambridge Books, Cambridge University Press, number 9780521355643.
    3. Leung, Siu Fai & Yu, Shihti, 1996. "On the choice between sample selection and two-part models," Journal of Econometrics, Elsevier, vol. 72(1-2), pages 197-229.
    4. Calzolari, Giorgio & Fiorentini, Gabriele, 1993. "Alternative covariance estimators of the standard Tobit model," Economics Letters, Elsevier, vol. 42(1), pages 5-13.
    5. Ernst R. Berndt & Bronwyn H. Hall & Robert E. Hall & Jerry A. Hausman, 1974. "Estimation and Inference in Nonlinear Structural Models," NBER Chapters, in: Annals of Economic and Social Measurement, Volume 3, number 4, pages 653-665, National Bureau of Economic Research, Inc.
    6. Heckman, James J. & Macurdy, Thomas E., 1986. "Labor econometrics," Handbook of Econometrics, in: Z. Griliches† & M. D. Intriligator (ed.), Handbook of Econometrics, edition 1, volume 3, chapter 32, pages 1917-1977, Elsevier.
    7. Cameron,A. Colin & Trivedi,Pravin K., 2005. "Microeconometrics," Cambridge Books, Cambridge University Press, number 9780521848053.
    8. Amemiya, Takeshi, 1984. "Tobit models: A survey," Journal of Econometrics, Elsevier, vol. 24(1-2), pages 3-61.
    9. Chamberlain, Gary, 1986. "Asymptotic efficiency in semi-parametric models with censoring," Journal of Econometrics, Elsevier, vol. 32(2), pages 189-218, July.
    10. Henningsen, Arne & Hamann, Jeff D., 2007. "systemfit: A Package for Estimating Systems of Simultaneous Equations in R," Journal of Statistical Software, Foundation for Open Access Statistics, vol. 23(i04).
    11. Heckman, James, 2013. "Sample selection bias as a specification error," Applied Econometrics, Russian Presidential Academy of National Economy and Public Administration (RANEPA), vol. 31(3), pages 129-137.
    12. Mroz, Thomas A, 1987. "The Sensitivity of an Empirical Model of Married Women's Hours of Work to Economic and Statistical Assumptions," Econometrica, Econometric Society, vol. 55(4), pages 765-799, July.
    13. Greene, William H, 1981. "Sample Selection Bias as a Specification Error: Comment," Econometrica, Econometric Society, vol. 49(3), pages 795-798, May.
    Full references (including those not matched with items on IDEAS)

    Citations

    Citations are extracted by the CitEc Project, subscribe to its RSS feed for this item.
    as


    Cited by:

    1. William E. Allen & Han Altae-Tran & James Briggs & Xin Jin & Glen McGee & Andy Shi & Rumya Raghavan & Mireille Kamariza & Nicole Nova & Albert Pereta & Chris Danford & Amine Kamel & Patrik Gothe & Evr, 2020. "Population-scale longitudinal mapping of COVID-19 symptoms, behaviour and testing," Nature Human Behaviour, Nature, vol. 4(9), pages 972-982, September.
    2. Marra Giampiero & Radice Rosalba, 2017. "A joint regression modeling framework for analyzing bivariate binary data in R," Dependence Modeling, De Gruyter, vol. 5(1), pages 268-294, December.
    3. Monica Novackova & Richard S.J. Tol, 2018. "Climate Change Awareness and Willingness to Pay for its Mitigation: Evidence from the UK," Working Paper Series 0318, Department of Economics, University of Sussex Business School.
    4. Tibamanya, Felister Y. & Henningsen, Arne & Milanzi, Mursali A., 2021. "Adoption of Improved Sunflower Varieties Among Smallholder Farmers in Singida, Tanzania: The Double-Hurdle Approach," 2021 Conference, August 17-31, 2021, Virtual 315262, International Association of Agricultural Economists.
    5. Marra, Giampiero & Radice, Rosalba, 2013. "Estimation of a regression spline sample selection model," Computational Statistics & Data Analysis, Elsevier, vol. 61(C), pages 158-173.
    6. Bocart, Fabian & Oosterlinck, Kim, 2011. "Discoveries of fakes: Their impact on the art market," Economics Letters, Elsevier, vol. 113(2), pages 124-126.
    7. Romain Aeberhardt & Denis Fougère & Julien Pouget & Roland Rathelot, 2010. "Wages and employment of French workers with African origin," Journal of Population Economics, Springer;European Society for Population Economics, vol. 23(3), pages 881-905, June.
    8. Nicolai Fink Simonsen & Anne Sophie Oxholm & Søren Rud Kristensen & Luigi Siciliani, 2020. "What explains differences in waiting times for health care across socioeconomic status?," Health Economics, John Wiley & Sons, Ltd., vol. 29(12), pages 1764-1785, December.
    9. Lim-Wavde, Kustini & Kauffman, Robert J. & Dawson, Gregory S., 2017. "Household informedness and policy analytics for the collection and recycling of household hazardous waste in California," Resources, Conservation & Recycling, Elsevier, vol. 120(C), pages 88-107.
    10. Mikhail Zhelonkin & Marc G. Genton & Elvezio Ronchetti, 2016. "Robust inference in sample selection models," Journal of the Royal Statistical Society Series B, Royal Statistical Society, vol. 78(4), pages 805-827, September.
    11. Bastian, Anne & Börjesson, Maria, 2015. "Peak Car for urban Swedish men?," Working papers in Transport Economics 2015:9, CTS - Centre for Transport Studies Stockholm (KTH and VTI).
    12. Hatice Parlakçı Doğan & Mustafa Hakkı Aydoğdu & Mehmet Reşit Sevinç & Mehmet Cançelik, 2020. "Farmers’ Willingness to Pay for Services to Ensure Sustainable Agricultural Income in the GAP-Harran Plain, Şanlıurfa, Turkey," Agriculture, MDPI, vol. 10(5), pages 1-16, May.
    13. Meyer, Maximilian & Hulke, Carolin & Kamwi, Jonathan & Kolem, Hannah & Börner, Jan, 2021. "Spatial Determinants of Collective Resource Management and Environmental Dependency: Evidence from Namibia," 2021 Conference, August 17-31, 2021, Virtual 315018, International Association of Agricultural Economists.
    14. Perry, Agnieszka Dobrowolska & Brown, Scott, 2021. "Does Dairy and Meat Demand Change over Time? Comparison of Aids Demand System from Two Time Periods," 2021 Conference, August 17-31, 2021, Virtual 315921, International Association of Agricultural Economists.
    15. Varghese, Varun & Jana, Arnab, 2019. "Interrelationships between ICT, social disadvantage, and activity participation behaviour: A case of Mumbai, India," Transportation Research Part A: Policy and Practice, Elsevier, vol. 125(C), pages 248-267.
    16. Claire Burch & Michelle Busch & Edward Higgins & Steven Bittner & Nuwanthika Perera & Kevin Neal & Lawrence Burkett & Antonio J. Castro & Christopher Anderson, 2020. "Revisiting a Water Conflict in Southeastern Oklahoma 6 Years Later: A New Valuation of the Willingness to Pay for Ecosystem Services," Sustainability, MDPI, vol. 12(3), pages 1-30, January.
    17. Serge Atherwood & Corey S Sparks, 2019. "Early-career trajectories of young workers in the U.S. in the context of the 2008–09 recession: The effect of labor market entry timing," PLOS ONE, Public Library of Science, vol. 14(3), pages 1-30, March.
    18. Zhao, Jun & Kim, Hea-Jung & Kim, Hyoung-Moon, 2020. "New EM-type algorithms for the Heckman selection model," Computational Statistics & Data Analysis, Elsevier, vol. 146(C).
    19. Natale, Fabrizio & Borrello, Alessandra & Motova, Arina, 2015. "Analysis of the determinants of international seafood trade using a gravity model," Marine Policy, Elsevier, vol. 60(C), pages 98-106.
    20. Matthew C. Chang & Chih‐Ling Tsai & Rebecca Chung‐Fern Wu & Ning Zhu, 2018. "Market uncertainty and market orders in futures markets," Journal of Futures Markets, John Wiley & Sons, Ltd., vol. 38(8), pages 865-880, August.
    21. Steven R. Gehrke & Timothy F. Welch, 2017. "The built environment determinants of activity participation and walking near the workplace," Transportation, Springer, vol. 44(5), pages 941-956, September.
    22. Mengistu Assefa Wendimu & Peter Gibbon, 2014. "Labour markets for irrigated agriculture in central Ethiopia: Wage premiums and segmentation," IFRO Working Paper 2014/06, University of Copenhagen, Department of Food and Resource Economics.
    23. Biørn, Erik & Wangen, Knut R., 2012. "New Taxonomies for Limited Dependent Variables Models," MPRA Paper 41461, University Library of Munich, Germany.
    24. Burli, Pralhad & Lal, Pankaj & Wolde, Bernabas & Jose, Shibu & Bardhan, Sougata, 2021. "Perceptions about switchgrass and land allocation decisions: Evidence from a farmer survey in Missouri," Land Use Policy, Elsevier, vol. 109(C).
    25. Johan A Elkink & Sarah Parlane & Thomas Sattler, 2020. "When one side stays home: A joint model of turnout and vote choice," Working Papers 202012, Geary Institute, University College Dublin.

    Most related items

    These are the items that most often cite the same works as this one and are cited by the same works as this one.
    1. Miguel Santolino & Catalina Bolancé & Manuela Alcañiz, 2011. "Factors affecting hospital admission and recovery stay duration of in-patient motor victims in Spain," IREA Working Papers 201119, University of Barcelona, Research Institute of Applied Economics, revised Oct 2011.
    2. Leung, Siu Fai & Yu, Shihti, 1996. "On the choice between sample selection and two-part models," Journal of Econometrics, Elsevier, vol. 72(1-2), pages 197-229.
    3. Takashi Yamagata & Chris Orme, 2005. "On Testing Sample Selection Bias Under the Multicollinearity Problem," Econometric Reviews, Taylor & Francis Journals, vol. 24(4), pages 467-481.
    4. Goic, Marcel & Rojas, Andrea & Saavedra, Ignacio, 2021. "The Effectiveness of Triggered Email Marketing in Addressing Browse Abandonments," Journal of Interactive Marketing, Elsevier, vol. 55(C), pages 118-145.
    5. Angrist, Joshua D., 1997. "Conditional independence in sample selection models," Economics Letters, Elsevier, vol. 54(2), pages 103-112, February.
    6. Miguel Santolino & Magnus Söderberg, 2014. "Modelling appellate courts’ responses in motor injury disputes," European Journal of Law and Economics, Springer, vol. 38(3), pages 393-407, December.
    7. Mikhail Zhelonkin & Marc G. Genton & Elvezio Ronchetti, 2016. "Robust inference in sample selection models," Journal of the Royal Statistical Society Series B, Royal Statistical Society, vol. 78(4), pages 805-827, September.
    8. Campbell, Randall C. & Nagel, Gregory L., 2016. "Private information and limitations of Heckman's estimator in banking and corporate finance research," Journal of Empirical Finance, Elsevier, vol. 37(C), pages 186-195.
    9. Myck, Michal & Nicinska, Anna & Morawski, Leszek, 2009. "Count Your Hours: Returns to Education in Poland," IZA Discussion Papers 4332, Institute of Labor Economics (IZA).
    10. P.W. Miller & S. Rummery, 1989. "Gender Wage Discrimination in Australia: A reassessment," Economics Discussion / Working Papers 89-21, The University of Western Australia, Department of Economics.
    11. Brannlund, Runar & Nordstrom, Jonas, 2004. "Carbon tax simulations using a household demand model," European Economic Review, Elsevier, vol. 48(1), pages 211-233, February.
    12. Verbeek, M.J.C.M. & Nijman, T.E., 1992. "Incomplete panels and selection bias : A survey," Discussion Paper 1992-7, Tilburg University, Center for Economic Research.
    13. Geweke, J. & Joel Horowitz & Pesaran, M.H., 2006. "Econometrics: A Bird’s Eye View," Cambridge Working Papers in Economics 0655, Faculty of Economics, University of Cambridge.
    14. Yan Zhang, 2013. "Fair Lending Analysis of Mortgage Pricing: Does Underwriting Matter?," The Journal of Real Estate Finance and Economics, Springer, vol. 46(1), pages 131-151, January.
    15. Hirschauer, Norbert & Grüner, Sven & Mußhoff, Oliver & Becker, Claudia & Jantsch, Antje, 2020. "Can p-values be meaningfully interpreted without random sampling?," EconStor Open Access Articles and Book Chapters, ZBW - Leibniz Information Centre for Economics, pages 71-91.
    16. Casey B. Mulligan & Yona Rubinstein, 2004. "The Closing of the Gender Gap as a Roy Model Illusion," NBER Working Papers 10892, National Bureau of Economic Research, Inc.
    17. Peter Hartley & Chris Trengove, 1990. "East‐West Airlines' Sydney‐Melbourne Service: Market Expansion or Demand Diversion?," The Economic Record, The Economic Society of Australia, vol. 66(3), pages 203-220, September.
    18. Kenneth Y. Chay & James L. Powell, 2001. "Semiparametric Censored Regression Models," Journal of Economic Perspectives, American Economic Association, vol. 15(4), pages 29-42, Fall.
    19. Gilpin, Gregory A., 2011. "Reevaluating the effect of non-teaching wages on teacher attrition," Economics of Education Review, Elsevier, vol. 30(4), pages 598-616, August.
    20. Chiang, Eric P. & Assane, Djeto, 2008. "Music piracy among students on the university campus: Do males and females react differently?," Journal of Behavioral and Experimental Economics (formerly The Journal of Socio-Economics), Elsevier, vol. 37(4), pages 1371-1380, August.

    More about this item

    Statistics

    Access and download statistics

    Corrections

    All material on this site has been provided by the respective publishers and authors. You can help correct errors and omissions. When requesting a correction, please mention this item's handle: RePEc:jss:jstsof:v:027:i07. See general information about how to correct material in RePEc.

    For technical questions regarding this item, or to correct its authors, title, abstract, bibliographic or download information, contact: . General contact details of provider: http://www.jstatsoft.org/ .

    If you have authored this item and are not yet registered with RePEc, we encourage you to do it here. This allows to link your profile to this item. It also allows you to accept potential citations to this item that we are uncertain about.

    If CitEc recognized a bibliographic reference but did not link an item in RePEc to it, you can help with this form .

    If you know of missing items citing this one, you can help us creating those links by adding the relevant references in the same way as above, for each refering item. If you are a registered author of this item, you may also want to check the "citations" tab in your RePEc Author Service profile, as there may be some citations waiting for confirmation.

    For technical questions regarding this item, or to correct its authors, title, abstract, bibliographic or download information, contact: Christopher F. Baum (email available below). General contact details of provider: http://www.jstatsoft.org/ .

    Please note that corrections may take a couple of weeks to filter through the various RePEc services.

    IDEAS is a RePEc service hosted by the Research Division of the Federal Reserve Bank of St. Louis . RePEc uses bibliographic data supplied by the respective publishers.