IDEAS home Printed from https://ideas.repec.org/p/nbr/nberte/0275.html
   My bibliography  Save this paper

Using Weights to Adjust for Sample Selection When Auxiliary Information is Available

Author

Listed:
  • Aviv Nevo

Abstract

In this paper I analyze GMM estimation when the sample is not a random draw from the population of interest. I exploit auxiliary information, in the form of moments from the population of interest, in order to compute weights that are proportional to the inverse probability of selection. The essential idea is to construct weights, for each observation in the primary data, such that the moments of the weighted data are set equal to the additional moments. The estimator is applied to the Dutch Transportation Panel, in which refreshment draws were taken from the population of interest in order to deal with heavy attrition of the original panel. I show how these additional samples can be used to adjust for sample selection.

Suggested Citation

  • Aviv Nevo, 2001. "Using Weights to Adjust for Sample Selection When Auxiliary Information is Available," NBER Technical Working Papers 0275, National Bureau of Economic Research, Inc.
  • Handle: RePEc:nbr:nberte:0275
    Note: TWP
    as

    Download full text from publisher

    File URL: http://www.nber.org/papers/t0275.pdf
    Download Restriction: no
    ---><---

    Other versions of this item:

    References listed on IDEAS

    as
    1. Hausman, Jerry A & Wise, David A, 1979. "Attrition Bias in Experimental and Panel Data: The Gary Income Maintenance Experiment," Econometrica, Econometric Society, vol. 47(2), pages 455-473, March.
    2. Hansen, Lars Peter, 1982. "Large Sample Properties of Generalized Method of Moments Estimators," Econometrica, Econometric Society, vol. 50(4), pages 1029-1054, July.
    3. Zvi Griliches & Jacques Mairesse, 1995. "Production Functions: The Search for Identification," NBER Working Papers 5067, National Bureau of Economic Research, Inc.
    4. Adrian Pagan, 1986. "Two Stage and Related Estimators and Their Applications," Review of Economic Studies, Oxford University Press, vol. 53(4), pages 517-538.
    5. Guido W. Imbens & Tony Lancaster, 1994. "Combining Micro and Macro Data in Microeconometric Models," Review of Economic Studies, Oxford University Press, vol. 61(4), pages 655-680.
    6. Thomas MaCurdy & Thomas Mroz & R. Mark Gritz, 1998. "An Evaluation of the National Longitudinal Survey on Youth," Journal of Human Resources, University of Wisconsin Press, vol. 33(2), pages 345-436.
    7. Ahn, Hyungtaik & Powell, James L., 1993. "Semiparametric estimation of censored selection models with a nonparametric selection mechanism," Journal of Econometrics, Elsevier, vol. 58(1-2), pages 3-29, July.
    8. Sims,Christopher A. (ed.), 1994. "Advances in Econometrics," Cambridge Books, Cambridge University Press, number 9780521444606, October.
    9. Newey, Whitney K & Powell, James L & Walker, James R, 1990. "Semiparametric Estimation of Selection Models: Some Empirical Results," American Economic Review, American Economic Association, vol. 80(2), pages 324-328, May.
    10. Judith K. Hellerstein & Guido W. Imbens, 1999. "Imposing Moment Restrictions From Auxiliary Data By Weighting," The Review of Economics and Statistics, MIT Press, vol. 81(1), pages 1-14, February.
    11. Keisuke Hirano & Guido W. Imbens & Geert Ridder & Donald B. Rubin, 2001. "Combining Panel Data Sets with Attrition and Refreshment Samples," Econometrica, Econometric Society, vol. 69(6), pages 1645-1659, November.
    12. Cosslett, Stephen R, 1981. "Maximum Likelihood Estimator for Choice-Based Samples," Econometrica, Econometric Society, vol. 49(5), pages 1289-1316, September.
    13. Manski, C.F., 1990. "The Selection Problem," Working papers 90-12, Wisconsin Madison - Social Systems.
    14. Olley, G Steven & Pakes, Ariel, 1996. "The Dynamics of Productivity in the Telecommunications Equipment Industry," Econometrica, Econometric Society, vol. 64(6), pages 1263-1297, November.
    15. Heckman, James, 2013. "Sample selection bias as a specification error," Applied Econometrics, Russian Presidential Academy of National Economy and Public Administration (RANEPA), vol. 31(3), pages 129-137.
    16. Keisuke Hirano & Guido W. Imbens & Geert Ridder, 2003. "Efficient Estimation of Average Treatment Effects Using the Estimated Propensity Score," Econometrica, Econometric Society, vol. 71(4), pages 1161-1189, July.
    17. Ridder, Geert, 1992. "An empirical evaluation of some models for non-random attrition in panel data," Structural Change and Economic Dynamics, Elsevier, vol. 3(2), pages 337-355, December.
    18. Guido W. Imbens & Richard H. Spady & Phillip Johnson, 1998. "Information Theoretic Approaches to Inference in Moment Condition Models," Econometrica, Econometric Society, vol. 66(2), pages 333-358, March.
    19. Yuichi Kitamura & Michael Stutzer, 1997. "An Information-Theoretic Alternative to Generalized Method of Moments Estimation," Econometrica, Econometric Society, vol. 65(4), pages 861-874, July.
    20. James Heckman & Hidehiko Ichimura & Jeffrey Smith & Petra Todd, 1998. "Characterizing Selection Bias Using Experimental Data," Econometrica, Econometric Society, vol. 66(5), pages 1017-1098, September.
    21. Heckman, James J, 1974. "Shadow Prices, Market Wages, and Labor Supply," Econometrica, Econometric Society, vol. 42(4), pages 679-694, July.
    22. Ekaterini Kyriazidou, 1997. "Estimation of a Panel Data Sample Selection Model," Econometrica, Econometric Society, vol. 65(6), pages 1335-1364, November.
    23. Back, Kerry & Brown, David P, 1993. "Implied Probabilities in GMM Estimators," Econometrica, Econometric Society, vol. 61(4), pages 971-975, July.
    24. Manski, Charles F & Lerman, Steven R, 1977. "The Estimation of Choice Probabilities from Choice Based Samples," Econometrica, Econometric Society, vol. 45(8), pages 1977-1988, November.
    25. Joshua D. Angrist, 1995. "Conditioning on the Probability of Selection to Control Selection Bias," NBER Technical Working Papers 0181, National Bureau of Economic Research, Inc.
    26. Heckman, James J, 1990. "Varieties of Selection Bias," American Economic Review, American Economic Association, vol. 80(2), pages 313-318, May.
    27. Guido W. Imbens, 1997. "One-Step Estimators for Over-Identified Generalized Method of Moments Models," Review of Economic Studies, Oxford University Press, vol. 64(3), pages 359-383.
    28. Sims,Christopher A. (ed.), 1994. "Advances in Econometrics," Cambridge Books, Cambridge University Press, number 9780521444590, October.
    29. Jeffrey M. Wooldridge, 1999. "Asymptotic Properties of Weighted M-Estimators for Variable Probability Samples," Econometrica, Econometric Society, vol. 67(6), pages 1385-1406, November.
    30. Nevo, Aviv, 2002. "Sample selection and information-theoretic alternatives to GMM," Journal of Econometrics, Elsevier, vol. 107(1-2), pages 149-157, March.
    Full references (including those not matched with items on IDEAS)

    Citations

    Citations are extracted by the CitEc Project, subscribe to its RSS feed for this item.
    as


    Cited by:

    1. Bhattacharya, Debopam, 2008. "Inference in panel data models under attrition caused by unobservables," Journal of Econometrics, Elsevier, vol. 144(2), pages 430-446, June.
    2. Rene Segers & Philip Hans Franses, 2014. "Panel design effects on response rates and response quality," Statistica Neerlandica, Netherlands Society for Statistics and Operations Research, vol. 68(1), pages 1-24, February.
    3. Igari, Ryosuke & Hoshino, Takahiro, 2018. "A Bayesian data combination approach for repeated durations under unobserved missing indicators: Application to interpurchase-timing in marketing," Computational Statistics & Data Analysis, Elsevier, vol. 126(C), pages 150-166.
    4. Heng Chen & Marie-Hélène Felt & Kim P. Huynh, 2017. "Retail payment innovations and cash usage: accounting for attrition by using refreshment samples," Journal of the Royal Statistical Society Series A, Royal Statistical Society, vol. 180(2), pages 503-530, February.
    5. Nail Kashaev, 2022. "Estimation of Parametric Binary Outcome Models with Degenerate Pure Choice-Based Data with Application to COVID-19-Positive Tests from British Columbia," University of Western Ontario, Departmental Research Report Series 20225, University of Western Ontario, Department of Economics.
    6. Harding, Matthew & Lamarche, Carlos, 2019. "A panel quantile approach to attrition bias in Big Data: Evidence from a randomized experiment," Journal of Econometrics, Elsevier, vol. 211(1), pages 61-82.
    7. Marcel Das & Vera Toepoel & Arthur van Soest, 2011. "Nonparametric Tests of Panel Conditioning and Attrition Bias in Panel Surveys," Sociological Methods & Research, , vol. 40(1), pages 32-56, February.
    8. Sizhong Sun, 2023. "Firm heterogeneity, worker training and labor productivity: the role of endogenous self-selection," Journal of Productivity Analysis, Springer, vol. 59(2), pages 121-133, April.
    9. Takahiro Hoshino & Yuya Shimizu, 2019. "Doubly Robust-type Estimation of Population Moments and Parameters in Biased Sampling," Keio-IES Discussion Paper Series 2019-006, Institute for Economics Studies, Keio University.
    10. Denis Heng Yan Leung & Ken Yamada & Biao Zhang, 2015. "Enriching Surveys with Supplementary Data and its Application to Studying Wage Regression," Scandinavian Journal of Statistics, Danish Society for Theoretical Statistics;Finnish Statistical Society;Norwegian Statistical Association;Swedish Statistical Association, vol. 42(1), pages 155-179, March.
    11. Takahiro Hoshino & Keisuke Takahata, 2018. "Identification of heterogeneous treatment effects as a function of potential untreated outcome under the nonignorable assignment condition," Keio-IES Discussion Paper Series 2018-005, Institute for Economics Studies, Keio University.
    12. Lechner, Michael, 2004. "Sequential Matching Estimation of Dynamic Causal Models," IZA Discussion Papers 1042, Institute of Labor Economics (IZA).
    13. Devereux, Paul J. & Tripathi, Gautam, 2009. "Optimally combining censored and uncensored datasets," Journal of Econometrics, Elsevier, vol. 151(1), pages 17-32, July.
    14. Zhong Guan & Jing Qin, 2017. "Empirical likelihood method for non-ignorable missing data problems," Lifetime Data Analysis: An International Journal Devoted to Statistical Methods and Applications for Time-to-Event Data, Springer, vol. 23(1), pages 113-135, January.
    15. Emre Ekinci & Insan Tunah & Berk Yavuzoglu, 2017. "Rescaled Additivity Non-Ignorable (RAN) Model of Generalized Attrition," Working Papers 1702, Nazarbayev University, Department of Economics, revised Mar 2017.
    16. Darren Lubotsky, 2007. "Chutes or Ladders? A Longitudinal Analysis of Immigrant Earnings," Journal of Political Economy, University of Chicago Press, vol. 115(5), pages 820-867, October.
    17. Olanrewaju Akande & Gabriel Madson & D. Sunshine Hillygus & Jerome P. Reiter, 2021. "Leveraging auxiliary information on marginal distributions in nonignorable models for item and unit nonresponse," Journal of the Royal Statistical Society Series A, Royal Statistical Society, vol. 184(2), pages 643-662, April.
    18. Hindsley, Paul & Landry, Craig E. & Gentner, Brad, 2011. "Addressing onsite sampling in recreation site choice models," Journal of Environmental Economics and Management, Elsevier, vol. 62(1), pages 95-110, July.
    19. repec:rre:publsh:v:35:y:2005:i:2:p:187-205 is not listed on IDEAS
    20. Takahiro Hoshino & Ryosuke Igari, 2017. "Quasi-Bayesian Inference for Latent Variable Models with External Information: Application to generalized linear mixed models for biased data," Keio-IES Discussion Paper Series 2017-014, Institute for Economics Studies, Keio University.
    21. Joachim Inkmann, 2010. "Estimating Firm Size Elasticities of Product and Process R&D," Economica, London School of Economics and Political Science, vol. 77(306), pages 384-402, April.
    22. Bryan S. Graham & Cristine Campos De Xavier Pinto & Daniel Egel, 2012. "Inverse Probability Tilting for Moment Condition Models with Missing Data," Review of Economic Studies, Oxford University Press, vol. 79(3), pages 1053-1079.
    23. Ryo Kato & Takahiro Hoshino, 2020. "Semiparametric Bayesian Instrumental Variables Estimation for Nonignorable Missing Instruments," Discussion Paper Series DP2020-06, Research Institute for Economics & Business Administration, Kobe University.
    24. Prokhorov, Artem & Schmidt, Peter, 2009. "GMM redundancy results for general missing data problems," Journal of Econometrics, Elsevier, vol. 151(1), pages 47-55, July.
    25. Emre Ekinci, 2009. "Dealing with Attrition When Refreshment Samples are Available: An Application to the Turkish Household Labor Force Survey," 2009 Meeting Papers 353, Society for Economic Dynamics.

    Most related items

    These are the items that most often cite the same works as this one and are cited by the same works as this one.
    1. Joachim Inkmann, 2010. "Estimating Firm Size Elasticities of Product and Process R&D," Economica, London School of Economics and Political Science, vol. 77(306), pages 384-402, April.
    2. Inkmann, J., 2005. "Inverse Probability Weighted Generalised Empirical Likelihood Estimators : Firm Size and R&D Revisited," Other publications TiSEM c39cff1f-16c1-4446-a83f-c, Tilburg University, School of Economics and Management.
    3. John Fitzgerald & Peter Gottschalk & Robert Moffitt, 1998. "An Analysis of Sample Attrition in Panel Data: The Michigan Panel Study of Income Dynamics," Journal of Human Resources, University of Wisconsin Press, vol. 33(2), pages 251-299.
    4. Lewbel, Arthur, 2007. "Endogenous selection or treatment model estimation," Journal of Econometrics, Elsevier, vol. 141(2), pages 777-806, December.
    5. Esmeralda A. Ramalho & Richard J. Smith, 2013. "Discrete Choice Non-Response," Review of Economic Studies, Oxford University Press, vol. 80(1), pages 343-364.
    6. Martin Huber, 2014. "Treatment Evaluation in the Presence of Sample Selection," Econometric Reviews, Taylor & Francis Journals, vol. 33(8), pages 869-905, November.
    7. Bryan S. Graham & Cristine Campos De Xavier Pinto & Daniel Egel, 2012. "Inverse Probability Tilting for Moment Condition Models with Missing Data," Review of Economic Studies, Oxford University Press, vol. 79(3), pages 1053-1079.
    8. James J. Heckman, 2005. "Micro Data, Heterogeneity and the Evaluation of Public Policy Part 2," The American Economist, Sage Publications, vol. 49(1), pages 16-44, March.
    9. Prokhorov, Artem & Schmidt, Peter, 2009. "GMM redundancy results for general missing data problems," Journal of Econometrics, Elsevier, vol. 151(1), pages 47-55, July.
    10. Richard Blundell & Amanda Gosling & Hidehiko Ichimura & Costas Meghir, 2007. "Changes in the Distribution of Male and Female Wages Accounting for Employment Composition Using Bounds," Econometrica, Econometric Society, vol. 75(2), pages 323-363, March.
    11. d'Haultfoeuille, Xavier, 2010. "A new instrumental method for dealing with endogenous selection," Journal of Econometrics, Elsevier, vol. 154(1), pages 1-15, January.
    12. Vazquez-Alvarez, R. & Melenberg, B. & van Soest, A.H.O., 1999. "Nonparametric Bounds on the Income Distribution in the Presence of Item Nonresponse," Other publications TiSEM d37fb6a5-2075-42b2-b0b4-5, Tilburg University, School of Economics and Management.
    13. Nevo, Aviv, 2002. "Sample selection and information-theoretic alternatives to GMM," Journal of Econometrics, Elsevier, vol. 107(1-2), pages 149-157, March.
    14. Gordon B. Dahl, 2002. "Mobility and the Return to Education: Testing a Roy Model with Multiple Markets," Econometrica, Econometric Society, vol. 70(6), pages 2367-2420, November.
    15. Angrist, Joshua D., 1997. "Conditional independence in sample selection models," Economics Letters, Elsevier, vol. 54(2), pages 103-112, February.
    16. Blundell, Richard & Macurdy, Thomas, 1999. "Labor supply: A review of alternative approaches," Handbook of Labor Economics, in: O. Ashenfelter & D. Card (ed.), Handbook of Labor Economics, edition 1, volume 3, chapter 27, pages 1559-1695, Elsevier.
    17. Sizhong Sun, 2023. "Firm heterogeneity, worker training and labor productivity: the role of endogenous self-selection," Journal of Productivity Analysis, Springer, vol. 59(2), pages 121-133, April.
    18. Manuel Arellano & Stéphane Bonhomme, 2017. "Sample Selection in Quantile Regression: A Survey," Working Papers wp2018_1702, CEMFI.
    19. Manuel Arellano & Stéphane Bonhomme, 2017. "Sample Selection in Quantile Regression: A Survey," Working Papers wp2017_1702, CEMFI.
    20. Hainmueller, Jens, 2012. "Entropy Balancing for Causal Effects: A Multivariate Reweighting Method to Produce Balanced Samples in Observational Studies," Political Analysis, Cambridge University Press, vol. 20(1), pages 25-46, January.

    More about this item

    JEL classification:

    • C23 - Mathematical and Quantitative Methods - - Single Equation Models; Single Variables - - - Models with Panel Data; Spatio-temporal Models

    NEP fields

    This paper has been announced in the following NEP Reports:

    Statistics

    Access and download statistics

    Corrections

    All material on this site has been provided by the respective publishers and authors. You can help correct errors and omissions. When requesting a correction, please mention this item's handle: RePEc:nbr:nberte:0275. See general information about how to correct material in RePEc.

    If you have authored this item and are not yet registered with RePEc, we encourage you to do it here. This allows to link your profile to this item. It also allows you to accept potential citations to this item that we are uncertain about.

    If CitEc recognized a bibliographic reference but did not link an item in RePEc to it, you can help with this form .

    If you know of missing items citing this one, you can help us creating those links by adding the relevant references in the same way as above, for each refering item. If you are a registered author of this item, you may also want to check the "citations" tab in your RePEc Author Service profile, as there may be some citations waiting for confirmation.

    For technical questions regarding this item, or to correct its authors, title, abstract, bibliographic or download information, contact: the person in charge (email available below). General contact details of provider: https://edirc.repec.org/data/nberrus.html .

    Please note that corrections may take a couple of weeks to filter through the various RePEc services.

    IDEAS is a RePEc service. RePEc uses bibliographic data supplied by the respective publishers.