IDEAS home Printed from https://ideas.repec.org/a/inm/oropre/v69y2021i3p985-1013.html
   My bibliography  Save this article

Sample Out-of-Sample Inference Based on Wasserstein Distance

Author

Listed:
  • Jose Blanchet

    (Department of Management Science and Engineering, Stanford University, Stanford, California 94305)

  • Yang Kang

    (Department of Statistics, Columbia University, New York, New York 10027)

Abstract

We present a novel inference approach that we call sample out-of-sample inference. The approach can be used widely, ranging from semisupervised learning to stress testing, and it is fundamental in the application of data-driven distributionally robust optimization. Our method enables measuring the impact of plausible out-of-sample scenarios in a given performance measure of interest, such as a financial loss. The methodology is inspired by empirical likelihood (EL), but we optimize the empirical Wasserstein distance (instead of the empirical likelihood) induced by observations. From a methodological standpoint, our analysis of the asymptotic behavior of the induced Wasserstein-distance profile function shows dramatic qualitative differences relative to EL. For instance, in contrast to EL, which typically yields chi-squared weak convergence limits, our asymptotic distributions are often not chi-squared. Also, the rates of convergence that we obtain have some dependence on the dimension in a nontrivial way but remain controlled as the dimension increases.

Suggested Citation

  • Jose Blanchet & Yang Kang, 2021. "Sample Out-of-Sample Inference Based on Wasserstein Distance," Operations Research, INFORMS, vol. 69(3), pages 985-1013, May.
  • Handle: RePEc:inm:oropre:v:69:y:2021:i:3:p:985-1013
    DOI: 10.1287/opre.2020.2028
    as

    Download full text from publisher

    File URL: http://dx.doi.org/10.1287/opre.2020.2028
    Download Restriction: no

    File URL: https://libkey.io/10.1287/opre.2020.2028?utm_source=ideas
    LibKey link: if access is restricted and if your library uses this service, LibKey will redirect you to where you can use your library subscription to access this item
    ---><---

    References listed on IDEAS

    as
    1. Aharon Ben-Tal & Dick den Hertog & Anja De Waegenaere & Bertrand Melenberg & Gijs Rennen, 2013. "Robust Solutions of Optimization Problems Affected by Uncertain Probabilities," Management Science, INFORMS, vol. 59(2), pages 341-357, April.
    2. Whitney K. Newey & Richard J. Smith, 2004. "Higher Order Properties of Gmm and Generalized Empirical Likelihood Estimators," Econometrica, Econometric Society, vol. 72(1), pages 219-255, January.
    3. Einmahl, J.H.J. & McKeague, I.W., 1999. "Confidence tubes for multiple quantile plots via empirical likelihood," Other publications TiSEM b64493f8-1c01-40fd-b16d-7, Tilburg University, School of Economics and Management.
    4. Antoine, Bertille & Bonnal, Helene & Renault, Eric, 2007. "On the efficient use of the informational content of estimating equations: Implied probabilities and Euclidean empirical likelihood," Journal of Econometrics, Elsevier, vol. 138(2), pages 461-487, June.
    5. Song Chen & Ingrid Van Keilegom, 2009. "A review on empirical likelihood methods for regression," TEST: An Official Journal of the Spanish Society of Statistics and Operations Research, Springer;Sociedad de Estadística e Investigación Operativa, vol. 18(3), pages 415-447, November.
    6. Bravo, Francesco, 2004. "Empirical Likelihood Based Inference With Applications To Some Econometric Models," Econometric Theory, Cambridge University Press, vol. 20(2), pages 231-264, April.
    7. Patrik Guggenberger, 2008. "Finite Sample Evidence Suggesting a Heavy Tail Problem of the Generalized Empirical Likelihood Estimator," Econometric Reviews, Taylor & Francis Journals, vol. 27(4-6), pages 526-541.
    8. Song Chen, 1993. "On the accuracy of empirical likelihood confidence regions for linear regression model," Annals of the Institute of Statistical Mathematics, Springer;The Institute of Statistical Mathematics, vol. 45(4), pages 621-637, December.
    9. Jose Blanchet & Karthyek Murthy, 2019. "Quantifying Distributional Model Risk via Optimal Transport," Mathematics of Operations Research, INFORMS, vol. 44(2), pages 565-600, May.
    10. Song Chen & Ingrid Van Keilegom, 2009. "Rejoinder on: A review on empirical likelihood methods for regression," TEST: An Official Journal of the Spanish Society of Statistics and Operations Research, Springer;Sociedad de Estadística e Investigación Operativa, vol. 18(3), pages 468-474, November.
    Full references (including those not matched with items on IDEAS)

    Most related items

    These are the items that most often cite the same works as this one and are cited by the same works as this one.
    1. Chang, Jinyuan & Chen, Song Xi & Chen, Xiaohong, 2015. "High dimensional generalized empirical likelihood for moment restrictions with dependent data," Journal of Econometrics, Elsevier, vol. 185(1), pages 283-304.
    2. Zhang, Jia & Shi, Haoming & Tian, Lemeng & Xiao, Fengjun, 2019. "Penalized generalized empirical likelihood in high-dimensional weakly dependent data," Journal of Multivariate Analysis, Elsevier, vol. 171(C), pages 270-283.
    3. Otsu, Taisuke & Xu, Ke-Li & Matsushita, Yukitoshi, 2015. "Empirical likelihood for regression discontinuity design," Journal of Econometrics, Elsevier, vol. 186(1), pages 94-112.
    4. Jin, Fei & Lee, Lung-fei, 2019. "GEL estimation and tests of spatial autoregressive models," Journal of Econometrics, Elsevier, vol. 208(2), pages 585-612.
    5. Hill, Jonathan B. & Prokhorov, Artem, 2016. "GEL estimation for heavy-tailed GARCH models with robust empirical likelihood inference," Journal of Econometrics, Elsevier, vol. 190(1), pages 18-45.
    6. Xuexin Wang, 2020. "A new class of tests for overidentifying restrictions in moment condition models," Econometric Reviews, Taylor & Francis Journals, vol. 39(5), pages 495-509, May.
    7. Antoine, Bertille & Dovonon, Prosper, 2021. "Robust estimation with exponentially tilted Hellinger distance," Journal of Econometrics, Elsevier, vol. 224(2), pages 330-344.
    8. Qinqin Hu & Lu Lin, 2017. "Conditional sure independence screening by conditional marginal empirical likelihood," Annals of the Institute of Statistical Mathematics, Springer;The Institute of Statistical Mathematics, vol. 69(1), pages 63-96, February.
    9. Siddharta Chib & Minchul Shin & Anna Simoni, 2016. "Bayesian Empirical Likelihood Estimation and Comparison of Moment Condition Models," Working Papers 2016-21, Center for Research in Economics and Statistics.
    10. Mardi Dungey & Vitali Alexeev & Jing Tian & Alastair R. Hall, 2015. "Econometricians Have Their Moments: GMM at 32," The Economic Record, The Economic Society of Australia, vol. 91, pages 1-24, June.
    11. Fan, Yanqin & Gentry, Matthew & Li, Tong, 2011. "A new class of asymptotically efficient estimators for moment condition models," Journal of Econometrics, Elsevier, vol. 162(2), pages 268-277, June.
    12. Alastair R. Hall, 2015. "Econometricians Have Their Moments: GMM at 32," The Economic Record, The Economic Society of Australia, vol. 91(S1), pages 1-24, June.
    13. Pierre Chaussé, 2011. "Generalized empirical likelihood for a continuum of moment conditions," Working Papers 1104, University of Waterloo, Department of Economics, revised Oct 2011.
    14. Seojeong Lee, 2018. "Asymptotic Refinements of a Misspecification-Robust Bootstrap for Generalized Empirical Likelihood Estimators," Papers 1806.00953, arXiv.org, revised Jun 2018.
    15. Prosper Dovonon, 2016. "Large Sample Properties of the Three-Step Euclidean Likelihood Estimators under Model Misspecification," Econometric Reviews, Taylor & Francis Journals, vol. 35(4), pages 465-514, April.
    16. Hong Guo & Changliang Zou & Zhaojun Wang & Bin Chen, 2014. "Empirical likelihood for high-dimensional linear regression models," Metrika: International Journal for Theoretical and Applied Statistics, Springer, vol. 77(7), pages 921-945, October.
    17. Lee, Seojeong, 2016. "Asymptotic refinements of a misspecification-robust bootstrap for GEL estimators," Journal of Econometrics, Elsevier, vol. 192(1), pages 86-104.
    18. Whang, Yoon-Jae, 2006. "Smoothed Empirical Likelihood Methods For Quantile Regression Models," Econometric Theory, Cambridge University Press, vol. 22(2), pages 173-205, April.
    19. Hansen, Lars Peter, 2013. "Uncertainty Outside and Inside Economic Models," Nobel Prize in Economics documents 2013-7, Nobel Prize Committee.
    20. repec:cep:stiecm:/2014/572 is not listed on IDEAS
    21. Weihua Zhao & Riquan Zhang & Yukun Liu & Jicai Liu, 2015. "Empirical likelihood based modal regression," Statistical Papers, Springer, vol. 56(2), pages 411-430, May.

    Corrections

    All material on this site has been provided by the respective publishers and authors. You can help correct errors and omissions. When requesting a correction, please mention this item's handle: RePEc:inm:oropre:v:69:y:2021:i:3:p:985-1013. See general information about how to correct material in RePEc.

    If you have authored this item and are not yet registered with RePEc, we encourage you to do it here. This allows to link your profile to this item. It also allows you to accept potential citations to this item that we are uncertain about.

    If CitEc recognized a bibliographic reference but did not link an item in RePEc to it, you can help with this form .

    If you know of missing items citing this one, you can help us creating those links by adding the relevant references in the same way as above, for each refering item. If you are a registered author of this item, you may also want to check the "citations" tab in your RePEc Author Service profile, as there may be some citations waiting for confirmation.

    For technical questions regarding this item, or to correct its authors, title, abstract, bibliographic or download information, contact: Chris Asher (email available below). General contact details of provider: https://edirc.repec.org/data/inforea.html .

    Please note that corrections may take a couple of weeks to filter through the various RePEc services.

    IDEAS is a RePEc service. RePEc uses bibliographic data supplied by the respective publishers.