IDEAS home Printed from https://ideas.repec.org/a/spr/annopr/v337y2024i1d10.1007_s10479-023-05726-3.html
   My bibliography  Save this article

Regularized distributionally robust optimization with application to the index tracking problem

Author

Listed:
  • Leyang Zhao

    (UNSW Sydney)

  • Guoyin Li

    (UNSW Sydney)

  • Spiridon Penev

    (UNSW Sydney)

Abstract

In recent years, distributionally robust optimization (DRO) has received a lot of interest due to its ability to reduce the worst-case risk when there is a perturbation to the data-generating distribution. A good statistical model is expected to perform well on both the normal and the perturbed data. On the other hand, variable selection and regularization is a research area that aims to identify the important features and remove the redundant ones. It helps to improve the prediction accuracy as well as the interpretability of the model. In this paper, we propose an optimization model that is a regularized version of the canonical distributionally robust optimization (DRO) problem where the ambiguity set is described by a general class of divergence measures that admit a suitable conic structure. The divergence measures we examined include several popular divergence measures used in the literature such as the Kullback–Leibler divergence, total variation, and the Chi-divergence. By exploiting the conic representability of the divergence measure, we show that the regularized DRO problem can be equivalently reformulated as a nonlinear conic programming problem. In the case where the regularization is convex and semi-definite programming representable, the reformulation can be further simplified as a tractable linear conic program and hence can be efficiently solved via existing software. More generally, if the regularization can be written as a difference of convex functions, we demonstrate that a solution for the regularized DRO problem can be found by solving a sequence of conic linear programming problems. Finally, we apply the proposed regularized DRO model to both simulated and real financial data and demonstrate its superior performance in comparison with some non-robust models.

Suggested Citation

  • Leyang Zhao & Guoyin Li & Spiridon Penev, 2024. "Regularized distributionally robust optimization with application to the index tracking problem," Annals of Operations Research, Springer, vol. 337(1), pages 397-424, June.
  • Handle: RePEc:spr:annopr:v:337:y:2024:i:1:d:10.1007_s10479-023-05726-3
    DOI: 10.1007/s10479-023-05726-3
    as

    Download full text from publisher

    File URL: http://link.springer.com/10.1007/s10479-023-05726-3
    File Function: Abstract
    Download Restriction: Access to the full text of the articles in this series is restricted.

    File URL: https://libkey.io/10.1007/s10479-023-05726-3?utm_source=ideas
    LibKey link: if access is restricted and if your library uses this service, LibKey will redirect you to where you can use your library subscription to access this item
    ---><---

    As the access to this document is restricted, you may want to search for a different version of it.

    References listed on IDEAS

    as
    1. Fan J. & Li R., 2001. "Variable Selection via Nonconcave Penalized Likelihood and its Oracle Properties," Journal of the American Statistical Association, American Statistical Association, vol. 96, pages 1348-1360, December.
    2. David Wozabal, 2012. "A framework for optimization under ambiguity," Annals of Operations Research, Springer, vol. 193(1), pages 21-47, March.
    3. Wu, Lan & Yang, Yuehan & Liu, Hanzhong, 2014. "Nonnegative-lasso and application in index tracking," Computational Statistics & Data Analysis, Elsevier, vol. 70(C), pages 116-126.
    4. Darinka Dentcheva & Spiridon Penev & Andrzej Ruszczyński, 2010. "Kusuoka representation of higher order dual risk measures," Annals of Operations Research, Springer, vol. 181(1), pages 325-335, December.
    5. Hoai An Le Thi & Van Ngai Huynh & Tao Pham Dinh, 2018. "Convergence Analysis of Difference-of-Convex Algorithm with Subanalytic Data," Journal of Optimization Theory and Applications, Springer, vol. 179(1), pages 103-126, October.
    6. Le Thi, H.A. & Pham Dinh, T. & Le, H.M. & Vo, X.T., 2015. "DC approximation approaches for sparse optimization," European Journal of Operational Research, Elsevier, vol. 244(1), pages 26-46.
    7. Aharon Ben-Tal & Dick den Hertog & Anja De Waegenaere & Bertrand Melenberg & Gijs Rennen, 2013. "Robust Solutions of Optimization Problems Affected by Uncertain Probabilities," Management Science, INFORMS, vol. 59(2), pages 341-357, April.
    8. Hui Zou & Trevor Hastie, 2005. "Addendum: Regularization and variable selection via the elastic net," Journal of the Royal Statistical Society Series B, Royal Statistical Society, vol. 67(5), pages 768-768, November.
    9. Georg Pflug & David Wozabal, 2007. "Ambiguity in portfolio selection," Quantitative Finance, Taylor & Francis Journals, vol. 7(4), pages 435-442.
    10. Hui Zou & Trevor Hastie, 2005. "Regularization and variable selection via the elastic net," Journal of the Royal Statistical Society Series B, Royal Statistical Society, vol. 67(2), pages 301-320, April.
    Full references (including those not matched with items on IDEAS)

    Most related items

    These are the items that most often cite the same works as this one and are cited by the same works as this one.
    1. Xia, Siwei & Yang, Yuehan & Yang, Hu, 2023. "High-dimensional sparse portfolio selection with nonnegative constraint," Applied Mathematics and Computation, Elsevier, vol. 443(C).
    2. Xiaofei Wu & Rongmei Liang & Hu Yang, 2022. "Penalized and constrained LAD estimation in fixed and high dimension," Statistical Papers, Springer, vol. 63(1), pages 53-95, February.
    3. Siwei Xia & Yuehan Yang & Hu Yang, 2022. "Sparse Laplacian Shrinkage with the Graphical Lasso Estimator for Regression Problems," TEST: An Official Journal of the Spanish Society of Statistics and Operations Research, Springer;Sociedad de Estadística e Investigación Operativa, vol. 31(1), pages 255-277, March.
    4. Ning Li & Hu Yang, 2021. "Nonnegative estimation and variable selection under minimax concave penalty for sparse high-dimensional linear regression models," Statistical Papers, Springer, vol. 62(2), pages 661-680, April.
    5. Shanshan Qin & Hao Ding & Yuehua Wu & Feng Liu, 2021. "High-dimensional sign-constrained feature selection and grouping," Annals of the Institute of Statistical Mathematics, Springer;The Institute of Statistical Mathematics, vol. 73(4), pages 787-819, August.
    6. Yang, Yuan & McMahan, Christopher S. & Wang, Yu-Bo & Ouyang, Yuyuan, 2024. "Estimation of l0 norm penalized models: A statistical treatment," Computational Statistics & Data Analysis, Elsevier, vol. 192(C).
    7. Tutz, Gerhard & Pößnecker, Wolfgang & Uhlmann, Lorenz, 2015. "Variable selection in general multinomial logit models," Computational Statistics & Data Analysis, Elsevier, vol. 82(C), pages 207-222.
    8. Shuichi Kawano, 2014. "Selection of tuning parameters in bridge regression models via Bayesian information criterion," Statistical Papers, Springer, vol. 55(4), pages 1207-1223, November.
    9. Qianyun Li & Runmin Shi & Faming Liang, 2019. "Drug sensitivity prediction with high-dimensional mixture regression," PLOS ONE, Public Library of Science, vol. 14(2), pages 1-18, February.
    10. Soave, David & Lawless, Jerald F., 2023. "Regularized regression for two phase failure time studies," Computational Statistics & Data Analysis, Elsevier, vol. 182(C).
    11. Zemin Zheng & Jie Zhang & Yang Li, 2022. "L 0 -Regularized Learning for High-Dimensional Additive Hazards Regression," INFORMS Journal on Computing, INFORMS, vol. 34(5), pages 2762-2775, September.
    12. Umberto Amato & Anestis Antoniadis & Italia De Feis & Irene Gijbels, 2021. "Penalised robust estimators for sparse and high-dimensional linear models," Statistical Methods & Applications, Springer;Società Italiana di Statistica, vol. 30(1), pages 1-48, March.
    13. Zichen Zhang & Ye Eun Bae & Jonathan R. Bradley & Lang Wu & Chong Wu, 2022. "SUMMIT: An integrative approach for better transcriptomic data imputation improves causal gene identification," Nature Communications, Nature, vol. 13(1), pages 1-12, December.
    14. Wang, Christina Dan & Chen, Zhao & Lian, Yimin & Chen, Min, 2022. "Asset selection based on high frequency Sharpe ratio," Journal of Econometrics, Elsevier, vol. 227(1), pages 168-188.
    15. Bartosz Uniejewski, 2024. "Regularization for electricity price forecasting," Operations Research and Decisions, Wroclaw University of Science and Technology, Faculty of Management, vol. 34(3), pages 267-286.
    16. Peter Bühlmann & Jacopo Mandozzi, 2014. "High-dimensional variable screening and bias in subsequent inference, with an empirical comparison," Computational Statistics, Springer, vol. 29(3), pages 407-430, June.
    17. Capanu, Marinela & Giurcanu, Mihai & Begg, Colin B. & Gönen, Mithat, 2023. "Subsampling based variable selection for generalized linear models," Computational Statistics & Data Analysis, Elsevier, vol. 184(C).
    18. Ander Wilson & Brian J. Reich, 2014. "Confounder selection via penalized credible regions," Biometrics, The International Biometric Society, vol. 70(4), pages 852-861, December.
    19. Loann David Denis Desboulets, 2018. "A Review on Variable Selection in Regression Analysis," Econometrics, MDPI, vol. 6(4), pages 1-27, November.
    20. Zeyu Bian & Erica E. M. Moodie & Susan M. Shortreed & Sahir Bhatnagar, 2023. "Variable selection in regression‐based estimation of dynamic treatment regimes," Biometrics, The International Biometric Society, vol. 79(2), pages 988-999, June.

    Corrections

    All material on this site has been provided by the respective publishers and authors. You can help correct errors and omissions. When requesting a correction, please mention this item's handle: RePEc:spr:annopr:v:337:y:2024:i:1:d:10.1007_s10479-023-05726-3. See general information about how to correct material in RePEc.

    If you have authored this item and are not yet registered with RePEc, we encourage you to do it here. This allows to link your profile to this item. It also allows you to accept potential citations to this item that we are uncertain about.

    If CitEc recognized a bibliographic reference but did not link an item in RePEc to it, you can help with this form .

    If you know of missing items citing this one, you can help us creating those links by adding the relevant references in the same way as above, for each refering item. If you are a registered author of this item, you may also want to check the "citations" tab in your RePEc Author Service profile, as there may be some citations waiting for confirmation.

    For technical questions regarding this item, or to correct its authors, title, abstract, bibliographic or download information, contact: Sonal Shukla or Springer Nature Abstracting and Indexing (email available below). General contact details of provider: http://www.springer.com .

    Please note that corrections may take a couple of weeks to filter through the various RePEc services.

    IDEAS is a RePEc service. RePEc uses bibliographic data supplied by the respective publishers.