IDEAS home Printed from https://ideas.repec.org/a/inm/orijoc/v33y2021i1p262-279.html
   My bibliography  Save this article

Sparse Convex Regression

Author

Listed:
  • Dimitris Bertsimas

    (Sloan School of Management and Operations Research Center, Massachusetts Institute of Technology, Cambridge, Massachusetts 02139;)

  • Nishanth Mundru

    (Kenan-Flagler Business School, University of North Carolina at Chapel Hill, Chapel Hill, North Carolina 27599)

Abstract

We consider the problem of best k -subset convex regression using n observations in d variables. For the case without sparsity, we develop a scalable algorithm for obtaining high quality solutions in practical times that compare favorably with other state of the art methods. We show that by using a cutting plane method, the least squares convex regression problem can be solved for sizes ( n , d ) = ( 10 4 , 10 ) in minutes and ( n , d ) = ( 10 5 , 1 0 2 ) in hours. Our algorithm can be adapted to solve variants such as finding the best convex or concave functions with coordinate-wise monotonicity, norm-bounded subgradients, and minimize the ℓ 1 loss—all with similar scalability to the least squares convex regression problem. Under sparsity, we propose algorithms which iteratively solve for the best subset of features based on first order and cutting plane methods. We show that our methods scale for sizes ( n , d , k = 10 4 , 1 0 2 , 10 ) in minutes and ( n , d , k = 10 5 , 1 0 2 , 10 ) in hours. We demonstrate that these methods control for the false discovery rate effectively.

Suggested Citation

  • Dimitris Bertsimas & Nishanth Mundru, 2021. "Sparse Convex Regression," INFORMS Journal on Computing, INFORMS, vol. 33(1), pages 262-279, January.
  • Handle: RePEc:inm:orijoc:v:33:y:2021:i:1:p:262-279
    DOI: 10.1287/ijoc.2020.0954
    as

    Download full text from publisher

    File URL: https://doi.org/10.1287/ijoc.2020.0954
    Download Restriction: no

    File URL: https://libkey.io/10.1287/ijoc.2020.0954?utm_source=ideas
    LibKey link: if access is restricted and if your library uses this service, LibKey will redirect you to where you can use your library subscription to access this item
    ---><---

    References listed on IDEAS

    as
    1. Varian, Hal R, 1982. "The Nonparametric Approach to Demand Analysis," Econometrica, Econometric Society, vol. 50(4), pages 945-973, July.
    2. Eunji Lim & Peter W. Glynn, 2012. "Consistency of Multidimensional Convex Regression," Operations Research, INFORMS, vol. 60(1), pages 196-208, February.
    3. Varian, Hal R, 1984. "The Nonparametric Approach to Production Analysis," Econometrica, Econometric Society, vol. 52(3), pages 579-597, May.
    4. Gad Allon & Michael Beenstock & Steven Hackman & Ury Passy & Alexander Shapiro, 2007. "Nonparametric estimation of concave production technologies by entropic methods," Journal of Applied Econometrics, John Wiley & Sons, Ltd., vol. 22(4), pages 795-816.
    5. NESTEROV, Yu., 2005. "Smooth minimization of non-smooth functions," LIDAM Reprints CORE 1819, Université catholique de Louvain, Center for Operations Research and Econometrics (CORE).
    6. Mekaroonreung, Maethee & Johnson, Andrew L., 2012. "Estimating the shadow prices of SO2 and NOx for U.S. coal power plants: A convex nonparametric least squares approach," Energy Economics, Elsevier, vol. 34(3), pages 723-732.
    Full references (including those not matched with items on IDEAS)

    Citations

    Citations are extracted by the CitEc Project, subscribe to its RSS feed for this item.
    as


    Cited by:

    1. Alireza Olama & Eduardo Camponogara & Paulo R. C. Mendes, 2023. "Distributed primal outer approximation algorithm for sparse convex programming with separable structures," Journal of Global Optimization, Springer, vol. 86(3), pages 637-670, July.
    2. Dai, Sheng, 2023. "Variable selection in convex quantile regression: L1-norm or L0-norm regularization?," European Journal of Operational Research, Elsevier, vol. 305(1), pages 338-355.

    Most related items

    These are the items that most often cite the same works as this one and are cited by the same works as this one.
    1. Lee, Chia-Yen & Johnson, Andrew L. & Moreno-Centeno, Erick & Kuosmanen, Timo, 2013. "A more efficient algorithm for Convex Nonparametric Least Squares," European Journal of Operational Research, Elsevier, vol. 227(2), pages 391-400.
    2. Ian Crawford, 2004. "Necessary and sufficient conditions for latent separability," CeMMAP working papers CWP02/04, Centre for Microdata Methods and Practice, Institute for Fiscal Studies.
    3. Kuosmanen, Timo & Johnson, Andrew, 2017. "Modeling joint production of multiple outputs in StoNED: Directional distance function approach," European Journal of Operational Research, Elsevier, vol. 262(2), pages 792-801.
    4. Laura Blow & Martin Browning & Ian Crawford, 2004. "Nonparametric methods for the characteristic model," CeMMAP working papers 18/04, Institute for Fiscal Studies.
    5. Cherchye, Laurens & De Rock, Bram & Kerstens, Pieter Jan, 2018. "Production with storable and durable inputs: Nonparametric analysis of intertemporal efficiency," European Journal of Operational Research, Elsevier, vol. 270(2), pages 498-513.
    6. Aguiar, Victor H. & Kashaev, Nail & Allen, Roy, 2023. "Prices, profits, proxies, and production," Journal of Econometrics, Elsevier, vol. 235(2), pages 666-693.
    7. Sundström, David, 2016. "On Specification and Inference in the Econometrics of Public Procurement," Umeå Economic Studies 931, Umeå University, Department of Economics.
    8. Thomas Demuynck & John Rehbeck, 2023. "Computing revealed preference goodness-of-fit measures with integer programming," Economic Theory, Springer;Society for the Advancement of Economic Theory (SAET), vol. 76(4), pages 1175-1195, November.
    9. Ait-Sahalia, Yacine & Duarte, Jefferson, 2003. "Nonparametric option pricing under shape restrictions," Journal of Econometrics, Elsevier, vol. 116(1-2), pages 9-47.
    10. Laurens Cherchye & Bram De Rock & Frederic Vermeulen & Selma Walther, 2021. "Where did it go wrong? Marriage and divorce in Malawi," Quantitative Economics, Econometric Society, vol. 12(2), pages 505-545, May.
    11. Polisson, Matthew & Renou, Ludovic, 2016. "Afriat’s Theorem and Samuelson’s ‘Eternal Darkness’," Journal of Mathematical Economics, Elsevier, vol. 65(C), pages 36-40.
    12. Gross, John, 1995. "Heterogeneity of preferences for local public goods: The case of private expenditure on public education," Journal of Public Economics, Elsevier, vol. 57(1), pages 103-127, May.
    13. Alfred Galichon & John Quah, 2013. "Symposium on revealed preference analysis," Economic Theory, Springer;Society for the Advancement of Economic Theory (SAET), vol. 54(3), pages 419-423, November.
    14. Laurens Cherchye & Bram De Rock & Dieter Saelens & Marijn Verschelde, 2022. "Productive Efficiency Analysis with Incomplete Output Information," Working Papers ECARES 2022-21, ULB -- Universite Libre de Bruxelles.
    15. Carvajal, Andres & Ray, Indrajit & Snyder, Susan, 2004. "Equilibrium behavior in markets and games: testable restrictions and identification," Journal of Mathematical Economics, Elsevier, vol. 40(1-2), pages 1-40, February.
    16. H. Spencer Banzhaf & Yaqin Liu & Martin Smith & Frank Asche, 2019. "Non-Parametric Tests of the Tragedy of the Commons," NBER Working Papers 26398, National Bureau of Economic Research, Inc.
    17. Cherchye, Laurens & Demuynck, Thomas & De Rock, Bram & Hjertstrand, Per, 2015. "Revealed preference tests for weak separability: An integer programming approach," Journal of Econometrics, Elsevier, vol. 186(1), pages 129-141.
    18. Nanjing Jian & Shane G. Henderson, 2020. "Estimating the Probability that a Function Observed with Noise Is Convex," INFORMS Journal on Computing, INFORMS, vol. 32(2), pages 376-389, April.
    19. Crooker, John & Kling, Catherine L., 2000. "Nonparametric Bounds on Welfare Measures: A New Tool for Nonmarket Valuation," Journal of Environmental Economics and Management, Elsevier, vol. 39(2), pages 145-161, March.
    20. Marshall B. Reinsdorf, 2010. "Terms Of Trade Effects: Theory And Measurement," Review of Income and Wealth, International Association for Research in Income and Wealth, vol. 56(s1), pages 177-205, June.

    Corrections

    All material on this site has been provided by the respective publishers and authors. You can help correct errors and omissions. When requesting a correction, please mention this item's handle: RePEc:inm:orijoc:v:33:y:2021:i:1:p:262-279. See general information about how to correct material in RePEc.

    If you have authored this item and are not yet registered with RePEc, we encourage you to do it here. This allows to link your profile to this item. It also allows you to accept potential citations to this item that we are uncertain about.

    If CitEc recognized a bibliographic reference but did not link an item in RePEc to it, you can help with this form .

    If you know of missing items citing this one, you can help us creating those links by adding the relevant references in the same way as above, for each refering item. If you are a registered author of this item, you may also want to check the "citations" tab in your RePEc Author Service profile, as there may be some citations waiting for confirmation.

    For technical questions regarding this item, or to correct its authors, title, abstract, bibliographic or download information, contact: Chris Asher (email available below). General contact details of provider: https://edirc.repec.org/data/inforea.html .

    Please note that corrections may take a couple of weeks to filter through the various RePEc services.

    IDEAS is a RePEc service. RePEc uses bibliographic data supplied by the respective publishers.