Sparse regression for large data sets with outliers
Author
Abstract
Suggested Citation
DOI: 10.1016/j.ejor.2021.05.049
Download full text from publisher
As the access to this document is restricted, you may want to search for a different version of it.
References listed on IDEAS
- Pun, Chi Seng & Wong, Hoi Ying, 2019. "A linear programming model for selection of sparse high-dimensional multiperiod portfolios," European Journal of Operational Research, Elsevier, vol. 273(2), pages 754-771.
- Bertsimas, Dimitris & Copenhaver, Martin S., 2018. "Characterization of the equivalence of robustification and regularization in linear and matrix regression," European Journal of Operational Research, Elsevier, vol. 270(3), pages 931-942.
- Alexandre Belloni & Victor Chernozhukov, 2011. "High Dimensional Sparse Econometric Models: An Introduction," Papers 1106.5242, arXiv.org, revised Sep 2011.
- Joki, Kaisa & Bagirov, Adil M. & Karmitsa, Napsu & Mäkelä, Marko M. & Taheri, Sona, 2020. "Clusterwise support vector linear regression," European Journal of Operational Research, Elsevier, vol. 287(1), pages 19-35.
- P. Tseng, 2001. "Convergence of a Block Coordinate Descent Method for Nondifferentiable Minimization," Journal of Optimization Theory and Applications, Springer, vol. 109(3), pages 475-494, June.
- Victor Chernozhukov & Christian Hansen & Martin Spindler, 2015.
"Post-Selection and Post-Regularization Inference in Linear Models with Many Controls and Instruments,"
American Economic Review, American Economic Association, vol. 105(5), pages 486-490, May.
- Victor Chernozhukov & Christian Hansen & Martin Spindler, 2015. "Post-Selection and Post-Regularization Inference in Linear Models with Many Controls and Instruments," Papers 1501.03185, arXiv.org.
- Victor Chernozhukov & Christian Hansen & Martin Spindler, 2015. "Post-selection and post-regularization inference in linear models with many controls and instruments," CeMMAP working papers 02/15, Institute for Fiscal Studies.
- Victor Chernozhukov & Christian Hansen & Martin Spindler, 2015. "Post-selection and post-regularization inference in linear models with many controls and instruments," CeMMAP working papers CWP02/15, Centre for Microdata Methods and Practice, Institute for Fiscal Studies.
- Ghaddar, Bissan & Naoum-Sawaya, Joe, 2018. "High dimensional data classification and feature selection using support vector machines," European Journal of Operational Research, Elsevier, vol. 265(3), pages 993-1004.
- Nazemi, Abdolreza & Heidenreich, Konstantin & Fabozzi, Frank J., 2018. "Improving corporate bond recovery rate prediction using multi-factor support vector regressions," European Journal of Operational Research, Elsevier, vol. 271(2), pages 664-675.
- Zhang, Yiyun & Li, Runze & Tsai, Chih-Ling, 2010. "Regularization Parameter Selections via Generalized Information Criterion," Journal of the American Statistical Association, American Statistical Association, vol. 105(489), pages 312-323.
- Sagaert, Yves R. & Aghezzaf, El-Houssaine & Kourentzes, Nikolaos & Desmet, Bram, 2018. "Tactical sales forecasting using a very large set of macroeconomic indicators," European Journal of Operational Research, Elsevier, vol. 264(2), pages 558-569.
- Grace Yoon & Raymond J Carroll & Irina Gaynanova, 2020. "Sparse semiparametric canonical correlation analysis for data of mixed types," Biometrika, Biometrika Trust, vol. 107(3), pages 609-625.
- Gür Ali, Özden & Yaman, Kübra, 2013. "Selecting rows and columns for training support vector regression models with large retail datasets," European Journal of Operational Research, Elsevier, vol. 226(3), pages 471-480.
- Martínez, Andrés & Schmuck, Claudia & Pereverzyev, Sergiy & Pirker, Clemens & Haltmeier, Markus, 2020. "A machine learning framework for customer purchase prediction in the non-contractual setting," European Journal of Operational Research, Elsevier, vol. 281(3), pages 588-596.
- Alexandre Belloni & Victor Chernozhukov & Christian Hansen, 2011.
"Inference for high-dimensional sparse econometric models,"
CeMMAP working papers
CWP41/11, Centre for Microdata Methods and Practice, Institute for Fiscal Studies.
- Alexandre Belloni & Victor Chernozhukov & Christian Hansen, 2011. "Inference for High-Dimensional Sparse Econometric Models," Papers 1201.0220, arXiv.org.
- Christophe Croux & Catherine Dehon, 2010.
"Influence functions of the Spearman and Kendall correlation measures,"
Statistical Methods & Applications, Springer;Società Italiana di Statistica, vol. 19(4), pages 497-515, November.
- Croux, C. & Dehon, C., 2010. "Influence Functions of the Spearman and Kendall Correlation Measures," Discussion Paper 2010-40, Tilburg University, Center for Economic Research.
- Croux, C. & Dehon, C., 2010. "Influence Functions of the Spearman and Kendall Correlation Measures," Other publications TiSEM 5e8ee766-ee1e-4d6e-b035-6, Tilburg University, School of Economics and Management.
- Flores, Salvador, 2015. "SOCP relaxation bounds for the optimal subset selection problem applied to robust linear regression," European Journal of Operational Research, Elsevier, vol. 246(1), pages 44-50.
- Martin-Barragan, Belen & Lillo, Rosa & Romo, Juan, 2014. "Interpretable support vector machines for functional data," European Journal of Operational Research, Elsevier, vol. 232(1), pages 146-155.
- Ballings, Michel & Van den Poel, Dirk, 2015. "CRM in social media: Predicting increases in Facebook usage frequency," European Journal of Operational Research, Elsevier, vol. 244(1), pages 248-260.
- Smucler, Ezequiel & Yohai, Victor J., 2017. "Robust and sparse estimators for linear regression models," Computational Statistics & Data Analysis, Elsevier, vol. 111(C), pages 116-130.
- Huang, Tao & Fildes, Robert & Soopramanien, Didier, 2014. "The value of competitive information in forecasting FMCG retail product sales and the variable selection problem," European Journal of Operational Research, Elsevier, vol. 237(2), pages 738-748.
- Erjie Ang & Sara Kwasnick & Mohsen Bayati & Erica L. Plambeck & Michael Aratow, 2016. "Accurate Emergency Department Wait Time Prediction," Manufacturing & Service Operations Management, INFORMS, vol. 18(1), pages 141-156, February.
- Abolhassani, Amir & James Harner, E. & Jaridi, Majid, 2019. "Empirical analysis of productivity enhancement strategies in the North American automotive industry," International Journal of Production Economics, Elsevier, vol. 208(C), pages 140-159.
- Ma, Shaohui & Fildes, Robert & Huang, Tao, 2016. "Demand forecasting with high dimensional data: The case of SKU retail sales forecasting with intra- and inter-category promotional information," European Journal of Operational Research, Elsevier, vol. 249(1), pages 245-257.
- Grznar, John & Prasad, Sameer & Tata, Jasmine, 2007. "Neural networks and organizational systems: Modeling non-linear relationships," European Journal of Operational Research, Elsevier, vol. 181(2), pages 939-955, September.
- Leung, Andy & Zhang, Hongyang & Zamar, Ruben, 2016. "Robust regression estimation and inference in the presence of cellwise and casewise contamination," Computational Statistics & Data Analysis, Elsevier, vol. 99(C), pages 1-11.
- Huck, Nicolas, 2019. "Large data sets and machine learning: Applications to statistical arbitrage," European Journal of Operational Research, Elsevier, vol. 278(1), pages 330-342.
- Çetin, Meral, 2009. "Robust model selection criteria for robust Liu estimator," European Journal of Operational Research, Elsevier, vol. 199(1), pages 21-24, November.
- Masci, Chiara & Johnes, Geraint & Agasisti, Tommaso, 2018. "Student and school performance across countries: A machine learning approach," European Journal of Operational Research, Elsevier, vol. 269(3), pages 1072-1085.
- Nicolas Huck, 2019. "Large data sets and machine learning: Applications to statistical arbitrage," Post-Print hal-02143971, HAL.
- Cui, Hailong & Rajagopalan, Sampath & Ward, Amy R., 2020. "Predicting product return volume using machine learning methods," European Journal of Operational Research, Elsevier, vol. 281(3), pages 612-627.
- Wilms, Ines & Gelper, Sarah & Croux, Christophe, 2016.
"The predictive power of the business and bank sentiment of firms: A high-dimensional Granger Causality approach,"
European Journal of Operational Research, Elsevier, vol. 254(1), pages 138-147.
- Ines Wilms & Sarah Gelper & Christophe Croux, 2015. "The predictive power of the business and bank sentiment of firms: A high-dimensional Granger Causality approach," Working Papers of Department of Decision Sciences and Information Management, Leuven 504661, KU Leuven, Faculty of Economics and Business (FEB), Department of Decision Sciences and Information Management, Leuven.
- Khan, Jafar A. & Van Aelst, Stefan & Zamar, Ruben H., 2007. "Robust Linear Model Selection Based on Least Angle Regression," Journal of the American Statistical Association, American Statistical Association, vol. 102, pages 1289-1299, December.
- Landajo, Manuel & de Andres, Javier & Lorca, Pedro, 2007. "Robust neural modeling for the cross-sectional analysis of accounting information," European Journal of Operational Research, Elsevier, vol. 177(2), pages 1232-1252, March.
Citations
Citations are extracted by the CitEc Project, subscribe to its RSS feed for this item.
Cited by:
- Fu, Saiji & Tian, Yingjie & Tang, Long, 2023. "Robust regression under the general framework of bounded loss functions," European Journal of Operational Research, Elsevier, vol. 310(3), pages 1325-1339.
- Mohd Shareduwan Mohd Kasihmuddin & Siti Zulaikha Mohd Jamaludin & Mohd. Asyraf Mansor & Habibah A. Wahab & Siti Maisharah Sheikh Ghadzi, 2022. "Supervised Learning Perspective in Logic Mining," Mathematics, MDPI, vol. 10(6), pages 1-35, March.
- Hossein Tarighi & Zeynab Nourbakhsh Hosseiny & Maryam Akbari & Elaheh Mohammadhosseini, 2023. "The Moderating Effect of the COVID-19 Pandemic on the Relation between Corporate Governance and Firm Performance," JRFM, MDPI, vol. 16(7), pages 1-43, June.
- Barbato, Michele & Ceselli, Alberto, 2024. "Mathematical programming for simultaneous feature selection and outlier detection under l1 norm," European Journal of Operational Research, Elsevier, vol. 316(3), pages 1070-1084.
Most related items
These are the items that most often cite the same works as this one and are cited by the same works as this one.- He Jiang, 2023. "Robust forecasting in spatial autoregressive model with total variation regularization," Journal of Forecasting, John Wiley & Sons, Ltd., vol. 42(2), pages 195-211, March.
- He Jiang, 2022. "A novel robust structural quadratic forecasting model and applications," Journal of Forecasting, John Wiley & Sons, Ltd., vol. 41(6), pages 1156-1180, September.
- Van Belle, Jente & Guns, Tias & Verbeke, Wouter, 2021. "Using shared sell-through data to forecast wholesaler demand in multi-echelon supply chains," European Journal of Operational Research, Elsevier, vol. 288(2), pages 466-479.
- Wang, Shixuan & Syntetos, Aris A. & Liu, Ying & Di Cairano-Gilfedder, Carla & Naim, Mohamed M., 2023. "Improving automotive garage operations by categorical forecasts using a large number of variables," European Journal of Operational Research, Elsevier, vol. 306(2), pages 893-908.
- Guillaume Coqueret & Tony Guida, 2020. "Training trees on tails with applications to portfolio choice," Post-Print hal-04144665, HAL.
- Achim Ahrens & Christian B. Hansen & Mark E. Schaffer, 2020.
"lassopack: Model selection and prediction with regularized regression in Stata,"
Stata Journal, StataCorp LP, vol. 20(1), pages 176-235, March.
- Ahrens, Achim & Hansen, Christian B. & Schaffer, Mark E, 2019. "lassopack: Model Selection and Prediction with Regularized Regression in Stata," IZA Discussion Papers 12081, Institute of Labor Economics (IZA).
- Achim Ahrens & Christian B. Hansen & Mark E. Schaffer, 2019. "lassopack: Model selection and prediction with regularized regression in Stata," Papers 1901.05397, arXiv.org.
- Fildes, Robert & Ma, Shaohui & Kolassa, Stephan, 2022. "Retail forecasting: Research and practice," International Journal of Forecasting, Elsevier, vol. 38(4), pages 1283-1318.
- Fildes, Robert & Ma, Shaohui & Kolassa, Stephan, 2019. "Retail forecasting: research and practice," MPRA Paper 89356, University Library of Munich, Germany.
- Guillaume Coqueret & Tony Guida, 2020. "Training trees on tails with applications to portfolio choice," Annals of Operations Research, Springer, vol. 288(1), pages 181-221, May.
- Ma, Shaohui & Fildes, Robert, 2020. "Forecasting third-party mobile payments with implications for customer flow prediction," International Journal of Forecasting, Elsevier, vol. 36(3), pages 739-760.
- Ma, Shaohui & Fildes, Robert, 2017. "A retail store SKU promotions optimization model for category multi-period profit maximization," European Journal of Operational Research, Elsevier, vol. 260(2), pages 680-692.
- Ma, Shaohui & Fildes, Robert, 2021. "Retail sales forecasting with meta-learning," European Journal of Operational Research, Elsevier, vol. 288(1), pages 111-128.
- Gür Ali, Özden & Gürlek, Ragıp, 2020. "Automatic Interpretable Retail forecasting with promotional scenarios," International Journal of Forecasting, Elsevier, vol. 36(4), pages 1389-1406.
- Baoqiang Zhan & Shu Zhang & Helen S. Du & Xiaoguang Yang, 2022. "Exploring Statistical Arbitrage Opportunities Using Machine Learning Strategy," Computational Economics, Springer;Society for Computational Economics, vol. 60(3), pages 861-882, October.
- Nikola Štefelová & Andreas Alfons & Javier Palarea-Albaladejo & Peter Filzmoser & Karel Hron, 2021. "Robust regression with compositional covariates including cellwise outliers," Advances in Data Analysis and Classification, Springer;German Classification Society - Gesellschaft für Klassifikation (GfKl);Japanese Classification Society (JCS);Classification and Data Analysis Group of the Italian Statistical Society (CLADAG);International Federation of Classification Societies (IFCS), vol. 15(4), pages 869-909, December.
- Chou, Ping & Chuang, Howard Hao-Chun & Chou, Yen-Chun & Liang, Ting-Peng, 2022. "Predictive analytics for customer repurchase: Interdisciplinary integration of buy till you die modeling and machine learning," European Journal of Operational Research, Elsevier, vol. 296(2), pages 635-651.
- González, Marta Ramos & Ureña, Antonio Partal & Fernández-Aguado, Pilar Gómez, 2023. "Forecasting for regulatory credit loss derived from the COVID-19 pandemic: A machine learning approach," Research in International Business and Finance, Elsevier, vol. 64(C).
- Umberto Amato & Anestis Antoniadis & Italia De Feis & Irene Gijbels, 2021. "Penalised robust estimators for sparse and high-dimensional linear models," Statistical Methods & Applications, Springer;Società Italiana di Statistica, vol. 30(1), pages 1-48, March.
- Ning Xu & Jian Hong & Timothy C. G. Fisher, 2016.
"Model selection consistency from the perspective of generalization ability and VC theory with an application to Lasso,"
Papers
1606.00142, arXiv.org.
- Xu, Ning & Hong, Jian & Fisher, Timothy, 2016. "Model selection consistency from the perspective of generalization ability and VC theory with an application to Lasso," MPRA Paper 71670, University Library of Munich, Germany.
- Sander Gerritsen & Mark Kattenberg & Sonny Kuijpers, 2019. "The impact of age at arrival on education and mental health," CPB Discussion Paper 389.rdf, CPB Netherlands Bureau for Economic Policy Analysis.
More about this item
Keywords
Data science; Lasso; Outliers; Robust regression; Variable selection;All these keywords.
Statistics
Access and download statisticsCorrections
All material on this site has been provided by the respective publishers and authors. You can help correct errors and omissions. When requesting a correction, please mention this item's handle: RePEc:eee:ejores:v:297:y:2022:i:2:p:782-794. See general information about how to correct material in RePEc.
If you have authored this item and are not yet registered with RePEc, we encourage you to do it here. This allows to link your profile to this item. It also allows you to accept potential citations to this item that we are uncertain about.
If CitEc recognized a bibliographic reference but did not link an item in RePEc to it, you can help with this form .
If you know of missing items citing this one, you can help us creating those links by adding the relevant references in the same way as above, for each refering item. If you are a registered author of this item, you may also want to check the "citations" tab in your RePEc Author Service profile, as there may be some citations waiting for confirmation.
For technical questions regarding this item, or to correct its authors, title, abstract, bibliographic or download information, contact: Catherine Liu (email available below). General contact details of provider: http://www.elsevier.com/locate/eor .
Please note that corrections may take a couple of weeks to filter through the various RePEc services.