IDEAS home Printed from https://ideas.repec.org/a/eee/csdana/v54y2010i12p3242-3248.html
   My bibliography  Save this article

An evolutionary algorithm for robust regression

Author

Listed:
  • Nunkesser, Robin
  • Morell, Oliver

Abstract

A drawback of robust statistical techniques is the increased computational effort often needed as compared to non-robust methods. Particularly, robust estimators possessing the exact fit property are NP-hard to compute. This means that--under the widely believed assumption that the computational complexity classes NP and P are not equal--there is no hope to compute exact solutions for large high dimensional data sets. To tackle this problem, search heuristics are used to compute NP-hard estimators in high dimensions. A new evolutionary algorithm that is applicable to different robust estimators is presented. Further, variants of this evolutionary algorithm for selected estimators--most prominently least trimmed squares and least median of squares--are introduced and shown to outperform existing popular search heuristics in difficult data situations. The results increase the applicability of robust methods and underline the usefulness of evolutionary algorithms for computational statistics.

Suggested Citation

  • Nunkesser, Robin & Morell, Oliver, 2010. "An evolutionary algorithm for robust regression," Computational Statistics & Data Analysis, Elsevier, vol. 54(12), pages 3242-3248, December.
  • Handle: RePEc:eee:csdana:v:54:y:2010:i:12:p:3242-3248
    as

    Download full text from publisher

    File URL: http://www.sciencedirect.com/science/article/pii/S0167-9473(10)00162-3
    Download Restriction: Full text for ScienceDirect subscribers only.
    ---><---

    As the access to this document is restricted, you may want to search for a different version of it.

    References listed on IDEAS

    as
    1. Hawkins, Douglas M., 1993. "The feasible set algorithm for least median of squares regression," Computational Statistics & Data Analysis, Elsevier, vol. 16(1), pages 81-101, June.
    2. Nunkesser, Robin, 2008. "RFreak-An R-package for evolutionary computation," Technical Reports 2008,12, Technische Universität Dortmund, Sonderforschungsbereich 475: Komplexitätsreduktion in multivariaten Datenstrukturen.
    3. Todorov, Valentin, 1992. "Computing the minimum covariance determinant estimator (MCD) by simulated annealing," Computational Statistics & Data Analysis, Elsevier, vol. 14(4), pages 515-525, November.
    4. Hossjer, O. & Croux, C. & Rousseeuw, P. J., 1994. "Asymptotics of Generalized S-Estimators," Journal of Multivariate Analysis, Elsevier, vol. 51(1), pages 148-177, October.
    5. Agullo, Jose, 2001. "New algorithms for computing the least trimmed squares regression estimator," Computational Statistics & Data Analysis, Elsevier, vol. 36(4), pages 425-439, June.
    6. Hawkins, Douglas M. & Olive, David J., 1999. "Improved feasible solution algorithms for high breakdown estimation," Computational Statistics & Data Analysis, Elsevier, vol. 30(1), pages 1-11, March.
    Full references (including those not matched with items on IDEAS)

    Citations

    Citations are extracted by the CitEc Project, subscribe to its RSS feed for this item.
    as


    Cited by:

    1. Torti, Francesca & Perrotta, Domenico & Atkinson, Anthony C. & Riani, Marco, 2012. "Benchmark testing of algorithms for very robust regression: FS, LMS and LTS," Computational Statistics & Data Analysis, Elsevier, vol. 56(8), pages 2501-2512.

    Most related items

    These are the items that most often cite the same works as this one and are cited by the same works as this one.
    1. Nunkesser, Robin & Morell, Oliver, 2008. "Evolutionary algorithms for robust methods," Technical Reports 2008,29, Technische Universität Dortmund, Sonderforschungsbereich 475: Komplexitätsreduktion in multivariaten Datenstrukturen.
    2. Nguyen, T.D. & Welsch, R., 2010. "Outlier detection and least trimmed squares approximation using semi-definite programming," Computational Statistics & Data Analysis, Elsevier, vol. 54(12), pages 3212-3226, December.
    3. Hawkins, Douglas M. & Olive, David, 1999. "Applications and algorithms for least trimmed sum of absolute deviations regression," Computational Statistics & Data Analysis, Elsevier, vol. 32(2), pages 119-134, December.
    4. Schyns, M. & Haesbroeck, G. & Critchley, F., 2010. "RelaxMCD: Smooth optimisation for the Minimum Covariance Determinant estimator," Computational Statistics & Data Analysis, Elsevier, vol. 54(4), pages 843-857, April.
    5. Agullo, Jose, 2001. "New algorithms for computing the least trimmed squares regression estimator," Computational Statistics & Data Analysis, Elsevier, vol. 36(4), pages 425-439, June.
    6. Čížek, Pavel, 2008. "General Trimmed Estimation: Robust Approach To Nonlinear And Limited Dependent Variable Models," Econometric Theory, Cambridge University Press, vol. 24(6), pages 1500-1529, December.
    7. L. Pitsoulis & G. Zioutas, 2010. "A fast algorithm for robust regression with penalised trimmed squares," Computational Statistics, Springer, vol. 25(4), pages 663-689, December.
    8. J. L. Alfaro & J. Fco. Ortega, 2009. "A comparison of robust alternatives to Hotelling's T2 control chart," Journal of Applied Statistics, Taylor & Francis Journals, vol. 36(12), pages 1385-1396.
    9. W. Ip & Ying Yang & P. Kwan & Y. Kwan, 2003. "Strong convergence rate of the least median absolute estimator in linear regression models," Statistical Papers, Springer, vol. 44(2), pages 183-201, April.
    10. Neath, Andrew A. & Cavanaugh, Joseph E., 2000. "A regression model selection criterion based on bootstrap bumping for use with resistant fitting," Computational Statistics & Data Analysis, Elsevier, vol. 35(2), pages 155-169, December.
    11. Vanessa Berenguer-Rico & Søren Johansen & Bent Nielsen, 2019. "Models where the Least Trimmed Squares and Least Median of Squares estimators are maximum likelihood," CREATES Research Papers 2019-15, Department of Economics and Business Economics, Aarhus University.
    12. Pokojovy, Michael & Jobe, J. Marcus, 2022. "A robust deterministic affine-equivariant algorithm for multivariate location and scatter," Computational Statistics & Data Analysis, Elsevier, vol. 172(C).
    13. Roelant, E. & Van Aelst, S. & Croux, C., 2009. "Multivariate generalized S-estimators," Journal of Multivariate Analysis, Elsevier, vol. 100(5), pages 876-887, May.
    14. Selin Ahipaşaoğlu, 2015. "Fast algorithms for the minimum volume estimator," Journal of Global Optimization, Springer, vol. 62(2), pages 351-370, June.
    15. Cerioli, Andrea & Farcomeni, Alessio & Riani, Marco, 2014. "Strong consistency and robustness of the Forward Search estimator of multivariate location and scatter," Journal of Multivariate Analysis, Elsevier, vol. 126(C), pages 167-183.
    16. Cizek, P., 2007. "General Trimmed Estimation : Robust Approach to Nonlinear and Limited Dependent Variable Models (Replaces DP 2007-1)," Other publications TiSEM eeccf622-dd18-41d4-a2f9-b, Tilburg University, School of Economics and Management.
    17. Winker, Peter & Gilli, Manfred, 2004. "Applications of optimization heuristics to estimation and modelling problems," Computational Statistics & Data Analysis, Elsevier, vol. 47(2), pages 211-223, September.
    18. Ranganai, Edmore, 2016. "Quality of fit measurement in regression quantiles: An elemental set method approach," Statistics & Probability Letters, Elsevier, vol. 111(C), pages 18-25.
    19. Hossjer, Ola, 1995. "Exact computation of the least trimmed squares estimate in simple linear regression," Computational Statistics & Data Analysis, Elsevier, vol. 19(3), pages 265-282, March.
    20. Lind, John C. & Wiens, Douglas P. & Yohai, Victor J., 2013. "Robust minimum information loss estimation," Computational Statistics & Data Analysis, Elsevier, vol. 65(C), pages 98-112.

    Corrections

    All material on this site has been provided by the respective publishers and authors. You can help correct errors and omissions. When requesting a correction, please mention this item's handle: RePEc:eee:csdana:v:54:y:2010:i:12:p:3242-3248. See general information about how to correct material in RePEc.

    If you have authored this item and are not yet registered with RePEc, we encourage you to do it here. This allows to link your profile to this item. It also allows you to accept potential citations to this item that we are uncertain about.

    If CitEc recognized a bibliographic reference but did not link an item in RePEc to it, you can help with this form .

    If you know of missing items citing this one, you can help us creating those links by adding the relevant references in the same way as above, for each refering item. If you are a registered author of this item, you may also want to check the "citations" tab in your RePEc Author Service profile, as there may be some citations waiting for confirmation.

    For technical questions regarding this item, or to correct its authors, title, abstract, bibliographic or download information, contact: Catherine Liu (email available below). General contact details of provider: http://www.elsevier.com/locate/csda .

    Please note that corrections may take a couple of weeks to filter through the various RePEc services.

    IDEAS is a RePEc service. RePEc uses bibliographic data supplied by the respective publishers.