IDEAS home Printed from https://ideas.repec.org/a/tsj/stataj/v10y2010i2p259-266.html
   My bibliography  Save this article

Multivariate outlier detection in Stata

Author

Listed:
  • Vincenzo Verardi

    (University of Namur)

  • Catherine Dehon

    (Universite libre de Bruxelles)

Abstract

Before implementing any multivariate statistical analysis based on em- pirical covariance matrices, it is important to check whether outliers are present because their existence could induce significant biases. In this article, we present the minimum covariance determinant estimator, which is commonly used in ro- bust statistics to estimate location parameters and multivariate scales. These estimators can be used to robustify Mahalanobis distances and to identify outliers. Verardi and Croux (1999, Stata Journal 9: 439–453; 2010, Stata Journal 10: 313) programmed this estimator in Stata and made it available with the mcd command. The implemented algorithm is relatively fast and, as we show in the simulation example section, outperforms the methods already available in Stata, such as the Hadi method. Copyright 2010 by StataCorp LP.

Suggested Citation

  • Vincenzo Verardi & Catherine Dehon, 2010. "Multivariate outlier detection in Stata," Stata Journal, StataCorp LP, vol. 10(2), pages 259-266, June.
  • Handle: RePEc:tsj:stataj:v:10:y:2010:i:2:p:259-266
    Note: to access software from within Stata, net describe http://www.stata-journal.com/software/sj10-2/st0192/
    as

    Download full text from publisher

    File URL: http://www.stata-journal.com/article.html?article=st0192
    Download Restriction: no
    ---><---

    References listed on IDEAS

    as
    1. Vincenzo Verardi & Christophe Croux, 2009. "Robust regression in Stata," Stata Journal, StataCorp LP, vol. 9(3), pages 439-453, September.
    2. Hubert, Mia & Van Driessen, Katrien, 2004. "Fast and robust discriminant analysis," Computational Statistics & Data Analysis, Elsevier, vol. 45(2), pages 301-320, March.
    Full references (including those not matched with items on IDEAS)

    Most related items

    These are the items that most often cite the same works as this one and are cited by the same works as this one.
    1. Wang, Jianzhou & Xiong, Shenghua, 2014. "A hybrid forecasting model based on outlier detection and fuzzy time series – A case study on Hainan wind farm of China," Energy, Elsevier, vol. 76(C), pages 526-541.
    2. Petrella, Ivan & Di Pace, Federico & Juvenal, Luciana, 2020. "Terms-of-Trade Shocks are Not all Alike," CEPR Discussion Papers 14594, C.E.P.R. Discussion Papers.
    3. Angela Cipollone & Paolo E. Giordani, 2012. "Animal Spirits in Entrepreneurial Innovation: Theory and Evidence," Working Papers CELEG 1201, Dipartimento di Economia e Finanza, LUISS Guido Carli.
    4. Adam C. Sales & Ben B. Hansen, 2020. "Limitless Regression Discontinuity," Journal of Educational and Behavioral Statistics, , vol. 45(2), pages 143-174, April.
    5. Gregory Corcos & Massimo Del Gatto & Giordano Mion & Gianmarco I.P. Ottaviano, 2012. "Productivity and Firm Selection: Quantifying the ‘New’ Gains from Trade," Economic Journal, Royal Economic Society, vol. 122(561), pages 754-798, June.
    6. Song, Zisheng, 2021. "The capitalization of school quality in rents in the Beijing housing market: A propensity score method," Working Paper Series 21/7, Royal Institute of Technology, Department of Real Estate and Construction Management & Banking and Finance.
    7. Olivier Gergaud & Victor Ginsburgh & Florine Livat, 2012. "Success of Celebrities: Talent, Intelligence or Beauty?," Economics Bulletin, AccessEcon, vol. 32(4), pages 3120-3127.
    8. Sequeira, Tiago & Morão, Hugo, 2020. "Growth accounting and regressions: New approach and results," International Economics, Elsevier, vol. 162(C), pages 67-79.
    9. Christian Pfeifer & Joachim Wagner, 2014. "Age and gender effects of workforce composition on productivity and profits: Evidence from a new type of data for German enterprises," Contemporary Economics, University of Economics and Human Sciences in Warsaw., vol. 8(1), March.
    10. Luigi Moretti & Paola Valbonesi, 2012. "Subcontracting in Public Procurement: An Empirical Investigation," "Marco Fanno" Working Papers 0154, Dipartimento di Scienze Economiche "Marco Fanno".
    11. Appio, Francesco Paolo & Leone, Daniele & Platania, Federico & Schiavone, Francesco, 2020. "Why are rewards not delivered on time in rewards-based crowdfunding campaigns? An empirical exploration," Technological Forecasting and Social Change, Elsevier, vol. 157(C).
    12. Kohei Kubota, 2017. "Intergenerational Wealth Elasticity in Japan," The Japanese Economic Review, Japanese Economic Association, vol. 68(4), pages 470-496, December.
    13. Vincenzo Verardi & Joachim Wagner, 2021. "Productivity Premia for German Manufacturing Firms Exporting to the Euro-area and Beyond: First Evidence from Robust Fixed Effects Estimations," World Scientific Book Chapters, in: Joachim Wagner (ed.), MICROECONOMETRIC STUDIES OF FIRMS’ IMPORTS AND EXPORTS Advanced Methods of Analysis and Evidence from German Enterprises, chapter 7, pages 87-109, World Scientific Publishing Co. Pte. Ltd..
    14. Victor Ginsburgh & Olivier Gergaud & F. Livat, "undated". "Succes :Talent, Intelligence or Beauty?," ULB Institutional Repository 2013/151568, ULB -- Universite Libre de Bruxelles.
    15. Jan Ámos Víšek, 2015. "Estimating the Model with Fixed and Random Effects by a Robust Method," Methodology and Computing in Applied Probability, Springer, vol. 17(4), pages 999-1014, December.
    16. Neil Foster-McGregor & Anders Isaksson & Florian Kaulich, 2014. "Importing, exporting and performance in sub-Saharan African manufacturing firms," Review of World Economics (Weltwirtschaftliches Archiv), Springer;Institut für Weltwirtschaft (Kiel Institute for the World Economy), vol. 150(2), pages 309-336, May.
    17. Alper Sinan & B. Barıs Alkan, 2015. "A useful approach to identify the multicollinearity in the presence of outliers," Journal of Applied Statistics, Taylor & Francis Journals, vol. 42(5), pages 986-993, May.
    18. Joachim Wagner, 2014. "Exports, foreign direct investments and productivity: are services firms different?," The Service Industries Journal, Taylor & Francis Journals, vol. 34(1), pages 24-37, January.
    19. Libman Alexander & Feld Lars P., 2013. "Strategic Tax Collection and Fiscal Decentralization: The Case of Russia," German Economic Review, De Gruyter, vol. 14(4), pages 449-482, December.
    20. Horváth, Roman & Podpiera, Anca, 2012. "Heterogeneity in bank pricing policies: The Czech evidence," Economic Systems, Elsevier, vol. 36(1), pages 87-108.

    Corrections

    All material on this site has been provided by the respective publishers and authors. You can help correct errors and omissions. When requesting a correction, please mention this item's handle: RePEc:tsj:stataj:v:10:y:2010:i:2:p:259-266. See general information about how to correct material in RePEc.

    If you have authored this item and are not yet registered with RePEc, we encourage you to do it here. This allows to link your profile to this item. It also allows you to accept potential citations to this item that we are uncertain about.

    If CitEc recognized a bibliographic reference but did not link an item in RePEc to it, you can help with this form .

    If you know of missing items citing this one, you can help us creating those links by adding the relevant references in the same way as above, for each refering item. If you are a registered author of this item, you may also want to check the "citations" tab in your RePEc Author Service profile, as there may be some citations waiting for confirmation.

    For technical questions regarding this item, or to correct its authors, title, abstract, bibliographic or download information, contact: Christopher F. Baum or Lisa Gilmore (email available below). General contact details of provider: http://www.stata-journal.com/ .

    Please note that corrections may take a couple of weeks to filter through the various RePEc services.

    IDEAS is a RePEc service. RePEc uses bibliographic data supplied by the respective publishers.