IDEAS home Printed from https://ideas.repec.org/a/tsj/stataj/v10y2010i2p259-266.html

Multivariate outlier detection in Stata

Author

Listed:
  • Vincenzo Verardi

    (University of Namur)

  • Catherine Dehon

    (Universite libre de Bruxelles)

Abstract

Before implementing any multivariate statistical analysis based on em- pirical covariance matrices, it is important to check whether outliers are present because their existence could induce significant biases. In this article, we present the minimum covariance determinant estimator, which is commonly used in ro- bust statistics to estimate location parameters and multivariate scales. These estimators can be used to robustify Mahalanobis distances and to identify outliers. Verardi and Croux (1999, Stata Journal 9: 439–453; 2010, Stata Journal 10: 313) programmed this estimator in Stata and made it available with the mcd command. The implemented algorithm is relatively fast and, as we show in the simulation example section, outperforms the methods already available in Stata, such as the Hadi method. Copyright 2010 by StataCorp LP.

Suggested Citation

  • Vincenzo Verardi & Catherine Dehon, 2010. "Multivariate outlier detection in Stata," Stata Journal, StataCorp LLC, vol. 10(2), pages 259-266, June.
  • Handle: RePEc:tsj:stataj:v:10:y:2010:i:2:p:259-266
    Note: to access software from within Stata, net describe http://www.stata-journal.com/software/sj10-2/st0192/
    as

    Download full text from publisher

    File URL: http://www.stata-journal.com/article.html?article=st0192
    Download Restriction: no
    ---><---

    References listed on IDEAS

    as
    1. Hubert, Mia & Van Driessen, Katrien, 2004. "Fast and robust discriminant analysis," Computational Statistics & Data Analysis, Elsevier, vol. 45(2), pages 301-320, March.
    2. Vincenzo Verardi & Christophe Croux, 2009. "Robust regression in Stata," Stata Journal, StataCorp LLC, vol. 9(3), pages 439-453, September.
    Full references (including those not matched with items on IDEAS)

    Most related items

    These are the items that most often cite the same works as this one and are cited by the same works as this one.
    1. Wang, Jianzhou & Xiong, Shenghua, 2014. "A hybrid forecasting model based on outlier detection and fuzzy time series – A case study on Hainan wind farm of China," Energy, Elsevier, vol. 76(C), pages 526-541.
    2. Franz Fuerst & Pat McAllister & Karen Smith, 2010. "Eco-Labeling, Rents, Sales Prices and Occupancy Rates: Do LEED and Energy Star Labeled Offices Obtain Multiple Premiums?," Real Estate & Planning Working Papers rep-wp2010-01, Henley Business School, University of Reading.
    3. Joachim Wagner & Yama Temouri, 2021. "Do Outliers and Unobserved Heterogeneity Explain the Exporter Productivity Premium? Evidence from France, Germany and the United Kingdom," World Scientific Book Chapters, in: Joachim Wagner (ed.), MICROECONOMETRIC STUDIES OF FIRMS’ IMPORTS AND EXPORTS Advanced Methods of Analysis and Evidence from German Enterprises, chapter 13, pages 223-236, World Scientific Publishing Co. Pte. Ltd..
    4. Joachim Wagner, 2014. "Exports, foreign direct investments and productivity: are services firms different?," The Service Industries Journal, Taylor & Francis Journals, vol. 34(1), pages 24-37, January.
    5. Rui Li & Wei Liu & Yong Liu & Sang-Bing Tsai, 2018. "IPO Underpricing After the 2008 Financial Crisis: A Study of the Chinese Stock Markets," Sustainability, MDPI, vol. 10(8), pages 1-13, August.
    6. Joachim Wagner, 2016. "From Estimation Results to Stylized Facts: Twelve Recommendations for Empirical Research in International Activities of Heterogeneous Firms," World Scientific Book Chapters, in: Microeconometrics of International Trade, chapter 15, pages 479-514, World Scientific Publishing Co. Pte. Ltd..
    7. Rodolphe Desbordes & Vincenzo Verardi, 2017. "Foreign Direct Investment and Democracy: A Robust Fixed Effects Approach to a Complex Relationship," Pacific Economic Review, Wiley Blackwell, vol. 22(1), pages 43-82, February.
    8. Croux, Christophe & Joossens, Kristel, 2005. "Influence of observations on the misclassification probability in quadratic discriminant analysis," Journal of Multivariate Analysis, Elsevier, vol. 96(2), pages 384-403, October.
    9. Gustavo Canavire-Bacarreza & Luis Castro Peñarrieta & Darwin Ugarte Ontiveros, 2021. "Outliers in Semi-Parametric Estimation of Treatment Effects," Econometrics, MDPI, vol. 9(2), pages 1-32, April.
    10. Bai, Yiyi & Okullo, Samuel J., 2023. "Drivers and pass-through of the EU ETS price: Evidence from the power sector," Energy Economics, Elsevier, vol. 123(C).
    11. Tiago Sequeira & Hugo Morão, 2020. "Growth accounting and regressions: New approach and results," International Economics, CEPII research center, issue 162, pages 67-79.
    12. Nusret Sahin & Anthony A. Braga & Robert Apel & Rod K. Brunson, 2017. "The Impact of Procedurally-Just Policing on Citizen Perceptions of Police During Traffic Stops: The Adana Randomized Controlled Trial," Journal of Quantitative Criminology, Springer, vol. 33(4), pages 701-726, December.
    13. Thomas Soseco & Isnawati Hidayah & Nila Cahayati & Fajar Try Leksono, 2024. "Access to Technology to Increase Food Resilience in Rural Households in Indonesia," Economia agro-alimentare, FrancoAngeli Editore, vol. 2024(1), pages 109-135.
    14. Adam C. Sales & Ben B. Hansen, 2020. "Limitless Regression Discontinuity," Journal of Educational and Behavioral Statistics, , vol. 45(2), pages 143-174, April.
    15. Paolo Canofari & Giancarlo Marini & Pasquale Scaramozzino, 2013. "To Sleep, Perchance to Dream: Prices for Funeral Homes in US States," CEIS Research Paper 260, Tor Vergata University, CEIS, revised 11 Jan 2013.
    16. Wagner, Joachim, 2013. "The granular nature of the great export collapse in German manufacturing industries, 2008/2009," Economics - The Open-Access, Open-Assessment E-Journal (2007-2020), Kiel Institute for the World Economy, vol. 7, pages 1-21.
    17. Kalin S. Kolev, 2019. "Do Investors Perceive Marking-to-Model as Marking-to-Myth? Early Evidence from FAS 157 Disclosure," Quarterly Journal of Finance (QJF), World Scientific Publishing Co. Pte. Ltd., vol. 9(02), pages 1-47, June.
    18. Song, Zisheng, 2021. "The capitalization of school quality in rents in the Beijing housing market: A propensity score method," Working Paper Series 21/7, Royal Institute of Technology, Department of Real Estate and Construction Management & Banking and Finance.
    19. Gani Aldashev & François Libois & Joaquín Morales Belpaire & Astrid Similon, 2014. "Encouraging Private Ownership of Public Goods: Theory and Evidence from Belgium," Working Papers 1408, University of Namur, Department of Economics.
    20. Pires, Ana M. & Branco, João A., 2010. "Projection-pursuit approach to robust linear discriminant analysis," Journal of Multivariate Analysis, Elsevier, vol. 101(10), pages 2464-2485, November.

    More about this item

    Keywords

    ;
    ;
    ;
    ;
    ;

    Statistics

    Access and download statistics

    Corrections

    All material on this site has been provided by the respective publishers and authors. You can help correct errors and omissions. When requesting a correction, please mention this item's handle: RePEc:tsj:stataj:v:10:y:2010:i:2:p:259-266. See general information about how to correct material in RePEc.

    If you have authored this item and are not yet registered with RePEc, we encourage you to do it here. This allows to link your profile to this item. It also allows you to accept potential citations to this item that we are uncertain about.

    If CitEc recognized a bibliographic reference but did not link an item in RePEc to it, you can help with this form .

    If you know of missing items citing this one, you can help us creating those links by adding the relevant references in the same way as above, for each refering item. If you are a registered author of this item, you may also want to check the "citations" tab in your RePEc Author Service profile, as there may be some citations waiting for confirmation.

    For technical questions regarding this item, or to correct its authors, title, abstract, bibliographic or download information, contact: Christopher F. Baum or Lisa Gilmore (email available below). General contact details of provider: http://www.stata-journal.com/ .

    Please note that corrections may take a couple of weeks to filter through the various RePEc services.

    IDEAS is a RePEc service. RePEc uses bibliographic data supplied by the respective publishers.