IDEAS home Printed from https://ideas.repec.org/a/spr/coopap/v79y2021i2d10.1007_s10589-021-00279-2.html
   My bibliography  Save this article

Matrix optimization based Euclidean embedding with outliers

Author

Listed:
  • Qian Zhang

    (Beijing University of Technology)

  • Xinyuan Zhao

    (Beijing University of Technology)

  • Chao Ding

    (Chinese Academy of Sciences)

Abstract

Euclidean embedding from noisy observations containing outlier errors is an important and challenging problem in statistics and machine learning. Many existing methods would struggle with outliers due to a lack of detection ability. In this paper, we propose a matrix optimization based embedding model that can produce reliable embeddings and identify the outliers jointly. We show that the estimators obtained by the proposed method satisfy a non-asymptotic risk bound, implying that the model provides a high accuracy estimator with high probability when the order of the sample size is roughly the degree of freedom up to a logarithmic factor. Moreover, we show that under some mild conditions, the proposed model also can identify the outliers without any prior information with high probability. Finally, numerical experiments demonstrate that the matrix optimization-based model can produce configurations of high quality and successfully identify outliers even for large networks.

Suggested Citation

  • Qian Zhang & Xinyuan Zhao & Chao Ding, 2021. "Matrix optimization based Euclidean embedding with outliers," Computational Optimization and Applications, Springer, vol. 79(2), pages 235-271, June.
  • Handle: RePEc:spr:coopap:v:79:y:2021:i:2:d:10.1007_s10589-021-00279-2
    DOI: 10.1007/s10589-021-00279-2
    as

    Download full text from publisher

    File URL: http://link.springer.com/10.1007/s10589-021-00279-2
    File Function: Abstract
    Download Restriction: Access to the full text of the articles in this series is restricted.

    File URL: https://libkey.io/10.1007/s10589-021-00279-2?utm_source=ideas
    LibKey link: if access is restricted and if your library uses this service, LibKey will redirect you to where you can use your library subscription to access this item
    ---><---

    As the access to this document is restricted, you may want to search for a different version of it.

    References listed on IDEAS

    as
    1. Gale Young & A. Householder, 1938. "Discussion of a set of points in terms of their mutual distances," Psychometrika, Springer;The Psychometric Society, vol. 3(1), pages 19-22, March.
    2. Chao Ding & Hou-Duo Qi, 2017. "Convex Euclidean distance embedding for collaborative position localization with NLOS mitigation," Computational Optimization and Applications, Springer, vol. 66(1), pages 187-218, January.
    3. Qiang Sun & Wen-Xin Zhou & Jianqing Fan, 2020. "Adaptive Huber Regression," Journal of the American Statistical Association, Taylor & Francis Journals, vol. 115(529), pages 254-265, January.
    Full references (including those not matched with items on IDEAS)

    Most related items

    These are the items that most often cite the same works as this one and are cited by the same works as this one.
    1. Fengzhen Zhai & Qingna Li, 2020. "A Euclidean distance matrix model for protein molecular conformation," Journal of Global Optimization, Springer, vol. 76(4), pages 709-728, April.
    2. Panpan Yu & Qingna Li, 2018. "Ordinal Distance Metric Learning with MDS for Image Ranking," Asia-Pacific Journal of Operational Research (APJOR), World Scientific Publishing Co. Pte. Ltd., vol. 35(01), pages 1-19, February.
    3. Si-Tong Lu & Miao Zhang & Qing-Na Li, 2020. "Feasibility and a fast algorithm for Euclidean distance matrix optimization with ordinal constraints," Computational Optimization and Applications, Springer, vol. 76(2), pages 535-569, June.
    4. Yijun Zuo, 2023. "Non-asymptotic analysis and inference for an outlyingness induced winsorized mean," Statistical Papers, Springer, vol. 64(5), pages 1465-1481, October.
    5. Han, Dongxiao & Huang, Jian & Lin, Yuanyuan & Shen, Guohao, 2022. "Robust post-selection inference of high-dimensional mean regression with heavy-tailed asymmetric or heteroskedastic errors," Journal of Econometrics, Elsevier, vol. 230(2), pages 416-431.
    6. Pei Wang & Shunjie Chen & Sijia Yang, 2022. "Recent Advances on Penalized Regression Models for Biological Data," Mathematics, MDPI, vol. 10(19), pages 1-24, October.
    7. Chao Ding & Hou-Duo Qi, 2017. "Convex Euclidean distance embedding for collaborative position localization with NLOS mitigation," Computational Optimization and Applications, Springer, vol. 66(1), pages 187-218, January.
    8. Xiao, Xuan & Xu, Xingbai & Zhong, Wei, 2023. "Huber estimation for the network autoregressive model," Statistics & Probability Letters, Elsevier, vol. 203(C).
    9. Zha, Hongyuan & Zhang, Zhenyue, 2007. "Continuum Isomap for manifold learnings," Computational Statistics & Data Analysis, Elsevier, vol. 52(1), pages 184-200, September.
    10. Neil Shephard, 2020. "An estimator for predictive regression: reliable inference for financial economics," Papers 2008.06130, arXiv.org.
    11. Yang, Shuquan & Ling, Nengxiang, 2023. "Robust projected principal component analysis for large-dimensional semiparametric factor modeling," Journal of Multivariate Analysis, Elsevier, vol. 195(C).
    12. J. Carroll, 1985. "Review," Psychometrika, Springer;The Psychometric Society, vol. 50(1), pages 133-140, March.
    13. Peter Bossaerts & Shijie Huang & Nitin Yadav, 2020. "Exploiting Distributional Temporal Difference Learning to Deal with Tail Risk," Risks, MDPI, vol. 8(4), pages 1-20, October.
    14. Małgorzata Poniatowska-Jaksch, 2021. "Energy Consumption in Central and Eastern Europe (CEE) Households in the Platform Economics," Energies, MDPI, vol. 14(4), pages 1-22, February.
    15. Francis Caillez & Pascale Kuntz, 1996. "A contribution to the study of the metric and Euclidean structures of dissimilarities," Psychometrika, Springer;The Psychometric Society, vol. 61(2), pages 241-253, June.
    16. Malone, Samuel W. & Tarazaga, Pablo & Trosset, Michael W., 2002. "Better initial configurations for metric multidimensional scaling," Computational Statistics & Data Analysis, Elsevier, vol. 41(1), pages 143-156, November.
    17. Luo, Jiyu & Sun, Qiang & Zhou, Wen-Xin, 2022. "Distributed adaptive Huber regression," Computational Statistics & Data Analysis, Elsevier, vol. 169(C).
    18. Aurea Grané & Rosario Romera, 2018. "On Visualizing Mixed-Type Data," Sociological Methods & Research, , vol. 47(2), pages 207-239, March.
    19. Joseph Zinnes & David MacKay, 1983. "Probabilistic multidimensional scaling: Complete and incomplete data," Psychometrika, Springer;The Psychometric Society, vol. 48(1), pages 27-48, March.
    20. Samuel Messick, 1956. "An empirical evaluation of multidimensional successive intervals," Psychometrika, Springer;The Psychometric Society, vol. 21(4), pages 367-375, December.

    Corrections

    All material on this site has been provided by the respective publishers and authors. You can help correct errors and omissions. When requesting a correction, please mention this item's handle: RePEc:spr:coopap:v:79:y:2021:i:2:d:10.1007_s10589-021-00279-2. See general information about how to correct material in RePEc.

    If you have authored this item and are not yet registered with RePEc, we encourage you to do it here. This allows to link your profile to this item. It also allows you to accept potential citations to this item that we are uncertain about.

    If CitEc recognized a bibliographic reference but did not link an item in RePEc to it, you can help with this form .

    If you know of missing items citing this one, you can help us creating those links by adding the relevant references in the same way as above, for each refering item. If you are a registered author of this item, you may also want to check the "citations" tab in your RePEc Author Service profile, as there may be some citations waiting for confirmation.

    For technical questions regarding this item, or to correct its authors, title, abstract, bibliographic or download information, contact: Sonal Shukla or Springer Nature Abstracting and Indexing (email available below). General contact details of provider: http://www.springer.com .

    Please note that corrections may take a couple of weeks to filter through the various RePEc services.

    IDEAS is a RePEc service. RePEc uses bibliographic data supplied by the respective publishers.