IDEAS home Printed from https://ideas.repec.org/a/bla/biomet/v74y2018i1p145-154.html
   My bibliography  Save this article

Robust mislabel logistic regression without modeling mislabel probabilities

Author

Listed:
  • Hung Hung
  • Zhi†Yu Jou
  • Su†Yun Huang

Abstract

Logistic regression is among the most widely used statistical methods for linear discriminant analysis. In many applications, we only observe possibly mislabeled responses. Fitting a conventional logistic regression can then lead to biased estimation. One common resolution is to fit a mislabel logistic regression model, which takes into consideration of mislabeled responses. Another common method is to adopt a robust M†estimation by down†weighting suspected instances. In this work, we propose a new robust mislabel logistic regression based on γ†divergence. Our proposal possesses two advantageous features: (1) It does not need to model the mislabel probabilities. (2) The minimum γ†divergence estimation leads to a weighted estimating equation without the need to include any bias correction term, that is, it is automatically bias†corrected. These features make the proposed γ†logistic regression more robust in model fitting and more intuitive for model interpretation through a simple weighting scheme. Our method is also easy to implement, and two types of algorithms are included. Simulation studies and the Pima data application are presented to demonstrate the performance of γ†logistic regression.

Suggested Citation

  • Hung Hung & Zhi†Yu Jou & Su†Yun Huang, 2018. "Robust mislabel logistic regression without modeling mislabel probabilities," Biometrics, The International Biometric Society, vol. 74(1), pages 145-154, March.
  • Handle: RePEc:bla:biomet:v:74:y:2018:i:1:p:145-154
    DOI: 10.1111/biom.12726
    as

    Download full text from publisher

    File URL: https://doi.org/10.1111/biom.12726
    Download Restriction: no

    File URL: https://libkey.io/10.1111/biom.12726?utm_source=ideas
    LibKey link: if access is restricted and if your library uses this service, LibKey will redirect you to where you can use your library subscription to access this item
    ---><---

    References listed on IDEAS

    as
    1. Fujisawa, Hironori & Eguchi, Shinto, 2008. "Robust parameter estimation with a small bias against heavy contamination," Journal of Multivariate Analysis, Elsevier, vol. 99(9), pages 2053-2081, October.
    2. Takafumi Kanamori & Hironori Fujisawa, 2015. "Robust estimation under heavy contamination using unnormalized models," Biometrika, Biometrika Trust, vol. 102(3), pages 559-572.
    3. Kenichi Hayashi, 2012. "A boosting method with asymmetric mislabeling probabilities which depend on covariates," Computational Statistics, Springer, vol. 27(2), pages 203-218, June.
    Full references (including those not matched with items on IDEAS)

    Citations

    Citations are extracted by the CitEc Project, subscribe to its RSS feed for this item.
    as


    Cited by:

    1. Miron, Julien & Poilane, Benjamin & Cantoni, Eva, 2022. "Robust polytomous logistic regression," Computational Statistics & Data Analysis, Elsevier, vol. 176(C).
    2. Takayuki Kawashima & Hironori Fujisawa, 2023. "Robust regression against heavy heterogeneous contamination," Metrika: International Journal for Theoretical and Applied Statistics, Springer, vol. 86(4), pages 421-442, May.
    3. Mingyang Ren & Sanguo Zhang & Qingzhao Zhang, 2021. "Robust high-dimensional regression for data with anomalous responses," Annals of the Institute of Statistical Mathematics, Springer;The Institute of Statistical Mathematics, vol. 73(4), pages 703-736, August.

    Most related items

    These are the items that most often cite the same works as this one and are cited by the same works as this one.
    1. Hirose, Kei & Fujisawa, Hironori & Sese, Jun, 2017. "Robust sparse Gaussian graphical modeling," Journal of Multivariate Analysis, Elsevier, vol. 161(C), pages 172-190.
    2. Takayuki Kawashima & Hironori Fujisawa, 2023. "Robust regression against heavy heterogeneous contamination," Metrika: International Journal for Theoretical and Applied Statistics, Springer, vol. 86(4), pages 421-442, May.
    3. Mingyang Ren & Sanguo Zhang & Qingzhao Zhang, 2021. "Robust high-dimensional regression for data with anomalous responses," Annals of the Institute of Statistical Mathematics, Springer;The Institute of Statistical Mathematics, vol. 73(4), pages 703-736, August.
    4. Arun Kumar Kuchibhotla & Somabha Mukherjee & Ayanendranath Basu, 2019. "Statistical inference based on bridge divergences," Annals of the Institute of Statistical Mathematics, Springer;The Institute of Statistical Mathematics, vol. 71(3), pages 627-656, June.
    5. Gayen, Atin & Kumar, M. Ashok, 2021. "Projection theorems and estimating equations for power-law models," Journal of Multivariate Analysis, Elsevier, vol. 184(C).
    6. Yu-Chuan Chen & Hyejung Ha & Hyunjoong Kim & Hongshik Ahn, 2014. "Canonical Forest," Computational Statistics, Springer, vol. 29(3), pages 849-867, June.
    7. Liu, Yan, 2017. "Robust parameter estimation for stationary processes by an exotic disparity from prediction problem," Statistics & Probability Letters, Elsevier, vol. 129(C), pages 120-130.
    8. A. Philip Dawid & Monica Musio & Laura Ventura, 2016. "Minimum Scoring Rule Inference," Scandinavian Journal of Statistics, Danish Society for Theoretical Statistics;Finnish Statistical Society;Norwegian Statistical Association;Swedish Statistical Association, vol. 43(1), pages 123-138, March.
    9. Abhijit Mandal & Beste Hamiye Beyaztas & Soutir Bandyopadhyay, 2023. "Robust density power divergence estimates for panel data models," Annals of the Institute of Statistical Mathematics, Springer;The Institute of Statistical Mathematics, vol. 75(5), pages 773-798, October.
    10. Paola Stolfi & Mauro Bernardi & Davide Vergni, 2022. "Robust estimation of time-dependent precision matrix with application to the cryptocurrency market," Financial Innovation, Springer;Southwestern University of Finance and Economics, vol. 8(1), pages 1-25, December.
    11. A. Basu & A. Mandal & N. Martin & L. Pardo, 2015. "Robust tests for the equality of two normal means based on the density power divergence," Metrika: International Journal for Theoretical and Applied Statistics, Springer, vol. 78(5), pages 611-634, July.
    12. Avijit Maji & Abhik Ghosh & Ayanendranath Basu & Leandro Pardo, 2019. "Robust statistical inference based on the C-divergence family," Annals of the Institute of Statistical Mathematics, Springer;The Institute of Statistical Mathematics, vol. 71(5), pages 1289-1322, October.
    13. Catania, Leopoldo & Luati, Alessandra, 2020. "Robust estimation of a location parameter with the integrated Hogg function," Statistics & Probability Letters, Elsevier, vol. 164(C).
    14. Ting-Li Chen, 2015. "On the convergence and consistency of the blurring mean-shift process," Annals of the Institute of Statistical Mathematics, Springer;The Institute of Statistical Mathematics, vol. 67(1), pages 157-176, February.
    15. Shogo Kato & Shinto Eguchi, 2016. "Robust estimation of location and concentration parameters for the von Mises–Fisher distribution," Statistical Papers, Springer, vol. 57(1), pages 205-234, March.
    16. Masashi Sugiyama & Taiji Suzuki & Takafumi Kanamori, 2012. "Density-ratio matching under the Bregman divergence: a unified framework of density-ratio estimation," Annals of the Institute of Statistical Mathematics, Springer;The Institute of Statistical Mathematics, vol. 64(5), pages 1009-1044, October.
    17. Chen, Ting-Li & Fujisawa, Hironori & Huang, Su-Yun & Hwang, Chii-Ruey, 2016. "On the weak convergence and Central Limit Theorem of blurring and nonblurring processes with application to robust location estimation," Journal of Multivariate Analysis, Elsevier, vol. 143(C), pages 165-184.
    18. Yuri Goegebeur & Armelle Guillou & Jing Qin, 2023. "Robust estimation of the conditional stable tail dependence function," Annals of the Institute of Statistical Mathematics, Springer;The Institute of Statistical Mathematics, vol. 75(2), pages 201-231, April.
    19. M. Ekström & S. M. Mirakhmedov & S. Rao Jammalamadaka, 2020. "A class of asymptotically efficient estimators based on sample spacings," TEST: An Official Journal of the Spanish Society of Statistics and Operations Research, Springer;Sociedad de Estadística e Investigación Operativa, vol. 29(3), pages 617-636, September.
    20. Aida Toma & Samuela Leoni-Aubin, 2015. "Robust Portfolio Optimization Using Pseudodistances," PLOS ONE, Public Library of Science, vol. 10(10), pages 1-26, October.

    More about this item

    Statistics

    Access and download statistics

    Corrections

    All material on this site has been provided by the respective publishers and authors. You can help correct errors and omissions. When requesting a correction, please mention this item's handle: RePEc:bla:biomet:v:74:y:2018:i:1:p:145-154. See general information about how to correct material in RePEc.

    If you have authored this item and are not yet registered with RePEc, we encourage you to do it here. This allows to link your profile to this item. It also allows you to accept potential citations to this item that we are uncertain about.

    If CitEc recognized a bibliographic reference but did not link an item in RePEc to it, you can help with this form .

    If you know of missing items citing this one, you can help us creating those links by adding the relevant references in the same way as above, for each refering item. If you are a registered author of this item, you may also want to check the "citations" tab in your RePEc Author Service profile, as there may be some citations waiting for confirmation.

    For technical questions regarding this item, or to correct its authors, title, abstract, bibliographic or download information, contact: Wiley Content Delivery (email available below). General contact details of provider: http://www.blackwellpublishing.com/journal.asp?ref=0006-341X .

    Please note that corrections may take a couple of weeks to filter through the various RePEc services.

    IDEAS is a RePEc service. RePEc uses bibliographic data supplied by the respective publishers.