IDEAS home Printed from https://ideas.repec.org/a/wsi/apjorx/v40y2023i01ns0217595923400055.html
   My bibliography  Save this article

Eigenvalue-Corrected Natural Gradient Based on a New Approximation

Author

Listed:
  • Kaixin Gao

    (School of Mathematics, Tianjin University, Tianjin 300350, P. R. China)

  • Zheng-Hai Huang

    (School of Mathematics, Tianjin University, Tianjin 300350, P. R. China)

  • Xiaolei Liu

    (School of Mathematics, Tianjin University, Tianjin 300350, P. R. China)

  • Min Wang

    (Central Software Institute, Huawei Technologies Co. Ltd, Hangzhou 310051, P. R. China)

  • Shuangling Wang

    (Central Software Institute, Huawei Technologies Co. Ltd, Hangzhou 310051, P. R. China)

  • Zidong Wang

    (Central Software Institute, Huawei Technologies Co. Ltd, Hangzhou 310051, P. R. China)

  • Dachuan Xu

    (Beijing Institute for Scientific and Engineering Computing, Beijing University of Technology, Beijing 100124, P. R. China)

  • Fan Yu

    (Central Software Institute, Huawei Technologies Co. Ltd, Hangzhou 310051, P. R. China)

Abstract

Using second-order optimization methods for training deep neural networks (DNNs) has attracted many researchers. A recently proposed method, Eigenvalue-corrected Kronecker Factorization (EKFAC), proposed an interpretation by viewing natural gradient update as a diagonal method and corrects the inaccurate re-scaling factor in the KFAC eigenbasis. What’s more, a new method to approximate the natural gradient called Trace-restricted Kronecker-factored Approximate Curvature (TKFAC) is also proposed, in which the Fisher information matrix (FIM) is approximated as a constant multiplied by the Kronecker product of two matrices and the traces can be kept equal before and after the approximation. In this work, we combine the ideas of these two methods and propose Trace-restricted Eigenvalue-corrected Kronecker Factorization (TEKFAC). The proposed method not only corrects the inexact re-scaling factor under the Kronecker-factored eigenbasis, but also considers the new approximation method and the effective damping technique adopted by TKFAC. We also discuss the differences and relationships among the related Kronecker-factored approximations. Empirically, our method outperforms SGD with momentum, Adam, EKFAC and TKFAC on several DNNs.

Suggested Citation

  • Kaixin Gao & Zheng-Hai Huang & Xiaolei Liu & Min Wang & Shuangling Wang & Zidong Wang & Dachuan Xu & Fan Yu, 2023. "Eigenvalue-Corrected Natural Gradient Based on a New Approximation," Asia-Pacific Journal of Operational Research (APJOR), World Scientific Publishing Co. Pte. Ltd., vol. 40(01), pages 1-18, February.
  • Handle: RePEc:wsi:apjorx:v:40:y:2023:i:01:n:s0217595923400055
    DOI: 10.1142/S0217595923400055
    as

    Download full text from publisher

    File URL: http://www.worldscientific.com/doi/abs/10.1142/S0217595923400055
    Download Restriction: Access to full text is restricted to subscribers

    File URL: https://libkey.io/10.1142/S0217595923400055?utm_source=ideas
    LibKey link: if access is restricted and if your library uses this service, LibKey will redirect you to where you can use your library subscription to access this item
    ---><---

    As the access to this document is restricted, you may want to search for a different version of it.

    Corrections

    All material on this site has been provided by the respective publishers and authors. You can help correct errors and omissions. When requesting a correction, please mention this item's handle: RePEc:wsi:apjorx:v:40:y:2023:i:01:n:s0217595923400055. See general information about how to correct material in RePEc.

    If you have authored this item and are not yet registered with RePEc, we encourage you to do it here. This allows to link your profile to this item. It also allows you to accept potential citations to this item that we are uncertain about.

    We have no bibliographic references for this item. You can help adding them by using this form .

    If you know of missing items citing this one, you can help us creating those links by adding the relevant references in the same way as above, for each refering item. If you are a registered author of this item, you may also want to check the "citations" tab in your RePEc Author Service profile, as there may be some citations waiting for confirmation.

    For technical questions regarding this item, or to correct its authors, title, abstract, bibliographic or download information, contact: Tai Tone Lim (email available below). General contact details of provider: http://www.worldscinet.com/apjor/apjor.shtml .

    Please note that corrections may take a couple of weeks to filter through the various RePEc services.

    IDEAS is a RePEc service. RePEc uses bibliographic data supplied by the respective publishers.