IDEAS home Printed from https://ideas.repec.org/a/gam/jmathe/v9y2021i17p2144-d627901.html
   My bibliography  Save this article

NICE: Noise Injection and Clamping Estimation for Neural Network Quantization

Author

Listed:
  • Chaim Baskin

    (Department of Computer Science, Technion, Haifa 3200003, Israel
    These authors contributed equally to this work.)

  • Evgenii Zheltonozhkii

    (Department of Computer Science, Technion, Haifa 3200003, Israel
    These authors contributed equally to this work.)

  • Tal Rozen

    (Department of Electrical Engineering, Technion, Haifa 3200003, Israel
    These authors contributed equally to this work.)

  • Natan Liss

    (Department of Electrical Engineering, Technion, Haifa 3200003, Israel)

  • Yoav Chai

    (School of Electrical Engineering, Tel-Aviv University, Tel-Aviv 6997801, Israel)

  • Eli Schwartz

    (School of Electrical Engineering, Tel-Aviv University, Tel-Aviv 6997801, Israel)

  • Raja Giryes

    (School of Electrical Engineering, Tel-Aviv University, Tel-Aviv 6997801, Israel)

  • Alexander M. Bronstein

    (Department of Computer Science, Technion, Haifa 3200003, Israel)

  • Avi Mendelson

    (Department of Computer Science, Technion, Haifa 3200003, Israel)

Abstract

Convolutional Neural Networks (CNNs) are very popular in many fields including computer vision, speech recognition, natural language processing, etc. Though deep learning leads to groundbreaking performance in those domains, the networks used are very computationally demanding and are far from being able to perform in real-time applications even on a GPU, which is not power efficient and therefore does not suit low power systems such as mobile devices. To overcome this challenge, some solutions have been proposed for quantizing the weights and activations of these networks, which accelerate the runtime significantly. Yet, this acceleration comes at the cost of a larger error unless spatial adjustments are carried out. The method proposed in this work trains quantized neural networks by noise injection and a learned clamping, which improve accuracy. This leads to state-of-the-art results on various regression and classification tasks, e.g., ImageNet classification with architectures such as ResNet-18/34/50 with as low as 3 bit weights and activations. We implement the proposed solution on an FPGA to demonstrate its applicability for low-power real-time applications. The quantization code will become publicly available upon acceptance.

Suggested Citation

  • Chaim Baskin & Evgenii Zheltonozhkii & Tal Rozen & Natan Liss & Yoav Chai & Eli Schwartz & Raja Giryes & Alexander M. Bronstein & Avi Mendelson, 2021. "NICE: Noise Injection and Clamping Estimation for Neural Network Quantization," Mathematics, MDPI, vol. 9(17), pages 1-12, September.
  • Handle: RePEc:gam:jmathe:v:9:y:2021:i:17:p:2144-:d:627901
    as

    Download full text from publisher

    File URL: https://www.mdpi.com/2227-7390/9/17/2144/pdf
    Download Restriction: no

    File URL: https://www.mdpi.com/2227-7390/9/17/2144/
    Download Restriction: no
    ---><---

    References listed on IDEAS

    as
    1. Alex Karbachevsky & Chaim Baskin & Evgenii Zheltonozhskii & Yevgeny Yermolin & Freddy Gabbay & Alex M. Bronstein & Avi Mendelson, 2021. "Early-Stage Neural Network Hardware Performance Analysis," Sustainability, MDPI, vol. 13(2), pages 1-20, January.
    Full references (including those not matched with items on IDEAS)

    Most related items

    These are the items that most often cite the same works as this one and are cited by the same works as this one.

      Corrections

      All material on this site has been provided by the respective publishers and authors. You can help correct errors and omissions. When requesting a correction, please mention this item's handle: RePEc:gam:jmathe:v:9:y:2021:i:17:p:2144-:d:627901. See general information about how to correct material in RePEc.

      If you have authored this item and are not yet registered with RePEc, we encourage you to do it here. This allows to link your profile to this item. It also allows you to accept potential citations to this item that we are uncertain about.

      If CitEc recognized a bibliographic reference but did not link an item in RePEc to it, you can help with this form .

      If you know of missing items citing this one, you can help us creating those links by adding the relevant references in the same way as above, for each refering item. If you are a registered author of this item, you may also want to check the "citations" tab in your RePEc Author Service profile, as there may be some citations waiting for confirmation.

      For technical questions regarding this item, or to correct its authors, title, abstract, bibliographic or download information, contact: MDPI Indexing Manager (email available below). General contact details of provider: https://www.mdpi.com .

      Please note that corrections may take a couple of weeks to filter through the various RePEc services.

      IDEAS is a RePEc service. RePEc uses bibliographic data supplied by the respective publishers.