Author
Listed:
- Weier Wan
(Stanford University
University of California San Diego)
- Rajkumar Kubendran
(University of California San Diego
University of Pittsburgh)
- Clemens Schaefer
(University of Notre Dame)
- Sukru Burc Eryilmaz
(Stanford University)
- Wenqiang Zhang
(Tsinghua University)
- Dabin Wu
(Tsinghua University)
- Stephen Deiss
(University of California San Diego)
- Priyanka Raina
(Stanford University)
- He Qian
(Tsinghua University)
- Bin Gao
(Tsinghua University)
- Siddharth Joshi
(University of California San Diego
University of Notre Dame)
- Huaqiang Wu
(Tsinghua University)
- H.-S. Philip Wong
(Stanford University)
- Gert Cauwenberghs
(University of California San Diego)
Abstract
Realizing increasingly complex artificial intelligence (AI) functionalities directly on edge devices calls for unprecedented energy efficiency of edge hardware. Compute-in-memory (CIM) based on resistive random-access memory (RRAM)1 promises to meet such demand by storing AI model weights in dense, analogue and non-volatile RRAM devices, and by performing AI computation directly within RRAM, thus eliminating power-hungry data movement between separate compute and memory2–5. Although recent studies have demonstrated in-memory matrix-vector multiplication on fully integrated RRAM-CIM hardware6–17, it remains a goal for a RRAM-CIM chip to simultaneously deliver high energy efficiency, versatility to support diverse models and software-comparable accuracy. Although efficiency, versatility and accuracy are all indispensable for broad adoption of the technology, the inter-related trade-offs among them cannot be addressed by isolated improvements on any single abstraction level of the design. Here, by co-optimizing across all hierarchies of the design from algorithms and architecture to circuits and devices, we present NeuRRAM—a RRAM-based CIM chip that simultaneously delivers versatility in reconfiguring CIM cores for diverse model architectures, energy efficiency that is two-times better than previous state-of-the-art RRAM-CIM chips across various computational bit-precisions, and inference accuracy comparable to software models quantized to four-bit weights across various AI tasks, including accuracy of 99.0 percent on MNIST18 and 85.7 percent on CIFAR-1019 image classification, 84.7-percent accuracy on Google speech command recognition20, and a 70-percent reduction in image-reconstruction error on a Bayesian image-recovery task.
Suggested Citation
Weier Wan & Rajkumar Kubendran & Clemens Schaefer & Sukru Burc Eryilmaz & Wenqiang Zhang & Dabin Wu & Stephen Deiss & Priyanka Raina & He Qian & Bin Gao & Siddharth Joshi & Huaqiang Wu & H.-S. Philip , 2022.
"A compute-in-memory chip based on resistive random-access memory,"
Nature, Nature, vol. 608(7923), pages 504-512, August.
Handle:
RePEc:nat:nature:v:608:y:2022:i:7923:d:10.1038_s41586-022-04992-8
DOI: 10.1038/s41586-022-04992-8
Download full text from publisher
As the access to this document is restricted, you may want to
for a different version of it.
Corrections
All material on this site has been provided by the respective publishers and authors. You can help correct errors and omissions. When requesting a correction, please mention this item's handle: RePEc:nat:nature:v:608:y:2022:i:7923:d:10.1038_s41586-022-04992-8. See general information about how to correct material in RePEc.
If you have authored this item and are not yet registered with RePEc, we encourage you to do it here. This allows to link your profile to this item. It also allows you to accept potential citations to this item that we are uncertain about.
We have no bibliographic references for this item. You can help adding them by using this form .
If you know of missing items citing this one, you can help us creating those links by adding the relevant references in the same way as above, for each refering item. If you are a registered author of this item, you may also want to check the "citations" tab in your RePEc Author Service profile, as there may be some citations waiting for confirmation.
For technical questions regarding this item, or to correct its authors, title, abstract, bibliographic or download information, contact: Sonal Shukla or Springer Nature Abstracting and Indexing (email available below). General contact details of provider: http://www.nature.com .
Please note that corrections may take a couple of weeks to filter through
the various RePEc services.