Author
Listed:
- Win-San Khwa
(Taiwan Semiconductor Manufacturing Company Limited (TSMC))
- Tai-Hao Wen
(National Tsing Hua University (NTHU))
- Hung-Hsi Hsu
(Taiwan Semiconductor Manufacturing Company Limited (TSMC)
National Tsing Hua University (NTHU))
- Wei-Hsing Huang
(National Tsing Hua University (NTHU))
- Yu-Chen Chang
(National Tsing Hua University (NTHU))
- Ting-Chien Chiu
(National Tsing Hua University (NTHU))
- Zhao-En Ke
(National Tsing Hua University (NTHU))
- Yu-Hsiang Chin
(National Tsing Hua University (NTHU))
- Hua-Jin Wen
(National Tsing Hua University (NTHU))
- Wei-Ting Hsu
(National Tsing Hua University (NTHU))
- Chung-Chuan Lo
(National Tsing Hua University (NTHU))
- Ren-Shuo Liu
(National Tsing Hua University (NTHU))
- Chih-Cheng Hsieh
(National Tsing Hua University (NTHU))
- Kea-Tiong Tang
(National Tsing Hua University (NTHU))
- Mon-Shu Ho
(National Chung Hsing University (NCHU))
- Ashwin Sanjay Lele
(Taiwan Semiconductor Manufacturing Company Limited (TSMC))
- Shih-Hsin Teng
(Taiwan Semiconductor Manufacturing Company Limited (TSMC))
- Chung-Cheng Chou
(Taiwan Semiconductor Manufacturing Company Limited (TSMC))
- Yu-Der Chih
(Taiwan Semiconductor Manufacturing Company Limited (TSMC))
- Tsung-Yung Jonathan Chang
(Taiwan Semiconductor Manufacturing Company Limited (TSMC))
- Meng-Fan Chang
(Taiwan Semiconductor Manufacturing Company Limited (TSMC)
National Tsing Hua University (NTHU))
Abstract
Artificial intelligence (AI) edge devices1–12 demand high-precision energy-efficient computations, large on-chip model storage, rapid wakeup-to-response time and cost-effective foundry-ready solutions. Floating point (FP) computation provides precision exceeding that of integer (INT) formats at the cost of higher power and storage overhead. Multi-level-cell (MLC) memristor compute-in-memory (CIM)13–15 provides compact non-volatile storage and energy-efficient computation but is prone to accuracy loss owing to process variation. Digital static random-access memory (SRAM)-CIM16–22 enables lossless computation; however, storage is low as a result of large bit-cell area and model loading is required during inference. Thus, conventional approaches using homogeneous CIM architectures and computation formats impose a trade-off between efficiency, storage, wakeup latency and inference accuracy. Here we present a mixed-precision heterogeneous CIM AI edge processor, which supports the layer-granular/kernel-granular partitioning of network layers among on-chip CIM architectures (that is, memristor-CIM, SRAM-CIM and tiny-digital units) and computation number formats (INT and FP) based on sensitivity to error. This layer-granular/kernel-granular flexibility allows simultaneous optimization within the two-dimensional design space at the hardware level. The proposed hardware achieved high energy efficiency (40.91 TFLOPS W−1 for ResNet-20 with CIFAR-100 and 28.63 TFLOPS W−1 for MobileNet-v2 with ImageNet), low accuracy degradation (
Suggested Citation
Win-San Khwa & Tai-Hao Wen & Hung-Hsi Hsu & Wei-Hsing Huang & Yu-Chen Chang & Ting-Chien Chiu & Zhao-En Ke & Yu-Hsiang Chin & Hua-Jin Wen & Wei-Ting Hsu & Chung-Chuan Lo & Ren-Shuo Liu & Chih-Cheng Hs, 2025.
"A mixed-precision memristor and SRAM compute-in-memory AI processor,"
Nature, Nature, vol. 639(8055), pages 617-623, March.
Handle:
RePEc:nat:nature:v:639:y:2025:i:8055:d:10.1038_s41586-025-08639-2
DOI: 10.1038/s41586-025-08639-2
Download full text from publisher
As the access to this document is restricted, you may want to search for a different version of it.
Corrections
All material on this site has been provided by the respective publishers and authors. You can help correct errors and omissions. When requesting a correction, please mention this item's handle: RePEc:nat:nature:v:639:y:2025:i:8055:d:10.1038_s41586-025-08639-2. See general information about how to correct material in RePEc.
If you have authored this item and are not yet registered with RePEc, we encourage you to do it here. This allows to link your profile to this item. It also allows you to accept potential citations to this item that we are uncertain about.
We have no bibliographic references for this item. You can help adding them by using this form .
If you know of missing items citing this one, you can help us creating those links by adding the relevant references in the same way as above, for each refering item. If you are a registered author of this item, you may also want to check the "citations" tab in your RePEc Author Service profile, as there may be some citations waiting for confirmation.
For technical questions regarding this item, or to correct its authors, title, abstract, bibliographic or download information, contact: Sonal Shukla or Springer Nature Abstracting and Indexing (email available below). General contact details of provider: http://www.nature.com .
Please note that corrections may take a couple of weeks to filter through
the various RePEc services.