Loading…

Data-Aware Compression of Neural Networks

Deep Neural networks (DNNs) are getting deeper and larger which intensify the data movement and compute demands. Prior work focuses on reducing data movements and computation through exploiting sparsity and similarity. However, none of them exploit input similarity and only focus on sparsity and wei...

Full description

Saved in:
Bibliographic Details
Published in:IEEE computer architecture letters 2021-07, Vol.20 (2), p.94-97
Main Authors: Falahati, Hajar, Peyro, Masoud, Amini, Hossein, Taghian, Mehran, Sadrosadati, Mohammad, Lotfi-Kamran, Pejman, Sarbazi-Azad, Hamid
Format: Article
Language:English
Subjects:
Citations: Items that this one cites
Items that cite this one
Online Access:Get full text
Tags: Add Tag
No Tags, Be the first to tag this record!
Description
Summary:Deep Neural networks (DNNs) are getting deeper and larger which intensify the data movement and compute demands. Prior work focuses on reducing data movements and computation through exploiting sparsity and similarity. However, none of them exploit input similarity and only focus on sparsity and weight similarity. Synergistically analysing the similarity and sparsity of inputs and weights, we show that memory accesses and computations can be reduced by 5.7× and 4.1×, more than what can be decreased by exploiting only sparsity, and 3.9× and 2.1×, more than what can be decreased by exploiting only weight similarity. We propose a new data-aware compression approach, called DANA , to effectively utilize both sparsity and similarity in inputs and weights. DANA can be orthogonally implemented on top of different hardware DNN accelerators. As an example, we implement DANA on top of an Eyeriss-like architecture. Our results over four famous DNNs reveal that DANA outperforms Eyeriss in terms of average performance and energy consumption by 18× and 83×, respectively. Moreover, DANA is faster than the state-of-the-art sparsity-aware and similarity-aware techniques by respectively 4.6× and 4.5×, and reduces the average energy consumption over them by 3.0× and 5.8×.
ISSN:1556-6056
1556-6064
DOI:10.1109/LCA.2021.3096191