Loading…

Normalization and dropout for stochastic computing-based deep convolutional neural networks

Recently, Deep Convolutional Neural Network (DCNN) has been recognized as the most effective model for pattern recognition and classification tasks. With the fast growing Internet of Things (IoTs) and wearable devices, it becomes attractive to implement DCNNs in embedded and portable systems. Howeve...

Full description

Saved in:
Bibliographic Details
Published in:Integration (Amsterdam) 2019-03, Vol.65, p.395-403
Main Authors: Li, Ji, Yuan, Zihao, Li, Zhe, Ren, Ao, Ding, Caiwen, Draper, Jeffrey, Nazarian, Shahin, Qiu, Qinru, Yuan, Bo, Wang, Yanzhi
Format: Article
Language:English
Subjects:
Citations: Items that this one cites
Items that cite this one
Online Access:Get full text
Tags: Add Tag
No Tags, Be the first to tag this record!
Description
Summary:Recently, Deep Convolutional Neural Network (DCNN) has been recognized as the most effective model for pattern recognition and classification tasks. With the fast growing Internet of Things (IoTs) and wearable devices, it becomes attractive to implement DCNNs in embedded and portable systems. However, novel computing paradigms are urgently required to deploy DCNNs that have huge power consumptions and complex topologies in systems with limited area and power supply. Recent works have demonstrated that Stochastic Computing (SC) can radically simplify the hardware implementation of arithmetic units and has the potential to bring the success of DCNNs to embedded systems. This paper introduces normalization and dropout, which are essential techniques for the state-of-the-art DCNNs, to the existing SC-based DCNN frameworks. In this work, the feature extraction block of DCNNs is implemented using an approximate parallel counter, a near-max pooling block and an SC-based rectified linear activation unit. A novel SC-based normalization design is proposed, which includes a square and summation unit, an activation unit and a division unit. The dropout technique is integrated into the training phase and the learned weights are adjusted during the hardware implementation. Experimental results on AlexNet with the ImageNet dataset show that the SC-based DCNN with the proposed normalization and dropout techniques achieves 3.26% top-1 accuracy improvement and 3.05% top-5 accuracy improvement compared with the SC-based DCNN without these two essential techniques, confirming the effectiveness of our normalization and dropout designs. •A novel and efficient stochastic computing based normalization design is proposed, which is comprised of summation and square, activation, and division units.•Dropout technique is integrated to the stochastic computing based deep convolutional neural network frameworks.•Experimental results on AlexNet with the ImageNet dataset show that the proposed normalization and dropout techniques achieves 3.26% top-1 accuracy improvement and 3.05% top-5 accuracy improvement, confirming the effectiveness of our normalization and dropout designs.
ISSN:0167-9260
1872-7522
DOI:10.1016/j.vlsi.2017.11.002