Loading…

Separable Convolutions for Optimizing 3D Stereo Networks

Deep learning based 3D stereo networks give superior performance compared to 2D networks and conventional stereo methods. However, this improvement in the performance comes at the cost of increased computational complexity, thus making these networks non-practical for the real-world applications. Sp...

Full description

Saved in:

Bibliographic Details
Main Authors:	Rahim, Rafia, Shamsafar, Faranak, Zell, Andreas
Format:	Conference Proceeding
Language:	English
Subjects:	CNNs Computational complexity Computational Efficiency Conferences Deep learning Disparity Estimation Image processing Separable Convolutions Stereo Matching Three-dimensional displays
Online Access:	Request full text
Tags:	Add Tag No Tags, Be the first to tag this record!

Description
Summary:	Deep learning based 3D stereo networks give superior performance compared to 2D networks and conventional stereo methods. However, this improvement in the performance comes at the cost of increased computational complexity, thus making these networks non-practical for the real-world applications. Specifically, these networks use 3D convolutions as a major work horse to refine and regress disparities. In this work first, we show that these 3D convolutions in stereo networks consume up to 94% of overall network operations and act as a major bottleneck. Next, we propose a set of "plug-&-run" separable convolutions to reduce the number of parameters and operations. When integrated with the existing state of the art stereo networks, these convolutions lead up to 7\times reduction in number of operations and up to 3.5\times reduction in parameters without compromising their performance. In fact these convolutions lead to improvement in their performance in the majority of cases 1 1 This work is part of the project DeepStereoVision (FRE: 01IS18024B) sponsored by the German Ministry of Education & Research (BMBF).
ISSN:	2381-8549
DOI:	10.1109/ICIP42928.2021.9506330