Loading…

Deep Convolutional Network for Stereo Depth Mapping in Binocular Endoscopy

Depth mapping from binocular endoscopy images plays an important role in stereoscopic surgical treatment. Owing to the development of deep convolutional neural networks (CNNs), binocular depth estimation models have achieved many exciting results in the fields of autonomous driving and machine visio...

Full description

Saved in:

Bibliographic Details
Published in:	IEEE access 2020, Vol.8, p.73241-73249
Main Authors:	Wang, Xiong-Zhi, Nie, Yunfeng, Lu, Shao-Ping, Zhang, Jingang
Format:	Article
Language:	English
Subjects:	Accuracy Artificial neural networks Binocular endoscopes Cameras deep convolutional neutral network Endoscopes Endoscopy Estimation Frames (data processing) Frames per second Gastrointestinal tract Gloss Ground truth Machine vision Mapping Real time real-time evaluation Real-time systems Source code stereo depth mapping Surgery Three-dimensional displays
Citations:	Items that this one cites Items that cite this one
Online Access:	Get full text
Tags:	Add Tag No Tags, Be the first to tag this record!

Description
Summary:	Depth mapping from binocular endoscopy images plays an important role in stereoscopic surgical treatment. Owing to the development of deep convolutional neural networks (CNNs), binocular depth estimation models have achieved many exciting results in the fields of autonomous driving and machine vision. However, the application of these methods to endoscopic imaging is greatly limited by the fact that binocular endoscopic images not only are rare, but also have unsatisfying features such as no texture, no ground truth, bad contrast, and high gloss. Aiming at solving the above-mentioned problems, we have built a precise gastrointestinal environment by the open-source software blender to simulate abundant binocular endoscopy data and proposed a 23-layer deep CNNs method to generate real-time stereo depth mapping. An efficient scale-invariant loss function is introduced in this paper to accommodate the characteristics of endoscope images, which improves the accuracy of achieved depth mapping results. Regarding the considerable training data for typical CNNs, our method requires only a few images ( 960\times 720 resolution) at 45 frames per second on an NVIDIA GTX 1080 GPU module, then the depth mapping information is generated in real-time with satisfactory accuracy. The effectiveness of the developed method is validated by comparing with state-of-the-art methods on processing the same datasets, demonstrating a faster and more accurate performance than other model frames.
ISSN:	2169-3536 2169-3536
DOI:	10.1109/ACCESS.2020.2987767