Loading…

Human-Perception-Oriented Pseudo Analog Video Transmissions With Deep Learning

Recently, pseudo analog transmission has gained increasing attentions due to its ability to alleviate the cliff effect in video multicast scenarios. The existing pseudo analog systems are optimized under the minimum mean squared error criterion. However, their power allocation strategies do not take...

Full description

Saved in:
Bibliographic Details
Published in:IEEE transactions on vehicular technology 2020-09, Vol.69 (9), p.9896-9909
Main Authors: Tang, Xiao-Wei, Huang, Xin-Lin, Hu, Fei, Shi, Qingjiang
Format: Article
Language:English
Subjects:
Citations: Items that this one cites
Items that cite this one
Online Access:Get full text
Tags: Add Tag
No Tags, Be the first to tag this record!
Description
Summary:Recently, pseudo analog transmission has gained increasing attentions due to its ability to alleviate the cliff effect in video multicast scenarios. The existing pseudo analog systems are optimized under the minimum mean squared error criterion. However, their power allocation strategies do not take the perceptual video quality into consideration. In this article, we propose a human-perception-based pseudo analog video transmission system named ROIC-Cast, which aims to intelligently enhance the transmission quality of the region-of-interest (ROI) parts. Firstly, the classic deep learning based saliency detection algorithm is adopted to decompose the continuous video sequences into ROI and non-ROI blocks. Secondly, an effective compression method is used to reduce the data amount of side information generated by the ROI extraction module. Then, the power allocation scheme is formulated as a convex problem, and the optimal transmission power for both ROI and non-ROI blocks is derived in a closed form. Finally, the simulations are conducted to validate the proposed system by comparing with a few of existing systems, e.g., KMV-Cast, SoftCast, and DAC-RAN. The proposed ROIC-Cast can achieve over 4.1 dB peak signal-to-noise ratio gains of ROI compared with other systems, given the channel signal-to-noise ratio as −5 dB, 0 dB, 5 dB, and 10 dB, respectively. This significant performance improvement is due to the automatic ROI extraction, high-efficiency data compression as well as adaptive power allocation.
ISSN:0018-9545
1939-9359
DOI:10.1109/TVT.2020.3003478