Distributed Inference Acceleration with Adaptive DNN Partitioning and Offloading

Deep neural networks (DNN) are the de-facto solution behind many intelligent applications of today, ranging from machine translation to autonomous driving. DNNs are accurate but resource-intensive, especially for embedded devices such as mobile phones and smart objects in the Internet of Things. To...

Full description

Saved in:
Bibliographic Details
Main Authors: Mohammed, Thaha, Joe-Wong, Carlee, Babbar, Rohit, Francesco, Mario Di
Format: Conference Proceeding
Language:English
Subjects:
Online Access:Request full text
Tags: Add Tag
No Tags, Be the first to tag this record!