Abstract
In cloud-edge environments, running all Deep Neural Network (DNN) models on the cloud causes significant network congestion and high latency, whereas the exclusive use of the edge device for execution limits the size and structure of the DNN, impacting accuracy. This paper introduces a novel partitioning approach for DNN inference between the edge and the cloud. This is the first work to consider simultaneous optimization of both the memory usage at the edge and the size of the data to be transferred over the wireless link. The experiments were performed on two different network architectures, MobileNetV1 and VGG16. The proposed approach makes it possible to execute part of the network on very constrained devices (e.g., microcontrollers), and under poor network conditions (e.g., LoRa) whilst retaining reasonable accuracies. Moreover, the results show that the choice of the optimal layer to split the network depends on the bandwidth and memory constraints, whereas prior work suggests that the best choice is always to split the network at higher layers. We demonstrate superior performance compared to existing techniques.
Original language | English |
---|---|
Title of host publication | 2021 18th IEEE International Conference on Sensing, Communication and Networking, SECON 2021 |
Editors | Luca Chiaraviglio, Chenyang Lu, Kay Romer |
Place of Publication | Piscataway NJ USA |
Publisher | IEEE, Institute of Electrical and Electronics Engineers |
Pages | 289-297 |
Number of pages | 9 |
ISBN (Electronic) | 9781665441087 |
ISBN (Print) | 9781665431118 |
DOIs | |
Publication status | Published - 2021 |
Externally published | Yes |
Event | IEEE International Conference on Sensing, Communication and Networking 2021 - Online, United States of America Duration: 6 Jul 2021 → 9 Jul 2021 Conference number: 18th https://ieeexplore-ieee-org.ezproxy.lib.monash.edu.au/xpl/conhome/9491576/proceeding (Proceedings) |
Publication series
Name | Annual IEEE Communications Society Conference on Sensor, Mesh and Ad Hoc Communications and Networks workshops |
---|---|
Publisher | IEEE, Institute of Electrical and Electronics Engineers |
Volume | 2021-July |
ISSN (Print) | 2155-5486 |
ISSN (Electronic) | 2155-5494 |
Conference
Conference | IEEE International Conference on Sensing, Communication and Networking 2021 |
---|---|
Abbreviated title | SECON 2021 |
Country/Territory | United States of America |
Period | 6/07/21 → 9/07/21 |
Internet address |
Keywords
- CNN
- deep learning
- Edge computing
- tasks partitioning