Abstract
Recently, on-device inference using deep learning (DL) models for mobile and edge devices has attracted significant attention in ubiquitous computing due to its lower latency, better performance and increased data privacy. Adopting pre-trained DL models as the backbone for downstream tasks has become the consensus of the Artificial Intelligence (AI) community since it can remarkably accelerate the DL deployment process. However, most of the pre-trained DL models are not suitable for resource-constraint platforms. Further, there is a scarcity of platforms providing a unified way to store, query, share and reuse pre-trained DL models, especially for mobile applications. To address these limitations, this paper proposes an ontology-based platform (MobileDLSearch) that offers end-users greater flexibility to store, query, share and reuse pre-trained DL models for various mobile applications. The proposed Mo-bileDLSearch uses a standardised ontology to represent various DL models with different backends (e.g., TensorFlow, Keras and PyTorch), and provides an intuitive and interactive user interface to support search and retrieval of DL models. It also implements an automatic model converter to optimise desktop/laboratory-oriented pre-trained DL models for mobile platforms, and has an on-device real-time model integration module to benchmark the model's performance on mobile devices. The evaluation results demonstrate the usability of the proposed MobileDLSearch to help end-users quickly search, deploy and benchmark DL models for various on-device inference tasks.
Original language | English |
---|---|
Title of host publication | Proceedings - IEEE Congress on Cybermatics |
Subtitle of host publication | 2021 IEEE International Conferences on Internet of Things, iThings 2021, IEEE Green Computing and Communications, GreenCom 2021, IEEE Cyber, Physical and Social Computing, CPSCom 2021 and IEEE Smart Data, SmartData 2021 |
Editors | James Zheng, Xiao Liu, Tom Hao Luan, Prem Prakash Jayaraman, Haipeng Dai, Karan Mitra, Kai Qin, Rajiv Ranjan, Sheng Wen |
Place of Publication | Piscataway NJ USA |
Publisher | IEEE, Institute of Electrical and Electronics Engineers |
Pages | 51-58 |
Number of pages | 8 |
ISBN (Electronic) | 9781665417624 |
ISBN (Print) | 9781665417631 |
DOIs | |
Publication status | Published - 2021 |
Event | IEEE/ACM International Conference on Green Computing and Communications (GreenCom) / IEEE International Conference on Internet of Things (iThings) / IEEE International Conference on Cyber, Physica 2021 - Online, Melbourne, Australia Duration: 6 Dec 2021 → 8 Dec 2021 https://ieeexplore.ieee.org/xpl/conhome/9693994/proceeding (Proceedings) |
Conference
Conference | IEEE/ACM International Conference on Green Computing and Communications (GreenCom) / IEEE International Conference on Internet of Things (iThings) / IEEE International Conference on Cyber, Physica 2021 |
---|---|
Country/Territory | Australia |
City | Melbourne |
Period | 6/12/21 → 8/12/21 |
Internet address |
Keywords
- Deep Learning
- Edge Device
- Mobile Platform
- Ontology
- Semantic Search