TY - JOUR
T1 - Load balancing for heterogeneous serverless edge computing
T2 - A performance-driven and empirical approach
AU - Aslanpour, Mohammad Sadegh
AU - Toosi, Adel N.
AU - Cheema, Muhammad Aamir
AU - Chhetri, Mohan Baruwal
AU - Salehi, Mohsen Amini
N1 - Funding Information:
Dr. Adel N. Toosi is a Senior Lecturer and the Director of the DisNet Lab at the Department of Software Systems and Cybersecurity, Monash University, Australia. He completed his Ph.D. at the University of Melbourne in 2015 and has a portfolio of over 70 peer-reviewed publications in esteemed venues such as IEEE Transactions on Cloud Computing, IEEE Transactions on Services Computing, and IEEE Transactions on Sustainable Computing. His publications have received over 4,000 citations, contributing to a current h-index of 30. Beyond his citations and publications, Dr. Toosi has significantly advanced the foundations of cloud computing and played a vital role in creating tools and technologies, such as CloudSim, InterCloud, SipaaS, Clouds-Pi, Con-Pi, WattEdge, and AutoScaleSim. Throughout his career, he has been honored with numerous prestigious awards, including the AusPDC’21 Best Paper Award and Best Paper Candidate in ICSOC’21, as well as the recipient of the Australian Research Council Discovery project in 2022 and Linkage 2022. Dr. Toosi also served on the editorial board of Future Generation Computer Systems (FGCS) and has been a guest editor for several Special Issues. Furthermore, he has organized several workshops in his field, including Starless in PerCom 2022 and 2023. His research interests include Distributed Systems, Cloud/Fog/Edge Computing, Software-defined Networking, Serverless Computing, and Sustainable IT.
Publisher Copyright:
© 2024 The Author(s)
PY - 2024/5
Y1 - 2024/5
N2 - Serverless edge systems simplify the deployment of real-time AI-based Internet of Things (IoT) applications at the edge. However, the heterogeneity of edge computing nodes – in terms of both hardware and software – makes load balancing challenging in these systems. In this paper, we propose a performance-driven, empirical weight-tuning approach to achieve effective load balancing based on the characteristics and capabilities of the nodes. By extensively profiling the nodes, we gather knowledge on performance metrics such as throughput, energy efficiency, response time, AI accuracy, and cost. Using this acquired knowledge, we introduce a weighted round-robin strategy to optimize the performance metrics according to their observed significance. To address multiple objectives, we introduce a multi-objective method that aims to strike a balance between any arbitrary set of performance objectives simultaneously. Additionally, we explore a coordinated distributed approach to overcome the limitations of centralized load balancing. Next, we introduce Hedgi, a heterogeneous serverless edge architecture designed to efficiently configure and utilize the derived load balancing policies, validated empirically. To demonstrate the practicality of Hedgi, we containerize and serverlessize a real-time object detection application. Extensive empirical studies are conducted using Hedgi to evaluate the performance of the proposed load balancing approach. The results provide valuable insights into the design trade-offs of various load balancing policies and system designs in the heterogeneous serverless edge.
AB - Serverless edge systems simplify the deployment of real-time AI-based Internet of Things (IoT) applications at the edge. However, the heterogeneity of edge computing nodes – in terms of both hardware and software – makes load balancing challenging in these systems. In this paper, we propose a performance-driven, empirical weight-tuning approach to achieve effective load balancing based on the characteristics and capabilities of the nodes. By extensively profiling the nodes, we gather knowledge on performance metrics such as throughput, energy efficiency, response time, AI accuracy, and cost. Using this acquired knowledge, we introduce a weighted round-robin strategy to optimize the performance metrics according to their observed significance. To address multiple objectives, we introduce a multi-objective method that aims to strike a balance between any arbitrary set of performance objectives simultaneously. Additionally, we explore a coordinated distributed approach to overcome the limitations of centralized load balancing. Next, we introduce Hedgi, a heterogeneous serverless edge architecture designed to efficiently configure and utilize the derived load balancing policies, validated empirically. To demonstrate the practicality of Hedgi, we containerize and serverlessize a real-time object detection application. Extensive empirical studies are conducted using Hedgi to evaluate the performance of the proposed load balancing approach. The results provide valuable insights into the design trade-offs of various load balancing policies and system designs in the heterogeneous serverless edge.
KW - Edge computing
KW - Function-as-a-service
KW - Heterogeneity
KW - Load balancing
KW - Performance
KW - Serverless
UR - https://www.scopus.com/pages/publications/85184753148
U2 - 10.1016/j.future.2024.01.020
DO - 10.1016/j.future.2024.01.020
M3 - Article
AN - SCOPUS:85184753148
SN - 0167-739X
VL - 154
SP - 266
EP - 280
JO - Future Generation Computer Systems
JF - Future Generation Computer Systems
ER -