TY - JOUR
T1 - Terabit encryption in a second
T2 - Performance evaluation of block ciphers in GPU with Kepler, Maxwell, and Pascal architectures
AU - Lee, Wai Kong
AU - Goi, Bok Min
AU - Phan, Raphael C.W.
N1 - Funding Information:
Universiti Tunku Abdul Rahman Research Fund (UTARRF), Grant/Award Number: IPSR/RMC/UTARRF/2016-C2/L04; Ministry of Science, Technology and Innovation (MOSTI), Malaysia, Grant/Award Number: 01-02-11-SF0202
Funding Information:
This work was supported partially by the Universiti Tunku Abdul Rahman Research Fund (UTARRF) under grant IPSR/RMC/UTARRF/2016-C2/L04 and by the Ministry of Science, Technology and Innovation (MOSTI), Malaysia under grant 01-02-11-SF0202. We would like to thank all the anonymous reviewers for their valuable comments. We would also like to thank the LEA design team for sharing their design resources with us.
Publisher Copyright:
© 2018 John Wiley & Sons, Ltd.
Copyright:
Copyright 2019 Elsevier B.V., All rights reserved.
PY - 2019/6/10
Y1 - 2019/6/10
N2 - With the emergence of IoT and cloud computing technologies, massive data are generated from various applications everyday and communicated through the Internet. Secure communication is essential to protect these data from malicious attacks. Block ciphers are one mechanism to offer such protection but unfortunately involve intensive computations that can be performance bottlenecks to the servers, especially when the data center needs to handle thousands of concurrent transactions. In this paper, we investigate the feasibility of the GPU as an accelerator to perform high-speed encryption in server environments. We present optimized implementations of a conventional block cipher (AES) and new lightweight block ciphers (LEA, Chaskey, SIMON, SPECK, and SIMECK) across three new GPU architectures (Kepler, Maxwell, and Pascal). For AES, we improve the fine-grain implementation by utilizing the warp shuffle instruction available in these three new GPU architectures, which yield a 6%-16% improvement over the previous implementations. For LEA, Chaskey, SIMON, SPECK, and SIMECK, we first analyze why they cannot have efficient fine-grain implementations in the GPU and then present our optimization techniques, which are able to achieve impressive encryption speeds of 1.912, 637, 1.485, 2.291, and 1.478 Tb/s, respectively, in GTX1080.
AB - With the emergence of IoT and cloud computing technologies, massive data are generated from various applications everyday and communicated through the Internet. Secure communication is essential to protect these data from malicious attacks. Block ciphers are one mechanism to offer such protection but unfortunately involve intensive computations that can be performance bottlenecks to the servers, especially when the data center needs to handle thousands of concurrent transactions. In this paper, we investigate the feasibility of the GPU as an accelerator to perform high-speed encryption in server environments. We present optimized implementations of a conventional block cipher (AES) and new lightweight block ciphers (LEA, Chaskey, SIMON, SPECK, and SIMECK) across three new GPU architectures (Kepler, Maxwell, and Pascal). For AES, we improve the fine-grain implementation by utilizing the warp shuffle instruction available in these three new GPU architectures, which yield a 6%-16% improvement over the previous implementations. For LEA, Chaskey, SIMON, SPECK, and SIMECK, we first analyze why they cannot have efficient fine-grain implementations in the GPU and then present our optimization techniques, which are able to achieve impressive encryption speeds of 1.912, 637, 1.485, 2.291, and 1.478 Tb/s, respectively, in GTX1080.
KW - block ciphers
KW - GPU
KW - lightweight block ciphers
KW - secure communication
UR - http://www.scopus.com/inward/record.url?scp=85055713037&partnerID=8YFLogxK
U2 - 10.1002/cpe.5048
DO - 10.1002/cpe.5048
M3 - Article
AN - SCOPUS:85055713037
VL - 31
JO - Concurrency and Computation: Practice and Experience
JF - Concurrency and Computation: Practice and Experience
SN - 1532-0626
IS - 11
M1 - e5048
ER -