Highly Efficient, Linear-Scaling Seminumerical Exact-Exchange Method for Graphic Processing Units

Henryk Laqua; Travis H. Thompson; Jörg Kussmann; Christian Ochsenfeld

doi:10.1021/acs.jctc.9b00860

Highly Efficient, Linear-Scaling Seminumerical Exact-Exchange Method for Graphic Processing Units

Journal of Chemical Theory and Computation ◽

10.1021/acs.jctc.9b00860 ◽

2020 ◽

Vol 16 (3) ◽

pp. 1456-1468 ◽

Cited By ~ 4

Author(s):

Henryk Laqua ◽

Travis H. Thompson ◽

Jörg Kussmann ◽

Christian Ochsenfeld

Keyword(s):

Linear Scaling ◽

Graphic Processing Units ◽

Exchange Method ◽

Highly Efficient ◽

Exact Exchange ◽

Graphic Processing

Download Full-text

Highly Efficient Implementation of Block Ciphers on Graphic Processing Units for Massively Large Data

Applied Sciences ◽

10.3390/app10113711 ◽

2020 ◽

Vol 10 (11) ◽

pp. 3711 ◽

Cited By ~ 1

Author(s):

SangWoo An ◽

Seog Chung Seo

Keyword(s):

Cloud Computing ◽

Block Cipher ◽

Personal Information ◽

Large Data ◽

Block Ciphers ◽

Optimization Techniques ◽

Graphic Processing Units ◽

Highly Efficient ◽

Cloud Computing Service ◽

Graphic Processing

With the advent of IoT and Cloud computing service technology, the size of user data to be managed and file data to be transmitted has been significantly increased. To protect users’ personal information, it is necessary to encrypt it in secure and efficient way. Since servers handling a number of clients or IoT devices have to encrypt a large amount of data without compromising service capabilities in real-time, Graphic Processing Units (GPUs) have been considered as a proper candidate for a crypto accelerator for processing a huge amount of data in this situation. In this paper, we present highly efficient implementations of block ciphers on NVIDIA GPUs (especially, Maxwell, Pascal, and Turing architectures) for environments using massively large data in IoT and Cloud computing applications. As block cipher algorithms, we choose AES, a representative standard block cipher algorithm; LEA, which was recently added in ISO/IEC 29192-2:2019 standard; and CHAM, a recently developed lightweight block cipher algorithm. To maximize the parallelism in the encryption process, we utilize Counter (CTR) mode of operation and customize it by using GPU’s characteristics. We applied several optimization techniques with respect to the characteristics of GPU architecture such as kernel parallelism, memory optimization, and CUDA stream. Furthermore, we optimized each target cipher by considering the algorithmic characteristics of each cipher by implementing the core part of each cipher with handcrafted inline PTX (Parallel Thread eXecution) codes, which are virtual assembly codes in CUDA platforms. With the application of our optimization techniques, in our implementation on RTX 2070 GPU, AES and LEA show up to 310 Gbps and 2.47 Tbps of throughput, respectively, which are 10.7% and 67% improved compared with the 279.86 Gbps and 1.47 Tbps of the previous best result. In the case of CHAM, this is the first optimized implementation on GPUs and it achieves 3.03 Tbps of throughput on RTX 2070 GPU.

Download Full-text

Accelerating Extended Hamming Code Decoders on Graphic Processing Units for High Speed Communication

IEICE Transactions on Communications ◽

10.1587/transcom.e97.b.1050 ◽

2014 ◽

Vol E97.B (5) ◽

pp. 1050-1058 ◽

Cited By ~ 2

Author(s):

Md Shohidul ISLAM ◽

Jong-Myon KIM

Keyword(s):

High Speed ◽

Hamming Code ◽

Graphic Processing Units ◽

Graphic Processing

Download Full-text

A highly efficient Fe-Ni-S/NF hybrid electrode for promoting oxygen evolution performance

Chemical Communications ◽

10.1039/d1cc00569c ◽

2021 ◽

Author(s):

YuYun Chen ◽

Yang Xu ◽

Shuai Niu ◽

Jun Yan ◽

Ye-Yu Wu ◽

...

Keyword(s):

Ion Exchange ◽

Oxygen Evolution ◽

Hierarchical Structure ◽

Alkaline Solution ◽

Exchange Method ◽

Highly Efficient ◽

Ion Exchange Method

In this study, a Fe-Ni-S/NF hybrid electrode with hierarchical structure was fabricated via a simple hydrothermal and ion exchange method, which exhibits remarkable OER performance in alkaline solution with an...

Download Full-text

Preparation of BiOCl/Bi2S3 composites by simple ion exchange method for highly efficient photocatalytic reduction of Cr6+

Applied Surface Science ◽

10.1016/j.apsusc.2019.145000 ◽

2020 ◽

Vol 506 ◽

pp. 145000 ◽

Cited By ~ 5

Author(s):

Yun Lu ◽

Jimei Song ◽

Wenfang Li ◽

Yali Pan ◽

Huiyao Fang ◽

...

Keyword(s):

Ion Exchange ◽

Photocatalytic Reduction ◽

Exchange Method ◽

Highly Efficient ◽

Ion Exchange Method

Download Full-text

Parallel genome-wide analysis with central and graphic processing units

2015 IEEE International Conference on Computer and Communications (ICCC) ◽

10.1109/compcomm.2015.7387579 ◽

2015 ◽

Cited By ~ 1

Author(s):

Muhamad Fitra Kacamarga ◽

James W. Baurley ◽

Bens Pardamean

Keyword(s):

Graphic Processing Units ◽

Genome Wide Analysis ◽

Genome Wide ◽

Graphic Processing

Download Full-text

Lattice-boltzmann Navier-stokes Simulation on Graphic Processing Units

Asian Journal of Applied Sciences ◽

10.3923/ajaps.2011.762.770 ◽

2011 ◽

Vol 4 (8) ◽

pp. 762-770 ◽

Cited By ~ 1

Author(s):

Pablo Rafael Rinaldi ◽

Enzo Alberto Dari ◽

Marcelo Javier Venere ◽

Alejandro Clausse

Keyword(s):

Lattice Boltzmann ◽

Navier Stokes ◽

Graphic Processing Units ◽

Graphic Processing

Download Full-text

PAC-k: A Parallel Aho-Corasick String Matching Approach on Graphic Processing Units Using Non-Overlapped Threads

IEICE Transactions on Communications ◽

10.1587/transcom.2015ebp3411 ◽

2016 ◽

Vol E99.B (7) ◽

pp. 1523-1531 ◽

Cited By ~ 4

Author(s):

ThienLuan HO ◽

Seung-Rohk OH ◽

HyunJin KIM

Keyword(s):

String Matching ◽

Graphic Processing Units ◽

Graphic Processing

Download Full-text

Planning Mobile Cloud Infrastructures Using Stochastic Petri Nets and Graphic Processing Units

2015 IEEE 7th International Conference on Cloud Computing Technology and Science (CloudCom) ◽

10.1109/cloudcom.2015.46 ◽

2015 ◽

Cited By ~ 4

Author(s):

Francisco Airton Silva ◽

Matheus Rodrigues ◽

Paulo Maciel ◽

Sokol Kosta ◽

Alessandro Mei

Keyword(s):

Petri Nets ◽

Stochastic Petri Nets ◽

Mobile Cloud ◽

Graphic Processing Units ◽

Cloud Infrastructures ◽

Graphic Processing

Download Full-text

Analysis of random noise generated by graphic processing units

International Journal of Services Technology and Management ◽

10.1504/ijstm.2017.081880 ◽

2017 ◽

Vol 23 (1/2) ◽

pp. 3

Author(s):

Yongjin Yeom ◽

Taeill Yoo

Keyword(s):

Random Noise ◽

Graphic Processing Units ◽

Graphic Processing

Download Full-text

Parallelized two-dimensional particle-in-cell simulation for capacitively coupled plasmas using graphic processing units

2012 Abstracts IEEE International Conference on Plasma Science ◽

10.1109/plasma.2012.6384063 ◽

2012 ◽

Author(s):

I. C. Song ◽

H. W. Bae ◽

S. W. Hwang ◽

H. Lee ◽

H. J. Lee

Keyword(s):

Two Dimensional ◽

Graphic Processing Units ◽

Capacitively Coupled ◽

Particle In Cell ◽

Cell Simulation ◽

Coupled Plasmas ◽

Graphic Processing

Download Full-text