Graphics processing unit (GPU) implementation of image processing algorithms to improve system performance of the control acquisition, processing, and image display system (CAPIDS) of the micro-angiographic fluoroscope (MAF)

Development and performance evaluation of virtual auditory display system to synthesize sound from multiple sound sources using graphics processing unit

The Journal of the Acoustical Society of America ◽

10.1121/1.4805735 ◽

2013 ◽

Vol 133 (5) ◽

pp. 3361-3361

Author(s):

Kanji Watanabe ◽

Yusuke Oikawa ◽

Sojun Sato ◽

Shouichi Takane ◽

Koji Abe

Keyword(s):

Performance Evaluation ◽

Graphics Processing Unit ◽

Processing Unit ◽

Auditory Display ◽

Display System ◽

Sound Sources ◽

And Performance ◽

Graphics Processing

Download Full-text

Graphics processing unit implementation of the F-statistic for continuous gravitational wave searches

Classical and Quantum Gravity ◽

10.1088/1361-6382/ac4616 ◽

2021 ◽

Author(s):

Liam Dunn ◽

Patrick Clearwater ◽

Andrew Melatos ◽

Karl Wette

Keyword(s):

Gravitational Wave ◽

Graphics Processing Units ◽

Graphics Processing Unit ◽

Computational Cost ◽

Processing Unit ◽

Central Processing ◽

Long Baseline ◽

Using Data ◽

Graphics Processing ◽

Gpu Implementation

Abstract The F-statistic is a detection statistic used widely in searches for continuous gravitational waves with terrestrial, long-baseline interferometers. A new implementation of the F-statistic is presented which accelerates the existing "resampling" algorithm using graphics processing units (GPUs). The new implementation runs between 10 and 100 times faster than the existing implementation on central processing units without sacrificing numerical accuracy. The utility of the GPU implementation is demonstrated on a pilot narrowband search for four newly discovered millisecond pulsars in the globular cluster Omega Centauri using data from the second Laser Interferometer Gravitational-Wave Observatory observing run. The computational cost is 17:2 GPU-hours using the new implementation, compared to 1092 core-hours with the existing implementation.

Download Full-text

A REVIEW ON IMAGE SEGMENTATION USING GPU

INTERNATIONAL JOURNAL OF COMPUTERS & TECHNOLOGY ◽

10.24297/ijct.v15i10.4502 ◽

2016 ◽

Vol 15 (10) ◽

pp. 7160-7163

Author(s):

Gurpreet Kaur ◽

Sonika Jindal

Keyword(s):

Image Processing ◽

Computer Vision ◽

Image Segmentation ◽

Graphics Processing Unit ◽

Processing Unit ◽

System Resources ◽

Cuda Architecture ◽

Graphics Processing

Image Segmentations play a heavy role in areas such as computer vision and image processing due to its broad usage and immense applications. Because of the large importance of image segmentation a number of algorithms have been proposed and different approaches have been adopted. Segmentation divides an image into distinct regions containing each pixel with similar attributes. The objective of apportioning is to simplify and/or alter the representation of an image into something that is more meaningful and more comfortable to break down. This paper discusses the various techniques implemented for image segmentation and discusses the various Computations that can be performed on the graphics processing unit (GPU) by means of the CUDA architecture in order to achieve fast performance and increase the utilization of available system resources.

Download Full-text

Design of graphics processing unit for image processing

2014 First International Conference on Computational Systems and Communications (ICCSC) ◽

10.1109/compsc.2014.7032666 ◽

2014 ◽

Cited By ~ 1

Author(s):

J. George Cherian Panappally ◽

M. S. Dhanesh

Keyword(s):

Image Processing ◽

Graphics Processing Unit ◽

Processing Unit ◽

Graphics Processing

Download Full-text

A GPU-accelerated continuous and discontinuous Galerkin non-hydrostatic atmospheric model

The International Journal of High Performance Computing Applications ◽

10.1177/1094342017694427 ◽

2017 ◽

Vol 33 (1) ◽

pp. 81-109 ◽

Cited By ~ 7

Author(s):

Daniel S Abdi ◽

Lucas C Wilcox ◽

Timothy C Warburton ◽

Francis X Giraldo

Keyword(s):

Discontinuous Galerkin ◽

Graphics Processing Unit ◽

Three Dimensional ◽

Atmospheric Model ◽

Benchmark Problems ◽

Processing Unit ◽

Multiple Thread ◽

And Performance ◽

Graphics Processing ◽

Gpu Implementation

We present a Graphics Processing Unit (GPU)-accelerated nodal discontinuous Galerkin method for the solution of the three-dimensional Euler equations that govern the motion and thermodynamic state of the atmosphere. Acceleration of the dynamical core of atmospheric models plays an important practical role in not only getting daily forecasts faster, but also in obtaining more accurate (high resolution) results within a given simulation time limit. We use algorithms suitable for the single instruction multiple thread architecture of GPUs to accelerate our model by two orders of magnitude relative to one core of a CPU. Tests on one node of the Titan supercomputer show a speedup of up to 15 times using the K20X GPU as compared to that on the 16-core AMD Opteron CPU. The scalability of the multi-GPU implementation is tested using 16,384 GPUs, which resulted in a weak scaling efficiency of about 90%. Finally, the accuracy and performance of our GPU implementation is verified using several benchmark problems representative of different scales of atmospheric dynamics.

Download Full-text

Implementation of Membrane Algorithms on GPU

Journal of Applied Mathematics ◽

10.1155/2014/307617 ◽

2014 ◽

Vol 2014 ◽

pp. 1-7 ◽

Cited By ~ 3

Author(s):

Xingyi Zhang ◽

Bangju Wang ◽

Zhuanlian Ding ◽

Jin Tang ◽

Juanjuan He

Keyword(s):

Graphics Processing Unit ◽

Processing Unit ◽

Matching Problem ◽

Computing Device ◽

Central Processing ◽

New Class ◽

Intractable Problems ◽

Point Set ◽

Graphics Processing ◽

Gpu Implementation

Membrane algorithms are a new class of parallel algorithms, which attempt to incorporate some components of membrane computing models for designing efficient optimization algorithms, such as the structure of the models and the way of communication between cells. Although the importance of the parallelism of such algorithms has been well recognized, membrane algorithms were usually implemented on the serial computing device central processing unit (CPU), which makes the algorithms unable to work in an efficient way. In this work, we consider the implementation of membrane algorithms on the parallel computing device graphics processing unit (GPU). In such implementation, all cells of membrane algorithms can work simultaneously. Experimental results on two classical intractable problems, the point set matching problem and TSP, show that the GPU implementation of membrane algorithms is much more efficient than CPU implementation in terms of runtime, especially for solving problems with a high complexity.

Download Full-text

Development and performance evaluation of virtual auditory display system to synthesize sound from multiple sound sources using graphics processing unit

10.1121/1.4799309 ◽

2013 ◽

Cited By ~ 1

Author(s):

Kanji Watanabe ◽

Yusuke Oikawa ◽

Sojun Sato ◽

Shouichi Takane ◽

Koji Abe

Keyword(s):

Performance Evaluation ◽

Graphics Processing Unit ◽

Processing Unit ◽

Auditory Display ◽

Display System ◽

Sound Sources ◽

And Performance ◽

Graphics Processing

Download Full-text

Performance Tradeoff Considerations in a Graphics Processing Unit (GPU) Implementation of a Low Detectable Aircraft Sensor System

51st AIAA Aerospace Sciences Meeting including the New Horizons Forum and Aerospace Exposition ◽

10.2514/6.2013-374 ◽

2013 ◽

Author(s):

Christopher Scannell ◽

Kevin Cox ◽

Joseph Collins ◽

William Smith ◽

Carlos Maraviglia

Keyword(s):

Graphics Processing Unit ◽

Sensor System ◽

Processing Unit ◽

Graphics Processing ◽

Gpu Implementation ◽

Performance Tradeoff

Download Full-text

Faster computation of elemental image generation for real-time integral imaging 3D display system using graphics processing unit and multi-directional projection scheme

Advances in Display Technologies IX ◽

10.1117/12.2509716 ◽

2019 ◽

Author(s):

Md. Ashraful Alam ◽

Mahfuze Subhani Protik ◽

Md. Sifatul Islam ◽

Mohd. Zishan Tareque ◽

M. Rashidur Rahman Rafi ◽

...

Keyword(s):

Graphics Processing Unit ◽

Processing Unit ◽

Image Generation ◽

Display System ◽

3D Display ◽

Integral Imaging ◽

Projection Scheme ◽

Time Integral ◽

Elemental Image ◽

Graphics Processing

Download Full-text

Efficient Prefix Scan for the GPU-Based Implementation of Random Forest

Advances in Social Networking and Online Communities - Handbook of Research on Interactive Information Quality in Expanding Social Network Communications ◽

10.4018/978-1-4666-7377-9.ch009 ◽

2015 ◽

pp. 140-151

Author(s):

Bojan Novak

Keyword(s):

Random Forest ◽

Graphics Processing Unit ◽

Processing Unit ◽

Random Forest Algorithm ◽

Central Processing ◽

Split Point ◽

Parallel Scan ◽

Graphics Processing ◽

Gpu Architecture ◽

Gpu Implementation

The random forest ensemble learning with the Graphics Processing Unit (GPU) version of prefix scan method is presented. The efficiency of the implementation of the random forest algorithm depends critically on the scan (prefix sum) algorithm. The prefix scan is used in the depth-first implementation of optimal split point computation. Described are different implementations of the prefix scan algorithms. The speeds of the algorithms depend on three factors: the algorithm itself, which could be improved, the programming skills, and the compiler. In parallel environments, things are even more complicated and depend on the programmer´s knowledge of the Central Processing Unit (CPU) or the GPU architecture. An efficient parallel scan algorithm that avoids bank conflicts is crucial for the prefix scan implementation. In our tests, multicore CPU and GPU implementation based on NVIDIA´s CUDA is compared.

Download Full-text