Parallel computing with graphics processing units for high-speed Monte Carlo simulation of photon migration

Erik Alerstam; Tomas Svensson; Stefan Andersson-Engels

doi:10.1117/1.3041496

Parallel computing with graphics processing units for high-speed Monte Carlo simulation of photon migration

Journal of Biomedical Optics ◽

10.1117/1.3041496 ◽

2008 ◽

Vol 13 (6) ◽

pp. 060504 ◽

Cited By ~ 232

Author(s):

Erik Alerstam ◽

Tomas Svensson ◽

Stefan Andersson-Engels

Keyword(s):

Monte Carlo Simulation ◽

Monte Carlo ◽

Parallel Computing ◽

Graphics Processing Units ◽

High Speed ◽

Photon Migration ◽

Graphics Processing

Download Full-text

Monte Carlo Simulation of Photon Migration in 3D Turbid Media Accelerated by Graphics Processing Units

Optics Express ◽

10.1364/oe.17.020178 ◽

2009 ◽

Vol 17 (22) ◽

pp. 20178 ◽

Cited By ~ 413

Author(s):

Qianqian Fang ◽

David A. Boas

Keyword(s):

Monte Carlo Simulation ◽

Monte Carlo ◽

Graphics Processing Units ◽

Turbid Media ◽

Photon Migration ◽

Graphics Processing

Download Full-text

Using graphics processing units to accelerate perturbation Monte Carlo simulation in a turbid medium

Journal of Biomedical Optics ◽

10.1117/1.jbo.17.4.040502 ◽

2012 ◽

Vol 17 (4) ◽

pp. 040502 ◽

Cited By ~ 12

Author(s):

Fuhong Cai

Keyword(s):

Monte Carlo Simulation ◽

Monte Carlo ◽

Graphics Processing Units ◽

Turbid Medium ◽

Graphics Processing

Download Full-text

Pricing derivatives on graphics processing units using Monte Carlo simulation

Concurrency and Computation Practice and Experience ◽

10.1002/cpe.2862 ◽

2012 ◽

Vol 26 (9) ◽

pp. 1679-1697 ◽

Cited By ~ 12

Author(s):

L.A. Abbas-Turki ◽

S. Vialle ◽

B. Lapeyre ◽

P. Mercier

Keyword(s):

Monte Carlo Simulation ◽

Monte Carlo ◽

Graphics Processing Units ◽

Graphics Processing

Download Full-text

Integrated photonic FFT for photonic tensor operations towards efficient and high-speed neural networks

Nanophotonics ◽

10.1515/nanoph-2020-0055 ◽

2020 ◽

Vol 9 (13) ◽

pp. 4097-4108 ◽

Cited By ~ 1

Author(s):

Moustafa Ahmed ◽

Yas Al-Hadeethi ◽

Ahmed Bakry ◽

Hamed Dalir ◽

Volker J. Sorger

Keyword(s):

Neural Networks ◽

Graphics Processing Units ◽

High Speed ◽

Fourier Transforms ◽

Optoelectronic Device ◽

Small Sample ◽

Sample Number ◽

Chip Area ◽

Domain Specific ◽

Graphics Processing

AbstractThe technologically-relevant task of feature extraction from data performed in deep-learning systems is routinely accomplished as repeated fast Fourier transforms (FFT) electronically in prevalent domain-specific architectures such as in graphics processing units (GPU). However, electronics systems are limited with respect to power dissipation and delay, due to wire-charging challenges related to interconnect capacitance. Here we present a silicon photonics-based architecture for convolutional neural networks that harnesses the phase property of light to perform FFTs efficiently by executing the convolution as a multiplication in the Fourier-domain. The algorithmic executing time is determined by the time-of-flight of the signal through this photonic reconfigurable passive FFT ‘filter’ circuit and is on the order of 10’s of picosecond short. A sensitivity analysis shows that this optical processor must be thermally phase stabilized corresponding to a few degrees. Furthermore, we find that for a small sample number, the obtainable number of convolutions per {time, power, and chip area) outperforms GPUs by about two orders of magnitude. Lastly, we show that, conceptually, the optical FFT and convolution-processing performance is indeed directly linked to optoelectronic device-level, and improvements in plasmonics, metamaterials or nanophotonics are fueling next generation densely interconnected intelligent photonic circuits with relevance for edge-computing 5G networks by processing tensor operations optically.

Download Full-text

Acceleration of Monte Carlo simulation of photon migration in complex heterogeneous media using Intel many-integrated core architecture

Journal of Biomedical Optics ◽

10.1117/1.jbo.20.8.085002 ◽

2015 ◽

Vol 20 (8) ◽

pp. 085002 ◽

Cited By ~ 6

Author(s):

Anton V. Gorshkov ◽

Mikhail Yu. Kirillin

Keyword(s):

Monte Carlo Simulation ◽

Monte Carlo ◽

Heterogeneous Media ◽

Photon Migration ◽

Many Integrated Core

Download Full-text

Investigation of Neutral Particles Using High Speed Camera and Monte-Carlo Simulation in the GAMMA 10 Central-Cell

Fusion Science & Technology ◽

10.13182/fst07-a1320 ◽

2007 ◽

Vol 51 (2T) ◽

pp. 82-85 ◽

Cited By ~ 4

Author(s):

Y. Nakashima ◽

Y. Higashizono ◽

N. Nishino ◽

H. Kawano ◽

M.K. Islam ◽

...

Keyword(s):

Monte Carlo Simulation ◽

Monte Carlo ◽

High Speed ◽

Central Cell ◽

High Speed Camera ◽

Neutral Particles

Download Full-text

Efficient smart monte carlo based SSTA on graphics processing units with improved resource utilization

Proceedings of the 47th Design Automation Conference on - DAC '10 ◽

10.1145/1837274.1837474 ◽

2010 ◽

Cited By ~ 8

Author(s):

Vineeth Veetil ◽

Yung-Hsu Chang ◽

Dennis Sylvester ◽

David Blaauw

Keyword(s):

Monte Carlo ◽

Resource Utilization ◽

Graphics Processing Units ◽

Graphics Processing

Download Full-text

Monte Carlo simulation of X-ray imaging using a graphics processing unit

2009 IEEE Nuclear Science Symposium Conference Record (NSS/MIC) ◽

10.1109/nssmic.2009.5402382 ◽

2009 ◽

Cited By ~ 10

Author(s):

A. Badal ◽

A. Badano

Keyword(s):

Monte Carlo Simulation ◽

Monte Carlo ◽

Graphics Processing Unit ◽

Processing Unit ◽

X Ray ◽

X Ray Imaging ◽

Graphics Processing

Download Full-text

TESLA GPUs versus MPI with OpenMP for the Forward Modeling of Gravity and Gravity Gradient of Large Prisms Ensemble

Journal of Applied Mathematics ◽

10.1155/2013/437357 ◽

2013 ◽

Vol 2013 ◽

pp. 1-15 ◽

Cited By ~ 4

Author(s):

Carlos Couder-Castañeda ◽

Carlos Ortiz-Alemán ◽

Mauricio Gabriel Orozco-del-Castillo ◽

Mauricio Nava-Flores

Keyword(s):

Parallel Computing ◽

Graphics Processing Units ◽

Forward Modeling ◽

Gravity Gradient ◽

Constant Density ◽

Gravitational Fields ◽

Design And Implementation ◽

Cuda Technology ◽

Performance Results ◽

Graphics Processing

An implementation with the CUDA technology in a single and in several graphics processing units (GPUs) is presented for the calculation of the forward modeling of gravitational fields from a tridimensional volumetric ensemble composed by unitary prisms of constant density. We compared the performance results obtained with the GPUs against a previous version coded in OpenMP with MPI, and we analyzed the results on both platforms. Today, the use of GPUs represents a breakthrough in parallel computing, which has led to the development of several applications with various applications. Nevertheless, in some applications the decomposition of the tasks is not trivial, as can be appreciated in this paper. Unlike a trivial decomposition of the domain, we proposed to decompose the problem by sets of prisms and use different memory spaces per processing CUDA core, avoiding the performance decay as a result of the constant calls to kernels functions which would be needed in a parallelization by observations points. The design and implementation created are the main contributions of this work, because the parallelization scheme implemented is not trivial. The performance results obtained are comparable to those of a small processing cluster.

Download Full-text