Large-Scale On-Chip Dynamic Programming Network Inferences Using Moderated Inter-core Communication

Optimization for Large-Scale Multi-Mission Space Campaign Design by Approximate Dynamic Programming

2018 AIAA SPACE and Astronautics Forum and Exposition ◽

10.2514/6.2018-5287 ◽

2018 ◽

Cited By ~ 1

Author(s):

Hao Chen ◽

Arthur Lapin ◽

Takaya Ukai ◽

Chao Lei ◽

Koki Ho

Keyword(s):

Dynamic Programming ◽

Large Scale ◽

Approximate Dynamic Programming

Download Full-text

Large-scale dynamic system optimization using dual decomposition method with approximate dynamic programming

Systems & Control Letters ◽

10.1016/j.sysconle.2021.104894 ◽

2021 ◽

Vol 150 ◽

pp. 104894

Author(s):

Pegah Rokhforoz ◽

Hamed Kebriaei ◽

Majid Nili Ahmadabadi

Keyword(s):

Dynamic Programming ◽

Dynamic System ◽

Decomposition Method ◽

Large Scale ◽

Approximate Dynamic Programming ◽

System Optimization ◽

Dual Decomposition

Download Full-text

Fast and Cycle-Accurate Emulation of Large-Scale Networks-on-Chip Using a Single FPGA

ACM Transactions on Reconfigurable Technology and Systems ◽

10.1145/3151758 ◽

2017 ◽

Vol 10 (4) ◽

pp. 1-27 ◽

Cited By ~ 4

Author(s):

Thiem Van Chu ◽

Shimpei Sato ◽

Kenji Kise

Keyword(s):

Large Scale ◽

Networks On Chip ◽

On Chip ◽

Large Scale Networks

Download Full-text

Combined Aggregated Sampling Stochastic Dynamic Programming and Simulation-Optimization to Derive Operation Rules for Large-Scale Hydropower System

Energies ◽

10.3390/en14030625 ◽

2021 ◽

Vol 14 (3) ◽

pp. 625

Author(s):

Xinyu Wu ◽

Rui Guo ◽

Xilong Cheng ◽

Chuntian Cheng

Keyword(s):

Dynamic Programming ◽

Stochastic Dynamic Programming ◽

Large Scale ◽

Simulation Optimization ◽

Optimal Operation ◽

Stochastic Dynamic ◽

Operation Model ◽

Two Stage ◽

Operation Rules ◽

Reservoir Systems

Simulation-optimization methods are often used to derive operation rules for large-scale hydropower reservoir systems. The solution of the simulation-optimization models is complex and time-consuming, for many interconnected variables need to be optimized, and the objective functions need to be computed through simulation in many periods. Since global solutions are seldom obtained, the initial solutions are important to the solution quality. In this paper, a two-stage method is proposed to derive operation rules for large-scale hydropower systems. In the first stage, the optimal operation model is simplified and solved using sampling stochastic dynamic programming (SSDP). In the second stage, the optimal operation model is solved by using a genetic algorithm, taking the SSDP solution as an individual in the initial population. The proposed method is applied to a hydropower system in Southwest China, composed of cascaded reservoir systems of Hongshui River, Lancang River, and Wu River. The numerical result shows that the two-stage method can significantly improve the solution in an acceptable solution time.

Download Full-text

Spark-based parallel dynamic programming and particle swarm optimization via cloud computing for a large-scale reservoir system

Journal of Hydrology ◽

10.1016/j.jhydrol.2021.126444 ◽

2021 ◽

pp. 126444

Author(s):

Yufei Ma ◽

Ping-an Zhong ◽

Bin Xu ◽

Feilin Zhu ◽

Qingwen Lu ◽

...

Keyword(s):

Cloud Computing ◽

Dynamic Programming ◽

Particle Swarm Optimization ◽

Large Scale ◽

Particle Swarm ◽

Swarm Optimization ◽

Reservoir System

Download Full-text

Nanolaser-based emulators of spin Hamiltonians

Nanophotonics ◽

10.1515/nanoph-2020-0230 ◽

2020 ◽

Vol 9 (13) ◽

pp. 4193-4198 ◽

Cited By ~ 2

Author(s):

Midya Parto ◽

William E. Hayenga ◽

Alireza Marandi ◽

Demetrios N. Christodoulides ◽

Mercedeh Khajavikhan

Keyword(s):

Large Scale ◽

Optimization Problems ◽

Exchange Interactions ◽

Np Hard ◽

Spin Models ◽

Geometric Frustration ◽

Spin Hamiltonians ◽

Hard Problems ◽

On Chip ◽

Classical Spin Models

AbstractFinding the solution to a large category of optimization problems, known as the NP-hard class, requires an exponentially increasing solution time using conventional computers. Lately, there has been intense efforts to develop alternative computational methods capable of addressing such tasks. In this regard, spin Hamiltonians, which originally arose in describing exchange interactions in magnetic materials, have recently been pursued as a powerful computational tool. Along these lines, it has been shown that solving NP-hard problems can be effectively mapped into finding the ground state of certain types of classical spin models. Here, we show that arrays of metallic nanolasers provide an ultra-compact, on-chip platform capable of implementing spin models, including the classical Ising and XY Hamiltonians. Various regimes of behavior including ferromagnetic, antiferromagnetic, as well as geometric frustration are observed in these structures. Our work paves the way towards nanoscale spin-emulators that enable efficient modeling of large-scale complex networks.

Download Full-text

Simba

Communications of the ACM ◽

10.1145/3460227 ◽

2021 ◽

Vol 64 (6) ◽

pp. 107-116

Author(s):

Yakun Sophia Shao ◽

Jason Cemons ◽

Rangharajan Venkatesan ◽

Brian Zimmer ◽

Matthew Fojtik ◽

...

Keyword(s):

Deep Learning ◽

Large Scale ◽

Data Locality ◽

Coarse Grained ◽

Batch Size ◽

Peak Performance ◽

Large Scale Systems ◽

High Area ◽

On Chip ◽

And Storage

Package-level integration using multi-chip-modules (MCMs) is a promising approach for building large-scale systems. Compared to a large monolithic die, an MCM combines many smaller chiplets into a larger system, substantially reducing fabrication and design costs. Current MCMs typically only contain a handful of coarse-grained large chiplets due to the high area, performance, and energy overheads associated with inter-chiplet communication. This work investigates and quantifies the costs and benefits of using MCMs with finegrained chiplets for deep learning inference, an application domain with large compute and on-chip storage requirements. To evaluate the approach, we architected, implemented, fabricated, and tested Simba, a 36-chiplet prototype MCM system for deep-learning inference. Each chiplet achieves 4 TOPS peak performance, and the 36-chiplet MCM package achieves up to 128 TOPS and up to 6.1 TOPS/W. The MCM is configurable to support a flexible mapping of DNN layers to the distributed compute and storage units. To mitigate inter-chiplet communication overheads, we introduce three tiling optimizations that improve data locality. These optimizations achieve up to 16% speedup compared to the baseline layer mapping. Our evaluation shows that Simba can process 1988 images/s running ResNet-50 with a batch size of one, delivering an inference latency of 0.50 ms.

Download Full-text

A linear-time eigenvalue solver for finite-element-based analysis of large-scale wave propagation problems in on-chip interconnect structures

2008 IEEE Antennas and Propagation Society International Symposium ◽

10.1109/aps.2008.4619427 ◽

2008 ◽

Author(s):

Jongwon Lee ◽

V. Balakrishnan ◽

Cheng-Kok Koh ◽

Dan Jiao

Keyword(s):

Finite Element ◽

Wave Propagation ◽

Large Scale ◽

Linear Time ◽

On Chip ◽

Eigenvalue Solver

Download Full-text

A unified finite-element solution from zero frequency to microwave frequencies for full-wave modeling of large-scale three-dimensional on-chip interconnect structures

2008 IEEE Antennas and Propagation Society International Symposium ◽

10.1109/aps.2008.4618921 ◽

2008 ◽

Author(s):

Jianfang Zhu ◽

Dan Jiao

Keyword(s):

Finite Element ◽

Large Scale ◽

Three Dimensional ◽

Finite Element Solution ◽

Element Solution ◽

Wave Modeling ◽

Microwave Frequencies ◽

Full Wave ◽

On Chip ◽

Zero Frequency

Download Full-text

Comparison of particle swarm optimization and dynamic programming for large scale hydro unit load dispatch

Energy Conversion and Management ◽

10.1016/j.enconman.2009.07.020 ◽

2009 ◽

Vol 50 (12) ◽

pp. 3007-3014 ◽

Cited By ~ 64

Author(s):

Chun-tian Cheng ◽

Sheng-li Liao ◽

Zi-Tian Tang ◽

Ming-yan Zhao

Keyword(s):

Dynamic Programming ◽

Particle Swarm Optimization ◽

Large Scale ◽

Particle Swarm ◽

Unit Load ◽

Swarm Optimization

Download Full-text