Multi-Physics Driven Co-Design of 3D Multicore Architectures

Author(s):  
He Xiao ◽  
Wen Yueh ◽  
Saibal Mukhopadhyay ◽  
Sudhakar Yalamanchili

The high heat flux and strong thermal coupling in the 3D ICs has limited the performance gains that would otherwise be feasible in 3D structures. The common practice of adopting worst-case design margins is in part responsible for this limitation since average-case performance would be limited by worst-case thermal design margins. The coupling between temperature and leakage power exacerbates this effect. However, worst-case thermal conditions are not the common state across the package at runtime. We argue for the co-design of the package, architecture, and power management based on the multi-physics interactions between temperature, power consumption and system performance. This approach suggests an adaptive architecture that accommodates the thermal coupling between layers and leads to increased energy efficiency over a wider operating voltage range and therefore higher performance. In this paper, we target at a 3D multicore architecture where the cores reside on one die and the last level cache (LLC) resides on the other. The DRAM stack may be stacked on top of the package (e.g., 3D) or in the same package (e.g., 2.5D). We propose a novel adaptive cache structure — the constant performance model (CPM) cache — based on voltage adaptations to temperature variations. We construct a HSPICE model for the SRAM to explore the relationship between temperature, supply voltage, and the circuit delay in the context of the LLC. This model is used to investigate, characterize, and analyze the effect of the temperature-delay dependence of the SRAM LLC configuration on the system-level performance and energy efficiency. This analysis gives rise to an intelligent scheme for dynamic voltage regulation in the LLC cache that is sensitive to the temperature of the individual cache banks. Each cache bank is thermally coupled to the associated cores and thus is sensitive to the local core-level power management. We show that this local adaptation to the temperature-delay dependence leads to a significant power reduction in the LLC cache, and improvement of system energy efficiency computed as energy per instruction (EPI). We evaluate our approach using a cycle-level, full system simulation model of a 16-core x86 homogenous microarchitecture in 16nm technology that boots a full Linux operating system and executes application binaries. The advantages of the proposed adaptive LLC structure illustrate the potential of the co-design of the package, architecture, and power management in future 3D multicore architectures.

2018 ◽  
Vol 30 (2) ◽  
pp. 213-227 ◽  
Author(s):  
Wen Cai ◽  
Ryan L Harne

In recent years, great advances in understanding the opportunities for nonlinear vibration energy harvesting systems have been achieved giving attention to either the structural or electrical subsystems. Yet, a notable disconnect appears in the knowledge on optimal means to integrate nonlinear energy harvesting structures with effective nonlinear rectifying and power management circuits for practical applications. Motivated to fill this knowledge gap, this research employs impedance principles to investigate power optimization strategies for a nonlinear vibration energy harvester interfaced with a bridge rectifier and a buck-boost converter. The frequency and amplitude dependence of the internal impedance of the harvester structure challenges the conventional impedance matching concepts. Instead, a system-level optimization strategy is established and validated through simulations and experiments. Through careful studies, the means to optimize the electrical power with partial information of the electrical load is revealed and verified in comparison to the full analysis. These results suggest that future study and implementation of optimal nonlinear energy harvesting systems may find effective guidance through power flow concepts built on linear theories despite the presence of nonlinearities in structures and circuits.


Energies ◽  
2018 ◽  
Vol 11 (11) ◽  
pp. 3143 ◽  
Author(s):  
Ignacio Acosta ◽  
Miguel Ángel Campano ◽  
Samuel Domínguez-Amarillo ◽  
Carmen Muñoz

Daylight performance metrics provide a promising approach for the design and optimization of lighting strategies in buildings and their management. Smart controls for electric lighting can reduce power consumption and promote visual comfort using different control strategies, based on affordable technologies and low building impact. The aim of this research is to assess the energy efficiency of these smart controls by means of dynamic daylight performance metrics, to determine suitable solutions based on the geometry of the architecture and the weather conditions. The analysis considers different room dimensions, with variable window size and two mean surface reflectance values. DaySim 3.1 lighting software provides the simulations for the study, determining the necessary quantification of dynamic metrics to evaluate the usefulness of the proposed smart controls and their impact on energy efficiency. The validation of dynamic metrics is carried out by monitoring a mesh of illuminance-meters in test cells throughout one year. The results showed that, for most rooms more than 3.00 m deep, smart controls achieve worthwhile energy savings and a low payback period, regardless of weather conditions and for worst-case situations. It is also concluded that dimming systems provide a higher net present value and allow the use of smaller window size than other control solutions.


2012 ◽  
Vol 198-199 ◽  
pp. 523-527
Author(s):  
Fang Yuan Chen ◽  
Dong Song Zhang ◽  
Zhi Ying Wang

Worst-Case Execution Time (WCET) is crucial in real-time systems and is very challenging in multicore processors due to the possible runtime inter-thread interferences caused by shared resources. This paper proposes a novel approach to analyze runtime inter-core interferences for consecutive or inconsecutive concurrent programs. Our approach can reasonably estimate runtime inter-core interferences in shared cache by introducing lifetime and instruction fetching timing relations analysis into address mapping method. Compared with the method based on lifetime alone, our proposed approach efficiently improves the tightness of WCET estimation.


Author(s):  
Enrico Macii ◽  
Vivek Tiwari ◽  
Massimo Poncino ◽  
Naehyuck Chang

2015 ◽  
Vol 138 (1) ◽  
Author(s):  
Jesse Austin-Breneman ◽  
Bo Yang Yu ◽  
Maria C. Yang

During the early stage design of large-scale engineering systems, design teams are challenged to balance a complex set of considerations. The established structured approaches for optimizing complex system designs offer strategies for achieving optimal solutions, but in practice suboptimal system-level results are often reached due to factors such as satisficing, ill-defined problems, or other project constraints. Twelve subsystem and system-level practitioners at a large aerospace organization were interviewed to understand the ways in which they integrate subsystems in their own work. Responses showed subsystem team members often presented conservative, worst-case scenarios to other subsystems when negotiating a tradeoff as a way of hedging against their own future needs. This practice of biased information passing, referred to informally by the practitioners as adding “margins,” is modeled in this paper with a series of optimization simulations. Three “bias” conditions were tested: no bias, a constant bias, and a bias which decreases with time. Results from the simulations show that biased information passing negatively affects both the number of iterations needed and the Pareto optimality of system-level solutions. Results are also compared to the interview responses and highlight several themes with respect to complex system design practice.


Author(s):  
Ali Akbar Merrikh ◽  
Sridhar Sundaram ◽  
David Walshak ◽  
Yizhang Yang ◽  
Tom Dolbear

We present a methodology for optimizing footprint, metal mass and thermal performance of an aluminum extruded heatsink for cooling chipset microprocessors in server form-factor. The analysis is based on predefined volume flow rate of air at a constant temperature assumed to be available upstream of the package. The front-to-back cooling assumption covers the worst case ambient conditions, typical of chipset boundary condition in servers. We present studies covering a range of heatsink footprints in order to compare and minimize the heatsink footprint, at the same time satisfying thermal specification of the chipset microprocessor. The study also focuses on the system-level assessment of the optimum 60×40 mm2 footprint and corner cases by studying the effect of motherboard thermal conductivity as well as blockages on the heatsink case-to-ambient thermal resistance.


Sign in / Sign up

Export Citation Format

Share Document