Stacked 3d package with improved bandwidth and power efficiency

2013 ◽  
Vol 2013 (DPC) ◽  
pp. 000347-000376
Author(s):  
Dev Gupta

Stacked die packages such as PoP are used in space - constrained portable systems e,g. SmartPhones. The processor ( SoC ) and Mobile DRAM chips are stacked vertically to fit the combination within the restricted board space. The use of wire bonds in the memory chips as well as package level interconnects introduces delays into the data transmitted between the chips and limits the max. clock rate ( at present about 533 MHz for LP DDR 2 memory ) and hence the data transfer rate / bandwidth to less than 6.4 GB per sec. For Smart Phones to become capable of PC quality Games and high speed video would require a doubling of the bandwidth to over 12.8 GB per sec. To achieve that target PoP type packages were to be replaced by 3-d stacks using TSVs and wide I/Os. However recently JEDEC has postponed the introduction of Wide I/O Mobile memory stacks ( using TSVs ) to 2015. So the bandwidth of PoP packages must be improved. A physical way to increase the Bandwidth between the SoC and memory chips in a PoP is by increasing the max. no. of vertical I/Os as this allows more channels / higher parallelism in data transfer. This can be accomplished by shrinking the pitch ( from current 0.4 mm to 0.3 mm ) and doubling the number of rows of vertical I/Os ( from current 2 to 4 ) in PoP. However this requires major effort for the Packaging supply chain and would still fall well short of the future Bandwidth and interconnect Power efficiency targets. In this paper we will describe modifications and additions to the baseline PoP package so that it can be operated at clock rates up to 800 MHz and deliver bandwidth of 12.8 GB per sec or more and match the Interconnect Power efficiency of 3-d stacks with TSVs. Compensation features are introduced for ea. interconnect line between the SoC and Memory in the PoP package. The features are integrated on chips that are inserted between the two layers of the package. The vertical interconnection between the two levels of the PoP are replaced and routed through this additional chip. Only minor changes are required in the baseline PoP package thus will not require long delay to shrink current PoP interconnect pitch etc. The additional chip is built with available technologies and would increase cost by only a fraction of that needed for stacks with TSV-based wide I/O. Simulation results will be presented and compared with test results.

2004 ◽  
Vol 43 (7B) ◽  
pp. 4811-4815 ◽  
Author(s):  
Daiichi Koide ◽  
Hitoshi Yanagisawa ◽  
Haruki Tokumaru ◽  
Shoichi Nakamura ◽  
Kiyoshi Ohishi ◽  
...  

Author(s):  
Junze Huang ◽  
Yueming Wang

Since bulk transfer bandwidth of the host is unstable, the universal serial bus (USB) 3.0 hyperspectral data transfer system can only achieve a data transfer rate of about 30 MBps which is less than one-fifteenth of USB 3.0 theoretical transfer rate of 5 Gbps. For aerial hyperspectral imager, data transfer system is required to meet different frame rates of detector for different speed-to-height ratios. In this paper, we propose a high-speed and adjustable synchronous transfer system. The USB 3.0 peripheral controller uses synchronous first in first out (FIFO) and automatic direct memory access (DMA) to achieve the highest data transfer bandwidth. The USB acquisition software collects a data block in every fixed time interval. The size in bytes of every data block must be an integer multiple of the maximum data packet payload size, which is a necessary condition for using automatic DMA and bulk transfers. The data transfer rate of the system could be adjusted by directly changing the data block size and acquisition time interval. The experimental results show that the synchronous transfer mechanism could facilitate the 100-MBps error-free and high data transfer bandwidth application on a hyperspectral data processing system.


Author(s):  
Pham Khoi Dong ◽  
Hung K Nguyen ◽  
Fawnizu Azmadi Hussin ◽  
Xuan Tu Tran

Security issues in high-speed data transfer between devices are always a big challenge. On the other hand, new data transfer standards such as IEEE P802.3bs 2017 stipulate the maximum data rate up to 400 Gbps. So, security encryptions need high throughput to meet data transfer rates and low latency to ensure the quality of services. In this paper, we propose a multi-core AES encryption hardware architecture to achieve ultra-high-throughput encryption. To reduce area cost and power consumption, these cores share the same KeyExpansion blocks. Fully parallel, outer round pipeline technique is also applied to the proposed architecture to achieve low latency encryption. The design has been modelled at RTL (Register-Transfer-Level) in VHDL and then synthesized with a CMOS 45nm technology using Synopsys Design Compiler. With 10-cores fully parallel and outer round pipeline, the implementation results show that our architecture achieves a throughput of 1 Tbps at the maximum operating frequency of 800 MHz. These results meet the speed requirements of future communication standards. In addition, our design also achieves a high power-efficiency of 2377 Gbps/W and area-efficiency of 833 Gbps/mm2, that is 2.6x and 4.5x higher than those of the other highest throughput of single-core AES, respectively.


2021 ◽  
Vol 10 (1) ◽  
Author(s):  
Sicong Wang ◽  
Chen Wei ◽  
Yuanhua Feng ◽  
Hongkun Cao ◽  
Wenzhe Li ◽  
...  

AbstractAlthough photonics presents the fastest and most energy-efficient method of data transfer, magnetism still offers the cheapest and most natural way to store data. The ultrafast and energy-efficient optical control of magnetism is presently a missing technological link that prevents us from reaching the next evolution in information processing. The discovery of all-optical magnetization reversal in GdFeCo with the help of 100 fs laser pulses has further aroused intense interest in this compelling problem. Although the applicability of this approach to high-speed data processing depends vitally on the maximum repetition rate of the switching, the latter remains virtually unknown. Here we experimentally unveil the ultimate frequency of repetitive all-optical magnetization reversal through time-resolved studies of the dual-shot magnetization dynamics in Gd27Fe63.87Co9.13. Varying the intensities of the shots and the shot-to-shot separation, we reveal the conditions for ultrafast writing and the fastest possible restoration of magnetic bits. It is shown that although magnetic writing launched by the first shot is completed after 100 ps, a reliable rewriting of the bit by the second shot requires separating the shots by at least 300 ps. Using two shots partially overlapping in space and minimally separated by 300 ps, we demonstrate an approach for GHz magnetic writing that can be scaled down to sizes below the diffraction limit.


2011 ◽  
Vol 52-54 ◽  
pp. 2021-2026
Author(s):  
Gui Ling Deng ◽  
Can Zhou

Thermal deformation is an important factor to affect the accuracy of the motorized spindle, the core component of high-speed machine tool. To understand the spindle system transient thermal characteristics of the high-speed turning center CH7516GS, some high-precision sensors and high-frequency data acquisition system is used to establish the temperature and displacement measuring system. The thermal deformation compensation model is established on the basis of the experimental test results.


2005 ◽  
Vol 50 (12) ◽  
pp. 2065-2069 ◽  
Author(s):  
R. Marquez ◽  
E. Altman ◽  
S. Sole-Alvarez

2007 ◽  
Vol 1054 ◽  
Author(s):  
Ruth Houbertz ◽  
Herbert Wolter ◽  
Volker Schmidt ◽  
Ladislav Kuna ◽  
Valentin Satzinger ◽  
...  

ABSTRACTThe integration of optical interconnects in printed circuit boards (PCB) is a rapidly growing field worldwide due to a continuously increasing need for high-speed data transfer. There are any concepts discussed, among which are the integration of optical fibers or the generation of waveguides by UV lithography, embossing, or direct laser writing. The devices presented so far require many different materials and process steps, but particularly also highly-sophisticated assembly steps in order to couple the optoelectronic elements to the generated waveguides. In order to overcome these restrictions, an innovative approach is presented which allows the embedding of optoelectronic components and the generation of optical waveguides in only one optical material. This material is an inorganic-organic hybrid polymer, in which the waveguides are processed by two-photon absorption (TPA) processes, initiated by ultra-short laser pulses. In particular, due to this integration and the possibility ofin situpositioning the optical waveguides with respect to the optoelectronic components by the TPA process, no complex packaging or assembly is necessary. Thus, the number of necessary processing steps is significantly reduced, which also contributes to the saving of resources such as energy or solvents. The material properties and the underlying processes will be discussed with respect to optical data transfer in PCBs.


2002 ◽  
Vol 41 (Part 1, No. 3B) ◽  
pp. 1804-1807 ◽  
Author(s):  
Gakuji Hashimoto ◽  
Hiroki Shima ◽  
Kenji Yamamoto ◽  
Tsutomu Maruyama ◽  
Takashi Nakao ◽  
...  

Sign in / Sign up

Export Citation Format

Share Document