A Cost and Performance Analytical Model for Large-Scale On-Chip Interconnection Networks

A comprehensive analytical model of interconnection networks in large-scale cluster systems

Concurrency and Computation Practice and Experience ◽

10.1002/cpe.1222 ◽

2007 ◽

Vol 20 (1) ◽

pp. 75-97 ◽

Cited By ~ 10

Author(s):

Bahman Javadi ◽

Jemal H. Abawajy ◽

Mohammad K. Akbari

Keyword(s):

Analytical Model ◽

Interconnection Networks ◽

Large Scale ◽

Cluster Systems

Download Full-text

Adaptive Channel Buffers in On-Chip Interconnection Networks— A Power and Performance Analysis

IEEE Transactions on Computers ◽

10.1109/tc.2008.77 ◽

2008 ◽

Vol 57 (9) ◽

pp. 1169-1181 ◽

Cited By ~ 20

Author(s):

Avinash Karanth Kodi ◽

Ashwini Sarathy ◽

Ahmed Louri

Keyword(s):

Performance Analysis ◽

Interconnection Networks ◽

And Performance ◽

On Chip

Download Full-text

An analysis of on-chip interconnection networks for large-scale chip multiprocessors

ACM Transactions on Architecture and Code Optimization ◽

10.1145/1736065.1736069 ◽

2010 ◽

Vol 7 (1) ◽

pp. 1-28 ◽

Cited By ~ 60

Author(s):

Daniel Sanchez ◽

George Michelogiannakis ◽

Christos Kozyrakis

Keyword(s):

Interconnection Networks ◽

Large Scale ◽

Chip Multiprocessors ◽

On Chip

Download Full-text

Microstructures of Al(Pd, Nb) and Al(Ti) lines for VLSI

Proceedings, annual meeting, Electron Microscopy Society of America ◽

10.1017/s0424820100131577 ◽

1992 ◽

Vol 50 (2) ◽

pp. 1388-1389

Author(s):

C. Stanis ◽

D. Smith ◽

P. Blauner ◽

M. Small

Keyword(s):

Large Scale ◽

Thermal Stresses ◽

Feature Size ◽

Silicon Devices ◽

Large Scale Integration ◽

Thermal Expansivities ◽

Current Densities ◽

And Performance ◽

On Chip ◽

Scale Integration

Very Large Scale Integration necessitated an ongoing and rapid decrease in the minimum feature size which must be made on silicon devices with the aims of improving productivity and performance. Conductor lines are commonly made from Al(Cu). Widths of 1.5 μm for conductor lines are common today, submicron lines are in late stages of development and 0.25 μm lines will be needed. These dimensions present new issues since the feature size is of the same order as the grain size of the Al and other metal alloys presently used for chip wiring. In order to make on-chip wiring reliable at these dimensions it is necessary to optimise the resistance to the stresses placed on them: electromigration due to increasing current densities; thermal stresses due to differences in thermal expansivities. The kinetics of both processes are dominated by interface transport. The resistance of the metal to both stresses can be modified by alloying.

Download Full-text

Evaluation and Performance Comparison of TriBA with existing On-Chip Interconnection Networks

2007 International Conference on Emerging Technologies ◽

10.1109/icet.2007.4516360 ◽

2007 ◽

Author(s):

Haroon-Ur-Rashid ◽

Shi Feng ◽

Muhammad Kamran ◽

Ji Weixing

Keyword(s):

Interconnection Networks ◽

Performance Comparison ◽

And Performance ◽

On Chip

Download Full-text

An Evaluation of an Integrated On-Chip/Off-Chip Network for High-Performance Reconfigurable Computing

International Journal of Reconfigurable Computing ◽

10.1155/2012/564704 ◽

2012 ◽

Vol 2012 ◽

pp. 1-15 ◽

Cited By ~ 6

Author(s):

Andrew G. Schmidt ◽

William V. Kritikos ◽

Shanyuan Gao ◽

Ron Sass

Keyword(s):

Integrated Circuit ◽

Reconfigurable Computing ◽

High Performance ◽

Large Scale ◽

The Body ◽

And Performance ◽

On Chip ◽

Point To Point ◽

Computing Machines ◽

Performance Computing

As the number of cores per discrete integrated circuit (IC) device grows, the importance of the network on chip (NoC) increases. However, the body of research in this area has focused on discrete IC devices alone which may or may not serve the high-performance computing community which needs to assemble many of these devices into very large scale, parallel computing machines. This paper describes an integrated on-chip/off-chip network that has been implemented on an all-FPGA computing cluster. The system supports MPI-style point-to-point messages, collectives, and other novel communication. Results include the resource utilization and performance (in latency and bandwidth).

Download Full-text

Two-Level FIFO Buffer Design for Routers in On-Chip Interconnection Networks

IEICE Transactions on Fundamentals of Electronics Communications and Computer Sciences ◽

10.1587/transfun.e94.a.2412 ◽

2011 ◽

Vol E94-A (11) ◽

pp. 2412-2424 ◽

Cited By ~ 1

Author(s):

Po-Tsang HUANG ◽

Wei HWANG

Keyword(s):

Interconnection Networks ◽

Buffer Design ◽

On Chip

Download Full-text

Efficient Instruction and Data Caching for High Performance Embedded Processors

Jornada de Jóvenes Investigadores del I3A ◽

10.26754/jji-i3a.201201788 ◽

1970 ◽

pp. 9

Author(s):

A. Ferrerón Labari ◽

D. Suárez Gracia ◽

V. Viñals Yúfera

Keyword(s):

Embedded Systems ◽

Power Consumption ◽

Low Power ◽

Interconnection Networks ◽

High Performance ◽

Critical Issue ◽

Content Management ◽

Structure Design ◽

Portable Devices ◽

On Chip

In the last years, embedded systems have evolved so that they offer capabilities we could only find before in high performance systems. Portable devices already have multiprocessors on-chip (such as PowerPC 476FP or ARM Cortex A9 MP), usually multi-threaded, and a powerful multi-level cache memory hierarchy on-chip. As most of these systems are battery-powered, the power consumption becomes a critical issue. Achieving high performance and low power consumption is a high complexity challenge where some proposals have been already made. Suarez et al. proposed a new cache hierarchy on-chip, the LP-NUCA (Low Power NUCA), which is able to reduce the access latency taking advantage of NUCA (Non-Uniform Cache Architectures) properties. The key points are decoupling the functionality, and utilizing three specialized networks on-chip. This structure has been proved to be efficient for data hierarchies, achieving a good performance and reducing the energy consumption. On the other hand, instruction caches have different requirements and characteristics than data caches, contradicting the low-power embedded systems requirements, especially in SMT (simultaneous multi-threading) environments. We want to study the benefits of utilizing small tiled caches for the instruction hierarchy, so we propose a new design, ID-LP-NUCAs. Thus, we need to re-evaluate completely our previous design in terms of structure design, interconnection networks (including topologies, flow control and routing), content management (with special interest in hardware/software content allocation policies), and structure sharing. In CMP environments (chip multiprocessors) with parallel workloads, coherence plays an important role, and must be taken into consideration.

Download Full-text