Parallel Responsive Task on Dependable Responsive Multithreaded Processor II

Author(s):  
Hiroyuki Chishiro ◽  
Yusuke Hatori ◽  
Kohei Osawa ◽  
Keigo Mizotani ◽  
Nobuyuki Yamasaki
1999 ◽  
Vol 7 (1) ◽  
pp. 21-37
Author(s):  
Balaram Sinharoy

Over the last decade processor speed has increased dramatically, whereas the speed of the memory subsystem improved at a modest rate. Due to the increase in the cache miss latency (in terms of the processor cycle), processors stall on cache misses for a significant portion of its execution time. Multithreaded processors has been proposed in the literature to reduce the processor stall time due to cache misses. Although multithreading improves processor utilization, it may also increase cache miss rates, because in a multithreaded processor multiple threads share the same cache, which effectively reduces the cache size available to each individual thread. Increased processor utilization and the increase in the cache miss rate demands higher memory bandwidth. A novel compiler optimization method has been presented in this paper that improves data locality for each of the threads and enhances data sharing among the threads. The method is based on loop transformation theory and optimizes both spatial and temporal data locality. The created threads exhibit high level of intra‐thread and inter‐thread data locality which effectively reduces both the data cache miss rates and the total execution time of numerically intensive computation running on a multithreaded processor.


1994 ◽  
Vol 22 (4) ◽  
pp. 55-59
Author(s):  
Ghulam Chaudhry ◽  
Xuechang Li

2005 ◽  
Vol 17 (4) ◽  
pp. 456-462 ◽  
Author(s):  
Tsutomu Itou ◽  
◽  
Nobuyuki Yamasaki ◽  

<I>Responsive Multithreaded (RMT) Processor</I> is designed for distributed real-time systems. This paper focuses on the multimedia processing architecture of <I>RMT Processor</I>. Multimedia processing requires high-throughput calculation for bulky data processing. <I>RMT Processor</I> architecture is based on eight-way prioritized simultaneous multithreading, which executes each thread in order of priority. Since the priority of hard real-time threads is higher than that of multimedia processing threads, instruction issue slots used by the multimedia processing threads are few in <I>RMT Processor</I> when hard real-time threads are executed simultaneously. Therefore multimedia processing threads need to utilize instruction issue slots effectively to achieve high performance. We have designed a novel vector operation mechanism to process multimedia data efficiently in parallel. Because the same instructions are iterated in multimedia processing, the compound operation mechanism is designed to calculate more data per instruction in multimedia processing.


IEEE Micro ◽  
2004 ◽  
Vol 24 (2) ◽  
pp. 40-47 ◽  
Author(s):  
R. Kalla ◽  
B. Sinharoy ◽  
J.M. Tendler

Sign in / Sign up

Export Citation Format

Share Document