processor caches Latest Research Papers

Processor caches have fixed line size. A processor cache defined by tuple (C, k, L) where C is the capacity, k associativity and L line size has fixed values for the parameters. Algorithms to have variable processor cache line size are proposed in literature. This paper proposes algorithm to have variable cache line size based on the miss count for any application. The line size is varied by increasing or decreasing line size based on the miss count for any time interval. The algorithm can be used in running any application. The SPEC2000 benchmarks are used for simulating the proposed algorithm for cache with one level. The average memory access time is chosen as performance parameter. A performance improvement of 12% is observed with energy saving of 18% for chosen parameters.

Download Full-text

THE PROBLEM OF PROVIDING CACHE COHERENCE IN MULTIPROCESSOR SYSTEMS WITH MANY PROCESSORS

Issues of radio electronics ◽

10.21778/2218-5453-2018-5-47-53 ◽

2018 ◽

pp. 47-53

Author(s):

B. Z. Shmeylin ◽

E. A. Alekseeva

Keyword(s):

Cache Coherence ◽

Bloom Filters ◽

Multiprocessor Systems ◽

Cache Line ◽

Maintenance Systems ◽

Processor Caches ◽

Conventional Systems ◽

Additional Hardware

In this paper the tasks of managing the directory in coherence maintenance systems in multiprocessor systems with a large number of processors are solved. In microprocessor systems with a large number of processors (MSLP) the problem of maintaining the coherence of processor caches is significantly complicated. This is due to increased traffic on the memory buses and increased complexity of interprocessor communications. This problem is solved in various ways. In this paper, we propose the use of Bloom filters used to accelerate the determination of an element’s belonging to a certain array. In this article, such filters are used to establish the fact that the processor belongs to some subset of the processors and determine if the processor has a cache line in the set. In the paper, the processes of writing and reading information in the data shared between processors are discussed in detail, as well as the process of data replacement from private caches. The article also shows how the addresses of cache lines and processor numbers are removed from the Bloom filters. The system proposed in this paper allows significantly speeding up the implementation of operations to maintain cache coherence in the MSLP as compared to conventional systems. In terms of performance and additional hardware and software costs, the proposed system is not inferior to the most efficient of similar systems, but on some applications and significantly exceeds them.

Download Full-text

A Survey of Recent Prefetching Techniques for Processor Caches

ACM Computing Surveys ◽

10.1145/2907071 ◽

2016 ◽

Vol 49 (2) ◽

pp. 1-35 ◽

Cited By ~ 23

Author(s):

Sparsh Mittal

Keyword(s):

Processor Caches

Download Full-text

Locality-Aware Task Scheduling and Data Distribution for OpenMP Programs on NUMA Systems and Manycore Processors

Scientific Programming ◽

10.1155/2015/981759 ◽

2015 ◽

Vol 2015 ◽

pp. 1-16 ◽

Cited By ~ 7

Author(s):

Ananya Muddukrishna ◽

Peter A. Jonsson ◽

Mats Brorsson

Keyword(s):

Task Scheduling ◽

Data Distribution ◽

Data Access ◽

Improve Performance ◽

Manycore Processors ◽

Cache Access ◽

On Chip ◽

Processor Caches ◽

Architectural Knowledge ◽

The Impact

Performance degradation due to nonuniform data access latencies has worsened on NUMA systems and can now be felt on-chip in manycore processors. Distributing data across NUMA nodes and manycore processor caches is necessary to reduce the impact of nonuniform latencies. However, techniques for distributing data are error-prone and fragile and require low-level architectural knowledge. Existing task scheduling policies favor quick load-balancing at the expense of locality and ignore NUMA node/manycore cache access latencies while scheduling. Locality-aware scheduling, in conjunction with or as a replacement for existing scheduling, is necessary to minimize NUMA effects and sustain performance. We present a data distribution and locality-aware scheduling technique for task-based OpenMP programs executing on NUMA systems and manycore processors. Our technique relieves the programmer from thinking of NUMA system/manycore processor architecture details by delegating data distribution to the runtime system and uses task data dependence information to guide the scheduling of OpenMP tasks to reduce data stall times. We demonstrate our technique on a four-socket AMD Opteron machine with eight NUMA nodes and on the TILEPro64 processor and identify that data distribution and locality-aware task scheduling improve performance up to 69% for scientific benchmarks compared to default policies and yet provide an architecture-oblivious approach for programmers.

Download Full-text

A Software-Based Self-Test methodology for on-line testing of processor caches

2011 IEEE International Test Conference ◽

10.1109/test.2011.6139154 ◽

2011 ◽

Cited By ~ 7

Author(s):

G. Theodorou ◽

N. Kranitis ◽

A. Paschalis ◽

D. Gizopoulos

Keyword(s):

Test Methodology ◽

On Line ◽

Self Test ◽

Processor Caches

Download Full-text

Processor caches built using multi-level spin-transfer torque RAM cells

IEEE/ACM International Symposium on Low Power Electronics and Design ◽

10.1109/islped.2011.5993610 ◽

2011 ◽

Cited By ~ 25

Author(s):

Yiran Chen ◽

Weng-Fai Wong ◽

Hai Li ◽

Cheng-Kok Koh

Keyword(s):

Spin Transfer Torque ◽

Spin Transfer ◽

Multi Level ◽

Processor Caches

Download Full-text

SCIPS: An emulation methodology for fault injection in processor caches

2011 Aerospace Conference ◽

10.1109/aero.2011.5747450 ◽

2011 ◽

Cited By ~ 4

Author(s):

Nicholas Wulf ◽

Grzegorz Cieslewski ◽

Ann Gordon-Ross ◽

Alan D. George

Keyword(s):

Fault Injection ◽

Processor Caches

Download Full-text

Cache vulnerability equations for protecting data in embedded processor caches from soft errors

ACM SIGPLAN Notices ◽

10.1145/1755951.1755910 ◽

2010 ◽

Vol 45 (4) ◽

pp. 143-152 ◽

Cited By ~ 5

Author(s):

Aviral Shrivastava ◽

Jongeun Lee ◽

Reiley Jeyapaul

Keyword(s):

Soft Errors ◽

Embedded Processor ◽

Processor Caches

Download Full-text

processor caches
Recently Published Documents

TOTAL DOCUMENTS

H-INDEX

Understanding Insecurity of Processor Caches Due to Cache Timing-Based Vulnerabilities

The Effects of Wide Vector Operations on Processor Caches

A Variable Processor Cache Line Size Architecture

THE PROBLEM OF PROVIDING CACHE COHERENCE IN MULTIPROCESSOR SYSTEMS WITH MANY PROCESSORS

A Survey of Recent Prefetching Techniques for Processor Caches

Locality-Aware Task Scheduling and Data Distribution for OpenMP Programs on NUMA Systems and Manycore Processors

A Software-Based Self-Test methodology for on-line testing of processor caches

Processor caches built using multi-level spin-transfer torque RAM cells

SCIPS: An emulation methodology for fault injection in processor caches

Cache vulnerability equations for protecting data in embedded processor caches from soft errors

Export Citation Format

processor cachesRecently Published Documents

TOTAL DOCUMENTS

H-INDEX

Understanding Insecurity of Processor Caches Due to Cache Timing-Based Vulnerabilities

The Effects of Wide Vector Operations on Processor Caches

A Variable Processor Cache Line Size Architecture

THE PROBLEM OF PROVIDING CACHE COHERENCE IN MULTIPROCESSOR SYSTEMS WITH MANY PROCESSORS

A Survey of Recent Prefetching Techniques for Processor Caches

Locality-Aware Task Scheduling and Data Distribution for OpenMP Programs on NUMA Systems and Manycore Processors

A Software-Based Self-Test methodology for on-line testing of processor caches

Processor caches built using multi-level spin-transfer torque RAM cells

SCIPS: An emulation methodology for fault injection in processor caches

Cache vulnerability equations for protecting data in embedded processor caches from soft errors

processor caches
Recently Published Documents